Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
A RECURSIVE SEARCH METHOD FOR LYRICS ALIGNMENT
Centre for Digital Music, Queen Mary University of London, United Kingdom.
Royal College of Music in Stockholm, Department of Folk Music. Kungliga Musikhögskolan, Stockholm.ORCID iD: 0000-0002-4756-1441
Centre for Digital Music, Queen Mary University of London, United Kingdom.
2020 (English)In: https://www.music-ir.org/mirex/wiki/2020:MIREX2020_Results, 2020Conference paper, Published paper (Refereed)
Abstract [en]

Audio-to-lyrics transcription and alignment requires strong acoustic and language models that are trained on in-domain data and a well-adapted pronunciation model for singing. Even in the presence of such models, the length of audio segments for decoding remains a challenge. In this year’s MIREX submission, we present a recursive search method that splits the audio with respect to anchor- ing words for performing alignment on shorter audio seg- ments. The recursive is applied through gradually restrict- ing the language model and search space after each search iteration. We apply a final pass of forced alignment on the segmented audio to obtain timings for every word in the input song lyrics. According to the initial experiments, our system is robust to various musical genre while being executable on local machines with low memory and com- putational resources.

Place, publisher, year, edition, pages
2020.
Keywords [en]
automatic lyrics transcription, automatic speech recognition, audio-to-lyrics alignment, music information retrieval
National Category
Signal Processing Musicology
Identifiers
URN: urn:nbn:se:kmh:diva-3741OAI: oai:DiVA.org:kmh-3741DiVA, id: diva2:1502274
Conference
ISMIR 2020 https://www.music-ir.org/mirex/wiki/2020:MIREX2020_Results
Available from: 2020-11-19 Created: 2020-11-19 Last updated: 2020-12-16Bibliographically approved

Open Access in DiVA

No full text in DiVA

Search in DiVA

By author/editor
Ahlbäck, Sven
By organisation
Department of Folk Music
Signal ProcessingMusicology

Search outside of DiVA

GoogleGoogle Scholar

urn-nbn

Altmetric score

urn-nbn
Total: 576 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf