Improving Luxembourgish Speech Recognition with Cross-Lingual Speech Representations

Jan 1, 2023·

Nguyen, L. M.

Nayak, S.

Matt Coler

· 1 min read

PDF Code Project Open Access Cite

Abstract

Luxembourgish is a West Germanic language spoken by roughly 390,000 people, mainly in Luxembourg. It is one of Europe’s under-described and under-resourced languages, not extensively investigated in the context of speech recognition. We explore the self-supervised multilingual learning of Luxembourgish speech representations for the speech recognition downstream task. We show that learning cross-lingual representations is essential for low-resourced languages such as Luxembourgish. Learning cross-lingual representations and rescoring the output transcriptions with language modelling while using only 4 hours of labelled speech achieves a word error rate of 15.1% and improves our Transfer Learning baseline model relatively by 33.1% and absolutely by 7.5%. Increasing the amount of labelled speech to 14 hours yields a significant performance gain resulting in a 9.3% word error rate.

Type

Conference Paper

Publication

2022 IEEE Spoken Language Technology Workshop (SLT)

This paper presents approaches for improving automatic speech recognition (ASR) for Luxembourgish, a low-resource West Germanic language. We demonstrate that leveraging cross-lingual speech representations significantly enhances ASR performance, achieving a 33.1% relative improvement over baseline models with just 4 hours of labeled speech data.

The research highlights the effectiveness of self-supervised learning approaches for under-resourced languages and provides both models and datasets via Hugging Face for further research in this area.

Last updated on Jan 1, 2023