Improving Luxembourgish Speech Recognition with Cross-Lingual Speech Representations
Jan 1, 2023·,
·
1 min read
Nguyen, L. M.
Nayak, S.

Matt Coler
Abstract
Luxembourgish is a West Germanic language spoken by roughly 390,000 people, mainly in Luxembourg. It is one of Europe’s under-described and under-resourced languages, not extensively investigated in the context of speech recognition. We explore the self-supervised multilingual learning of Luxembourgish speech representations for the speech recognition downstream task. We show that learning cross-lingual representations is essential for low-resourced languages such as Luxembourgish. Learning cross-lingual representations and rescoring the output transcriptions with language modelling while using only 4 hours of labelled speech achieves a word error rate of 15.1% and improves our Transfer Learning baseline model relatively by 33.1% and absolutely by 7.5%. Increasing the amount of labelled speech to 14 hours yields a significant performance gain resulting in a 9.3% word error rate.
Type
Publication
2022 IEEE Spoken Language Technology Workshop (SLT)
This paper presents approaches for improving automatic speech recognition (ASR) for Luxembourgish, a low-resource West Germanic language. We demonstrate that leveraging cross-lingual speech representations significantly enhances ASR performance, achieving a 33.1% relative improvement over baseline models with just 4 hours of labeled speech data.
The research highlights the effectiveness of self-supervised learning approaches for under-resourced languages and provides both models and datasets via Hugging Face for further research in this area.
@inproceedings{nayak2023improving,
title={Improving Luxembourgish Speech Recognition with Cross-Lingual Speech Representations},
author={Nayak, Shekhar and Coler, Matt and others},
booktitle={2022 IEEE Spoken Language Technology Workshop (SLT)},
pages={792--797},
year={2023},
organization={IEEE}
}