Full Text

Turn on search term navigation

© 2024. This work is published under https://creativecommons.org/licenses/by-sa/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Abstract

This paper presents newly created Latvian speech corpora aimed at advancing both linguistic research and speech technology development. Although multilingual models like XLSR R Whisper have reduced the amount of data needed for fine-tuning speech recognition models even for less-resourced languages, diverse and curated speech corpora remain essential. We provide an overview of several recent Latvian speech corpora, emphasizing their importance for both general-purpose and domain-specific use cases and comparing their design with previously created speech datasets for Latvian. We also introduce a common platform for analysing open-access Latvian speech corpora, and discuss initial evaluation and integration of speech recognition models fine-tuned on the new datasets for practical speech transcription and post-editing applications in research and industry. Finally, we present a competitive open-source speech recognition model for Latvian.

Details

Title
Recent Latvian Speech Corpora for Linguistic Research and Technology Development
Author
Auzina, Ilze 1 ; Gruzitis, Normunds 1 ; Dargis, Roberts 1 ; Rabante-busa, Guna 1 ; Gosko, Didzis 1 ; Vempers, Janis; Kivkucans, Raivis; Znotins, Artürs

 IMCS, University of Latvia, Raina blvd. 29, Riga, Latvia 
Pages
646-658
Publication year
2024
Publication date
2024
Publisher
University of Latvia
ISSN
22558942
e-ISSN
22558950
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
3168800639
Copyright
© 2024. This work is published under https://creativecommons.org/licenses/by-sa/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.