Content area

Abstract

Conference Title: ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Conference Start Date: 2025 April 6

Conference End Date: 2025 April 11

Conference Location: Hyderabad, India

Automatic Speech Recognition (ASR) performance for low-resource languages is still far behind that of higher-resource languages such as English, due to a lack of sufficient labeled data. State-of-the-art methods deploy self-supervised transfer learning where a model pre-trained on large amounts of data is fine-tuned using little labeled data in a target low-resource language. In this paper, we present and examine a method for fine-tuning an SSL-based model in order to improve the performance for Frisian and its regional dialects (Clay Frisian, Wood Frisian, and South Frisian). We show that Frisian ASR performance can be improved by using multilingual (Frisian, Dutch, English and German) fine-tuning data and an auxiliary language identification task. In addition, our findings show that performance on dialectal speech suffers substantially, and, importantly, that this effect is moderated by the elicitation approach used to collect the dialectal data. Our findings also particularly suggest that relying solely on standard language data for ASR evaluation may underestimate real-world performance, particularly in languages with substantial dialectal variation.

Details

Title
Enhancing Standard and Dialectal Frisian ASR: Multilingual Fine-tuning and Language Identification for Improved Low-resource Performance
Author
Amooie, Reihaneh 1 ; De Vries, Wietse 1 ; Yun Hao 1 ; Dijkstra, Jelske 2 ; Coler, Matt 3 ; Wieling, Martijn 1 

 Center for Language and Cognition University of Groningen,Groningen,The Netherlands 
 Mercator European Research Centre Fryske Akademy,Friesland,The Netherlands 
 Speech Technology Lab University of Groningen,Friesland,The Netherlands 
Pages
1-5
Number of pages
5
Publication year
2025
Publication date
2025
Publisher
The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Place of publication
Piscataway
Country of publication
United States
Source type
Conference Paper
Language of publication
English
Document type
Conference Proceedings
Publication history
 
 
Online publication date
2025-03-07
Publication history
 
 
   First posting date
07 Mar 2025
ProQuest document ID
3268870229
Document URL
https://www.proquest.com/conference-papers-proceedings/enhancing-standard-dialectal-frisian-asr/docview/3268870229/se-2?accountid=208611
Copyright
Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2025
Last updated
2025-11-06
Database
ProQuest One Academic