Content area

Abstract

In a bilingual and linguistically diverse country like India, where a significant portion of the population is fluent in multiple languages, the conventional bilingual Transformer neural network architecture faces challenges in accurately translating conversations that seamlessly switch between different languages. In this paper, we propose a multilingual automatic speech recognition system that can understand all intra-sentential terms and transcribe human speech into written text in English or any other language without making any grammatical mistakes. As a result, this method of translating Tanglish to Tamil or English works well. It is finished with the help of generative AI. Here, we use a generative pre-trained transformer model, which learns to predict the subsequent word in a language during the pre-training stage in order to get an understanding of language structure and semantics. The algorithm used here is long short-term memory (LSTM) plays a crucial role in speech to text by capturing temporal dependencies maintaining context and generating accurate transcriptions from audio inputs. We experimented on 50 Tamil–English agriculturally based data and found that the generative pre-trained transformer model can achieve an 84.37% relative accuracy rate even for short sentences and 73.98% relative accuracy rate for lengthy sentences in bilingual automatic speech recognition (ASR) performance.

Details

1009240
Business indexing term
Title
Generative AI-powered multilingual ASR for seamless language-mixing transcriptions
Author
Dash, Puspita 1   VIAFID ORCID Logo  ; Babu, Sruthi 1 ; Singaravel, Logeswari 1 ; Balasubramanian, Devadarshini 1 

 Sri Manakula Vinayagar Engineering College, Department of Information Technology, Madagadipet, India 
Volume
12
Issue
1
Pages
42
Publication year
2025
Publication date
Dec 2025
Publisher
Springer Nature B.V.
Place of publication
Cairo
Country of publication
Netherlands
e-ISSN
23147172
Source type
Scholarly Journal
Language of publication
English
Document type
Journal Article
Publication history
 
 
Online publication date
2025-07-14
Milestone dates
2025-04-11 (Registration); 2024-08-03 (Received); 2025-04-10 (Accepted)
Publication history
 
 
   First posting date
14 Jul 2025
ProQuest document ID
3230018464
Document URL
https://www.proquest.com/scholarly-journals/generative-ai-powered-multilingual-asr-seamless/docview/3230018464/se-2?accountid=208611
Copyright
© The Author(s) 2025. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Last updated
2025-07-15
Database
2 databases
  • Coronavirus Research Database
  • ProQuest One Academic