Content area

Abstract

The proposed hybrid AI-driven translation system’s architecture integrates phrase-based machine translation (PBMT) and neural machine translation (NMT) within a recursive learning framework. It provides a blueprint for institutions that digitize, translate, or teach under-resourced languages. Due to its ability to adapt to multilingual inputs and preserve cultural expressions, it is highly suitable for applications in education, community media, cultural preservation, and government-supported language revitalization initiatives.

This study presents a hybrid artificial intelligence model designed to enhance translation quality for low-resource languages, specifically targeting the Hakka language. The proposed model integrates phrase-based machine translation (PBMT) and neural machine translation (NMT) within a recursive learning framework. The methodology consists of three key stages: (1) initial translation using PBMT, where Hakka corpus data is structured into a parallel dataset; (2) NMT training with Transformers, leveraging the generated parallel corpus to train deep learning models; and (3) recursive translation refinement, where iterative translations further enhance model accuracy by expanding the training dataset. This study employs preprocessing techniques to clean and optimize the dataset, reducing noise and improving sentence segmentation. A BLEU score evaluation is conducted to compare the effectiveness of PBMT and NMT across various corpus sizes, demonstrating that while PBMT performs well with limited data, the Transformer-based NMT achieves superior results as training data increases. The findings highlight the advantages of a hybrid approach in overcoming data scarcity challenges for minority languages. This research contributes to machine translation methodologies by proposing a scalable framework for improving linguistic accessibility in under-resourced languages.

Details

1009240
Business indexing term
Title
Integrating Hybrid AI Approaches for Enhanced Translation in Minority Languages
Author
Chen-Chi, Chang 1   VIAFID ORCID Logo  ; Yu-Hsun, Lin 2   VIAFID ORCID Logo  ; Yun-Hsiang, Hsu 1 ; I-Hsin, Fan 3 

 Department of Culture Creativity and Digital Marketing, College of Hakka Studies, National United University, Miaoli 36063, Taiwan; [email protected] (C.-C.C.); [email protected] (Y.-H.H.) 
 Department of Business and Management, College of Management and Design, Ming Chi University of Technology, New Taipei 243303, Taiwan 
 Department of Cultural Tourism, College of Hakka Studies, National United University, Miaoli 36063, Taiwan; [email protected] 
Publication title
Volume
15
Issue
16
First page
9039
Number of pages
17
Publication year
2025
Publication date
2025
Publisher
MDPI AG
Place of publication
Basel
Country of publication
Switzerland
Publication subject
e-ISSN
20763417
Source type
Scholarly Journal
Language of publication
English
Document type
Journal Article
Publication history
 
 
Online publication date
2025-08-15
Milestone dates
2025-07-08 (Received); 2025-08-14 (Accepted)
Publication history
 
 
   First posting date
15 Aug 2025
ProQuest document ID
3243982299
Document URL
https://www.proquest.com/scholarly-journals/integrating-hybrid-ai-approaches-enhanced/docview/3243982299/se-2?accountid=208611
Copyright
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Last updated
2025-11-07
Database
2 databases
  • Coronavirus Research Database
  • ProQuest One Academic