Content area
This study presents the first systematic evaluation of in-context learning for Tarifit machine translation, a low-resource Amazigh language spoken by 5 million people in Morocco and Europe. We assess three large language models (GPT-4, Claude-3.5, PaLM-2) across Tarifit–Arabic, Tarifit–French, and Tarifit–English translation using 1000 sentence pairs and 5-fold cross-validation. Results show that 8-shot similarity-based demonstration selection achieves optimal performance. GPT-4 achieved 20.2 BLEU for Tarifit–Arabic, 14.8 for Tarifit–French, and 10.9 for Tarifit–English. Linguistic proximity significantly impacts translation quality, with Tarifit–Arabic substantially outperforming other language pairs by 8.4 BLEU points due to shared vocabulary and morphological patterns. Error analysis reveals systematic issues with morphological complexity (42% of errors) and cultural terminology preservation (18% of errors). This work establishes baseline benchmarks for Tarifit translation and demonstrates the viability of in-context learning for morphologically complex low-resource languages, contributing to linguistic equity in AI systems.
Details
Language;
French language;
Dictionaries;
English language;
Datasets;
Errors;
Morphology;
Machine translation;
Terminology;
Endangered languages;
Morphological analysis;
Vocabulary;
Business metrics;
Grammar;
Linguistics;
Learning;
Large language models;
Arabic language;
Voice recognition;
Proximity;
Languages;
Preservation;
Berber languages;
Natural language processing;
Multilingualism;
Complexity;
Context;
Morphological complexity;
Language modeling;
Translation;
Cultural heritage
