Full text

Turn on search term navigation

Copyright University of Latvia 2016

Abstract

One major drawback of using Translation Memories (TMs) in phrase-based Machine Translation (MT) is that only continuous phrases are considered. In contrast, syntax-based MT allows phrasal discontinuity by learning translation rules containing non-terminals. In this paper, we combine a TM with syntax-based MT via sparse features. These features are extracted during decoding based on translation rules and their corresponding patterns in the TM. We have tested this approach by carrying out experiments on real English-Spanish industrial data. Our results show that these TM features significantly improve syntax-based MT. Our final system yields improvements of up to +3.1 BLEU, +1.6 METEOR, and -2.6 TER when compared with a state-of-the-art phrase-based MT system.

Details

Title
Combining Translation Memories and Syntax-Based: SMT Experiments with Real Industrial Data
Author
Li, Liangyou; Escartín, Carla Parra; Liu, Qun
Pages
165-177
Publication year
2016
Publication date
2016
Publisher
University of Latvia
ISSN
22558942
e-ISSN
22558950
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
1812275912
Copyright
Copyright University of Latvia 2016