Machine Translation of Low-Resource Indo-European Languages

Type de ressource
Conference Paper
Auteurs/contributeurs
Title
Machine Translation of Low-Resource Indo-European Languages
Abstract
In this work, we investigate methods for the challenging task of translating between low- resource language pairs that exhibit some level of similarity. In particular, we consider the utility of transfer learning for translating between several Indo-European low-resource languages from the Germanic and Romance language families. In particular, we build two main classes of transfer-based systems to study how relatedness can benefit the translation performance. The primary system fine-tunes a model pre-trained on a related language pair and the contrastive system fine-tunes one pre-trained on an unrelated language pair. Our experiments show that although relatedness is not necessary for transfer learning to work, it does benefit model performance.
Date
2021-11
Proceedings Title
Proceedings of the Sixth Conference on Machine Translation
Conference Name
WMT 2021
Place
Online
Publisher
Association for Computational Linguistics
Pages
347–353
Accessed
02/08/2024 13:55
Library Catalog
ACLWeb
Référence
Chen, W.-R., & Abdul-Mageed, M. (2021). Machine Translation of Low-Resource Indo-European Languages. In L. Barrault, O. Bojar, F. Bougares, R. Chatterjee, M. R. Costa-jussa, C. Federmann, M. Fishel, A. Fraser, M. Freitag, Y. Graham, R. Grundkiewicz, P. Guzman, B. Haddow, M. Huck, A. J. Yepes, P. Koehn, T. Kocmi, A. Martins, M. Morishita, & C. Monz (Eds.), Proceedings of the Sixth Conference on Machine Translation (pp. 347–353). Association for Computational Linguistics. https://aclanthology.org/2021.wmt-1.41