Automatic Translation Between Kreol Morisien and English Using the Marian Machine Translation Framework
Article
Boodeea, Z. B. J., Pudaruth, S., Chooramun, N. and Sukhoo, A. 2025. Automatic Translation Between Kreol Morisien and English Using the Marian Machine Translation Framework. Informatics. 12 (1), p. Art. 16. https://doi.org/10.3390/informatics12010016
Authors | Boodeea, Z. B. J., Pudaruth, S., Chooramun, N. and Sukhoo, A. |
---|---|
Abstract | Kreol Morisien is a vibrant and expressive language that reflects the multicultural heritage of Mauritius. There are different versions of Kreol languages. While Kreol Morisien is spoken in Mauritius, Kreol Rodrige is spoken only in Rodrigues, and they are distinct languages. Being spoken by only about 1.5 million speakers in the world, Kreol Morisien falls in the category of under-resourced languages. Initially, Kreol Morisien lacked a formalised writing system, with many people using different spellings for the same words. The first step towards standardisation of writing Kreol Morisien was after the publication of the Kreol Morisien orthography in 2011 and Kreol Morisien grammar in 2012 by the Kreol Morisien Academy. Kreol Morisien obtained a national position in the year 2012 when it was introduced in educational organisations. This was a major breakthrough for Kreol Morisien to be recognised as a national language on the same level as English, French, and other oriental languages. By providing a means for Kreol Morisien speakers to connect with others, a translation system will help to preserve and strengthen the identity of the language and its speakers in an increasingly globalized world. The aim of this paper is to develop a translation system for Kreol Morisien and English. Thus, a dataset consisting of 50,000 parallel Kreol Morisien and English sentences was created, where 48,000 sentence pairs were used to train the models, while 1000 sentences were used for evaluation and another 1000 sentences were used for testing. Several machine translation systems such as statistical machine translation, open-source neural machine translation, a Transformer model with attention mechanism, and Marian machine translation are trained and evaluated. Our best model, using MarianMT, achieved a BLEU score of 0.62 for the translation of English to Kreol Morisien and a BLEU score of 0.58 for the translation of Kreol Morisien into English. To our knowledge, these are the highest BLEU scores that are available in the literature for this language pair. A high-quality translation tool for Kreol Morisien will facilitate its integration into digital platforms. This will make previously inaccessible knowledge more accessible, as the information can now be translated into the mother tongue of most Mauritians with reasonable accuracy. |
Journal | Informatics |
Journal citation | 12 (1), p. Art. 16 |
ISSN | 2227-9709 |
Year | 2025 |
Publisher | MDPI |
Publisher's version | License File Access Level Anyone |
Digital Object Identifier (DOI) | https://doi.org/10.3390/informatics12010016 |
Publication dates | |
Online | 10 Feb 2025 |
Publication process dates | |
Deposited | 04 Apr 2025 |
Copyright holder | © 2025 The Author |
https://repository.uel.ac.uk/item/8z4v9
Download files
45
total views18
total downloads17
views this month3
downloads this month