Automatic Translation Between Kreol Morisien and English Using the Marian Machine Translation Framework

Article


Boodeea, Z. B. J., Pudaruth, S., Chooramun, N. and Sukhoo, A. 2025. Automatic Translation Between Kreol Morisien and English Using the Marian Machine Translation Framework. Informatics. 12 (1), p. Art. 16. https://doi.org/10.3390/informatics12010016
AuthorsBoodeea, Z. B. J., Pudaruth, S., Chooramun, N. and Sukhoo, A.
Abstract

Kreol Morisien is a vibrant and expressive language that reflects the multicultural heritage of Mauritius. There are different versions of Kreol languages. While Kreol Morisien is spoken in Mauritius, Kreol Rodrige is spoken only in Rodrigues, and they are distinct languages. Being spoken by only about 1.5 million speakers in the world, Kreol Morisien falls in the category of under-resourced languages. Initially, Kreol Morisien lacked a formalised writing system, with many people using different spellings for the same words. The first step towards standardisation of writing Kreol Morisien was after the publication of the Kreol Morisien orthography in 2011 and Kreol Morisien grammar in 2012 by the Kreol Morisien Academy. Kreol Morisien obtained a national position in the year 2012 when it was introduced in educational organisations. This was a major breakthrough for Kreol Morisien to be recognised as a national language on the same level as English, French, and other oriental languages. By providing a means for Kreol Morisien speakers to connect with others, a translation system will help to preserve and strengthen the identity of the language and its speakers in an increasingly globalized world. The aim of this paper is to develop a translation system for Kreol Morisien and English. Thus, a dataset consisting of 50,000 parallel Kreol Morisien and English sentences was created, where 48,000 sentence pairs were used to train the models, while 1000 sentences were used for evaluation and another 1000 sentences were used for testing. Several machine translation systems such as statistical machine translation, open-source neural machine translation, a Transformer model with attention mechanism, and Marian machine translation are trained and evaluated. Our best model, using MarianMT, achieved a BLEU score of 0.62 for the translation of English to Kreol Morisien and a BLEU score of 0.58 for the translation of Kreol Morisien into English. To our knowledge, these are the highest BLEU scores that are available in the literature for this language pair. A high-quality translation tool for Kreol Morisien will facilitate its integration into digital platforms. This will make previously inaccessible knowledge more accessible, as the information can now be translated into the mother tongue of most Mauritians with reasonable accuracy.

JournalInformatics
Journal citation12 (1), p. Art. 16
ISSN2227-9709
Year2025
PublisherMDPI
Publisher's version
License
File Access Level
Anyone
Digital Object Identifier (DOI)https://doi.org/10.3390/informatics12010016
Publication dates
Online10 Feb 2025
Publication process dates
Deposited04 Apr 2025
Copyright holder© 2025 The Author
Permalink -

https://repository.uel.ac.uk/item/8z4v9

Download files


Publisher's version
informatics-12-00016.pdf
License: CC BY 4.0
File access level: Anyone

  • 45
    total views
  • 18
    total downloads
  • 17
    views this month
  • 3
    downloads this month

Export as

Related outputs

A Comprehensive Review of Mobile User Interfaces in mHealth applications for elderly and the related ageing barriers
Ramdowar, H., Khedo, K, K. and Chooramun, N. 2024. A Comprehensive Review of Mobile User Interfaces in mHealth applications for elderly and the related ageing barriers . Universal Access in the Information Society. 23 (4), pp. 1613-1629. https://doi.org/10.1007/s10209-023-01011-z
Applicability of Federated Learning for Securing Critical Energy Infrastructures
Beeharry, Y., Bassoo, V. and Chooramun, N. 2023. Applicability of Federated Learning for Securing Critical Energy Infrastructures. in: Daneshvar, M., Mohammadi-Ivatloo, B., Zare, K. and Anvari-Moghaddam, A. (ed.) IoT Enabled Multi-Energy Systems: From Isolated Energy Grids to Modern Interconnected Networks Academic Press. pp. 137-157
An ICT architecture for Smart Local Councils: a Mauritian case study
Gobin-Rahimbux, B., Heenaye-Mamode Khan, M., Cadersaib, Z., Cheerkoot-Jalim, S., Chooramun, N., Gooda Sahib-Kaudeer, N., Kishnah, S. and Ahku, Y. 2023. An ICT architecture for Smart Local Councils: a Mauritian case study. International Journal of Managing Information Technology. 15, p. 2423–2433. https://doi.org/https://doi.org/10.1007/s41870-023-01280-0
Real-Time Customer Emotion Analysis in E-Commerce based on Social Media Data: Insights and Opportunities
Suresh, M. M., Chooramun, N. and Sharif, S. 2023. Real-Time Customer Emotion Analysis in E-Commerce based on Social Media Data: Insights and Opportunities. 3ICT 2023: International Conference on Innovation and Intelligence for Informatics, Computing, and Technologies. University of Bahrain, Bahrain 20 - 21 Nov 2023 IEEE. https://doi.org/10.1109/3ICT60104.2023.10391602
Implementing a Chatbot Music Recommender System Based on User Emotion
Mathew, N,, Chooramun, N. and Sharif, S. 2023. Implementing a Chatbot Music Recommender System Based on User Emotion. 3ICT 2023: International Conference on Innovation and Intelligence for Informatics, Computing, and Technologies. University of Bahrain, Bahrain 20 - 21 Nov 2023 IEEE. https://doi.org/10.1109/3ICT60104.2023.10391771