An Effective Hybrid Approach Based on Machine Learning Techniques for Auto-Translation: Japanese to English
Sharif, S., Auwal, B., Maltby, H. and Al-Bayatti, A. 2021. An Effective Hybrid Approach Based on Machine Learning Techniques for Auto-Translation: Japanese to English. 3ICT 2021: International Conference on Innovation and Intelligence for Informatics, Computing, and Technologies. Bahrain, University of Bahrain 29 - 30 Sep 2021 IEEE. https://doi.org/10.1109/3ICT53449.2021.9581629
|Sharif, S., Auwal, B., Maltby, H. and Al-Bayatti, A.
In recent years machine learning techniques have been able to perform tasks previously thought impossible or impractical such as image classification and natural language translation, as such this allows for the automation of tasks previously thought only possible by humans. This research work aims to test a naïve post processing grammar correction method using a Long Short Term Memory neural network to rearrange translated sentences from Subject Object Verb to Subject Verb Object. Here machine learning based techniques are used to successfully translate works in an automated fashion rather than manually and post processing translations to increase sentiment and grammar accuracy. The implementation of the proposed methodology uses a bounding box object detection model, optical character recognition model and a natural language processing model to fully translate manga without human intervention. The grammar correction experimentation tries to fix a common problem when machines translate between two natural languages that use different ordering, in this case from Japanese Subject Object Verb to English Subject Verb Object. For this experimentation 2 sequence to sequence Long Short Term Memory neural networks were developed, a character level and a word level model using word embedding to reorder English sentences from Subject Object Verb to Subject Verb Object. The results showed that the methodology works in practice and can automate the translation process successfully.
|Optical Character Recognition (OCR); Artificial Intelligence (AI); Natural Language Tool Kit (NLTK),; Long Short Term Memory (LSTM); Bilingual Evaluation Understudy (BLEU)
|3ICT 2021: International Conference on Innovation and Intelligence for Informatics, Computing, and Technologies
|Accepted author manuscript
File Access Level
|13 Nov 2021
|Publication process dates
|02 Jul 2021
|13 Aug 2021
|2021 International Conference on Innovation and Intelligence for Informatics, Computing, and Technologies (3ICT)
|Digital Object Identifier (DOI)
|© 2021 IEEE
|Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
4views this month
1downloads this month