Charles Translator: A Machine Translation System between Ukrainian and Czech
Authors:
Martin Popel,
Lucie Poláková,
Michal Novák,
**dřich Helcl,
**dřich Libovický,
Pavel Straňák,
Tomáš Krabač,
Jaroslava Hlaváčová,
Mariia Anisimova,
Tereza Chlaňová
Abstract:
We present Charles Translator, a machine translation system between Ukrainian and Czech, developed as part of a society-wide effort to mitigate the impact of the Russian-Ukrainian war on individuals and society. The system was developed in the spring of 2022 with the help of many language data providers in order to quickly meet the demand for such a service, which was not available at the time in…
▽ More
We present Charles Translator, a machine translation system between Ukrainian and Czech, developed as part of a society-wide effort to mitigate the impact of the Russian-Ukrainian war on individuals and society. The system was developed in the spring of 2022 with the help of many language data providers in order to quickly meet the demand for such a service, which was not available at the time in the required quality. The translator was later implemented as an online web interface and as an Android app with speech input, both featuring Cyrillic-Latin script transliteration. The system translates directly, compared to other available systems that use English as a pivot, and thus take advantage of the typological similarity of the two languages. It uses the block back-translation method, which allows for efficient use of monolingual training data. The paper describes the development process, including data collection and implementation, evaluation, mentions several use cases, and outlines possibilities for the further development of the system for educational purposes.
△ Less
Submitted 10 April, 2024;
originally announced April 2024.
A Test Suite and Manual Evaluation of Document-Level NMT at WMT19
Authors:
Kateřina Rysová,
Magdaléna Rysová,
Tomáš Musil,
Lucie Poláková,
Ondřej Bojar
Abstract:
As the quality of machine translation rises and neural machine translation (NMT) is moving from sentence to document level translations, it is becoming increasingly difficult to evaluate the output of translation systems.
We provide a test suite for WMT19 aimed at assessing discourse phenomena of MT systems participating in the News Translation Task. We have manually checked the outputs and iden…
▽ More
As the quality of machine translation rises and neural machine translation (NMT) is moving from sentence to document level translations, it is becoming increasingly difficult to evaluate the output of translation systems.
We provide a test suite for WMT19 aimed at assessing discourse phenomena of MT systems participating in the News Translation Task. We have manually checked the outputs and identified types of translation errors that are relevant to document-level translation.
△ Less
Submitted 8 August, 2019;
originally announced August 2019.