Alva Manchego, Fernando and Shardlow, Matthew 2022. Towards readability-controlled machine translation of COVID-19 texts. Presented at: 23rd Annual Conference of the European Association for Machine Translation, Ghent, Belgium, 1-3 June 2022. Published in: Moniz, Helena, Macken, Lieve, Rufener, Andrew, Barrault, Loic, Costa-Jussa, Marta R., Declercq, Christophe, Koponen, Maarit, Kemp, Ellie, Pilos, Spyridon, Forcada, Mikel F., Scarton, Carolina, Van den Bogaert, Joachim, Daems, Joke, Tezcan, Arda, Vanroy, Bram and Fonteyne, Margot eds. Proceedings of the 23rd Annual Conference of the European Association for Machine Translation. European Association for Machine Translation, 287–288. |
Official URL: https://aclanthology.org/2022.eamt-1.33
Abstract
This project investigates the capabilities of Machine Translation models for generating translations at varying levels of readability, focusing on texts related to COVID-19. Whilst it is possible to automatically translate this information, the resulting text may contain specialised terminology, or may be written in a style that is difficult for lay readers to understand. So far, we have collected a new dataset with manual simplifications for English and Spanish sentences in the TICO-19 dataset, as well as implemented baseline pipelines combining Machine Translation and Text Simplification models.
Item Type: | Conference or Workshop Item (Paper) |
---|---|
Status: | Published |
Schools: | Computer Science & Informatics |
Publisher: | European Association for Machine Translation |
Last Modified: | 04 Oct 2023 13:45 |
URI: | https://orca.cardiff.ac.uk/id/eprint/161903 |
Actions (repository staff only)
Edit Item |