Cardiff University | Prifysgol Caerdydd ORCA
Online Research @ Cardiff 
WelshClear Cookie - decide language by browser settings

Towards readability-controlled machine translation of COVID-19 texts

Alva Manchego, Fernando and Shardlow, Matthew 2022. Towards readability-controlled machine translation of COVID-19 texts. Presented at: 23rd Annual Conference of the European Association for Machine Translation, Ghent, Belgium, 1-3 June 2022. Published in: Moniz, Helena, Macken, Lieve, Rufener, Andrew, Barrault, Loic, Costa-Jussa, Marta R., Declercq, Christophe, Koponen, Maarit, Kemp, Ellie, Pilos, Spyridon, Forcada, Mikel F., Scarton, Carolina, Van den Bogaert, Joachim, Daems, Joke, Tezcan, Arda, Vanroy, Bram and Fonteyne, Margot eds. Proceedings of the 23rd Annual Conference of the European Association for Machine Translation. European Association for Machine Translation, 287–288.

Full text not available from this repository.

Abstract

This project investigates the capabilities of Machine Translation models for generating translations at varying levels of readability, focusing on texts related to COVID-19. Whilst it is possible to automatically translate this information, the resulting text may contain specialised terminology, or may be written in a style that is difficult for lay readers to understand. So far, we have collected a new dataset with manual simplifications for English and Spanish sentences in the TICO-19 dataset, as well as implemented baseline pipelines combining Machine Translation and Text Simplification models.

Item Type: Conference or Workshop Item (Paper)
Status: Published
Schools: Computer Science & Informatics
Publisher: European Association for Machine Translation
Last Modified: 04 Oct 2023 13:45
URI: https://orca.cardiff.ac.uk/id/eprint/161903

Actions (repository staff only)

Edit Item Edit Item