Cardiff University | Prifysgol Caerdydd ORCA
Online Research @ Cardiff 
WelshClear Cookie - decide language by browser settings

Syntactic simplification for machine translation

Camacho Collados, Jose ORCID: https://orcid.org/0000-0003-1618-7239 2013. Syntactic simplification for machine translation. BULAG: Bulletin de Linguistique Appliquée et Générale 38

Full text not available from this repository.

Abstract

This paper presents a new rule-based Syntactic Simplification system for the Spanish language. The system basically consists of splitting complex sentences into several simple ones preserving its original meaning. It is envisioned to be a preprocessing tool for other Natural Language Processing applications such as Text Summarization, Information Extraction, Parsing and Machine Translation. According to the evaluation, the application of the syntactic rules proposed clearly improves Machine Translation from Spanish to far target languages such as Korean and Chinese. A parallel corpus of Spanish original and simplified texts has been manually built and validated by native Spanish speakers for this research. In addition, an experiment was carried out among two groups of Spanish learners comparing their reading comprehension of original and simplified texts. Their answers show a null correlation between simplification and text comprehension. The automatic Syntactic Simplification system has also been intrinsically evaluated obtaining promising results.

Item Type: Article
Date Type: Publication
Status: Published
Schools: Computer Science & Informatics
Uncontrolled Keywords: Text Simplification; Syntactic Simplification; Natural Language Processing; Spanish; Machine Translation.
ISSN: 0758-6787
Last Modified: 23 Oct 2022 14:13
URI: https://orca.cardiff.ac.uk/id/eprint/113068

Actions (repository staff only)

Edit Item Edit Item