Camacho Collados, Jose ORCID: https://orcid.org/0000-0003-1618-7239, Pilehvar, Mohammad Taher, Collier, Nigel and Navigli, Roberto 2017. SemEval-2017 Task 2: Multilingual and cross-lingual semantic word similarity. Presented at: 11th International Workshop on Semantic Evaluations (SemEval-2017), Vancouver, Canada, 3rd-4th August 2017. Proceedings of the 11th International Workshop on Semantic Evaluations (SemEval-2017). Stroudsburg, PA: The Association for Computational Linguistics, pp. 15-26. 10.18653/v1/S17-2002 |
Abstract
This paper introduces a new task on Multilingual and Cross-lingual Semantic Word Similarity which measures the semantic similarity of word pairs within and across five languages: English, Farsi, German, Italian and Spanish. High quality datasets were manually curated for the five languages with high inter-annotator agreements (consistently in the 0.9 ballpark). These were used for semi-automatic construction of ten cross-lingual datasets. 17 teams participated in the task, submitting 24 systems in subtask 1 and 14 systems in subtask 2. Results show that systems that combine statistical knowledge from text corpora, in the form of word embeddings, and external knowledge from lexical resources are best performers in both subtasks. More information can be found on the task website: http://alt.qcri. org/semeval2017/task2.
Item Type: | Conference or Workshop Item (Paper) |
---|---|
Date Type: | Publication |
Status: | Published |
Schools: | Computer Science & Informatics |
Publisher: | The Association for Computational Linguistics |
ISBN: | 978-1-945626-55-5 |
Last Modified: | 24 Oct 2022 07:04 |
URI: | https://orca.cardiff.ac.uk/id/eprint/114041 |
Actions (repository staff only)
Edit Item |