Camacho Collados, Jose ![]() |
Official URL: http://dx.doi.org/10.18653/v1/W16-2508
Abstract
We present a new framework for an intrinsic evaluation of word vector representations based on the outlier detection task. This task is intended to test the capability of vector space models to create semantic clusters in the space. We carried out a pilot study building a gold standard dataset and the results revealed two important features: human performance on the task is extremely high compared to the standard word similarity task, and state-of-the-art word embedding models, whose current shortcomings were highlighted as part of the evaluation, still have considerable room for improvement.
Item Type: | Conference or Workshop Item (Paper) |
---|---|
Date Type: | Publication |
Status: | Published |
Schools: | Computer Science & Informatics |
Publisher: | The Association for Computational Linguistics |
ISBN: | 978-1-945626-14-2 |
Last Modified: | 24 Oct 2022 07:04 |
URI: | https://orca.cardiff.ac.uk/id/eprint/114031 |
Actions (repository staff only)
![]() |
Edit Item |