Cardiff University | Prifysgol Caerdydd ORCA
Online Research @ Cardiff 
WelshClear Cookie - decide language by browser settings

Idiom–based features in sentiment analysis: cutting the Gordian knot

Spasic, Irena ORCID: https://orcid.org/0000-0002-8132-3885, Williams, Lowri and Buerki, Andreas ORCID: https://orcid.org/0000-0003-2151-3246 2020. Idiom–based features in sentiment analysis: cutting the Gordian knot. IEEE Transactions on Affective Computing 11 (2) 10.1109/TAFFC.2017.2777842

[thumbnail of 08226982.pdf]
Preview
PDF - Published Version
Available under License Creative Commons Attribution.

Download (2MB) | Preview

Abstract

In this paper we describe an automated approach to enriching sentiment analysis with idiom–based features. Specifically, we automated the development of the supporting lexico–semantic resources, which include (1) a set of rules used to identify idioms in text and (2) their sentiment polarity classifications. Our method demonstrates how idiom dictionaries, which are readily available general pedagogical resources, can be adapted into purpose–specific computational resources automatically. These resources were then used to replace the manually engineered counterparts in an existing system, which originally outperformed the baseline sentiment analysis approaches by 17 percentage points on average, taking the F–measure from 40s into 60s. The new fully automated approach outperformed the baselines by 8 percentage points on average taking the F–measure from 40s into 50s. Although the latter improvement is not as high as the one achieved with the manually engineered features, it has got the advantage of being more general in a sense that it can readily utilize an arbitrary list of idioms without the knowledge acquisition overhead previously associated with this task, thereby fully automating the original approach.

Item Type: Article
Date Type: Publication
Status: Published
Schools: Computer Science & Informatics
English, Communication and Philosophy
Subjects: P Language and Literature > P Philology. Linguistics
Q Science > QA Mathematics > QA76 Computer software
Additional Information: This work is licensed under a Creative Commons Attribution 3.0 License.
Publisher: IEEE
ISSN: 1949-3045
Date of First Compliant Deposit: 22 December 2017
Date of Acceptance: 21 November 2017
Last Modified: 21 Jan 2024 16:02
URI: https://orca.cardiff.ac.uk/id/eprint/106926

Citation Data

Cited 11 times in Scopus. View in Scopus. Powered By Scopus® Data

Actions (repository staff only)

Edit Item Edit Item

Downloads

Downloads per month over past year

View more statistics