Cardiff University | Prifysgol Caerdydd ORCA
Online Research @ Cardiff 
WelshClear Cookie - decide language by browser settings

Acronyms as an integral part of multi–word term recognition - A token of appreciation

Spasic, Irena ORCID: https://orcid.org/0000-0002-8132-3885 2018. Acronyms as an integral part of multi–word term recognition - A token of appreciation. IEEE Access 6 , pp. 8351-8363. 10.1109/ACCESS.2018.2807122

[thumbnail of 08293774.pdf] PDF - Published Version
Available under License Creative Commons Attribution.

Download (5MB)

Abstract

Term conflation is the process of linking together different variants of the same term. In automatic term recognition approaches, all term variants should be aggregated into a single normalized term representative, which is associated with a single domain–specific concept as a latent variable. In a previous study, we described FlexiTerm, an unsupervised method for recognition of multi–word terms from a domain–specific corpus. It uses a range of methods to normalize three types of term variation – orthographic, morphological and syntactic variation. Acronyms, which represent a highly productive type of term variation, were not supported. In this study, we describe how the functionality of FlexiTerm has been extended to recognize acronyms and incorporate them into the term conflation process. The main contribution of this study is not acronym recognition per se, but rather its integration with other types of term variation into the term conflation process. We evaluated the effects of term conflation in the context of information retrieval as one of its most prominent applications. On average, relative recall increased by 32 percent points, whereas index compression factor increased by 7 percent points. Therefore, evidence suggests that integration of acronyms provides non–trivial improvement of term conflation.

Item Type: Article
Date Type: Published Online
Status: Published
Schools: Computer Science & Informatics
Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Q Science > QA Mathematics > QA76 Computer software
Additional Information: This work is licensed under a Creative Commons Attribution 3.0 License.
Publisher: Institute of Electrical and Electronics Engineers (IEEE)
ISSN: 2169-3536
Date of First Compliant Deposit: 15 February 2018
Date of Acceptance: 13 February 2018
Last Modified: 03 May 2023 23:23
URI: https://orca.cardiff.ac.uk/id/eprint/109121

Citation Data

Cited 11 times in Scopus. View in Scopus. Powered By Scopus® Data

Actions (repository staff only)

Edit Item Edit Item

Downloads

Downloads per month over past year

View more statistics