Spasic, Irena ORCID: https://orcid.org/0000-0002-8132-3885 2018. Acronyms as an integral part of multi–word term recognition - A token of appreciation. IEEE Access 6 , pp. 8351-8363. 10.1109/ACCESS.2018.2807122 |
PDF
- Published Version
Available under License Creative Commons Attribution. Download (5MB) |
Abstract
Term conflation is the process of linking together different variants of the same term. In automatic term recognition approaches, all term variants should be aggregated into a single normalized term representative, which is associated with a single domain–specific concept as a latent variable. In a previous study, we described FlexiTerm, an unsupervised method for recognition of multi–word terms from a domain–specific corpus. It uses a range of methods to normalize three types of term variation – orthographic, morphological and syntactic variation. Acronyms, which represent a highly productive type of term variation, were not supported. In this study, we describe how the functionality of FlexiTerm has been extended to recognize acronyms and incorporate them into the term conflation process. The main contribution of this study is not acronym recognition per se, but rather its integration with other types of term variation into the term conflation process. We evaluated the effects of term conflation in the context of information retrieval as one of its most prominent applications. On average, relative recall increased by 32 percent points, whereas index compression factor increased by 7 percent points. Therefore, evidence suggests that integration of acronyms provides non–trivial improvement of term conflation.
Item Type: | Article |
---|---|
Date Type: | Published Online |
Status: | Published |
Schools: | Computer Science & Informatics |
Subjects: | Q Science > QA Mathematics > QA75 Electronic computers. Computer science Q Science > QA Mathematics > QA76 Computer software |
Additional Information: | This work is licensed under a Creative Commons Attribution 3.0 License. |
Publisher: | Institute of Electrical and Electronics Engineers (IEEE) |
ISSN: | 2169-3536 |
Date of First Compliant Deposit: | 15 February 2018 |
Date of Acceptance: | 13 February 2018 |
Last Modified: | 03 May 2023 23:23 |
URI: | https://orca.cardiff.ac.uk/id/eprint/109121 |
Citation Data
Cited 11 times in Scopus. View in Scopus. Powered By Scopus® Data
Actions (repository staff only)
Edit Item |