Cardiff University | Prifysgol Caerdydd ORCA
Online Research @ Cardiff 
WelshClear Cookie - decide language by browser settings

An approach to Tweets categorization by using machine learning classifiers in oil business

Aldahawi, Hanaa and Allen, Stuart Michael ORCID: 2015. An approach to Tweets categorization by using machine learning classifiers in oil business. Presented at: CICLing 2015: 16th International Conference on Intelligent Text Processing and Computational Linguistics, Cairo, Egypt, 14-20 April 2015. Published in: Gelbukh, Alexander ed. Computational Linguistics and Intelligent Text Processing: 16th International Conference, CICLing 2015, Cairo, Egypt, April 14-20, 2015, Proceedings, Part II. Lecture Notes in Computer Science. , vol.9042 Springer, pp. 535-546. 10.1007/978-3-319-18117-2_40

Full text not available from this repository.


The rapid growth in social media data has motivated the development of a real time framework to understand and extract the meaning of the data. Text categorization is a well-known method for understanding text. Text categorization can be applied in many forms, such as authorship detection and text mining by extracting useful information from documents to sort a set of documents automatically into predefined categories. Here, we propose a method for identifying those who posted the tweets into categories. The task is performed by extracting key features from tweets and subjecting them to a machine learning classifier. The research shows that this multi-classification task is very difficult, in particular the building of a domain-independent machine learning classifier. Our problem specifically concerned tweets about oil companies, most of which were noisy enough to affect the accuracy. The analytical technique used here provided structured and valuable information for oil companies.

Item Type: Conference or Workshop Item (Paper)
Date Type: Publication
Status: Published
Schools: Computer Science & Informatics
Subjects: H Social Sciences > HM Sociology
Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Publisher: Springer
ISBN: 9783319181165
ISSN: 0302-9743
Last Modified: 01 Nov 2022 09:21

Citation Data

Cited 1 time in Scopus. View in Scopus. Powered By Scopus® Data

Actions (repository staff only)

Edit Item Edit Item