Cardiff University | Prifysgol Caerdydd ORCA
Online Research @ Cardiff 
WelshClear Cookie - decide language by browser settings

Arabic event detection in social media

Alsaedi, Nasser and Burnap, Peter ORCID: 2015. Arabic event detection in social media. Presented at: 16th International Conference on Intelligent Text Processing and Computational Linguistics, Cairo, Egypt, 14-20 April 2015. Published in: Gelbukh, Alexander ed. Computational Linguistics and Intelligent Text Processing: 16th International Conference, CICLing 2015, Cairo, Egypt, April 14-20, 2015, Proceedings, Part I. Lecture Notes in Computer Science. Lecture Notes in Computer Science , vol.9041 Springer Verlag, pp. 384-401. 10.1007/978-3-319-18111-0_29

[thumbnail of Arabic Event Detection in Social Media.pdf]
PDF - Accepted Post-Print Version
Download (2MB) | Preview


Event detection is a concept that is crucial to the assurance of public safety surrounding real-world events. Decision makers use information from a range of terrestrial and online sources to help inform decisions that enable them to develop policies and react appropriately to events as they unfold. One such source of online information is social media. Twitter, as a form of social media, is a popular micro-blogging web application serving hundreds of millions of users. User-generated content can be utilized as a rich source of information to identify real-world events. In this paper, we present a novel detection framework for identifying such events, with a focus on ‘disruptive’ events using Twitter data. The approach is based on five steps; data collection, pre-processing, classification, clustering and summarization. We use a Naïve Bayes classification model and an Online Clustering method to validate our model over multiple real-world data sets. To the best of our knowledge, this study is the first effort to identify real-world events in Arabic from social media.

Item Type: Conference or Workshop Item (Paper)
Date Type: Publication
Status: Published
Schools: Computer Science & Informatics
Subjects: H Social Sciences > HM Sociology
Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Q Science > QA Mathematics > QA76 Computer software
Uncontrolled Keywords: Text mining, Information Extraction, Classification, Online Clustering, Machine Learning, Event detection.
Publisher: Springer Verlag
ISBN: 9783319181103
ISSN: 0302-9743
Date of First Compliant Deposit: 30 March 2016
Last Modified: 07 Nov 2023 03:50

Citation Data

Cited 31 times in Scopus. View in Scopus. Powered By Scopus® Data

Actions (repository staff only)

Edit Item Edit Item


Downloads per month over past year

View more statistics