Cardiff University | Prifysgol Caerdydd ORCA
Online Research @ Cardiff 
WelshClear Cookie - decide language by browser settings

CANELC – constructing an e-language corpus

Knight, Dawn ORCID:, Adolphs, Svenja and Ronald, Carter 2014. CANELC – constructing an e-language corpus. Corpora 9 (1) , pp. 29-56. 10.3366/cor.2014.0050

[thumbnail of 18. Knight et al., 2014.pdf]
PDF - Accepted Post-Print Version
Download (1MB) | Preview


This paper reports on the construction of the Cambridge and Nottingham e-language Corpus (CANELC). CANELC is a one-million word corpus of digital communication in English, taken from online discussion boards, blogs, tweets, e-mails and Short Message Services (SMS). The paper outlines the approaches used when planning the corpus: obtaining consent, collecting the data and compiling the corpus database. This is followed by a detailed analysis of some of the patterns of language used in the corpus. The analysis includes a discussion of the key words and phrases used, as well as the common themes and semantic associations connected with the data. These discussions form the basis of an investigation into how e-language operates in ways that are both similar to and different from spoken and written records of communication (as evidenced by the British National Corpus, BNC).

Item Type: Article
Date Type: Publication
Status: Published
Schools: English, Communication and Philosophy
Subjects: P Language and Literature > P Philology. Linguistics
Q Science > QA Mathematics > QA76 Computer software
Uncontrolled Keywords: blogs, tweets, SMS, discussion boards, e-language, corpus linguistics
Publisher: Edinburgh University Press
ISSN: 1749-5032
Date of First Compliant Deposit: 30 March 2016
Last Modified: 30 Oct 2022 04:19

Citation Data

Cited 18 times in Scopus. View in Scopus. Powered By Scopus® Data

Actions (repository staff only)

Edit Item Edit Item


Downloads per month over past year

View more statistics