Knight, Dawn ORCID: https://orcid.org/0000-0002-4745-6502, Adolphs, Svenja and Ronald, Carter 2014. CANELC – constructing an e-language corpus. Corpora 9 (1) , pp. 29-56. 10.3366/cor.2014.0050 |
Preview |
PDF
- Accepted Post-Print Version
Download (1MB) | Preview |
Abstract
This paper reports on the construction of the Cambridge and Nottingham e-language Corpus (CANELC). CANELC is a one-million word corpus of digital communication in English, taken from online discussion boards, blogs, tweets, e-mails and Short Message Services (SMS). The paper outlines the approaches used when planning the corpus: obtaining consent, collecting the data and compiling the corpus database. This is followed by a detailed analysis of some of the patterns of language used in the corpus. The analysis includes a discussion of the key words and phrases used, as well as the common themes and semantic associations connected with the data. These discussions form the basis of an investigation into how e-language operates in ways that are both similar to and different from spoken and written records of communication (as evidenced by the British National Corpus, BNC).
Item Type: | Article |
---|---|
Date Type: | Publication |
Status: | Published |
Schools: | English, Communication and Philosophy |
Subjects: | P Language and Literature > P Philology. Linguistics Q Science > QA Mathematics > QA76 Computer software |
Uncontrolled Keywords: | blogs, tweets, SMS, discussion boards, e-language, corpus linguistics |
Publisher: | Edinburgh University Press |
ISSN: | 1749-5032 |
Date of First Compliant Deposit: | 30 March 2016 |
Last Modified: | 30 Nov 2024 03:45 |
URI: | https://orca.cardiff.ac.uk/id/eprint/72349 |
Citation Data
Cited 21 times in Scopus. View in Scopus. Powered By Scopus® Data
Actions (repository staff only)
Edit Item |