Cardiff University | Prifysgol Caerdydd ORCA
Online Research @ Cardiff 
WelshClear Cookie - decide language by browser settings

The video game dialogue corpus

Rennick, Stephanie and Roberts, Seán ORCID: https://orcid.org/0000-0001-5990-9161 2024. The video game dialogue corpus. Corpora 19 (1) , pp. 93-106.

[thumbnail of VGDC_CorpusDesign_Revision1.pdf]
Preview
PDF - Accepted Post-Print Version
Download (415kB) | Preview

Abstract

This paper presents the Video Game Dialogue Corpus, the first large-scale, consistently coded, open source corpus of dialogue from video games. It contains over 6.2 million words of English dialogue from fifty games in the Role Playing Game (rpg) genre. This includes games produced between 1985 and 2020, rated for children, teenagers and adults, and in both ‘Western’ and ‘Japanese’ sub-genres. The corpus design is described, including custom data formats for representing branching dialogue. We demonstrate the use of the corpus by comparing the dialogue of female and male characters, where we find reflections of gendered language in other media as well as patterns that seem specific to video games. We provide the source code for a ‘self-inflating corpus’ – a pipeline that obtains the data then processes and parses it into a standard format. This makes the corpus available for teaching and research purposes, providing the first such resource for empirical analysis of video game dialogue.

Item Type: Article
Date Type: Published Online
Status: Published
Schools: English, Communication and Philosophy
Subjects: P Language and Literature > P Philology. Linguistics
P Language and Literature > PR English literature
Publisher: Edinburgh University Press
ISSN: 1755-1676
Funders: Swiss National Science Foundation
Date of First Compliant Deposit: 7 March 2023
Date of Acceptance: 7 February 2023
Last Modified: 14 May 2024 16:30
URI: https://orca.cardiff.ac.uk/id/eprint/157527

Actions (repository staff only)

Edit Item Edit Item

Downloads

Downloads per month over past year

View more statistics