Cardiff University | Prifysgol Caerdydd ORCA
Online Research @ Cardiff 
WelshClear Cookie - decide language by browser settings

CLASSIC utterance boundary: a chunking-based model of early naturalistic word segmentation

Cabiddu, Francesco, Bott, Lewis ORCID:, Jones, Gary and Gambi, Chiara ORCID: 2023. CLASSIC utterance boundary: a chunking-based model of early naturalistic word segmentation. Language Learning 73 (3) , pp. 942-975. 10.1111/lang.12559

[thumbnail of Language Learning - 2023 - Cabiddu - CLASSIC Utterance Boundary  A Chunking‐Based Model of Early Naturalistic Word.pdf]
PDF - Published Version
Available under License Creative Commons Attribution.

Download (1MB) | Preview
License URL:
License Start date: 2 February 2023


Word segmentation is a crucial step in children's vocabulary learning. While computational models of word segmentation can capture infants’ performance in small-scale artificial tasks, the examination of early word segmentation in naturalistic settings has been limited by the lack of measures that can relate models’ performance to developmental data. Here, we extended CLASSIC (Chunking Lexical and Sublexical Sequences in Children; Jones et al., 2021), a corpus-trained chunking model that can simulate several memory and phonological and vocabulary learning phenomena to allow it to perform word segmentation using utterance boundary information, and we have named this extended version CLASSIC utterance boundary (CLASSIC-UB). Further, we compared our model to the performance of children on a wide range of new measures, capitalizing on the link between word segmentation and vocabulary learning abilities. We showed that the combination of chunking and utterance-boundary information used by CLASSIC utterance boundary allowed a better prediction of English-learning children's output vocabulary than did other models.

Item Type: Article
Date Type: Publication
Status: Published
Schools: Psychology
Publisher: Wiley
ISSN: 0023-8333
Date of First Compliant Deposit: 13 December 2022
Date of Acceptance: 9 December 2022
Last Modified: 30 Nov 2023 14:36

Actions (repository staff only)

Edit Item Edit Item


Downloads per month over past year

View more statistics