LLM-Commentator: Novel fine-tuning strategies of large language models for automatic commentary generation using football event data

Cook, Alec and Karakuş, Oktay

2024. LLM-Commentator: Novel fine-tuning strategies of large language models for automatic commentary generation using football event data. Knowledge-Based Systems 300 , 112219. 10.1016/j.knosys.2024.112219

[thumbnail of 1-s2.0-S0950705124008530-main.pdf]

Preview

PDF - Published Version
Available under License Creative Commons Attribution Non-commercial No Derivatives.
Download (3MB) | Preview

Official URL: http://dx.doi.org/10.1016/j.knosys.2024.112219

Abstract

Real-time commentary on football matches is a challenging task that requires precise and coherent descriptions of events as they unfold. Traditional methods often fall short in providing timely and accurate insights into the game. This study aims to explore the utilisation of innovative Large language model (LLM) techniques to develop an adept language model – dubbed LLM-Commentator – that can generate (near-) real-time commentary on football matches. The goal is to demonstrate that open-source language models, when fine-tuned with domain-specific data on consumer-grade hardware, can accurately depict football events from raw match data. Three distinct training strategies are employed to fine-tune the language models, addressing various challenges encountered in generating real-time football commentary. The study evaluates the efficacy of these models in producing coherent and accurate descriptions of unseen football events. Among the three strategies proposed, the Mixed Immediately Model emerges as particularly efficient in learning and adeptly handling challenging workloads. This suggests a promising future for simultaneous multi-task learning with compact, open-source language models in the context of real-time sports commentary. Additionally, the study highlights the practicality of utilising consumer-grade hardware for fine-tuning language models with specialised knowledge. The findings underscore the importance of customising training approaches and ensuring well-balanced datasets when fine-tuning language models for specific tasks. Moreover, they serve as a practical guide for broader accessibility to large language models and significantly contribute to the application of NLP in sports journalism, enabling more insightful and engaging real-time commentary on football matches.

Item Type:	Article
Date Type:	Publication
Status:	Published
Schools:	Schools > Computer Science & Informatics
Publisher:	Elsevier
ISSN:	0950-7051
Funders:	N/A
Projects:	N/A
Date of First Compliant Deposit:	6 August 2024
Date of Acceptance:	7 July 2024
Last Modified:	15 Aug 2024 13:45
URI:	https://orca.cardiff.ac.uk/id/eprint/171222

Actions (repository staff only)

Edit Item

Altmetric

Dimensions

Download Statistics

Downloads

Downloads per month over past year

View more statistics

CORE (COnnecting REpositories)