Cook, Alec and Karakuş, Oktay ORCID: https://orcid.org/0000-0001-8009-9319 2024. LLM-Commentator: Novel fine-tuning strategies of large language models for automatic commentary generation using football event data. Knowledge-Based Systems 300 , 112219. 10.1016/j.knosys.2024.112219 |
Preview |
PDF
- Published Version
Available under License Creative Commons Attribution Non-commercial No Derivatives. Download (3MB) | Preview |
Abstract
Real-time commentary on football matches is a challenging task that requires precise and coherent descriptions of events as they unfold. Traditional methods often fall short in providing timely and accurate insights into the game. This study aims to explore the utilisation of innovative Large language model (LLM) techniques to develop an adept language model – dubbed LLM-Commentator – that can generate (near-) real-time commentary on football matches. The goal is to demonstrate that open-source language models, when fine-tuned with domain-specific data on consumer-grade hardware, can accurately depict football events from raw match data. Three distinct training strategies are employed to fine-tune the language models, addressing various challenges encountered in generating real-time football commentary. The study evaluates the efficacy of these models in producing coherent and accurate descriptions of unseen football events. Among the three strategies proposed, the Mixed Immediately Model emerges as particularly efficient in learning and adeptly handling challenging workloads. This suggests a promising future for simultaneous multi-task learning with compact, open-source language models in the context of real-time sports commentary. Additionally, the study highlights the practicality of utilising consumer-grade hardware for fine-tuning language models with specialised knowledge. The findings underscore the importance of customising training approaches and ensuring well-balanced datasets when fine-tuning language models for specific tasks. Moreover, they serve as a practical guide for broader accessibility to large language models and significantly contribute to the application of NLP in sports journalism, enabling more insightful and engaging real-time commentary on football matches.
Item Type: | Article |
---|---|
Date Type: | Publication |
Status: | Published |
Schools: | Computer Science & Informatics |
Publisher: | Elsevier |
ISSN: | 0950-7051 |
Funders: | N/A |
Date of First Compliant Deposit: | 6 August 2024 |
Date of Acceptance: | 7 July 2024 |
Last Modified: | 15 Aug 2024 13:45 |
URI: | https://orca.cardiff.ac.uk/id/eprint/171222 |
Actions (repository staff only)
Edit Item |