Cardiff University | Prifysgol Caerdydd ORCA
Online Research @ Cardiff 
WelshClear Cookie - decide language by browser settings

Bayes-xG: player and position correction on expected goals (xG) using Bayesian hierarchical approach

Scholtes, Alexander and Karakus, Oktay ORCID: https://orcid.org/0000-0001-8009-9319 2024. Bayes-xG: player and position correction on expected goals (xG) using Bayesian hierarchical approach. Frontiers in Sports and Active Living 6 , 1348983. 10.3389/fspor.2024.1348983

[thumbnail of fspor-06-1348983.pdf]
Preview
PDF - Published Version
Available under License Creative Commons Attribution.

Download (9MB) | Preview

Abstract

This study employs Bayesian methodologies to explore the influence of player or positional factors in predicting the probability of a shot resulting in a goal, measured by the expected goals (xG) metric. Utilising publicly available data from StatsBomb, Bayesian hierarchical logistic regressions are constructed, analysing approximately 10,000 shots from the English Premier League (for the years of 2003 and 2015) to ascertain whether positional or player-level effects impact xG. The findings reveal positional effects in a basic model that includes only distance to goal and shot angle as predictors, highlighting that strikers and attacking midfielders exhibit a higher likelihood of scoring. However, these effects diminish when more informative predictors are introduced. Nevertheless, even with additional predictors, player-level effects persist, indicating that certain players possess notable positive or negative xG adjustments, influencing their likelihood of scoring a given chance. The study extends its analysis to data from Spain’s La Liga ( ≈ 20 K shots from 1973 and 2004 to 2020) and Germany’s Bundesliga ( ≈ 7.5 K shots from 2015), yielding comparable results. Additionally, the paper assesses the impact of prior distribution choices on outcomes, concluding that the priors employed in the models provide sound results but could be refined to enhance sampling efficiency for constructing more complex and extensive models feasibly.

Item Type: Article
Date Type: Publication
Status: Published
Schools: Computer Science & Informatics
Publisher: Frontiers Media
ISSN: 2624-9367
Funders: N/A
Date of First Compliant Deposit: 6 August 2024
Date of Acceptance: 16 May 2024
Last Modified: 09 Aug 2024 15:00
URI: https://orca.cardiff.ac.uk/id/eprint/171223

Actions (repository staff only)

Edit Item Edit Item

Downloads

Downloads per month over past year

View more statistics