Cardiff University | Prifysgol Caerdydd ORCA
Online Research @ Cardiff 
WelshClear Cookie - decide language by browser settings

Predicting type 1 diabetes in children using electronic health records in primary care in the UK: development and validation of a machine-learning algorithm

Daniel, Rhian ORCID: https://orcid.org/0000-0001-5649-9320, Jones, Hywel ORCID: https://orcid.org/0000-0001-8308-2002, Gregory, John W. ORCID: https://orcid.org/0000-0001-5189-3812, Shetty, Ambika, Francis, Nick, Paranjothy, Shantini and Townson, Julia ORCID: https://orcid.org/0000-0001-8679-3619 2024. Predicting type 1 diabetes in children using electronic health records in primary care in the UK: development and validation of a machine-learning algorithm. The Lancet Digital Health 6 (6) , e386-e395. 10.1016/S2589-7500(24)00050-5

[thumbnail of PIIS2589750024000505.pdf]
Preview
PDF - Published Version
Available under License Creative Commons Attribution Non-commercial.

Download (831kB) | Preview

Abstract

Background Children presenting to primary care with suspected type 1 diabetes should be referred immediately to secondary care to avoid life-threatening diabetic ketoacidosis. However, early recognition of children with type 1 diabetes is challenging. Children might not present with classic symptoms, or symptoms might be attributed to more common conditions. A quarter of children present with diabetic ketoacidosis, a proportion unchanged over 25 years. Our aim was to investigate whether a machine-learning algorithm could lead to earlier detection of type 1 diabetes in primary care. Methods We developed the predictive algorithm using Welsh primary care electronic health records (EHRs) linked to the Brecon Dataset, a register of children newly diagnosed with type 1 diabetes. Children were included from their first primary care record within the study period of Jan 1, 2000, to Dec 31, 2016, until either type 1 diabetes diagnosis, they turned 15 years of age, or study end. We developed an ensemble learner (SuperLearner) using 26 potential predictors. Validation of the algorithm was done in English EHRs from the Clinical Practice Research Datalink (primary care) and Hospital Episode Statistics, focusing on the ability of the algorithm to identify children who went on to develop type 1 diabetes and the time by which diagnosis could be anticipated. Findings The development dataset comprised 34 754 400 primary care contacts, relating to 952 402 children, and the validation dataset comprised 43 089 103 primary care contacts, relating to 1 493 328 children. Of these, 1829 (0·19%) children younger than 15 years in the development dataset, and 1516 (0·10%) in the validation dataset had a reliable date of type 1 diabetes diagnosis. If set to give an alert in 10% of contacts, an estimated 71·6% (95% CI 68·8–74·4) of the children with type 1 diabetes would receive an alert by the algorithm in the 90 days before diagnosis, with diagnosis anticipated, on average, by an estimated 9·34 days (95% CI 7·77–10·9). Interpretation If implemented into primary care settings, this predictive algorithm could substantially reduce the proportion of patients with new-onset type 1 diabetes presenting in diabetic ketoacidosis. Acceptability of alert thresholds should be explored in primary care.

Item Type: Article
Date Type: Publication
Status: Published
Schools: Medicine
Centre for Trials Research (CNTRR)
Publisher: Elsevier
ISSN: 2589-7500
Funders: Diabetes UK
Date of First Compliant Deposit: 30 August 2024
Date of Acceptance: 1 June 2024
Last Modified: 02 Sep 2024 09:45
URI: https://orca.cardiff.ac.uk/id/eprint/171683

Actions (repository staff only)

Edit Item Edit Item

Downloads

Downloads per month over past year

View more statistics