Cardiff University | Prifysgol Caerdydd ORCA
Online Research @ Cardiff 
WelshClear Cookie - decide language by browser settings

Replicating a COVID-19 study in a national England database to assess the generalisability of research with regional electronic health record data

Williams, Richard, Jenkins, David, Bolton, Thomas, Heald, Adrian, Mizani, Mehrdad, Sperrin, Matthew, Peek, Niels, CVD-COVID-UK/COVID-IMPACT Consortium and Townson, Julia ORCID: https://orcid.org/0000-0001-8679-3619 2025. Replicating a COVID-19 study in a national England database to assess the generalisability of research with regional electronic health record data. BMJ Open 15 (4) , e093080. 10.1136/bmjopen-2024-093080

[thumbnail of e093080.full.pdf] PDF - Published Version
Available under License Creative Commons Attribution.

Download (722kB)

Abstract

Objectives To assess the degree to which we can replicate a study between a regional and a national database of electronic health record data in the UK. The original study examined the risk factors associated with hospitalisation following COVID-19 infection in people with diabetes. Design A replication of a retrospective cohort study. Setting Observational electronic health record data from primary and secondary care sources in the UK. The original study used data from a large, urbanised region (Greater Manchester Care Record, Greater Manchester, UK—2.8 m patients). This replication study used a national database covering the whole of England, UK (NHS England’s Secure Data Environment service for England, accessed via the BHF Data Science Centre’s CVD-COVID-UK/COVID-IMPACT Consortium—54 m patients). Participants Individuals with a diagnosis of type 1 diabetes or type 2 diabetes prior to a positive COVID-19 test result. The matched controls (3:1) were individuals who had a positive COVID-19 test result, but who did not have a diagnosis of diabetes on the date of their positive COVID-19 test result. Matching was done on age at COVID-19 diagnosis, sex and approximate date of COVID-19 test. Primary and secondary outcome measures Hospitalisation within 28 days of a positive COVID-19 test. Results We found that many of the effect sizes did not show a statistically significant difference, but that some did. Where effect sizes were statistically significant in the regional study, then they remained significant in the national study and the effect size was the same direction and of similar magnitude. Conclusions There is some evidence that the findings from studies in smaller regional datasets can be extrapolated to a larger, national setting. However, there were some differences, and therefore replication studies remain an essential part of healthcare research.

Item Type: Article
Date Type: Published Online
Status: Published
Schools: Schools > Medicine
Research Institutes & Centres > Centre for Trials Research (CNTRR)
Additional Information: For the full list of consortium members, please refer to the article PDF at http://dx.doi.org/10.1136/bmjopen-2024-093080
Publisher: BMJ Publishing Group
ISSN: 2044-6055
Date of First Compliant Deposit: 30 January 2026
Date of Acceptance: 24 March 2025
Last Modified: 30 Jan 2026 09:45
URI: https://orca.cardiff.ac.uk/id/eprint/184278

Actions (repository staff only)

Edit Item Edit Item

Downloads

Downloads per month over past year

View more statistics