Cardiff University | Prifysgol Caerdydd ORCA
Online Research @ Cardiff 
WelshClear Cookie - decide language by browser settings

Reducing defects in the datasets of clinical research studies: conformance with data quality metrics

Shaheeen, Naila A., Manezhi, Bipin, Thomas, Abin ORCID: https://orcid.org/0000-0002-8283-6762 and AlKelya, Mohammed 2019. Reducing defects in the datasets of clinical research studies: conformance with data quality metrics. BMC Medical Research Methodology 19 , 98. 10.1186/s12874-019-0735-7

[thumbnail of s12874-019-0735-7.pdf] PDF - Published Version
Available under License Creative Commons Attribution.

Download (1MB)

Abstract

Background A dataset is indispensable to answer the research questions of clinical research studies. Inaccurate data lead to ambiguous results, and the removal of errors results in increased cost. The aim of this Quality Improvement Project (QIP) was to improve the Data Quality (DQ) by enhancing conformance and minimizing data entry errors. Methods This is a QIP which was conducted in the Department of Biostatistics using historical datasets submitted for statistical data analysis from the department’s knowledge base system. Forty-five datasets received for statistical data analysis, were included at baseline. A 12-item checklist based on six DQ domains (i) completeness (ii) uniqueness (iii) timeliness (iv) accuracy (v) validity and (vi) consistency was developed to assess the DQ. The checklist was comprised of 12 items; missing values, un-coded values, miscoded values, embedded values, implausible values, unformatted values, missing codebook, inconsistencies with the codebook, inaccurate format, unanalyzable data structure, missing outcome variables, and missing analytic variables. The outcome was the number of defects per dataset. Quality improvement DMAIC (Define, Measure, Analyze, Improve, Control) framework and sigma improvement tools were used. Pre-Post design was implemented using mode of interventions. Pre-Post change in defects (zero, one, two or more defects) was compared by using chi-square test. Results At baseline, out of forty-five datasets; six (13.3%) datasets had zero defects, eight (17.8%) had one defect, and 31(69%) had ≥2 defects. The association between the nature of data capture (single vs. multiple data points) and defective data was statistically significant (p = 0.008). Twenty-one datasets were received during post-intervention for statistical data analysis. Seventeen (81%) had zero defects, two (9.5%) had one defect, and two (9.5%) had two or more defects. The proportion of datasets with zero defects had increased from 13.3 to 81%, whereas the proportion of datasets with two or more defects had decreased from 69 to 9.5% (p = < 0.001). Conclusion Clinical research study teams often have limited knowledge of data structuring. Given the need for good quality data, we recommend training programs, consultation with data experts prior to data structuring and use of electronic data capturing methods.

Item Type: Article
Date Type: Publication
Status: Published
Schools: Medicine
Centre for Trials Research (CNTRR)
Publisher: BioMed Central
ISSN: 1471-2288
Date of First Compliant Deposit: 19 January 2022
Date of Acceptance: 15 April 2019
Last Modified: 24 May 2023 00:10
URI: https://orca.cardiff.ac.uk/id/eprint/146610

Citation Data

Cited 2 times in Scopus. View in Scopus. Powered By Scopus® Data

Actions (repository staff only)

Edit Item Edit Item

Downloads

Downloads per month over past year

View more statistics