Start Submission Become a Reviewer

Reading: Multisite Evaluation of a Data Quality Tool for Patient-Level Clinical Datasets

Download

A- A+
dyslexia friendly

Empirical research

Multisite Evaluation of a Data Quality Tool for Patient-Level Clinical Datasets

Authors:

Vojtech Huser ,

NIH (NLM)
X close

Frank J. DeFalco,

Janssen Research & Development
X close

Martijn Schuemie,

Janssen Research & Development, Epidemiology, Titusville, New Jersey, United States; and Observational Health Data Sciences and Informatics (OHDSI) New York, New York, United States
X close

Patrick B. Ryan,

Janssen Research & Development
X close

Ning Shang,

Department of Biomedical Informatics, Columbia University, New York, USA
X close

Mark Velez,

Department of Biomedical Informatics, Columbia University, New York, USA
X close

Rae Woong Park,

Department of Biomedical Informatics, Ajou University, Suwon, Korea
X close

Richard D. Boyce,

Department of Biomedical Informatics, University of Pittsburgh, Pittsburgh, PA
X close

Jon Duke,

Regenstrief Institute, Indianapolis, IN
X close

Ritu Khare,

Department of Biomedical and Health Informatics, The Children's Hospital of Philadelphia
X close

Levon Utidjian,

Department of Biomedical and Health Informatics, The Children's Hospital of Philadelphia
X close

Charles Bailey

Department of Biomedical and Health Informatics, Department of Pediatrics, The Children's Hospital of Philadelphia
X close

Abstract

Introduction: Data quality and fitness for analysis are crucial if outputs of analyses of electronic health record data or administrative claims data should be trusted by the public and the research community.

Methods: We describe a data quality analysis tool (called Achilles Heel) developed by the Observational Health Data Sciences and Informatics Collaborative (OHDSI) and compare outputs from this tool as it was applied to 24 large healthcare datasets across seven different organizations.

Results: We highlight 12 data quality rules that identified issues in at least 10 of the 24 datasets and provide a full set of 71 rules identified in at least one dataset. Achilles Heel is a freely available software that provides a useful starter set of data quality rules with the ability to add additional rules. We also present results of a structured email-based interview of all participating sites that collected qualitative comments about the value of Achilles Heel for data quality evaluation.

Discussion: Our analysis represents the first comparison of outputs from a data quality tool that implements a fixed (but extensible) set of data quality rules. Thanks to a common data model, we were able to compare quickly multiple datasets originating from several countries in America, Europe and Asia.

How to Cite: Huser V, DeFalco FJ, Schuemie M, Ryan PB, Shang N, Velez M, et al.. Multisite Evaluation of a Data Quality Tool for Patient-Level Clinical Datasets. eGEMs (Generating Evidence & Methods to improve patient outcomes). 2016;4(1):24. DOI: http://doi.org/10.13063/2327-9214.1239
8
Views
Published on 30 Nov 2016.
Peer Reviewed

Downloads

  • PDF (EN)