Back to Search Start Over

Validation of electronic medical data: Identifying diabetes prevalence in general practice.

Authors :
Henderson J
Barnett S
Ghosh A
Pollack AJ
Hodgkins A
Win KT
Miller GC
Bonney A
Source :
Health information management : journal of the Health Information Management Association of Australia [Health Inf Manag] 2019 Jan; Vol. 48 (1), pp. 3-11. Date of Electronic Publication: 2018 Oct 03.
Publication Year :
2019

Abstract

Background:: Electronic medical records are increasingly used for research with limited external validation of their data.<br />Objective:: This study investigates the validity of electronic medical data (EMD) for estimating diabetes prevalence in general practitioner (GP) patients by comparing EMD with national Bettering the Evaluation and Care of Health (BEACH) data.<br />Method:: A "decision tree" was created using inclusion/exclusion of pre-agreed variables to determine the probability of diabetes in absence of diagnostic label, including diagnoses (coded/free-text diabetes, polycystic ovarian syndrome, impaired glucose tolerance, impaired fasting glucose), diabetic annual cycle of care (DACC), glycated haemoglobin (HbA1c) > 6.5%, and prescription (metformin, other diabetes medications). Via SQL query, cases were identified in EMD of five Illawarra and Southern Practice Network practices (30,007 active patients; from 2 years to January 2015). Patient-based Supplementary Analysis of Nominated Data (SAND) sub-studies from BEACH investigating diabetes prevalence (1172 GPs; 35,162 patients; November 2012 to February 2015) were comparison data. SAND results were adjusted for number of GP encounters per year, per patient, and then age-sex standardised to match age-sex distribution of EMD patients. Cluster-adjusted 95% confidence intervals (CIs) were calculated for both datasets.<br />Results:: EMD diabetes prevalence (T1 and/or T2) was 6.5% (95% CI: 4.1-8.9). Following age-sex standardisation, SAND prevalence, not significantly different, was 6.7% (95% CI: 6.3-7.1). Extracting only coded diagnosis missed 13.0% of probable cases, subsequently identified through the presence of metformin/other diabetes medications (*without other indicator variables) (6.1%), free-text diabetes label (3.8%), HbA1c result* (1.6%), DACC* (1.3%), and diabetes medications* (0.2%).<br />Discussion:: While complex, proxy variables can improve usefulness of EMD for research. Without their consideration, EMD results should be interpreted with caution.<br />Conclusion:: Enforceable, transparent data linkages in EMRs would resolve many problems with identification of diagnoses. Ongoing data quality improvement remains essential.

Details

Language :
English
ISSN :
1833-3575
Volume :
48
Issue :
1
Database :
MEDLINE
Journal :
Health information management : journal of the Health Information Management Association of Australia
Publication Type :
Academic Journal
Accession number :
30278786
Full Text :
https://doi.org/10.1177/1833358318798123