1. Development and Validation of an Algorithm for Identifying Patients with Hemophilia A in an Administrative Claims Database
- Author
-
Yaping Xu, William D Finkle, Jennifer G Lyons, Stephan Lanes, Greg Ridgeway, Sean D. Sullivan, Vibha C. A. Desai, and Paul G. Solari
- Subjects
Adult ,Male ,congenital, hereditary, and neonatal diseases and abnormalities ,Adolescent ,Databases, Factual ,Office visits ,Pharmacy ,030204 cardiovascular system & hematology ,Hemophilia A ,computer.software_genre ,Logistic regression ,Cohort Studies ,03 medical and health sciences ,0302 clinical medicine ,hemic and lymphatic diseases ,Covariate ,Humans ,Medicine ,030212 general & internal medicine ,Claims database ,Program Development ,Child ,Aged ,Retrospective Studies ,Database ,business.industry ,Health Policy ,Medical record ,Public Health, Environmental and Occupational Health ,Infant ,Middle Aged ,Administrative claims ,Logistic Models ,Child, Preschool ,Female ,Database research ,business ,Administrative Claims, Healthcare ,computer ,Algorithm ,Algorithms - Abstract
Background The accuracy with which hemophilia A can be identified in claims databases is unknown. Objective Develop and validate an algorithm using predictive modeling supported by machine learning to identify patients with hemophilia A in an administrative claims database. Methods We first created a screening algorithm using medical and pharmacy claims to identify potential hemophilia A patients in the US HealthCore Integrated Research Database between January 1, 2006 and April 30, 2015. Medical records for a random sample of patients were reviewed to confirm case status. In this validation sample, we used lasso logistic regression with cross-validation to select covariates in claims data and develop a predictive model to estimate the probability of being a confirmed hemophilia A case. Results The screening algorithm identified 2,252 patients and we reviewed medical records for 400 of these patients. The screening algorithm had a positive predictive value (PPV) of 65%. The predictive model identified 18 predictors of being a hemophilia A case or noncase. The strongest predictors of case status included male sex, factor VIII therapy, office visits for hemophilia A, and hospitalizations for hemophilia A. The strongest predictors of noncase status included hospitalizations for reasons other than hemophilia A and factor VIIa therapy. A probability threshold of ≥0.6 resulted in a PPV of 94.7% (95% CI: 92.0–97.5) and sensitivity of 94.4% (95% CI: 91.5–97.2). Conclusions We developed and validated an algorithm to identify hemophilia A cases in an administrative claims database with high sensitivity and high PPV.
- Published
- 2018
- Full Text
- View/download PDF