Back to Search Start Over

Machine learning methods for propensity and disease risk score estimation in high-dimensional data: a plasmode simulation and real-world data cohort analysis.

Authors :
Guo Y
Strauss VY
Català M
Jödicke AM
Khalid S
Prieto-Alhambra D
Source :
Frontiers in pharmacology [Front Pharmacol] 2024 Oct 28; Vol. 15, pp. 1395707. Date of Electronic Publication: 2024 Oct 28 (Print Publication: 2024).
Publication Year :
2024

Abstract

Introduction: Machine learning (ML) methods are promising and scalable alternatives for propensity score (PS) estimation, but their comparative performance in disease risk score (DRS) estimation remains unexplored.<br />Methods: We used real-world data comparing antihypertensive users to non-users with 69 negative control outcomes, and plasmode simulations to study the performance of ML methods in PS and DRS estimation. We conducted a cohort study using UK primary care records. Further, we conducted a plasmode simulation with synthetic treatment and outcome mimicking empirical data distributions. We compared four PS and DRS estimation methods: 1. Reference: Logistic regression including clinically chosen confounders. 2. Logistic regression with L1 regularisation (LASSO). 3. Multi-layer perceptron (MLP). 4. Extreme Gradient Boosting (XgBoost). Covariate balance, coverage of the null effect of negative control outcomes (real-world data) and bias based on the absolute difference between observed and true effects (for plasmode) were estimated. 632,201 antihypertensive users and nonusers were included.<br />Results: ML methods outperformed the reference method for PS estimation in some scenarios, both in terms of covariate balance and coverage/bias. Specifically, XgBoost achieved the best performance. DRS-based methods performed worse than PS in all tested scenarios.<br />Discussion: We found that ML methods could be reliable alternatives for PS estimation. ML-based DRS methods performed worse than PS ones, likely given the rarity of outcomes.<br />Competing Interests: Author VS was employed by Boehringer-Ingelheim The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.<br /> (Copyright © 2024 Guo, Strauss, Català, Jödicke, Khalid and Prieto-Alhambra.)

Details

Language :
English
ISSN :
1663-9812
Volume :
15
Database :
MEDLINE
Journal :
Frontiers in pharmacology
Publication Type :
Academic Journal
Accession number :
39529889
Full Text :
https://doi.org/10.3389/fphar.2024.1395707