Back to Search Start Over

Penalized logistic regression with low prevalence exposures beyond high dimensional settings.

Authors :
Doerken, Sam
Avalos, Marta
Lagarde, Emmanuel
Schumacher, Martin
Source :
PLoS ONE; 5/20/2019, Vol. 14 Issue 5, p1-14, 14p
Publication Year :
2019

Abstract

Estimating and selecting risk factors with extremely low prevalences of exposure for a binary outcome is a challenge because classical standard techniques, markedly logistic regression, often fail to provide meaningful results in such settings. While penalized regression methods are widely used in high-dimensional settings, we were able to show their usefulness in low-dimensional settings as well. Specifically, we demonstrate that Firth correction, ridge, the lasso and boosting all improve the estimation for low-prevalence risk factors. While the methods themselves are well-established, comparison studies are needed to assess their potential benefits in this context. This is done here using the dataset of a large unmatched case-control study from France (2005-2008) about the relationship between prescription medicines and road traffic accidents and an accompanying simulation study. Results show that the estimation of risk factors with prevalences below 0.1% can be drastically improved by using Firth correction and boosting in particular, especially for ultra-low prevalences. When a moderate number of low prevalence exposures is available, we recommend the use of penalized techniques. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
19326203
Volume :
14
Issue :
5
Database :
Complementary Index
Journal :
PLoS ONE
Publication Type :
Academic Journal
Accession number :
136532967
Full Text :
https://doi.org/10.1371/journal.pone.0217057