351. Measuring overlap in logistic regression
- Author
-
Christmann, Andreas, Rousseeuw, Peter J., Applied Mathematics, and Vrije Universiteit Brussel
- Subjects
Overlap ,separation ,ddc:519 ,linear discriminant analysis ,regression depth ,Linear discriminant analysis ,logistic regression ,outliers ,Logistic regression ,Separation ,probit regression ,Linear discriminant analysis,Logistic regression,Outliers,Overlap,Probit regression,Regression depth,Separation ,Probit regression ,overlap ,Outliers ,Regression depth - Abstract
In this paper we show that the recent notion of regression depth can be used as a data-analytic tool to measure the amount of separation between successes and failures in the binary response framework. Extending this algorithm allows us to compute the overlap in data sets which are commonly fitted by logistic regression models. The overlap is the number of observations that would need to be removed to obtain complete or quasicomplete separation, i.e. the situation where the logistic regression parameters are no longer identifiable and the maximum likelihood estimate does not exist. It turns out that the overlap is often quite small.
- Published
- 2001