1. Variable selection with the strong heredity constraint and its oracle property
- Author
-
Choi, Nam Hee, Li, William, and Zhu, Ji
- Subjects
Genetics -- Analysis ,Heredity -- Models ,Mathematics - Abstract
In this paper, we extend the LASSO method (Tibshirani 1996) for simultaneously fitting a regression model and identifying important interaction terms. Unlike most of the existing variable selection methods, our method automatically enforces the heredity constraint, that is, an interaction term can be included in the model only if the corresponding main terms are also included in the model. Furthermore, we extend our method m generalized linear models, and show that it performs as well as if the true model were given in advance, that is, the oracle properly as in Fun and Li (2001) and Fan and Peng (2004). The proof of the oracle property is given in online supplemental materials, Numerical results on both simulation data and real data indicate that our method lends lo remove irrelevant variables more effectively and provide better prediction performance than previous work (Yuan. Joseph, and Lin 2007 and Zhao. Rocha, and Yu 2009 as well as the classical LASSO method). KEY WORDS: Heredity structure; LASSO; Regularization.
- Published
- 2010