Back to Search
Start Over
9. Multiple Imputation of Incomplete Categorical Data Using Latent Class Analysis
- Source :
- Sociological Methodology. 38:369-397
- Publication Year :
- 2008
- Publisher :
- SAGE Publications, 2008.
-
Abstract
- We propose using latent class analysis as an alternative to log-linear analysis for the multiple imputation of incomplete categorical data. Similar to log-linear models, latent class models can be used to describe complex association structures between the variables used in the imputation model. However, unlike log-linear models, latent class models can be used to build large imputation models containing more than a few categorical variables. To obtain imputations reflecting uncertainty about the unknown model parameters, we use a nonparametric bootstrap procedure as an alternative to the more common full Bayesian approach. The proposed multiple imputation method, which is implemented in Latent GOLD software for latent class analysis, is illustrated with two examples. In a simulated data example, we compare the new method to well-established methods such as maximum likelihood estimation with incomplete data and multiple imputation using a saturated log-linear model. This example shows that the proposed method yields unbiased parameter estimates and standard errors. The second example concerns an application using a typical social sciences data set. It contains 79 variables that are all included in the imputation model. The proposed method is especially useful for such large data sets because standard methods for dealing with missing data in categorical variables break down when the number of variables is so large.
- Subjects :
- Nonparametric bootstrap
Sociology and Political Science
Probabilistic latent semantic analysis
business.industry
Bayesian probability
computer.software_genre
Latent class model
Software
Data mining
Imputation (statistics)
Latent variable model
business
Categorical variable
computer
Mathematics
Subjects
Details
- ISSN :
- 14679531 and 00811750
- Volume :
- 38
- Database :
- OpenAIRE
- Journal :
- Sociological Methodology
- Accession number :
- edsair.doi...........5cf890dce6ef1159d1f76e8009918a6f
- Full Text :
- https://doi.org/10.1111/j.1467-9531.2008.00202.x