Back to Search
Start Over
Mixture of Gaussians for distance estimation with missing data
- Source :
- Neurocomputing, Neurocomputing, Elsevier, 2014, 131, pp.32-42, Neurocomputing, 2014, 131, pp.32-42. ⟨10.1016/j.neucom.2013.07.050⟩, Neurocomputing, Elsevier, 2014, 131, pp.32-42. ⟨10.1016/j.neucom.2013.07.050⟩
- Publication Year :
- 2014
- Publisher :
- Elsevier BV, 2014.
-
Abstract
- Many data sets have missing values in practical application contexts, but the majority of commonly studied machine learning methods cannot be applied directly when there are incomplete samples. However, most such methods only depend on the relative differences between samples instead of their particular values, and thus one useful approach is to directly estimate the pairwise distances between all samples in the data set. This is accomplished by fitting a Gaussian mixture model to the data, and using it to derive estimates for the distances. A variant of the model for high-dimensional data with missing values is also studied. Experimental simulations confirm that the proposed method provides accurate estimates compared to alternative methods for estimating distances. In particular, using the mixture model for estimating distances is on average more accurate than using the same model to impute any missing values and then calculating distances. The experimental evaluation additionally shows that more accurately estimating distances lead to improved prediction performance for classification and regression tasks when used as inputs for a neural network.
- Subjects :
- distance estimation
Missing data
Cognitive Neuroscience
02 engineering and technology
01 natural sciences
010104 statistics & probability
Artificial Intelligence
0202 electrical engineering, electronic engineering, information engineering
0101 mathematics
Mathematics
Alternative methods
Estimation
mixture model
Artificial neural network
business.industry
Pattern recognition
Mixture model
Regression
Computer Science Applications
Data set
[ STAT.ME ] Statistics [stat]/Methodology [stat.ME]
020201 artificial intelligence & image processing
Pairwise comparison
Artificial intelligence
business
[STAT.ME]Statistics [stat]/Methodology [stat.ME]
Subjects
Details
- ISSN :
- 09252312
- Volume :
- 131
- Database :
- OpenAIRE
- Journal :
- Neurocomputing
- Accession number :
- edsair.doi.dedup.....ef4a4d2d1f30776b294df3deeb82b42e