Back to Search
Start Over
Supervised learning via smoothed Polya trees
- Source :
- Advances in Data Analysis and Classification. 13:877-904
- Publication Year :
- 2018
- Publisher :
- Springer Science and Business Media LLC, 2018.
-
Abstract
- We propose a generative classification model that extends Quadratic Discriminant Analysis (QDA) (Cox in J R Stat Soc Ser B (Methodol) 20:215–242, 1958) and Linear Discriminant Analysis (LDA) (Fisher in Ann Eugen 7:179–188, 1936; Rao in J R Stat Soc Ser B 10:159–203, 1948) to the Bayesian nonparametric setting, providing a competitor to MclustDA (Fraley and Raftery in Am Stat Assoc 97:611–631, 2002). This approach models the data distribution for each class using a multivariate Polya tree and realizes impressive results in simulations and real data analyses. The flexibility gained from further relaxing the distributional assumptions of QDA can greatly improve the ability to correctly classify new observations for models with severe deviations from parametric distributional assumptions, while still performing well when the assumptions hold. The proposed method is quite fast compared to other supervised classifiers and very simple to implement as there are no kernel tricks or initialization steps perhaps making it one of the more user-friendly approaches to supervised learning. This highlights a significant feature of the proposed methodology as suboptimal tuning can greatly hamper classification performance; e.g., SVMs fit with non-optimal kernels perform significantly worse.
- Subjects :
- Statistics and Probability
0303 health sciences
Computer science
business.industry
Applied Mathematics
Supervised learning
Initialization
Quadratic classifier
Linear discriminant analysis
Machine learning
computer.software_genre
01 natural sciences
Computer Science Applications
Support vector machine
010104 statistics & probability
03 medical and health sciences
Kernel (statistics)
Feature (machine learning)
Artificial intelligence
0101 mathematics
business
computer
030304 developmental biology
Parametric statistics
Subjects
Details
- ISSN :
- 18625355 and 18625347
- Volume :
- 13
- Database :
- OpenAIRE
- Journal :
- Advances in Data Analysis and Classification
- Accession number :
- edsair.doi...........455e91b93270fca4575d7690af6687fb
- Full Text :
- https://doi.org/10.1007/s11634-018-0344-z