Back to Search
Start Over
Semisupervised Multiclass Classification Problems With Scarcity of Labeled Data: A Theoretical Study.
- Source :
- IEEE Transactions on Neural Networks & Learning Systems; Dec2016, Vol. 27 Issue 12, p2602-2614, 13p
- Publication Year :
- 2016
-
Abstract
- In recent years, the performance of semisupervised learning (SSL) has been theoretically investigated. However, most of this theoretical development has focused on binary classification problems. In this paper, we take it a step further by extending the work of Castelli and Cover to the multiclass paradigm. In particular, we consider the key problem in SSL of classifying an unseen instance x into one of $K$ different classes, using a training data set sampled from a mixture density distribution and composed of $l$ labeled records and $u$ unlabeled examples. Even under the assumption of identifiability of the mixture and having infinite unlabeled examples, labeled records are needed to determine the $K$ decision regions. Therefore, in this paper, we first investigate the minimum number of labeled examples needed to accomplish that task. Then, we propose an optimal multiclass learning algorithm, which is a generalization of the optimal procedure proposed in the literature for binary problems. Finally, we make use of this generalization to study the probability of error when the binary class constraint is relaxed. [ABSTRACT FROM PUBLISHER]
- Subjects :
- MACHINE learning
CLASSIFICATION
Subjects
Details
- Language :
- English
- ISSN :
- 2162237X
- Volume :
- 27
- Issue :
- 12
- Database :
- Complementary Index
- Journal :
- IEEE Transactions on Neural Networks & Learning Systems
- Publication Type :
- Periodical
- Accession number :
- 119593056
- Full Text :
- https://doi.org/10.1109/TNNLS.2015.2498525