Back to Search Start Over

Semisupervised Multiclass Classification Problems With Scarcity of Labeled Data: A Theoretical Study.

Authors :
Ortigosa-Hernandez, Jonathan
Inza, Inaki
Lozano, Jose A.
Source :
IEEE Transactions on Neural Networks & Learning Systems; Dec2016, Vol. 27 Issue 12, p2602-2614, 13p
Publication Year :
2016

Abstract

In recent years, the performance of semisupervised learning (SSL) has been theoretically investigated. However, most of this theoretical development has focused on binary classification problems. In this paper, we take it a step further by extending the work of Castelli and Cover to the multiclass paradigm. In particular, we consider the key problem in SSL of classifying an unseen instance x into one of $K$ different classes, using a training data set sampled from a mixture density distribution and composed of $l$ labeled records and $u$ unlabeled examples. Even under the assumption of identifiability of the mixture and having infinite unlabeled examples, labeled records are needed to determine the $K$ decision regions. Therefore, in this paper, we first investigate the minimum number of labeled examples needed to accomplish that task. Then, we propose an optimal multiclass learning algorithm, which is a generalization of the optimal procedure proposed in the literature for binary problems. Finally, we make use of this generalization to study the probability of error when the binary class constraint is relaxed. [ABSTRACT FROM PUBLISHER]

Subjects

Subjects :
MACHINE learning
CLASSIFICATION

Details

Language :
English
ISSN :
2162237X
Volume :
27
Issue :
12
Database :
Complementary Index
Journal :
IEEE Transactions on Neural Networks & Learning Systems
Publication Type :
Periodical
Accession number :
119593056
Full Text :
https://doi.org/10.1109/TNNLS.2015.2498525