Start Over

Learning Distributions by Their Density Levels: A Paradigm for Learning without a Teacher

Authors :: Michael Lindenbaum
Shai Ben-David
Source :: Journal of Computer and System Sciences. 55(1):171-182
Publication Year :: 1997
Publisher :: Elsevier BV, 1997.
Abstract: We propose a mathematical model for learning the high-density areas of an unknown distribution from (unlabeled) random points drawn according to this distribution. While this type of a learning task has not been previously addressed in the computational learnability literature, we believe that this it a rather basic problem that appears in many practical learning scenarios. From a statistical theory standpoint, our model may be viewed as a restricted instance of the fundamental issue of inferring information about a probability distribution from the random samples it generates. From a computational learning angle, what we propose is a few framework of unsupervised concept learning. The examples provided to the learner in our model are not labeled (and are not necessarily all positive or all negative). The only information about their membership is indirectly disclosed to the student through the sampling distribution. We investigate the basic features of the proposed model and provide lower and upper bounds on the sample complexity of such learning tasks. We prove that classes whose VC-dimension is finite are learnable in a very strong sense, while on the other hand,ε-covering numbers of a concept class impose lower bounds on the sample size needed for learning in our models. One direction of the proof involves a reduction of the density-level learnability to PAC learning with respect to fixed distributions (as well as some fundamental statistical lower bounds), while the sufficiency condition is proved through the introduction of a generic learning algorithm.

Subjects :: Computer Science::Machine Learning
Computer science
business.industry
Computer Networks and Communications
Algorithmic learning theory
Applied Mathematics
Probably approximately correct learning
Stability (learning theory)
Semi-supervised learning
Theoretical Computer Science
Computational learning theory
Computational Theory and Mathematics
Sample exclusion dimension
Unsupervised learning
Artificial intelligence
Instance-based learning
business

Details

ISSN :: 00220000
Volume :: 55
Issue :: 1
Database :: OpenAIRE
Journal :: Journal of Computer and System Sciences
Accession number :: edsair.doi.dedup.....7981922103258ae718062a4a24482ed9
Full Text :: https://doi.org/10.1006/jcss.1997.1507

Full Text Access

View/download PDF

Tools

Email
Cite

Printer

Authors Abstract Subjects Details

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Learning Distributions by Their Density Levels: A Paradigm for Learning without a Teacher

Abstract

Subjects

Details

Tools

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Learning Distributions by Their Density Levels: A Paradigm for Learning without a Teacher

Abstract

Subjects

Details

Tools

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources