Back to Search
Start Over
Large margin classification with indefinite similarities
- Source :
- Machine Learning. 103:215-237
- Publication Year :
- 2016
- Publisher :
- Springer Science and Business Media LLC, 2016.
-
Abstract
- Classification with indefinite similarities has attracted attention in the machine learning community. This is partly due to the fact that many similarity functions that arise in practice are not symmetric positive semidefinite, i.e. the Mercer condition is not satisfied, or the Mercer condition is difficult to verify. Examples of such indefinite similarities in machine learning applications are ample including, for instance, the BLAST similarity score between protein sequences, human-judged similarities between concepts and words, and the tangent distance or the shape matching distance in computer vision. Nevertheless, previous works on classification with indefinite similarities are not fully satisfactory. They have either introduced sources of inconsistency in handling past and future examples using kernel approximation, settled for local-minimum solutions using non-convex optimization, or produced non-sparse solutions by learning in Krein spaces. Despite the large volume of research devoted to this subject lately, we demonstrate in this paper how an old idea, namely the 1-norm support vector machine (SVM) proposed more than 15 years ago, has several advantages over more recent work. In particular, the 1-norm SVM method is conceptually simpler, which makes it easier to implement and maintain. It is competitive, if not superior to, all other methods in terms of predictive accuracy. Moreover, it produces solutions that are often sparser than more recent methods by several orders of magnitude. In addition, we provide various theoretical justifications by relating 1-norm SVM to well-established learning algorithms such as neural networks, SVM, and nearest neighbor classifiers. Finally, we conduct a thorough experimental evaluation, which reveals that the evidence in favor of 1-norm SVM is statistically significant.
- Subjects :
- Similarity (geometry)
Artificial neural network
Linear programming
business.industry
Supervised learning
02 engineering and technology
Positive-definite matrix
Machine learning
computer.software_genre
k-nearest neighbors algorithm
Support vector machine
Artificial Intelligence
Margin (machine learning)
020204 information systems
0202 electrical engineering, electronic engineering, information engineering
020201 artificial intelligence & image processing
Artificial intelligence
business
computer
Software
Mathematics
Subjects
Details
- ISSN :
- 15730565 and 08856125
- Volume :
- 103
- Database :
- OpenAIRE
- Journal :
- Machine Learning
- Accession number :
- edsair.doi...........a70861264cb5cb0b6e069f10cff59f0a
- Full Text :
- https://doi.org/10.1007/s10994-015-5542-8