1. Reducing Dimensionality in Multiple Instance Learning with a Filter Method
- Author
-
Zafra, A., Pechenizkiy, M., Sebastian Ventura, Corchado, E., Romay, Mg, Savio, Am, and Data Mining
- Subjects
Learning classifier system ,Wake-sleep algorithm ,Computer science ,business.industry ,Active learning (machine learning) ,Algorithmic learning theory ,Competitive learning ,Supervised learning ,Stability (learning theory) ,Online machine learning ,Multi-task learning ,Feature selection ,Semi-supervised learning ,Machine learning ,computer.software_genre ,Ensemble learning ,Generalization error ,Computational learning theory ,Unsupervised learning ,Artificial intelligence ,Instance-based learning ,Empirical risk minimization ,business ,computer ,Curse of dimensionality - Abstract
In this article, we describe a feature selection algorithm which can automatically find relevant features for multiple instance learning. Multiple instance learning is considered an extension of traditional supervised learning where each example is made up of several instances and there is no specific information about particular instance labels. In this scenario, traditional supervised learning can not be applied directly and it is necessary to design new techniques. Our approach is based on principles of the well-known Relief-F algorithm which is extended to select features in this new learning paradigm by modifying the distance, the difference function and computation of the weight of the features. Four different variants of this algorithm are proposed to evaluate their performance in this new learning framework. Experiment results using a representative number of different algorithms show that predictive accuracy improves significantly when a multiple instance learning classifier is learnt on the reduced data set.