Back to Search
Start Over
A Framework for Classification in Data Streams Using Multi-strategy Learning
- Source :
- Discovery Science ISBN: 9783319463063, DS
- Publication Year :
- 2016
- Publisher :
- Springer International Publishing, 2016.
-
Abstract
- Adaptive online learning algorithms have been successfully applied to fast-evolving data streams. Such streams are susceptible to concept drift, which implies that the most suitable type of classifier often changes over time. In this setting, a system that is able to seamlessly select the type of learner that presents the current “best” model holds much value. For example, in a scenario such as user profiling for security applications, model adaptation is of the utmost importance. We have implemented a multi-strategy framework, the so-called Tornado environment, which is able to run multiple and diverse classifiers simultaneously for decision making. In our framework, the current learner with the highest performance, at a specific point in time, is selected and the corresponding model is then provided to the user. In our implementation, we employ an Error-Memory-Runtime (EMR) measure which combines the error-rate, the memory usage and the runtime of classifiers as a performance indicator. We conducted experiments on synthetic and real-world datasets with the Hoeffding Tree, Naive Bayes, Perceptron, K-Nearest Neighbours and Decision Stumps algorithms. Our results indicate that our environment is able to adapt to changes and to continuously select the best current type of classifier, as the data evolve.
- Subjects :
- Concept drift
Computer science
Data stream mining
02 engineering and technology
Perceptron
computer.software_genre
Naive Bayes classifier
020204 information systems
0202 electrical engineering, electronic engineering, information engineering
Profiling (information science)
020201 artificial intelligence & image processing
Data mining
Performance indicator
Adaptive learning
Classifier (UML)
computer
Subjects
Details
- ISBN :
- 978-3-319-46306-3
- ISBNs :
- 9783319463063
- Database :
- OpenAIRE
- Journal :
- Discovery Science ISBN: 9783319463063, DS
- Accession number :
- edsair.doi...........933de5574a0c7273382e1df847dcc774