Back to Search
Start Over
E-CVFDT: An improving CVFDT method for concept drift data stream
- Source :
- 2013 International Conference on Communications, Circuits and Systems (ICCCAS).
- Publication Year :
- 2013
- Publisher :
- IEEE, 2013.
-
Abstract
- Distribution of data stream is always changed in the real world. This problem is usually defined as concept drift [1]. The state-of-the-art decision tree classification method CVFDT[2] can solve the concept drift problem well, but the efficiency is debased because of its general method of handling instances in CVFDT without considering the types of concept drift. In this paper, an algorithm called Efficient CVFDT (E-CVFDT) is proposed to improve the efficiency of CVFDT. E-CVFDT introduces cache mechanism and treats the instances in three kinds of concept drift respectively, i.e. accidental concept drift, gradual concept drift, instantaneously concept drift. Besides, in E-CVFDT, the cached instances which have similar attributes will be sent in batches to calculate the information gain calculation rather than in sequence adopted by CVFDT. The experiments are carried out on the MOA platform. The results show that E-CVFDT algorithm achieves not only better efficiency but also higher accuracy than CVFDT algorithm.
Details
- Database :
- OpenAIRE
- Journal :
- 2013 International Conference on Communications, Circuits and Systems (ICCCAS)
- Accession number :
- edsair.doi...........77742a9b71b0f2d35f6247bfd769b3c7