Back to Search Start Over

E-CVFDT: An improving CVFDT method for concept drift data stream

Authors :
Gang Liu
Cai-xia Liu
Qiao Liu
Zhiguang Qin
Hongrong Cheng
Source :
2013 International Conference on Communications, Circuits and Systems (ICCCAS).
Publication Year :
2013
Publisher :
IEEE, 2013.

Abstract

Distribution of data stream is always changed in the real world. This problem is usually defined as concept drift [1]. The state-of-the-art decision tree classification method CVFDT[2] can solve the concept drift problem well, but the efficiency is debased because of its general method of handling instances in CVFDT without considering the types of concept drift. In this paper, an algorithm called Efficient CVFDT (E-CVFDT) is proposed to improve the efficiency of CVFDT. E-CVFDT introduces cache mechanism and treats the instances in three kinds of concept drift respectively, i.e. accidental concept drift, gradual concept drift, instantaneously concept drift. Besides, in E-CVFDT, the cached instances which have similar attributes will be sent in batches to calculate the information gain calculation rather than in sequence adopted by CVFDT. The experiments are carried out on the MOA platform. The results show that E-CVFDT algorithm achieves not only better efficiency but also higher accuracy than CVFDT algorithm.

Details

Database :
OpenAIRE
Journal :
2013 International Conference on Communications, Circuits and Systems (ICCCAS)
Accession number :
edsair.doi...........77742a9b71b0f2d35f6247bfd769b3c7