Back to Search Start Over

Learning accurate very fast decision trees from uncertain data streams.

Authors :
Liang, Chunquan
Zhang, Yang
Shi, Peng
Hu, Zhengguo
Source :
International Journal of Systems Science. Dec2015, Vol. 46 Issue 16, p3032-3050. 19p.
Publication Year :
2015

Abstract

Most existing works on data stream classification assume the streaming data is precise and definite. Such assumption, however, does not always hold in practice, since data uncertainty is ubiquitous in data stream applications due to imprecise measurement, missing values, privacy protection, etc. The goal of this paper is to learn accurate decision tree models from uncertain data streams for classification analysis. On the basis of very fast decision tree (VFDT) algorithms, we proposed an algorithm for constructing anuncertainVFDTtree withclassifiers at tree leaves (uVFDTc). The uVFDTc algorithm can exploit uncertain information effectively and efficiently in both the learning and the classification phases. In the learning phase, it uses Hoeffding bound theory to learn from uncertain data streams and yield fast and reasonable decision trees. In the classification phase, at tree leaves it uses uncertain naive Bayes (UNB) classifiers to improve the classification performance. Experimental results on both synthetic and real-life datasets demonstrate the strong ability of uVFDTc to classify uncertain data streams. The use of UNB at tree leaves has improved the performance of uVFDTc, especially the any-time property, the benefit of exploiting uncertain information, and the robustness against uncertainty. [ABSTRACT FROM PUBLISHER]

Details

Language :
English
ISSN :
00207721
Volume :
46
Issue :
16
Database :
Academic Search Index
Journal :
International Journal of Systems Science
Publication Type :
Academic Journal
Accession number :
108329752
Full Text :
https://doi.org/10.1080/00207721.2014.895877