Back to Search Start Over

Evaluation of an associative classifier based on position-constrained frequent/closed subtree mining

Authors :
Michael Hecker
Andrea Tagarelli
Fedja Hadzic
Dang Bach Bui
Source :
Journal of Intelligent Information Systems. 45:397-421
Publication Year :
2014
Publisher :
Springer Science and Business Media LLC, 2014.

Abstract

Tree-structured data are popular in many domains making structural classification an important task. In this paper, an associative classification method is introduced based on a structure preserving flat representation of trees. A major difference to traditional tree mining techniques is that subtrees are constrained by the position in the original trees, leading to a drastic reduction in the number of rules generated, especially with data having great structural variation among tree instances. This characteristic would be desired in the current status of frequent pattern mining, where excessive patterns hinder the practical use of results. However the question remains whether this reduction comes at a high cost in accuracy and coverage rate reduction. We explore this aspect and compare the approach with a state-of-the-art structural classifier based on same subtree type, but not positional constrained in any way. We investigate the effect of using different types of frequent pattern (frequent or closed), or subtree types (induced, embedded or embedded-plus-disconnected subtrees) to the performance of the two classifiers. Different rule strength measures such as confidence, weighted confidence and likelihood are also examined in our study. The experiments on three real-world data sets reveal important similarities and differences between the methods.

Details

ISSN :
15737675 and 09259902
Volume :
45
Database :
OpenAIRE
Journal :
Journal of Intelligent Information Systems
Accession number :
edsair.doi...........3f32d37d9e963d6cf76b9badae7f842c
Full Text :
https://doi.org/10.1007/s10844-014-0312-9