Back to Search Start Over

Efficiently mining cohesion-based patterns and rules in event sequences.

Authors :
Cule, Boris
Feremans, Len
Goethals, Bart
Source :
Data Mining & Knowledge Discovery; Jul2019, Vol. 33 Issue 4, p1125-1182, 58p
Publication Year :
2019

Abstract

Discovering patterns in long event sequences is an important data mining task. Traditionally, research focused on frequency-based quality measures that allow algorithms to use the anti-monotonicity property to prune the search space and efficiently discover the most frequent patterns. In this work, we step away from such measures, and evaluate patterns using cohesion — a measure of how close to each other the items making up the pattern appear in the sequence on average. We tackle the fact that cohesion is not an anti-monotonic measure by developing an upper bound on cohesion in order to prune the search space. By doing so, we are able to efficiently unearth rare, but strongly cohesive, patterns that existing methods often fail to discover. Furthermore, having found the occurrences of cohesive itemsets in the input sequence, we use them to discover the representative sequential patterns and the dominant partially ordered episodes, without going through the computationally expensive candidate generation procedures typically associated with sequential pattern and episode mining. Experiments show that our method efficiently discovers important patterns that existing state-of-the-art methods fail to discover. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
13845810
Volume :
33
Issue :
4
Database :
Complementary Index
Journal :
Data Mining & Knowledge Discovery
Publication Type :
Academic Journal
Accession number :
136939267
Full Text :
https://doi.org/10.1007/s10618-019-00628-0