1. Event abstraction for process mining using supervised learning techniques
- Author
-
Tax, N., Sidorova, N., Haakma, R., van der Aalst, W.M.P., Bi, Y., Kapoor, S., Bhatia, R., and Process Science
- Subjects
Process modeling ,business.industry ,Event (computing) ,Computer science ,Feature vector ,Supervised learning ,Process mining ,02 engineering and technology ,Machine learning ,computer.software_genre ,Conformance checking ,Business process discovery ,Event abstraction ,020204 information systems ,0202 electrical engineering, electronic engineering, information engineering ,020201 artificial intelligence & image processing ,Artificial intelligence ,business ,computer ,Abstraction (linguistics) ,Probabilistic graphical models - Abstract
Process mining techniques focus on extracting insight in processes from event logs. In many cases, events recorded in the event log are too fine-grained, causing process discovery algorithms to discover incomprehensible process models or process models that are not representative of the event log. We show that when process discovery algorithms are only able to discover an unrepresentative process model from a low-level event log, structure in the process can in some cases still be discovered by first abstracting the event log to a higher level of granularity. This gives rise to the challenge to bridge the gap between an original low-level event log and a desired high-level perspective on this log, such that a more structured or more comprehensible process model can be discovered. We show that supervised learning can be leveraged for the event abstraction task when annotations with high-level interpretations of the low-level events are available for a subset of the sequences (i.e., traces). We present a method to generate feature vector representations of events based on XES extensions, and describe an approach to abstract events in an event log with Condition Random Fields using these event features. Furthermore, we propose a sequence-focused metric to evaluate supervised event abstraction results that fits closely to the tasks of process discovery and conformance checking. We conclude this paper by demonstrating the usefulness of supervised event abstraction for obtaining more structured and/or more comprehensible process models using both real life event data and synthetic event data.
- Published
- 2018