Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Markku Hinkka

Discovering Business Area Effects to Process Mining Analysis Using Clustering and Influence Analysis

Mar 18, 2020

Teemu Lehto, Markku Hinkka

Figure 1 for Discovering Business Area Effects to Process Mining Analysis Using Clustering and Influence Analysis

Figure 2 for Discovering Business Area Effects to Process Mining Analysis Using Clustering and Influence Analysis

Figure 3 for Discovering Business Area Effects to Process Mining Analysis Using Clustering and Influence Analysis

Figure 4 for Discovering Business Area Effects to Process Mining Analysis Using Clustering and Influence Analysis

Abstract:A common challenge for improving business processes in large organizations is that business people in charge of the operations are lacking a fact-based understanding of the execution details, process variants, and exceptions taking place in business operations. While existing process mining methodologies can discover these details based on event logs, it is challenging to communicate the process mining findings to business people. In this paper, we present a novel methodology for discovering business areas that have a significant effect on the process execution details. Our method uses clustering to group similar cases based on process flow characteristics and then influence analysis for detecting those business areas that correlate most with the discovered clusters. Our analysis serves as a bridge between BPM people and business, people facilitating the knowledge sharing between these groups. We also present an example analysis based on publicly available real-life purchase order process data.

* 12 pages. Paper accepted in 23rd International Conference on Business Information Systems (BIS 2020) to be published in a proceedings edition of the Lecture Notes in Business Information Processing

Via

Access Paper or Ask Questions

Exploiting Event Log Data-Attributes in RNN Based Prediction

Apr 15, 2019

Markku Hinkka, Teemu Lehto, Keijo Heljanko

Figure 1 for Exploiting Event Log Data-Attributes in RNN Based Prediction

Figure 2 for Exploiting Event Log Data-Attributes in RNN Based Prediction

Figure 3 for Exploiting Event Log Data-Attributes in RNN Based Prediction

Figure 4 for Exploiting Event Log Data-Attributes in RNN Based Prediction

Abstract:In predictive process analytics, current and historical process data in event logs are used to predict future. E.g., to predict the next activity or how long a process will still require to complete. Recurrent neural networks (RNN) and its subclasses have been demonstrated to be well suited for creating prediction models. Thus far, event attributes have not been fully utilized in these models. The biggest challenge in exploiting them in prediction models is the potentially large amount of event attributes and attribute values. We present a novel clustering technique which allows for trade-offs between prediction accuracy and the time needed for model training and prediction. As an additional finding, we also found that this clustering method combined with having raw event attribute values provides even better prediction accuracy at the cost of additional time required for training and prediction. We also built a highly configurable test framework that can be used to efficiently evaluate different prediction approaches and parameterizations.

Via

Access Paper or Ask Questions

Classifying Process Instances Using Recurrent Neural Networks

Sep 16, 2018

Markku Hinkka, Teemu Lehto, Keijo Heljanko, Alexander Jung

Figure 1 for Classifying Process Instances Using Recurrent Neural Networks

Figure 2 for Classifying Process Instances Using Recurrent Neural Networks

Figure 3 for Classifying Process Instances Using Recurrent Neural Networks

Figure 4 for Classifying Process Instances Using Recurrent Neural Networks

Abstract:Process Mining consists of techniques where logs created by operative systems are transformed into process models. In process mining tools it is often desired to be able to classify ongoing process instances, e.g., to predict how long the process will still require to complete, or to classify process instances to different classes based only on the activities that have occurred in the process instance thus far. Recurrent neural networks and its subclasses, such as Gated Recurrent Unit (GRU) and Long Short-Term Memory (LSTM), have been demonstrated to be able to learn relevant temporal features for subsequent classification tasks. In this paper we apply recurrent neural networks to classifying process instances. The proposed model is trained in a supervised fashion using labeled process instances extracted from event log traces. This is the first time we know of GRU having been used in classifying business process instances. Our main experimental results shows that GRU outperforms LSTM remarkably in training time while giving almost identical accuracies to LSTM models. Additional contributions of our paper are improving the classification model training time by filtering infrequent activities, which is a technique commonly used, e.g., in Natural Language Processing (NLP).

* Proceedings of the BPM 2018 Workshops

Via

Access Paper or Ask Questions

Structural Feature Selection for Event Logs

May 17, 2018

Markku Hinkka, Teemu Lehto, Keijo Heljanko, Alexander Jung

Figure 1 for Structural Feature Selection for Event Logs

Figure 2 for Structural Feature Selection for Event Logs

Figure 3 for Structural Feature Selection for Event Logs

Figure 4 for Structural Feature Selection for Event Logs

Abstract:We consider the problem of classifying business process instances based on structural features derived from event logs. The main motivation is to provide machine learning based techniques with quick response times for interactive computer assisted root cause analysis. In particular, we create structural features from process mining such as activity and transition occurrence counts, and ordering of activities to be evaluated as potential features for classification. We show that adding such structural features increases the amount of information thus potentially increasing classification accuracy. However, there is an inherent trade-off as using too many features leads to too long run-times for machine learning classification models. One way to improve the machine learning algorithms' run-time is to only select a small number of features by a feature selection algorithm. However, the run-time required by the feature selection algorithm must also be taken into account. Also, the classification accuracy should not suffer too much from the feature selection. The main contributions of this paper are as follows: First, we propose and compare six different feature selection algorithms by means of an experimental setup comparing their classification accuracy and achievable response times. Second, we discuss the potential use of feature selection results for computer assisted root cause analysis as well as the properties of different types of structural features in the context of feature selection.

* Extended version of a paper published in the proceedings of the BPM 2017 workshops

Via

Access Paper or Ask Questions