Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Natalia Sidorova

In System Alignments we Trust! Explainable Alignments via Projections

Jan 24, 2025

Dominique Sommers, Natalia Sidorova, Boudewijn van Dongen

Figure 1 for In System Alignments we Trust! Explainable Alignments via Projections

Figure 2 for In System Alignments we Trust! Explainable Alignments via Projections

Figure 3 for In System Alignments we Trust! Explainable Alignments via Projections

Figure 4 for In System Alignments we Trust! Explainable Alignments via Projections

Abstract:Alignments are a well-known process mining technique for reconciling system logs and normative process models. Evidence of certain behaviors in a real system may only be present in one representation - either a log or a model - but not in the other. Since for processes in which multiple entities, like objects and resources, are involved in the activities, their interactions affect the behavior and are therefore essential to take into account in the alignments. Additionally, both logged and modeled representations of reality may be imprecise and only partially represent some of these entities, but not all. In this paper, we introduce the concept of "relaxations" through projections for alignments to deal with partially correct models and logs. Relaxed alignments help to distinguish between trustworthy and untrustworthy content of the two representations (the log and the model) to achieve a better understanding of the underlying process and expose quality issues.

Via

Access Paper or Ask Questions

Discovering More Precise Process Models from Event Logs by Filtering Out Chaotic Activities

Nov 03, 2017

Niek Tax, Natalia Sidorova, Wil M. P. van der Aalst

Figure 1 for Discovering More Precise Process Models from Event Logs by Filtering Out Chaotic Activities

Figure 2 for Discovering More Precise Process Models from Event Logs by Filtering Out Chaotic Activities

Figure 3 for Discovering More Precise Process Models from Event Logs by Filtering Out Chaotic Activities

Figure 4 for Discovering More Precise Process Models from Event Logs by Filtering Out Chaotic Activities

Abstract:Process Discovery is concerned with the automatic generation of a process model that describes a business process from execution data of that business process. Real life event logs can contain chaotic activities. These activities are independent of the state of the process and can, therefore, happen at rather arbitrary points in time. We show that the presence of such chaotic activities in an event log heavily impacts the quality of the process models that can be discovered with process discovery techniques. The current modus operandi for filtering activities from event logs is to simply filter out infrequent activities. We show that frequency-based filtering of activities does not solve the problems that are caused by chaotic activities. Moreover, we propose a novel technique to filter out chaotic activities from event logs. We evaluate this technique on a collection of seventeen real-life event logs that originate from both the business process management domain and the smart home environment domain. As demonstrated, the developed activity filtering methods enable the discovery of process models that are more behaviorally specific compared to process models that are discovered using standard frequency-based filtering.

* Journal of Intelligent Information Systems, (2018), 1-33

Via

Access Paper or Ask Questions

Generating Time-Based Label Refinements to Discover More Precise Process Models

Oct 31, 2017

Niek Tax, Emin Alasgarov, Natalia Sidorova, Wil M. P. van der Aalst, Reinder Haakma

Figure 1 for Generating Time-Based Label Refinements to Discover More Precise Process Models

Figure 2 for Generating Time-Based Label Refinements to Discover More Precise Process Models

Figure 3 for Generating Time-Based Label Refinements to Discover More Precise Process Models

Figure 4 for Generating Time-Based Label Refinements to Discover More Precise Process Models

Abstract:Process mining is a research field focused on the analysis of event data with the aim of extracting insights related to dynamic behavior. Applying process mining techniques on data from smart home environments has the potential to provide valuable insights in (un)healthy habits and to contribute to ambient assisted living solutions. Finding the right event labels to enable the application of process mining techniques is however far from trivial, as simply using the triggering sensor as the label for sensor events results in uninformative models that allow for too much behavior (overgeneralizing). Refinements of sensor level event labels suggested by domain experts have been shown to enable discovery of more precise and insightful process models. However, there exists no automated approach to generate refinements of event labels in the context of process mining. In this paper we propose a framework for the automated generation of label refinements based on the time attribute of events, allowing us to distinguish behaviourally different instances of the same event type based on their time attribute. We show on a case study with real life smart home event data that using automatically generated refined labels in process discovery, we can find more specific, and therefore more insightful, process models. We observe that one label refinement could have an effect on the usefulness of other label refinements when used together. Therefore, we explore four strategies to generate useful combinations of multiple label refinements and evaluate those on three real life smart home event logs.

Via

Access Paper or Ask Questions

Guided Interaction Exploration in Artifact-centric Process Models

Jun 07, 2017

Maikel L. van Eck, Natalia Sidorova, Wil M. P. van der Aalst

Figure 1 for Guided Interaction Exploration in Artifact-centric Process Models

Figure 2 for Guided Interaction Exploration in Artifact-centric Process Models

Figure 3 for Guided Interaction Exploration in Artifact-centric Process Models

Figure 4 for Guided Interaction Exploration in Artifact-centric Process Models

Abstract:Artifact-centric process models aim to describe complex processes as a collection of interacting artifacts. Recent development in process mining allow for the discovery of such models. However, the focus is often on the representation of the individual artifacts rather than their interactions. Based on event data we can automatically discover composite state machines representing artifact-centric processes. Moreover, we provide ways of visualizing and quantifying interactions among different artifacts. For example, we are able to highlight strongly correlated behaviours in different artifacts. The approach has been fully implemented as a ProM plug-in; the CSM Miner provides an interactive artifact-centric process discovery tool focussing on interactions. The approach has been evaluated using real life data sets, including the personal loan and overdraft process of a Dutch financial institution.

* 10 pages, 4 figures, to be published in proceedings of the 19th IEEE Conference on Business Informatics, CBI 2017

Via

Access Paper or Ask Questions

Mining Process Model Descriptions of Daily Life through Event Abstraction

May 25, 2017

Niek Tax, Natalia Sidorova, Reinder Haakma, Wil M. P. van der Aalst

Figure 1 for Mining Process Model Descriptions of Daily Life through Event Abstraction

Figure 2 for Mining Process Model Descriptions of Daily Life through Event Abstraction

Figure 3 for Mining Process Model Descriptions of Daily Life through Event Abstraction

Figure 4 for Mining Process Model Descriptions of Daily Life through Event Abstraction

Abstract:Process mining techniques focus on extracting insight in processes from event logs. Process mining has the potential to provide valuable insights in (un)healthy habits and to contribute to ambient assisted living solutions when applied on data from smart home environments. However, events recorded in smart home environments are on the level of sensor triggers, at which process discovery algorithms produce overgeneralizing process models that allow for too much behavior and that are difficult to interpret for human experts. We show that abstracting the events to a higher-level interpretation can enable discovery of more precise and more comprehensible models. We present a framework for the extraction of features that can be used for abstraction with supervised learning methods that is based on the XES IEEE standard for event logs. This framework can automatically abstract sensor-level events to their interpretation at the human activity level, after training it on training data for which both the sensor and human activity events are known. We demonstrate our abstraction framework on three real-life smart home event logs and show that the process models that can be discovered after abstraction are more precise indeed.

* Studies in Computational Intelligence, 751 (2017) 83-104
* arXiv admin note: substantial text overlap with arXiv:1606.07283

Via

Access Paper or Ask Questions

The Imprecisions of Precision Measures in Process Mining

May 16, 2017

Niek Tax, Xixi Lu, Natalia Sidorova, Dirk Fahland, Wil M. P. van der Aalst

Figure 1 for The Imprecisions of Precision Measures in Process Mining

Figure 2 for The Imprecisions of Precision Measures in Process Mining

Figure 3 for The Imprecisions of Precision Measures in Process Mining

Figure 4 for The Imprecisions of Precision Measures in Process Mining

Abstract:In process mining, precision measures are used to quantify how much a process model overapproximates the behavior seen in an event log. Although several measures have been proposed throughout the years, no research has been done to validate whether these measures achieve the intended aim of quantifying over-approximation in a consistent way for all models and logs. This paper fills this gap by postulating a number of axioms for quantifying precision consistently for any log and any model. Further, we show through counter-examples that none of the existing measures consistently quantifies precision.

* Information Processing Letters, 135 (2018), 1-8

Via

Access Paper or Ask Questions

Mining Local Process Models

May 16, 2017

Niek Tax, Natalia Sidorova, Reinder Haakma, Wil M. P. van der Aalst

Figure 1 for Mining Local Process Models

Figure 2 for Mining Local Process Models

Figure 3 for Mining Local Process Models

Figure 4 for Mining Local Process Models

Abstract:In this paper we describe a method to discover frequent behavioral patterns in event logs. We express these patterns as \emph{local process models}. Local process model mining can be positioned in-between process discovery and episode / sequential pattern mining. The technique presented in this paper is able to learn behavioral patterns involving sequential composition, concurrency, choice and loop, like in process mining. However, we do not look at start-to-end models, which distinguishes our approach from process discovery and creates a link to episode / sequential pattern mining. We propose an incremental procedure for building local process models capturing frequent patterns based on so-called process trees. We propose five quality dimensions and corresponding metrics for local process models, given an event log. We show monotonicity properties for some quality dimensions, enabling a speedup of local process model discovery through pruning. We demonstrate through a real life case study that mining local patterns allows us to get insights in processes where regular start-to-end process discovery techniques are only able to learn unstructured, flower-like, models.

* Journal of Innovation in Digital Ecosystems volume 3 issue 2 (2016), pages 183-196
* Published in Elsevier's Journal of Innovation in Digital Ecosystems, Special Issue on Data Mining

Via

Access Paper or Ask Questions

Interest-Driven Discovery of Local Process Models

Mar 21, 2017

Niek Tax, Benjamin Dalmas, Natalia Sidorova, Wil M P van der Aalst, Sylvie Norre

Figure 1 for Interest-Driven Discovery of Local Process Models

Figure 2 for Interest-Driven Discovery of Local Process Models

Figure 3 for Interest-Driven Discovery of Local Process Models

Figure 4 for Interest-Driven Discovery of Local Process Models

Abstract:Local Process Models (LPM) describe structured fragments of process behavior occurring in the context of less structured business processes. Traditional LPM discovery aims to generate a collection of process models that describe highly frequent behavior, but these models do not always provide useful answers for questions posed by process analysts aiming at business process improvement. We propose a framework for goal-driven LPM discovery, based on utility functions and constraints. We describe four scopes on which these utility functions and constrains can be defined, and show that utility functions and constraints on different scopes can be combined to form composite utility functions/constraints. Finally, we demonstrate the applicability of our approach by presenting several actionable business insights discovered with LPM discovery on two real life data sets.

* Information Systems (2018), 1-31
* submitted to the International Conference on Business Process Management (BPM) 2017

Via

Access Paper or Ask Questions

Heuristic Approaches for Generating Local Process Models through Log Projections

Oct 10, 2016

Niek Tax, Natalia Sidorova, Wil M. P. van der Aalst, Reinder Haakma

Figure 1 for Heuristic Approaches for Generating Local Process Models through Log Projections

Figure 2 for Heuristic Approaches for Generating Local Process Models through Log Projections

Figure 3 for Heuristic Approaches for Generating Local Process Models through Log Projections

Figure 4 for Heuristic Approaches for Generating Local Process Models through Log Projections

Abstract:Local Process Model (LPM) discovery is focused on the mining of a set of process models where each model describes the behavior represented in the event log only partially, i.e. subsets of possible events are taken into account to create so-called local process models. Often such smaller models provide valuable insights into the behavior of the process, especially when no adequate and comprehensible single overall process model exists that is able to describe the traces of the process from start to end. The practical application of LPM discovery is however hindered by computational issues in the case of logs with many activities (problems may already occur when there are more than 17 unique activities). In this paper, we explore three heuristics to discover subsets of activities that lead to useful log projections with the goal of speeding up LPM discovery considerably while still finding high-quality LPMs. We found that a Markov clustering approach to create projection sets results in the largest improvement of execution time, with discovered LPMs still being better than with the use of randomly generated activity sets of the same size. Another heuristic, based on log entropy, yields a more moderate speedup, but enables the discovery of higher quality LPMs. The third heuristic, based on the relative information gain, shows unstable performance: for some data sets the speedup and LPM quality are higher than with the log entropy based method, while for other data sets there is no speedup at all.

* Proceedings of the IEEE Symposium Series on Computational Intelligence (SSCI), (2016) 1-8
* paper accepted and to appear in the proceedings of the IEEE Symposium on Computational Intelligence and Data Mining (CIDM), special session on Process Mining, part of the Symposium Series on Computational Intelligence (SSCI)

Via

Access Paper or Ask Questions

On Generation of Time-based Label Refinements

Sep 12, 2016

Niek Tax, Emin Alasgarov, Natalia Sidorova, Reinder Haakma

Figure 1 for On Generation of Time-based Label Refinements

Figure 2 for On Generation of Time-based Label Refinements

Figure 3 for On Generation of Time-based Label Refinements

Figure 4 for On Generation of Time-based Label Refinements

Abstract:Process mining is a research field focused on the analysis of event data with the aim of extracting insights in processes. Applying process mining techniques on data from smart home environments has the potential to provide valuable insights in (un)healthy habits and to contribute to ambient assisted living solutions. Finding the right event labels to enable application of process mining techniques is however far from trivial, as simply using the triggering sensor as the label for sensor events results in uninformative models that allow for too much behavior (overgeneralizing). Refinements of sensor level event labels suggested by domain experts have shown to enable discovery of more precise and insightful process models. However, there exist no automated approach to generate refinements of event labels in the context of process mining. In this paper we propose a framework for automated generation of label refinements based on the time attribute of events. We show on a case study with real life smart home event data that behaviorally more specific, and therefore more insightful, process models can be found by using automatically generated refined labels in process discovery.

* Accepted at CS&P workshop 2016 Overlap in preliminaries with arXiv:1606.07259

Via

Access Paper or Ask Questions