Automated Process Discovery: A Literature Review and a Comparative Evaluation with Domain Experts

Allar Soo
Process mining methods allow analysts to use logs of historical executions of business processes in order to gain knowledge about the actual performance of these processes.
One of the most widely studied process mining operations is automated process discovery.
An event log is taken as input by an automated process discovery method and produces a business process model as output that captures the control-flow relations between tasks that are described by the event log.
Several automated process discovery methods have been proposed in the past two decades, striking different tradeoffs between scalability, accuracy and complexity of the resulting models.
So far, automated process discovery methods have been evaluated in an ad hoc manner, with different authors employing different datasets, experimental setups, evaluation measures and baselines, often leading to incomparable conclusions and sometimes unreproducible results due to the use of non-publicly available datasets.
In this setting, this thesis provides a systematic review of automated process discovery methods and a systematic comparative evaluation of existing implementations of these methods with domain experts by using a real-life event log extracted from a international software engineering company and four quality metrics.
The review and evaluation results highlight gaps and unexplored tradeoffs in the field in the context of four business process model quality metrics.
The results of this master thesis allows researchers to improve the lacks in the automated process discovery methods and also answers question about the usability of process discovery techniques in industry.
Graduation Thesis language
Graduation Thesis type
Master - Software Engineering
Fabrizio Maria Maggi, Fredrik Payman Milani
Defence year