Get all the updates for this publication
Control-flow based anomaly detection in the bug-fixing process of open-source projects
In the past few years, substantial research has been conducted to find out the anomalies present in the real-world business processes. Existing research either uses process mining techniques or discrete sequence-based anomaly detection techniques. The bugfixing process of various open-source projects has been analyzed previously to discover the process inefficiencies using process mining techniques. These works exploit generic process mining tools to create the process models. Also, they did not evaluate the performance of their proposed conformance checking algorithms. In addition to these, the discrete sequence-based analogy and anomaly detection techniques are not discussed in the bug-fixing process context. In this paper, we report a bug-fixing process dataset for 30 Apache open-source projects that use JIRA bug tracking system for bug reporting. This real-world dataset is analyzed to discover the anomalous process sequences and the root cause of anomalies. The contributions of this paper include (i) a formalized approach for pre-processing and transforming the bug report history data, from bug tracking systems into event logs, suitable for process analysis; (ii) a process mining based anomaly detection framework for bug-fixing processes that comprises our proposed algorithms for process discovery and conformance checking; and (iii) an artificial labelled process dataset available at Mendeley open-source dataset repository ( doi:10.17632/5yb2xv93w3.1). © 2020 Association for Computing Machinery.
Journal | Data powered by TypesetACM International Conference Proceeding Series |
---|---|
Publisher | Data powered by TypesetAssociation for Computing Machinery |
Open Access | No |