A Framework for the Discovery of Predictive Fix-time Models

Francesco Folino, Massimo Guarascio, Luigi Pontieri

2014

Abstract

Fix-time prediction is a key task in bug tracking systems, which has been recently faced through the definition of inductive learning methods, trained to estimate the time needed to solve a case at the moment when it is reported. And yet, the actions performed on a bug along its life can help refine the prediction of its (remaining) fix time, possibly with the help of Process Mining techniques. However, typical bug-tracking systems lack any task-oriented description of the resolution process, and store fine-grain records, just capturing bug attributes’ updates. Moreover, no general approach has been proposed to support the definition of derived data, which can help improve considerably fix-time predictions. A new methodological framework for the analysis of bug repositories is presented here, along with an associated toolkit, leveraging two kinds of tools: (i) a combination of modular and flexible data-transformation mechanisms, for producing an enhanced process-oriented view of log data, and (ii) a series of ad-hoc induction techniques, for extracting a prediction model out of such a view. Preliminary results on the bug repository of a real project confirm the validity of our proposal and, in particular, of our log transformation methods.

References

  1. Anbalagan, P. and Vouk, M. (2009). On predicting the time taken to correct bug reports in open source projects. In Proc. of Int. Conf. on Software Maintenance (ICSM'09), pages 523-526.
  2. Bhattacharya, P. and Neamtiu, I. (2011). Bug-fix time prediction models: can we do better? In Proc. of 8th Intl. Conf. on Mining Software Repositories (MSR'11), pages 207-210.
  3. Breiman, L. (2001). Random forests. Machine Learning, 45(1):5-32.
  4. Costa, G., Guarascio, M., Manco, G., Ortale, R., and Ritacco, E. (2009). Rule learning with probabilistic smoothing. In Proc. of 11th Int. Conf. on Data Wareh. and Knowl. Discovery (DaWaK'09), pages 428-440.
  5. Folino, F., Guarascio, M., and Pontieri, L. (2012). Discovering context-aware models for predicting business process performances. In Proc. of 20th Intl. Conf. on Cooperative Inf. Systems (CoopIS'12), pages 287-304.
  6. Folino, F., Guarascio, M., and Pontieri, L. (2013). A data-adaptive trace abstraction approach to the prediction of business process performances. In Proc. of 15th Intl. Conf. on Enterprise Information Systems (ICEIS'13), pages 56-65.
  7. Giger, E., Pinzger, M., and Gall, H. (2010). Predicting the fix time of bugs. In Proc. of 2nd Intl. Workshop on Recommendation Systems for Software Engineering (RSSE'10), pages 52-56.
  8. Hooimeijer, P. and Weimer, W. (2007). Modeling bug report quality. In Proc. of 22nd IEEE/ACM Intl. Conf. on Automated Software Engin. (ASE'07), pages 34-43.
  9. Marks, L., Zou, Y., and Hassan, A. E. (2011). Studying the fix-time for bugs in large open source projects. In Proc. of 7th Intl. Conf. on Predictive Models in Software Engineering (Promise'11), pages 11:1-11:8.
  10. Panjer, L. (2007). Predicting eclipse bug lifetimes. In Proc. of 4th Intl. Workshop on Mining Software Repositories (MSR'07), pages 29-.
  11. Quinlan, J. R. (1993). C4.5: programs for machine learning. Morgan Kaufmann Publishers Inc., San Francisco, CA, USA.
  12. Tan, P.-N., Steinbach, M., and Kumar, V. (2005). Introduction to Data Mining. Addison-Wesley Longman.
  13. van der Aalst, W., van Dongen, B., Herbst, J., Maruster, L., Schimm, G., and Weijters, A. (2003). Workflow mining: a survey of issues and approaches. Data & Knowledge Engineering, 47(2):237-267.
  14. van der Aalst, W. M. P., Schonenberg, M. H., and Song, M. (2011). Time prediction based on process mining. Information Systems, 36(2):450-475.
Download


Paper Citation


in Harvard Style

Folino F., Guarascio M. and Pontieri L. (2014). A Framework for the Discovery of Predictive Fix-time Models . In Proceedings of the 16th International Conference on Enterprise Information Systems - Volume 1: ICEIS, ISBN 978-989-758-027-7, pages 99-108. DOI: 10.5220/0004897400990108


in Bibtex Style

@conference{iceis14,
author={Francesco Folino and Massimo Guarascio and Luigi Pontieri},
title={A Framework for the Discovery of Predictive Fix-time Models},
booktitle={Proceedings of the 16th International Conference on Enterprise Information Systems - Volume 1: ICEIS,},
year={2014},
pages={99-108},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0004897400990108},
isbn={978-989-758-027-7},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 16th International Conference on Enterprise Information Systems - Volume 1: ICEIS,
TI - A Framework for the Discovery of Predictive Fix-time Models
SN - 978-989-758-027-7
AU - Folino F.
AU - Guarascio M.
AU - Pontieri L.
PY - 2014
SP - 99
EP - 108
DO - 10.5220/0004897400990108