
Subsequently, the classification phase, aimed at
predicting the outcome of emergency department vis-
its, demonstrated the effectiveness of various Ma-
chine Learning algorithms. Specifically, the Random-
Forest and ExtraTrees classifiers consistently showed
the best performance, both in the configuration with
eight outcome classes and the reduced five-class ver-
sion. These results emphasize the potential of us-
ing pre-processed clinical data and machine learning
techniques to develop accurate predictive models in
healthcare settings.
Despite the promising performance achieved, fu-
ture research could further explore the integration be-
tween insights derived from Process Mining (e.g.,
bottlenecks or process variability) and the features
used in Machine Learning models. This could help
improve predictive capacity and provide more contex-
tualized information to support clinical and manage-
rial decision-making in the emergency context.
REFERENCES
A., G., L., A., L., G., J., H., C., I. P., and E., M. R. S. H.
(2000). Physiobank, physiotoolkit, and physionet:
Components of a new research resource for complex
physiologic signals. PhysioNet.
Aversano, L., Iammarino, M., Madau, A., Montano, D., and
Verdone, C. (2025a). Repairing missing activity la-
bels in healthcare process logs: A machine learning
approach. In Innovation in Medicine and Healthcare
(KES InMed 2024), pages 91–101. Springer.
Aversano, L., Iammarino, M., Madau, A., Pirlo, G., and
Semeraro, G. (2025b). Process mining applications
in healthcare: a systematic literature review. PeerJ
Computer Science, 11:e2613.
Aversano, L., Iammarino, M., Madau, A., Pirlo, G., and
Semeraro, G. (2025c). What time is it? finding which
temporal features are more useful for next activity pre-
diction. IEEE Open Journal of the Computer Society,
6:261–271.
Breiman, L. (2001). Random forests. Machine Learning,
45(1):5–32.
Chawla, N. V., Bowyer, K. W., Hall, L. O., and Kegelmeyer,
W. P. (2002). Smote: Synthetic minority over-
sampling technique. Journal of Artificial Intelligence
Research, 16:321–357.
Chen, T. and Guestrin, C. (2016). Xgboost: A scalable
tree boosting system. In Proceedings of the 22nd
ACM SIGKDD International Conference on Knowl-
edge Discovery and Data Mining, pages 785–794.
ACM.
Choi, E., Bahadori, M. T., Kulas, J. A., Schuetz, A., Stew-
art, W. F., and Sun, J. (2017). Retain: An interpretable
predictive model for healthcare using reverse time at-
tention mechanism.
Coscia, A., Dentamaro, V., Galantucci, S., Maci, A., and
Pirlo, G. (2024). Progesi: a proxy grammar to enhance
web application firewall for sql injection prevention.
IEEE Access.
De Roock, E. and Martin, N. (2022). Process mining in
healthcare – an updated perspective on the state of the
art. Journal of Biomedical Informatics, 127:103995.
Dentamaro, V., Convertini, V. N., Galantucci, S., Giglio,
P., Palmisano, T., Pirlo, G., et al. (2021). Ensemble
consensus: An unsupervised algorithm for anomaly
detection in network security data. In ITASEC, pages
309–318.
Dorogush, A. V., Ershov, V., and Gulin, A. (2018). Cat-
boost: Gradient boosting with categorical features
support. In 2018 IEEE International Conference on
Data Mining (ICDM), pages 660–669. IEEE.
Freund, Y. and Schapire, R. E. (1997). A decision-theoretic
generalization of on-line learning and an application
to boosting. Journal of Computer and System Sci-
ences, 55(1):119–139.
Friedman, J. H. (2001). Greedy function approximation: A
gradient boosting machine. The Annals of Statistics,
29(5):1189–1232.
Gattulli, V., Impedovo, D., Pirlo, G., and Semeraro, G.
(2023). Handwriting task-selection based on the anal-
ysis of patterns in classification results on alzheimer
dataset. In DSTNDS, pages 18–29.
Geurts, P., Ernst, D., and Wehenkel, L. (2006). Extremely
randomized trees. Machine Learning, 63(1):3–42.
Ghasemi, M. and Amyot, D. (2016). Process mining in
healthcare: A systematised literature review. Inter-
national Journal of Electronic Healthcare, 9:60.
Quinlan, J. R. (1986). Induction of decision trees. Machine
Learning, 1(1):81–106.
Rebuge
´
A. and Ferreira, D. R. (2012). Business process
analysis in healthcare environments: A methodol-
ogy based on process mining. Information Systems,
37(2):99–116.
Topol, E. J. (2019). Deep Medicine: How Artificial Intel-
ligence Can Make Healthcare Human Again. Basic
Books.
Van Der Aalst, W. M. (2019). A practitioner’s guide to
process mining: Limitations of the directly-follows
graph.
van der Aalst, W. M. P. (2016). Process Mining: Data Sci-
ence in Action (2nd ed.). Springer.
van der Heijden, M., Luitse, J. S. K., de Keizer, N. F.,
and Cornet, R. (2020). Process mining in emergency
medicine: A systematic review. Journal of the Amer-
ican Medical Informatics Association, 27(9):1471–
1483.
Wei, J., He, Z., Ouyang, C., and Moreira, C. (2023). Mim-
icel: Mimic-iv event log for emergency department
(version 2.1.0). Accessed: 2025-04-02.
Xie, F., Zhou, J., Lee, J. W., Tan, M., Li, S., Rajnth-
ern, L. S., Chee, M. L., Chakraborty, B., Wong, A.-
K. I., Dagan, A., Ong, M. E. H., Gao, F., and Liu,
N. (2022). Benchmarking emergency department pre-
diction models with machine learning and public elec-
tronic health records. Scientific Data, 9(1).
Process Mining and Machine Learning for Predicting Clinical Outcomes in Emergency Care: A Study on the MIMICEL Dataset
799