Predicting Trains Delays using a Two-level Machine Learning Approach

Hassiba Laifa, Raoudha Khcherif, Henda Ben Ghezala

2022

Abstract

Train delay is a critical problem in railway systems. A previous prediction of delays is a critical issue advantageous for passengers to re-plan their journeys more reliably. It is also essential for railway operators to control the feasibility of timetable realization for more efficient train schedules. This paper aims to present a novel two-level Light Gradient Boosting Machine (LightGBM) approach that combines classification and regression in a hybrid model. It was proposed to predict passenger train delays on the Tunisian railway. The first level indicates the class of delay, where the delays are divided into intervals of 5 minutes ([0,5], [6,10], …, [>60]), 13 classes in total were obtained. The second level then predicts the actual delay in minutes, considering the expected delay class at the first level. This model was trained and tested based on the historical data of train operation collected by the Tunisian National Railways Company (SNCFT) and infrastructure characteristics. Our methodology consists of the following phases: data collection, data cleaning, complete data analysis, feature engineering, modeling and evaluation. The obtained results indicate that the two-level approach based on the LightGBM model outperforms the one-level method. It also outperformed the benchmark models.

Download


Paper Citation


in Harvard Style

Laifa H., Khcherif R. and Ben Ghezala H. (2022). Predicting Trains Delays using a Two-level Machine Learning Approach. In Proceedings of the 14th International Conference on Agents and Artificial Intelligence - Volume 3: ICAART, ISBN 978-989-758-547-0, pages 737-744. DOI: 10.5220/0010898300003116


in Bibtex Style

@conference{icaart22,
author={Hassiba Laifa and Raoudha Khcherif and Henda Ben Ghezala},
title={Predicting Trains Delays using a Two-level Machine Learning Approach},
booktitle={Proceedings of the 14th International Conference on Agents and Artificial Intelligence - Volume 3: ICAART,},
year={2022},
pages={737-744},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0010898300003116},
isbn={978-989-758-547-0},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 14th International Conference on Agents and Artificial Intelligence - Volume 3: ICAART,
TI - Predicting Trains Delays using a Two-level Machine Learning Approach
SN - 978-989-758-547-0
AU - Laifa H.
AU - Khcherif R.
AU - Ben Ghezala H.
PY - 2022
SP - 737
EP - 744
DO - 10.5220/0010898300003116