in the aviation sector and other domains requiring
accurate forecasting.
4 CONCLUSIONS
Accurate forecasting of aircraft delays is pivotal for
mitigating financial losses and enhancing passenger
satisfaction within the aviation industry. This study
addresses the critical need for reliable delay
predictions by evaluating three advanced machine
learning techniques: NN, Wide & Deep Learning, and
CatBoost. These techniques were assessed using a
comprehensive Kaggle dataset, with performance
metrics including accuracy and AUC as key
indicators of model efficacy. The study finds that
CatBoost outperforms both NN and Wide & Deep
Learning models, achieving the highest accuracy and
AUC scores. This demonstrates CatBoost’s superior
capability in managing categorical features and
handling complex data interactions effectively. The
NN model, while useful, showed limitations in its
ability to capture intricate patterns compared to
CatBoost. The Wide & Deep model, though
beneficial in combining different learning
approaches, did not surpass CatBoost’s performance
in this context. Despite these valuable insights, the
study has certain limitations. The evaluation was
confined to a specific Kaggle dataset, and the models’
performance may vary with different datasets or
problem domains. The study did not explore the
potential benefits of further hyperparameter tuning,
feature engineering, or the integration of additional
machine-learning techniques. Future research should
consider exploring additional datasets to validate the
generalizability of the findings. Investigating
advanced model variations, such as hybrid
approaches combining CatBoost with other
techniques and refining feature engineering practices,
could yield further improvements. Additionally,
incorporating real-time data and dynamic models
may enhance forecasting accuracy and applicability
in operational settings. In conclusion, this study
underscores the importance of selecting and refining
predictive models to enhance flight delay forecasting.
CatBoost's superior performance in this study
provides a valuable reference for future research and
practical applications in the aviation industry.
Continued advancements in machine learning
techniques and their applications will improve the
sector's operational efficiency and passenger
experience.
REFERENCES
Cheng, H. T, et al., 2016. Wide & deep learning for
recommender systems. In Proceedings of the first
workshop on deep learning for recommender systems
(pp. 7-10).
Dorogush, A. V, Ershov, V, Gulin, A., 2018. CatBoost:
gradient boosting with support for categorical features.
arXiv preprint arXiv:1810.11363.
Friedman, J. H., 2001. Greedy function approximation: a
gradient boosting machine. Annals of statistics, 1189-
1232.
Khan, W. A, Chung, S. H, Eltoukhy, A. E, Khurshid, F.,
2024. A novel parallel series data-driven model for
IATA-coded flight delays prediction and features
analysis. Journal of Air Transport Management, 114,
102488.
Krizhevsky, A, Sutskever, I, Hinton, G. E., 2012. Imagenet
classification with deep convolutional neural networks.
Advances in neural information processing systems, 25.
LeCun, Y, Bengio, Y, Hinton, G., 2015. Deep learning.
nature, 521(7553), 436-444.
LeCun, Y, Bottou, L, Orr, G. B, Müller, K. R., 2002.
Efficient backprop. In Neural networks: Tricks of the
trade (pp. 9-50). Berlin, Heidelberg: Springer Berlin
Heidelberg.
Mamdouh, M, Ezzat, M, Hefny, H., 2024. Improving flight
delays prediction by developing attention-based
bidirectional LSTM network. Expert Systems with
Applications, 238, 121747.
McCulloch, W. S, Pitts, W., 1943. A logical calculus of the
ideas immanent in nervous activity. The bulletin of
mathematical biophysics, 5, 115-133.
Rosenblatt, F., 1958. The perceptron: a probabilistic model
for information storage and organization in the brain.
Psychological review, 65(6), 386.
Rumelhart, D. E, Hinton, G. E, Williams, R. J., 1986.
Learning representations by back-propagating errors.
nature, 323(6088), 533-536.
Vaswani, A., 2017. Attention is all you need. Advances in
Neural Information Processing Systems