3.6 Discussion
Compared with similar articles, this paper has some
advantages, mainly in the following aspects. In the
data processing part, principal component analysis
was used to make the subsequent processing more
energy efficient (Liu, 2022; Alharithi et al., 2025). In
feature screening, the results of two models were
combined, which may reduce the effect of one model
bias (Salah, Lincy, & Al, 2024).
However, there is still some space left to improve
in the article. For example, in the result of the
prediction model, the accuracy of the random forest
model is not particularly high. More models can be
chosen for prediction. For example, a study of an
integrated approach that incorporates Convolutional
Neural Networks (CNN) and Long Short-Term
Memory (LSTM) showed that it can outperform the
average accuracy of traditional machine learning
models by about 10 percent when predicting customer
churn risk (Park et al., 2022). Besides, the content of
the dataset could be further supplemented. For
example, a user comment module could be added to
process natural language so as to better understand
the users’ emotional bias (Kowalski, Esteve, &
Mikhaylov, 2020). Additionally, random forest itself
is not a interpretable model, and it is often difficult to
understand how the model makes its predictions.
Remedying this may require some auxiliary
optimizations such as the LIME interpretation
technique (Ribeiro, Singh, & Guestrin, 2016). These
techniques may be more capable of appearing
justified and convincing people emotionally.
4 CONCLUSIONS
By comparing the logistic regression algorithm and
random forest algorithm on the airline passenger
satisfaction prediction model, this study found that
the random forest algorithm has advantages in all
aspects, with a relatively high precision of over 90%.
The study then used the feature importance of random
forests to filter out several factors to be the most
influential features of satisfaction, showing that the
immediate experience of service quality, especially in
the field of online booking, has more predictive value
than the inherent user attributes. Thus, this study
provided direction for airlines to optimize their
services. The discussion of satisfaction models in this
study can be similarly applicable to the analysis of
other service industries.
Subsequent research can integrate more real-time
data (e.g., flight dynamics, user feedback) to build a
dynamic prediction system and introduce models
such as deep learning models to further improve the
performance. In addition, interpretable methods, such
as Shapley additive explanations(SHAP) values, can
be combined to better explain the mechanism of
feature effects, assist in the formulation of
differentiated service strategies, and promote the
development of aviation services in the direction of
precision and personalization.
REFERENCES
Alharithi, M., Almetwally, E. M., Alotaibi, O., Eid, M. M.,
El - kenawy, E. S. M., & Elnazer, A. A. 2025. A
comparative study of statistical and intelligent
classification models for predicting airlines passenger
management satisfaction. Alexandria Engineering
Journal, 119: 99-110.
Eshaghi, M. S., Afshardoost, M., Lohmann, G., & Moyle,
B. D. 2024. Drivers and outcomes of airline passenger
satisfaction: A Meta-analysis. Journal of the Air
Transport Research Society, 3, 100034.
Kaggle. 2020. Airline Passenger Satisfaction.
https://www.kaggle.com/datasets/teejmahal20/airline-
passenger-satisfaction
Kowalski, R., Esteve, M., & Jankin Mikhaylov, S.J. 2020.
Improving public services by mining citizen feedback:
An application of natural language processing. Public
Administration, 98(4), 1011 - 1026.
Leon, S., & Dixon, S. 2023. Airline satisfaction and
loyalty: Assessing the influence of personality, trust
and service quality. Journal of Air Transport
Management, 113, 102487.
Liu, Y.B. 2022. Prediction of Airline Passenger Satisfaction
Based on Machine Learning. Technology Venture
Monthly, 35(4), 142-145.
Park, S.-H., Kim, M.-Y., Kim, Y.-J., & Park, Y.-H. 2022.
A Deep Learning Approach to Analyze Airline
Customer Propensities: The Case of South Korea.
Applied Sciences, 12(4), 1916.
Ribeiro, M. T., Singh, S., & Guestrin, C. 2016. "Why
Should I Trust You?" Explaining the Predictions of Any
Classifier. 22nd ACM SIGKDD International
Conference on Knowledge Discovery and Data Mining
(KDD), San Francisco, CA., 1135 - 1144.
Ruiz, E., Yushimito, W. F., Aburto, L., & de la Cruz, R.
2024. Predicting passenger satisfaction in public
transportation using machine learning models.
Transportation Research Part A, 181, 103995.
Salah-Ud-Din, M., Lincy.S.S, B. T., & Al Ali, H. 2024.
Exploratory Data Analysis and Prediction of Passenger
Satisfaction with Airline services. New Trends in Civil
Aviation, NTCA 2024, 295-302
Xiang, Z. & Gretzel, U. 2010. Role of Social Media in
Online Travel Information Search. Tourism
Management, 31(2), 179-188.