Figure 2: Feature Importance Scores (Photo/Picture credit: Original).
5 CONCLUSIONS
This study aimed to predict Walmart's sales volume
and assess which model better supports inventory
control, supply chain management, and mitigating the
bullwhip effect. Polynomial Regression and RF
regression were evaluated for prediction accuracy and
generalization ability. The results indicate that while
both models perform well, there are notable
differences. RF demonstrated superior performance
on the training set with lower RMSE and values
closer to 1. However, on the test set, Polynomial
Regression outperformed RF, with smaller RMSE
values and values nearer to 1. This suggests that
Polynomial Regression offers stronger generalization
capabilities. Cross-validation further confirmed that
Polynomial Regression maintains a higher average
value, indicating better prediction performance across
various scenarios. For retail supply chain
management, selecting a model with strong
generalization is crucial. Although RF shows better
fitting on training data, Polynomial Regression's
superior generalization makes it more suitable for
predicting sales in dynamic environments.
Nonetheless, this does not discount the potential of
RF or other models. Exploring additional data science
methods can address overfitting and enhance
generalization. Future research should integrate
supply chain management tools and strategies, and
evaluate a broader range of models - including LSTM
and other machine learning and deep learning
techniques - to improve prediction accuracy and
supply chain effectiveness.
REFERENCES
Biau, G., & Scornet, E., 2016. A random forest-guided tour.
Test, 25, 197-227.
Boone, T., Ganeshan, R., Jain, A., Sanders, N. R., 2019.
Forecasting sales in the supply chain: Consumer
analytics in the big data era. International journal of
forecasting, 35(1), 170-180.
Carbonneau, R., Laframboise, K., & Vahidov, R. (2008).
Application of machine learning techniques for supply
chain demand forecasting. European journal of
operational research, 184(3), 1140-1154.
Darlington, R. B., Hayes, A. F., 2016. Regression analysis
and linear models: Concepts, applications, and
implementation. Guilford Publications.
Koumanakos, D. P., 2008. The effect of inventory
management on firm performance. International
journal of productivity and performance
management, 57(5), 355-369.
Hastie, T., Tibshirani, R., Friedman, J., Hastie, T.,
Tibshirani, R., Friedman, J., 2009. Random forests. The
elements of statistical learning: Data mining, inference,
and prediction, 587-604.
Heiberger, R. M., Neuwirth, E., Heiberger, R. M.,
Neuwirth, E., 2009. Polynomial regression. R Through
Excel: A Spreadsheet Interface for Statistics, Data
Analysis, and Graphics, 269-284.
Ramos, P., Oliveira, J. M., 2023. Robust Sales forecasting
Using Deep Learning with Static and Dynamic
Covariates. Applied System Innovation, 6(5), 85.
Rigatti, S. J., 2017. Random forest. Journal of Insurance
Medicine, 47(1), 31-39.
Wang, X., Disney, S. M., 2016. The bullwhip effect:
Progress, trends and directions. European Journal of
Operational Research, 250(3), 691-701.