
to time series analysis stands as a robust framework
for businesses seeking actionable intelligence from
their sales data.
5 CONCLUSION
This project presents a comprehensive approach to
predictive modeling. It seamlessly integrates data
exploration, advanced regression modeling with Cat-
Boost, XGBoost, LightGBM and AdaBoost, thought-
ful validation strategies, hyperparameter optimiza-
tion, and extensive feature engineering. Through this
iterative process, XGBoost emerged as the standout
performer, showcasing its efficacy in predicting sales
quantities and providing valuable insights into the un-
derlying dynamics of the dataset. In conclusion, big
data analytics and time series analysis are indispens-
able tools for e-commerce businesses seeking to un-
cover hidden insights, make informed decisions, and
drive revenue growth. The identification of XGBoost
as the most effective model adds a crucial layer to
the project’s significance, emphasizing its prowess in
handling the complexities of the given dataset.
ACKNOWLEDGEMENTS
The authors thank Amrita School of Computing, Ben-
galuru, for their support and resources in conducting
this study.
REFERENCES
A.S. Nambiar, K. Likhita, K. P. and Panda, N. (2023). Ex-
ploring the power of deep learning for seamless back-
ground audio generation in videos. In 2023 14th Inter-
national Conference on Computing Communication
and Networking Technologies (ICCCNT), pages 1–7.
IEEE.
B. Singh, P. Kumar, N. S. and Sharma, K. (2020).
Sales forecast for amazon sales with time series
modeling. In 2020 First International Confer-
ence on Power, Control and Computing Technologies
(ICPC2T), pages 38–43. IEEE.
C.C.F.C. Chee, K.L. Chiew, I. S. and Jing, E. (2022). Data
analytics approach for short-term sales forecasts using
limited information in e-commerce marketplace. Acta
Informatica Pragensia, 11(3):309–323.
D. Wei, P. Geng, L. Y. and Shuaipeng, S. (2014). A pre-
diction study on e-commerce sales based on structure
time series model and web search data. In The 26th
Chinese Control and Decision Conference (CCDC),
pages 5346–5351. IEEE.
K. Bandara, P. Shi, C. B. H. H. Q. T. and Seaman, B.
(2019). Sales demand forecast in e-commerce using a
long short-term memory neural network methodology.
In Neural Information Processing: 26th International
Conference, ICONIP 2019, Sydney, NSW, Australia,
December 12–15, 2019, Proceedings, Part III, pages
462–474. Springer International Publishing.
K.A. Xavier, C. Manjunath, M. M. V. G. N. J. and Bal-
amurugan, M. (2023). Analytical methods of ma-
chine learning model for e-commerce sales analysis
and prediction. In 2023 IEEE International Confer-
ence on Contemporary Computing and Communica-
tions (InC4), pages 1–6. IEEE.
Kumar, R. (2023). Hybrid machine learning method for
product sales forecasting in e-commerce. In 2023 4th
International Conference on Smart Electronics and
Communication (ICOSEC), pages 781–787. IEEE.
Makkar, S. and Jaiswal, S. (2022). Predictive analytics on
e-commerce annual sales. In Proceedings of Data An-
alytics and Management: ICDAM 2021, Volume 1,
pages 557–567. Springer Singapore.
P. Amitasree, G. V. and Devi, V. (2021). Electricity con-
sumption forecasting using machine learning. In 2021
2nd International Conference on Smart Electronics
and Communication (ICOSEC), pages 1–8. IEEE.
Pan, H. and Zhou, H. (2020). Study on convolutional neural
network and its application in data mining and sales
forecasting for e-commerce. Electronic Commerce
Research, 20(2):297–320.
R. Gayathri, P.B. Pati, T. S. and Nair, R. (2022). A frame-
work for the prediction of diabetes mellitus using
hyper-parameter tuned xgboost classifier. In 2022
13th International Conference on Computing Com-
munication and Networking Technologies (ICCCNT),
pages 1–5. IEEE.
Ravindran, N. and Gopalakrishnan, P. (2018). Predictive
analysis for healthcare sector using big data technol-
ogy. In 2018 Second International Conference On
Green Computing And Internet Of Things (ICGCIoT),
pages 326–331. IEEE.
S. Aswin, P. G. and Vinayakumar, R. (2018). Deep learn-
ing models for the prediction of rainfall. In 2018 In-
ternational Conference on Communication and Signal
Processing (ICCSP), pages 0657–0661. IEEE.
S. Narendranath, S. Khare, D. G. and Jyotishi, A. (2018).
Characteristics of ‘escaping’ and ‘falling into’ poverty
in india: An analysis of ihds panel data using machine
learning approach. In 2018 International Conference
on Advances in Computing, Communications and In-
formatics (ICACCI), pages 1391–1397. IEEE.
Sharma, G. and Patil, S. (2023). Big Data Analysis for Rev-
enue and Sales Prediction using Support Vector Re-
gression with Auto-regressive Integrated Moving Av-
erage, volume 15. SAMRIDDHI: A Journal of Phys-
ical Sciences, Engineering and Technology.
Shih, Y. and Lin, M. (2019). A lstm approach for sales
forecasting of goods with short-term demands in e-
commerce. In Intelligent Information and Database
Systems: 11th Asian Conference, ACIIDS 2019, Yo-
gyakarta, Indonesia, April 8–11, 2019, Proceedings,
INCOFT 2025 - International Conference on Futuristic Technology
358