applied to make sentimental analysis of tweets in real
time. The system uses PySpark for the data
preprocessing and machine learning algorithm,
Django for building an appealing web user interface,
Kafka for effective real time stream processing, and
MongoDB for storing big amounts of data in
optimized manners makes the present system a
perfect, efficient and effective solution for the
sentiment analysis.
As a result of the process of model selection
during model evaluation, the selected logistic
regression model offers accurate and timely
classification of tweet sentiments. The employment
of real-time streaming guarantees that users can
receive sentiment predictions with least delay,
making the application fast and efficient. Also, the
use of web interface is rather convenient to interact
with, as well as entering and analyzing tweets which
enable users to get the idea of sentiments’
distributions. Not only does the project prove the
possibility of carrying out real-time sentiment
analysis, but also the need for combining machine
learning with big data and web frameworks. The
system proposes complex factors of performance:
scalability, operational productivity and adaptability,
which allows considering it as a perspective for usage
in such fields as social media monitoring, market and
trend analysis.
Two suggestions for future work are the
expansion of the current system with features such as
multilingual sentiment analysis, topic extraction and
the addition of an emotion recognition feature. An
uplift in model performance, employing state-of-art
deep learning techniques, can act as a further
enhancement of the proposed system. In sum,
implementing this project means a major shift
towards utilizing real time sentiment analysis for real
life application which will prove as handy and helpful
for the purpose of meaningful decision making in a
constantly evolving digital landscape.
REFERENCES
A. Kc and R. Sumathi, “Analyzing twitter sentiments with
big data,” in 2018 2nd International Conference on
Trends in Electronics and Informatics (ICOEI), pp.
989–994, IEEE, 2018.
A. Alqarni and A. Rahman, “Arabic tweets-based sentiment
analysis to investigate the impact of covid-19 in ksa: a
deep learning approach,” Big Data and Cognitive
Computing, vol. 7, no. 1, p. 16, 2023.
A. Ismail, F. H. Sazali, S. N. A. Jawaddi, and S. Mutalib,
“Stream etl framework for twitter-based sentiment
analysis: Leveraging big data technologies,” Expert
Systems with Applications, vol. 261, p. 125523, 2025.
B. Yadranjiaghdam, S. Yasrobi, and N. Tabrizi,
“Developing a real-time data analytics framework for
twitter streaming data,” in 2017 IEEE International
Congress on Big Data (BigData Congress), pp. 329–
336, IEEE, 2017.
Bikku, T., Rao, N.S. and Akepogu, A.R., 2016. “Hadoop
based feature selection and decision-making models on
big data”. Indian Journal of Science and Technology,
9(10), pp.1-6.
C. Verma and R. Pandey, “Big data representation for grade
analysis through hadoop framework,” in 2016 6th
international conference-cloud system and big data
engineering (Confluence), pp. 312–315, IEEE, 2016.
Dr. Tripty Singh and S, D. V., “A modern data architecture
with apache Hadoop”, in 2015 International Conference
on Green Computing and Internet of Things
(ICGCIoT), Greater Noida on the beautiful campus of
Galgotias Institutions , 2015.
G. P. A. Mary, M. Hema, R. Maheshprabhu, and M. N.
Guptha, “Sentimental analysis of twitter data using
machine learning algorithms,” in 2021 International
Conference on Forensics, Analytics, Big Data, Security
(FABS), vol. 1, pp. 1–5, IEEE, 2021.
G.S. Ganesh, Yazhini Samyukkktha Ramanikaran,
M.Sindhuja, G. Rajesh Kannan, J. Jeyalakshmi, “Big
data in visual analytics to supplement digital image
processing using Hadoop”, Journal of Chemical and
Pharmaceutical Sciences,JCHPS., Special Issue
9,pp.54-56,Dec,2016.(SCOPUS) ISSN: 0974-2115
J. H. Joloudari, S. Hussain, M. A. Nematollahi, R. Bagheri,
F. Fazl, R. Alizadehsani, R. Lashgari, and A. Talukder,
“Bert-deep cnn: State of the art for sentiment analysis
of covid-19 tweets,” Social Network Analysis and
Mining, vol. 13, no. 1, p. 99, 2023.
K. S. Madhu, B. C. Reddy, C. Damarukanadhan, M.
Polireddy, and N. Ravinder, “Real time sentimental
analysis on twitter,” in 2021 6th International
Conference on Inventive Computation Technologies
(ICICT), pp. 1030–1034, IEEE, 2021.
K. Fahd, S. Parvin, and A. de Souza-Daw, “A framework
for real-time sentiment analysis of big data generated
by social media platforms,” in 2021 31st international
telecommunication networks and applications confere-
nce (ITNAC), pp. 30–33, IEEE, 2021.
R. Shree, T. Choudhury, S. C. Gupta, and P. Kumar,
“Kafka: The modern platform for data management and
analysis in big data domain,” in 2017 2nd international
conference on telecommunication and networks (TEL-
NET), pp. 1–5, IEEE, 2017.
R. G., K. Prasath, N., and Rajasundari, T., “Sentimental
Analysis on Tamil News Feed”, Journal of Advanced
Research in Dynamical and Control Systems, vol. 9, pp.
397-399, 2017.
R. Singh and R. Singh, “Applications of sentiment analysis
and machine learning techniques in disease outbreak
prediction–a review,” Materials Today: Proceedings,
vol. 81, pp. 1006–1011, 2023.