
6 CONCLUSION AND FUTURE
WORK
In conclusion, this paper proposed a real-time stream-
ing prediction system for suicidal ideation prediction
of users’ posts on social networks using a big data
analytics environment—the work methodology anal-
ysis of social media content with two-phase batch pro-
cessing and real time streaming prediction. Our sys-
tem applied two types of datasets. Reddit’s historical
big data are used for model building, while Twitter
streams big data have been used for real-time stream-
ing prediction.
Our proposed methodology for building binary
classification models was evaluated using various as-
sessment metrics and showed high levels of accuracy
and AUC scores with stable Recall and Precision. The
experimental results of the batch processing phase
revealed that the MLP classifier achieved the high-
est classification accuracy of 93.47% on an unseen
dataset and was used for the real-time streaming pre-
diction phase.
According to the results of various testing scenar-
ios, we can conclude that the features retrieved from
stream data could accurately determine the suicidal
ideation of users in real time. The developed system
might also assist public health professionals with lim-
ited resources in determining and controlling suicidal
ideation and preparing preventative steps to save lives.
Multiple languages, such as Turkish and Arabic, can
be added for future work. To deal with such datasets,
which require sequential information and local fea-
ture engineering, we may use Ensemble LSTM and
CNN models for better performance. We also plan to
develop a web or mobile interface as a text-analysis
tool to detect the individual’s health status.
REFERENCES
Agarwal, G., Dinkar, S. K., and Agarwal, A. (2024). Bi-
narized spiking neural networks optimized with no-
madic people optimization-based sentiment analysis
for social product recommendation. Knowledge and
Information Systems, 66(2):933–958.
Alada
˘
g, A. E., Muderrisoglu, S., Akbas, N. B., Zahma-
cioglu, O., and Bingol, H. O. (2018). Detecting suici-
dal ideation on forums: proof-of-concept study. Jour-
nal of medical Internet research, 20(6):e9840.
Alamoodi, A. H., Zaidan, B. B., Zaidan, A. A., Albahri,
O. S., Mohammed, K. I., Malik, R. Q., Almahdi,
E. M., Chyad, M. A., Tareq, Z., Albahri, A. S.,
et al. (2021). Sentiment analysis and its applica-
tions in fighting covid-19 and infectious diseases: A
systematic review. Expert systems with applications,
167:114155.
Aldhyani, T. H., Alsubari, S. N., Alshebami, A. S., Alkah-
tani, H., and Ahmed, Z. A. (2022). Detecting and an-
alyzing suicidal ideation on social media using deep
learning and machine learning models. International
journal of environmental research and public health,
19(19):12635.
Allayla, M. A. and Ayvaz, S. (2023). A Hybrid and Scal-
able Sentiment Analysis Framework: Case of Russo-
Ukrainian War. In 2023 3rd International Scientific
Conference of Engineering Sciences (ISCES), pages
13–18. IEEE.
Ayvaz, S. and Shiha, M. O. (2018). A scalable streaming big
data architecture for real-time sentiment analysis. In
Proceedings of the 2018 2nd international conference
on cloud and big data computing, pages 47–51.
Baghdadi, N. A., Malki, A., Balaha, H. M., AbdulAzeem,
Y., Badawy, M., and Elhosseini, M. (2022). An
optimized deep learning approach for suicide detec-
tion through Arabic tweets. PeerJ Computer Science,
8:e1070.
Birjali, M., Beni-Hssane, A., and Erritali, M. (2017). Ma-
chine learning and semantic sentiment analysis based
algorithms for suicide sentiment prediction in social
networks. Procedia Computer Science, 113:65–72.
Breiman, L. (2001). Random Forests. Machine Learning,
45(1):5–32.
Brownlee, J. (2017). Deep learning for natural language
processing: develop deep learning models for your
natural language problems. Machine Learning Mas-
tery.
Carson, N. J., Mullin, B., Sanchez, M. J., Lu, F., Yang,
K., Menezes, M., and Cook, B. L. (2019). Identifi-
cation of suicidal behavior among psychiatrically hos-
pitalized adolescents using natural language process-
ing and machine learning of electronic health records.
PloS one, 14(2):e0211116.
Chatterjee, M., Kumar, P., Samanta, P., and Sarkar, D.
(2022). Suicide ideation detection from online social
media: A multi-modal feature based technique. In-
ternational Journal of Information Management Data
Insights, 2(2):100103.
Deshpande, K. and Rao, M. (2022). An Open-Source
Framework Unifying Stream and Batch Processing. In
Inventive Computation and Information Technologies,
pages 607–630. Springer.
Desu, V., Komati, N., Lingamaneni, S., and Shaik, F.
(2022). Suicide and Depression Detection in Social
Media Forums. In Smart Intelligent Computing and
Applications, Volume 2: Proceedings of Fifth Interna-
tional Conference on Smart Computing and Informat-
ics (SCI 2021), pages 263–270. Springer.
Gijzen, M. W., Rasing, S. P., Creemers, D. H., Smit, F., En-
gels, R. C., and De Beurs, D. (2021). Suicide ideation
as a symptom of adolescent depression. a network
analysis. Journal of Affective Disorders, 278:68–77.
Goel, A., Gautam, J., and Kumar, S. (2016). Real time
sentiment analysis of tweets using Naive Bayes. In
2016 2nd International Conference on Next Gener-
ation Computing Technologies (NGCT), pages 257–
261. IEEE.
DATA 2025 - 14th International Conference on Data Science, Technology and Applications
142