
proposed system provides a new approach, which is
accurate, flexible and transparent. By combining
making continual learning, the multilanguage adapter
modules with the hierarchical multi-label
classification the model is able to generalize well on
diverse and emerging modes of hate speech
including subtle, implicit, context dependent hate
speech expressions.
The system’s low latency, which is implemented
based on more compact BERT versions and ONNX
optimization, makes the system feasible for real-time
deployment on cloud and edge. And there are
explainability-based tools powered by SHAP and
LIME which offer valuable explanations for model
decisions, enabling trust and interpretability for
moderators and users of the platform. Continuous
expansion of hate lexicon and retraining in real-time
help in making the model well-equipped to new and
emerging hate terms.
Through extensive empirical studies, we can
conclusively prove the efficiency of our framework
compared to traditional baselines of precision, recall,
F1-score, and fairness, with strong transferability
across different languages and platforms. This work
establishes a new state of the art for designing
intelligent, ethical, and high-performance models in
hate speech detection, thereby enabling safer and
more inclusive digital spaces. Possible future
improvements are interfacing with multimedia
content analysis and reinforcement learning to
include feedback of moderator.
REFERENCES
Almaliki, M., Almars, A. M., Gad, I., et al. (2022). ABMM:
Arabic BERT-Mini Model for Hate-Speech Detection
on Social Media. Journal of King Saud University-
Computer and Information Sciences, 34(7), 4335–
4344. https://doi.org/10.1016/j.jksuci.2022.05.006
Caselli, T., Basile, V., Mitrović, J., & Granitzer, M. (2021).
HateBERT: Retraining BERT for Abusive Language
Detection in English. In Proceedings of the 5th
Workshop on Online Abuse and Harms (WOAH 2021)
(pp. 17 25). Association for ComputationalLinguistics.
https://aclanthology.org/2021.woah-1.3/
ACL Anthology+9ACL Anthology+9ACLAnthology+
9
Das, S., Mandal, P., & Chatterji, S. (2021). Probabilistic
Impact Score Generation using Ktrain-BERT to
Identify Hate Words from Twitter Discussions. arXiv
preprint arXiv:2111.12939.https://arxiv.org/abs/2111.1
2939arXiv+1GitHub+1
El-Sayed, A., & Nasr, O. (2024). AAST-NLP at
Multimodal Hate Speech Event Detection 2024: A
Multimodal Approach for Classification of Text-
Embedded Images Based on CLIP and BERT-Based
Models. In Proceedings of the 7th Workshop on
Challenges and Applications of Automated Extraction
of Socio-political Events from Text (CASE 2024) (pp.
139–144). Association for Computational Linguistics.
https://aclanthology.org/2024.case-1.19/ACL
Anthology+2ACL Anthology+2ACL Anthology+2
Fillies, J., & Paschke, A. (2025). Youth language and
emerging slurs: Tackling bias in BERT-based hate
speech detection. AI and Ethics, 12 March 2025.
https://link.springer.com/article/10.1007/s43681-025-
00701-z SpringerLink
Guragain, A., Poudel, N., Piryani, R., & Khanal, B. (2025).
NLPineers@ NLU of Devanagari Script Languages
2025: Hate Speech Detection using Ensembling of
BERTbased models. arXiv preprintarXiv:2412.08163.
https://arxiv.org/abs/2412.08163
ACL Anthology+3arXiv+3arXiv+3
Jahan, M. S., Oussalah, M., Beddia, D. R., Mim, J. K., &
Arhab, N. (2024). A Comprehensive Study on NLP
Data Augmentation for Hate Speech Detection: Legacy
Methods, BERT, and LLMs. arXiv preprint
arXiv:2404.00303. https://arxiv.org/abs/2404.00303
arXiv
Mathew, B., Saha, P., Yimam, S. M., Biemann, C., Goyal,
P., & Mukherjee, A. (2021). HateXplain: A Benchmark
Dataset for Explainable Hate Speech Detection. arXiv
preprint arXiv:2012.10289.https://arxiv.org/abs/2012.1
0289Hugging Face
Patil, H., Velankar, A., & Joshi, R. (2022). L3Cube-
MahaHate: A Tweet-based Marathi Hate Speech
Detection Dataset and BERT Models. In Proceedings of
the Third Workshop on Threat, Aggression and
Cyberbullying (TRAC 2022) (pp. 1–9). Association for
Computational Linguistics. https://aclanthology.org/20
22.trac1.1/ACL Anthology+5ACLAnthology+5jai.fro
nt-sci.com+5
Pendzel, S., Wullach, T., Adler, A., & Minkov, E. (2023).
Generative AI for Hate Speech Detection: Evaluation
and Findings. arXiv preprint arXiv:2311.09993.
https://arxiv.org/abs/2311.09993arXiv
Rajput, G., Punn, N. S., Sonbhadra, S. K., & Agarwal, S.
(2021). Hate Speech Detection Using Static BERT
Embeddings. arXiv preprint arXiv:2106.15537.https://
arxiv.org/abs/2106.15537arXiv+1Academia+1
Schneider, N., Shouei, S., Ghantous, S., & Feldman, E.
(2023). Hate Speech Targets Detection in Parler using
BERT. arXiv preprint arXiv:2304.01179 https://arxiv.
org/abs/2304.01179arXiv
Schneider, N., Shouei, S., Ghantous, S., & Feldman, E.
(2023). Hate Speech Targets Detection in Parler using
BERT. arXiv preprint arXiv:2304.01179 https://arxiv.
org/abs/2304.01179arXiv
Singhal, K., & Bedi, J. (2024). Transformers at HSD-2Lang
2024: Hate Speech Detection in Arabic and Turkish
Tweets Using BERT Based Architectures. In
Proceedings of the 7th Workshop on Challenges and
Applications of Automated Extraction of Socio-
political Events from Text (CASE 2024) (pp. 190–194).
Association for Computational Linguistics.
Explainable, Cross-Platform, Multilingual BERT-Based Framework with Continual Learning for Realâ
˘
A
´
STime, Fineâ
˘
A
´
SGrained Hate
Speech Detection and Filtering on Online Social Platforms
129