analyze this content is sentiment analysis, widely
used in previous research (Fortuna et al.; Aluru et al.).
1.1 Background
Social media operates as both a communication and
an information-sharing platform due to its global
reach (Patil et al.). However, the same digital
connectedness allows harmful content like memes
containing hate speech or provocative humor, and
cyberbullying messages to spread rapidly (Toraman
et al.; Antypas & Camacho-Collados). Cyberbullying
involves deliberate acts of harassment or threats using
digital tools, while offensive memes often relay
derogatory and hateful messages with even greater
virality (Sabat et al.; Zhou et al.). Such content affects
users’ mental health, undermines platform standards,
and threatens the credibility of social media
businesses (Velankar et al.; Akuma et al.). Given the
volume of textual content online, it is infeasible for
human moderators to review every post, even with
filtering rules (Mullah & Zainon; Rabiul Awal et al.).
This calls for real-time, automated detection systems
to mitigate the spread of harmful material (Gravano
et al.; Malik et al.).
2 LITERATURE REVIEW
This research applies a Deep Convolutional Neural
Network (DCNN) to detect hate speech and offers a
multilingual study using 16 datasets across 9
languages (Roy et al.; Aluru et al.). While hate speech
has long been studied in humanities, its exploration in
computer science remains nascent (Alkomah & Ma;
Fortuna et al.). One key proposal is DeepHate, a deep
learning model leveraging high-dimensional text
representations for robust detection (Cao et al.).
Another approach, BiCHAT, combines BiLSTM,
CNN, and hierarchical attention for nuanced
detection (Khan et al. 2022a), while the AngryBERT
model jointly learns target and emotion contexts
(Rabiul Awal et al.). A BERT-based transfer learning
technique has also demonstrated success in detecting
hate speech from social media platforms (Mozafari et
al.).
The recently proposed HCovBi-Caps model
integrates convolutional layers, BiGRU, and capsule
networks to improve hate speech detection,
particularly considering its abstract nature (Khan et
al. 2022b). Further, methods like TF-IDF and Bag of
Words have been evaluated in combination with
machine learning models for hate classification on
live tweets (Akuma et al.). The cross-domain
transferability of hate speech detection models is
examined by Toraman et al., while Antypas &
Camacho-Collados evaluate robustness across
datasets. Meanwhile, context-based methods (Mosca
et al.) and sentiment-sharing models (Zhou et al.)
have enhanced semantic understanding in hate speech
systems. Moreover, offensive meme detection,
focusing on pixel-level content, pushes hate
moderation into multimodal dimensions (Sabat et al.).
3 EXISTING METHODOLOGIES
And even if individuals are communicating
asynchronously over the Internet, social networking
as a stylish and, most importantly, unobtrusive way
for people to show their respect to each other’s
individual opinions and beliefs is already a part of
everyday life. Today, the grown man who expresses
hatred through politics, misogyny, and racial
prejudice has harassed and mistreated other people.
Social networks are also gaining popularity among
users for online tyrant, including blackmail. Myspace
is a common social networking site through which
people can comfortably join one or many societies
that they find interesting Due to the sheer amount of
information that exists in SNSs, researchers have
been able to conduct extensive research in a number
of fields. Emotion analysis is one of the areas of
research that receive a lot of attention and primarily
use data from social media. Recommending filters
arrange the offered content list in relation to the
similarity between the desired item content and the
end user’s interests; whereas, collaborative filters
offer items based upon comparison with other like-
minded individuals. Since content-based filtering
processes are more or less focused on text-based
documents, Filtering, which categorizes the mass
incoming texts into relevant and irrelevant, must truly
be regarded as a single tag classification. Multi-label
text categorization is an advanced form of filter that
will categorize the communications partly by topics
and categories. The classifier generates automatically
according with the learning need from the set of
previously classified sample in content-based
filtering paradigm from the machine learning (ML)
framework, identified that Bag of Words (BoW)
method perform better when compare with complex
text representation method that may have better
statistical features but lower semantics.