Hate Speech Detection using Word Embedding and Deep Learning in the Arabic Language Context

Hossam Faris; Ibrahim Aljarah; Maria Habib; Pedro Castillo

doi:10.5220/0008954004530460

Hate Speech Detection using Word Embedding and Deep Learning in the Arabic Language Context

Hossam Faris, Ibrahim Aljarah, Maria Habib, Pedro Castillo

2020

Abstract

Hate speech over online social networks is a worldwide problem that leads for diminishing the cohesion of civil societies. The rapid spread of social media websites is accompanied with an increasing number of social media users which showed a higher rate of hate speech, as well. The objective of this paper is to propose a smart deep learning approach for the automatic detection of cyber hate speech. Particularly, the detection of hate speech on Twitter on the Arabic region. Hence, a dataset is collected from Twitter that captures the hate expressions in different topics at the Arabic region. A set of features extracted from the dataset based on a word embedding mechanism. The word embeddings fed into a deep learning framework. The implemented deep learning approach is a hybrid of convolutional neural network (CNN) and long short-term memory (LSTM) network. The proposed approach achieved good results in classifying tweets as Hate or Normal regarding accuracy, precision, recall, and F1 measure.

Download

Paper Citation

in Harvard Style

Faris H., Aljarah I., Habib M. and Castillo P. (2020). Hate Speech Detection using Word Embedding and Deep Learning in the Arabic Language Context. In Proceedings of the 9th International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM, ISBN 978-989-758-397-1, pages 453-460. DOI: 10.5220/0008954004530460

in Bibtex Style

@conference{icpram20,
author={Hossam Faris and Ibrahim Aljarah and Maria Habib and Pedro Castillo},
title={Hate Speech Detection using Word Embedding and Deep Learning in the Arabic Language Context},
booktitle={Proceedings of the 9th International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM,},
year={2020},
pages={453-460},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0008954004530460},
isbn={978-989-758-397-1},
}

in EndNote Style

TY - CONF

JO - Proceedings of the 9th International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM,
TI - Hate Speech Detection using Word Embedding and Deep Learning in the Arabic Language Context
SN - 978-989-758-397-1
AU - Faris H.
AU - Aljarah I.
AU - Habib M.
AU - Castillo P.
PY - 2020
SP - 453
EP - 460
DO - 10.5220/0008954004530460