loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Lucas Cabral 1 ; José Maria Monteiro 1 ; José Wellington Franco da Silva 1 ; César Lincoln Mattos 1 and Pedro Jorge Chaves Mourão 2

Affiliations: 1 Computer Science Department, Federal University of Ceará, Fortaleza, Ceará, Brazil ; 2 State University of Ceará, Fortaleza, Brazil

Keyword(s): Misinformation Detection, Fake News Detection, Natural Language Processing, WhatsApp, Social Media.

Abstract: In the past few years, the large-scale dissemination of misinformation through social media has become a critical issue, harming the trustworthiness of legit information, social stability, democracy and public health. Thus, developing automated misinformation detection methods has become a field of high interests both in academia and in industry. In many developing countries such as Brazil, India, and Mexico, one of the primary sources of misinformation is the messaging application WhatsApp. Despite this scenario, due to the private messaging nature of WhatsApp, there still few methods of misinformation detection developed specifically for this platform. In this work we present the FakeWhatsApp.BR, a dataset of WhatsApp messages in Brazilian Portuguese, collected from Brazilian public groups and manually labeled. Besides, we evaluated a series of misinformation classifiers combining Natural Language Processing-based techniques of feature extraction and a set of well-know machine lear ning algorithms, totaling 108 different scenarios. Our best result achieved a F1 score of 0.73, and the analysis of errors indicates that they occur mainly due to the predominance of short texts that accompany media files. When texts with less than 50 words are filtered, the F1 score rises to 0.87. (More)

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 54.242.96.240

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Cabral, L.; Monteiro, J.; Franco da Silva, J.; Mattos, C. and Mourão, P. (2021). FakeWhastApp.BR: NLP and Machine Learning Techniques for Misinformation Detection in Brazilian Portuguese WhatsApp Messages. In Proceedings of the 23rd International Conference on Enterprise Information Systems - Volume 1: ICEIS; ISBN 978-989-758-509-8; ISSN 2184-4992, SciTePress, pages 63-74. DOI: 10.5220/0010446800630074

@conference{iceis21,
author={Lucas Cabral. and José Maria Monteiro. and José Wellington {Franco da Silva}. and César Lincoln Mattos. and Pedro Jorge Chaves Mourão.},
title={FakeWhastApp.BR: NLP and Machine Learning Techniques for Misinformation Detection in Brazilian Portuguese WhatsApp Messages},
booktitle={Proceedings of the 23rd International Conference on Enterprise Information Systems - Volume 1: ICEIS},
year={2021},
pages={63-74},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0010446800630074},
isbn={978-989-758-509-8},
issn={2184-4992},
}

TY - CONF

JO - Proceedings of the 23rd International Conference on Enterprise Information Systems - Volume 1: ICEIS
TI - FakeWhastApp.BR: NLP and Machine Learning Techniques for Misinformation Detection in Brazilian Portuguese WhatsApp Messages
SN - 978-989-758-509-8
IS - 2184-4992
AU - Cabral, L.
AU - Monteiro, J.
AU - Franco da Silva, J.
AU - Mattos, C.
AU - Mourão, P.
PY - 2021
SP - 63
EP - 74
DO - 10.5220/0010446800630074
PB - SciTePress