Unsupervised Topic Extraction from Twitter: A Feature-pivot Approach

Nada A. GabAllah, Ahmed Rafea

2019

Abstract

Extracting topics from textual data has been an active area of research with many applications in our daily life. The digital content is increasing every day, and recently it has become the main source of information in all domains. Organizing and categorizing related topics from this data is a crucial task to get the best benefit out of this massive amount of information. In this paper we are presenting a feature-pivot based approach to extract topics from tweets. The approach is applied on a Twitter dataset in Egyptian dialect from four different domains. We are comparing our results to a document-pivot based approach and investigate which approach performs better to extract the topics in the underlying datasets. By applying t-test on recall, precision, and F1 measure values for both approaches on different datasets from different domains we confirmed our hypothesis that feature-pivot approach performs better in extracting topics from Egyptian dialect tweets in the datasets in question.

Download


Paper Citation


in Harvard Style

GabAllah N. and Rafea A. (2019). Unsupervised Topic Extraction from Twitter: A Feature-pivot Approach.In Proceedings of the 15th International Conference on Web Information Systems and Technologies - Volume 1: WEBIST, ISBN 978-989-758-386-5, pages 185-192. DOI: 10.5220/0007959001850192


in Bibtex Style

@conference{webist19,
author={Nada GabAllah and Ahmed Rafea},
title={Unsupervised Topic Extraction from Twitter: A Feature-pivot Approach},
booktitle={Proceedings of the 15th International Conference on Web Information Systems and Technologies - Volume 1: WEBIST,},
year={2019},
pages={185-192},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0007959001850192},
isbn={978-989-758-386-5},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 15th International Conference on Web Information Systems and Technologies - Volume 1: WEBIST,
TI - Unsupervised Topic Extraction from Twitter: A Feature-pivot Approach
SN - 978-989-758-386-5
AU - GabAllah N.
AU - Rafea A.
PY - 2019
SP - 185
EP - 192
DO - 10.5220/0007959001850192