loading
Documents

Research.Publish.Connect.

Paper

Authors: Amira Shoukry and Ahmed Rafea

Affiliation: Department of Computer Science and Engineering, The American University in Cairo (AUC), Cairo and Egypt

ISBN: 978-989-758-386-5

ISSN: 2184-3252

Keyword(s): Arabic Sentiment Analysis, Arabic Sentiment Lexicons, Domain-specific, Egyptian Dialect, Gulf Dialect, Arabic Opinion Mining.

Related Ontology Subjects/Areas/Topics: Enterprise Information Systems ; Recommendation Systems ; Social Media Analytics ; Society, e-Business and e-Government ; Software Agents and Internet Computing ; Web Information Systems and Technologies

Abstract: Given the sacristy of the Arabic sentiment lexicon especially for the Egyptian and Gulf dialects, together with the fact that a word’s sentiment depends mostly on the domain in which it is used, we present SATALex which is a two-part sentiment lexicon covering the telecom domain for the Egyptian and Gulf Arabic dialects. The Egyptian sentiment lexicon contains close to 1.5 thousand Egyptian words and compound phrases, while the Gulf sentiment lexicon contains close to 3.5 thousand Gulf words and compound phrases. The development of the presented lexicons has taken place iteratively, in each iteration manual annotators analyzed tweets for the corresponding dialect to try to extract as many domain specific words as possible and measure their effect on the performance of the classification. The result are lexicons which are more focused and related to the telecom domain more than any translated or general-purpose sentiment lexicon. To demonstrate the effectiveness of these built lexicons and how directly they can impact the task of sentiment analysis, we compared their performance to one of the biggest publicly available sentiment lexicon (WeightedNileULex) using Semantic Orientation (SO) approach on telecom test datasets; one for each dialect. The experiments show that using SATALex lexicons improved the results over the publicly available lexicon. (More)

PDF ImageFull Text

Download
CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 3.231.220.225

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Shoukry, A. and Rafea, A. (2019). SATALex: Telecom Domain-specific Sentiment Lexicons for Egyptian and Gulf Arabic Dialects.In Proceedings of the 15th International Conference on Web Information Systems and Technologies - Volume 1: WEBIST, ISBN 978-989-758-386-5, ISSN 2184-3252, pages 169-176. DOI: 10.5220/0007950401690176

@conference{webist19,
author={Amira Shoukry. and Ahmed Rafea.},
title={SATALex: Telecom Domain-specific Sentiment Lexicons for Egyptian and Gulf Arabic Dialects},
booktitle={Proceedings of the 15th International Conference on Web Information Systems and Technologies - Volume 1: WEBIST,},
year={2019},
pages={169-176},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0007950401690176},
isbn={978-989-758-386-5},
}

TY - CONF

JO - Proceedings of the 15th International Conference on Web Information Systems and Technologies - Volume 1: WEBIST,
TI - SATALex: Telecom Domain-specific Sentiment Lexicons for Egyptian and Gulf Arabic Dialects
SN - 978-989-758-386-5
AU - Shoukry, A.
AU - Rafea, A.
PY - 2019
SP - 169
EP - 176
DO - 10.5220/0007950401690176

Login or register to post comments.

Comments on this Paper: Be the first to review this paper.