loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Authors: Bilel Elayeb 1 ; 2 ; Mohamed Firas Ettih 3 and Raja Ayed 4 ; 2

Affiliations: 1 Liwa College of Technology, P.O. Box 41009, Abu Dhabi, U.A.E. ; 2 RIADI Research Laboratory, ENSI, Manouba University, Tunisia ; 3 Université Paris-Est Créteil, Paris 12 Val de Marne, France ; 4 Faculty of Economics and Management of Nabeul, Carthage University, Tunisia

Keyword(s): Morphological Disambiguation, Arabic Text, Machine-Learning Algorithms, Data Transformation, Morphological Feature, Classification.

Abstract: Arabic language is characterized by its complexity and its morphological and orthographic variations including syntactic and semantic diversity of a word. This specificity may cause Arabic morphological ambiguity. We present in this paper a new architecture for morphological disambiguation of Arabic texts. The latter can be treated as a classification problem where the set of morphological features’ values represent classes, and a classification algorithm is used to assign a class to each word’s occurrence based on the context. The first step consists of identifying the correct morphological analysis of a non-vocalized Arabic word using the morphological dependencies extracted from the corpus of vocalized texts. Then, we propose a method of transforming imperfect training datasets into perfect data having precise attributes and certain classes. We experiment this architecture on a set of machine-learning classifiers using a corpus of classic Arabic texts. Results highlight some stati stically significant improvement of SVM and Naïve Bayes classifiers in terms of disambiguation rate. (More)

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 18.118.145.114

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Elayeb, B.; Ettih, M. and Ayed, R. (2022). Experimenting Machine-Learning Algorithms for Morphological Disambiguation of Arabic Texts. In Proceedings of the 14th International Conference on Agents and Artificial Intelligence - Volume 3: ICAART; ISBN 978-989-758-547-0; ISSN 2184-433X, SciTePress, pages 851-862. DOI: 10.5220/0010917300003116

@conference{icaart22,
author={Bilel Elayeb. and Mohamed Firas Ettih. and Raja Ayed.},
title={Experimenting Machine-Learning Algorithms for Morphological Disambiguation of Arabic Texts},
booktitle={Proceedings of the 14th International Conference on Agents and Artificial Intelligence - Volume 3: ICAART},
year={2022},
pages={851-862},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0010917300003116},
isbn={978-989-758-547-0},
issn={2184-433X},
}

TY - CONF

JO - Proceedings of the 14th International Conference on Agents and Artificial Intelligence - Volume 3: ICAART
TI - Experimenting Machine-Learning Algorithms for Morphological Disambiguation of Arabic Texts
SN - 978-989-758-547-0
IS - 2184-433X
AU - Elayeb, B.
AU - Ettih, M.
AU - Ayed, R.
PY - 2022
SP - 851
EP - 862
DO - 10.5220/0010917300003116
PB - SciTePress