Overview of Arabic Sentence Corpora

Hussein Awdeh, Adelle Abdallah, Gilles Bernard, Mohammad Hajjar

2021

Abstract

The Arabic corpus, specifically the gold standard corpus is an important part of The Arabic Natural Language Processing. Described as a very large collection of texts stored on a computer, a corpus is considered as the most important source for semantic and syntax research and it can be a single language, a monolingual Corpus, or a multilingual Corpus. Then, an easy access to available corpora is highly needed in the Natural Language process (NLP) research community especially for language such as Arabic. Currently, there is no easy way to access to a comprehensive and updated list of available Arabic corpora. Our study in this paper, aims to present the results of a recent survey conducted to identify the list of the available Arabic corpora classified into categories and their resources.

Download


Paper Citation


in Harvard Style

Awdeh H., Abdallah A., Bernard G. and Hajjar M. (2021). Overview of Arabic Sentence Corpora. In Proceedings of the 13th International Joint Conference on Computational Intelligence (IJCCI 2021) - Volume 1: NCTA; ISBN 978-989-758-534-0, SciTePress, pages 285-292. DOI: 10.5220/0010651200003063


in Bibtex Style

@conference{ncta21,
author={Hussein Awdeh and Adelle Abdallah and Gilles Bernard and Mohammad Hajjar},
title={Overview of Arabic Sentence Corpora},
booktitle={Proceedings of the 13th International Joint Conference on Computational Intelligence (IJCCI 2021) - Volume 1: NCTA},
year={2021},
pages={285-292},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0010651200003063},
isbn={978-989-758-534-0},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 13th International Joint Conference on Computational Intelligence (IJCCI 2021) - Volume 1: NCTA
TI - Overview of Arabic Sentence Corpora
SN - 978-989-758-534-0
AU - Awdeh H.
AU - Abdallah A.
AU - Bernard G.
AU - Hajjar M.
PY - 2021
SP - 285
EP - 292
DO - 10.5220/0010651200003063
PB - SciTePress