loading
Documents

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Imen Ben Cheikh and Zeineb Zouaoui

Affiliation: LaTICE Research lab,University of Tunis and ESSTT, Tunisia

ISBN: 978-989-8565-41-9

Keyword(s): Natural Language Processing, Arabic Writing Recognition, Large Vocabulary, Hidden Markov Models, Canonical Vocabulary, Linguistic Properties, Viterbi Algorithm.

Related Ontology Subjects/Areas/Topics: Applications ; Artificial Intelligence ; Classification ; Information Retrieval and Learning ; Knowledge Engineering and Ontology Development ; Knowledge-Based Systems ; Natural Language Processing ; Pattern Recognition ; Stochastic Methods ; Symbolic Systems ; Theory and Methods

Abstract: The complexity of the recognition process is strongly related to language, the type of writing and the vocabulary size. Our work represents a contribution to a system of recognition of large canonical Arabic vocabulary of decomposable words derived from tri-consonantal roots. This system is based on a collaboration of three morphological classifiers specialized in the recognition of roots, schemes and conjugations. Our work deals with the first classifier. It is about proposing a root classifier based on 101 Hidden Markov Models, used to classify 101 tri-consonantal roots. The models have the same architecture endowed with Arabic linguistic knowledge. The proposed system deals, up to now, with a vocabulary of 5757 words. It has been learned then tested using a total of more than 17000 samples of printed words. Obtained results are satisfying and the top2 recognition rate reached 96%.

PDF ImageFull Text

Download
CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 34.204.194.190

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Ben Cheikh, I. and Zouaoui, Z. (2013). HMM based Classifier for the Recognition of Roots of a Large Canonical Arabic Vocabulary.In Proceedings of the 2nd International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM, ISBN 978-989-8565-41-9, pages 244-252. DOI: 10.5220/0004335202440252

@conference{icpram13,
author={Imen Ben Cheikh. and Zeineb Zouaoui.},
title={HMM based Classifier for the Recognition of Roots of a Large Canonical Arabic Vocabulary},
booktitle={Proceedings of the 2nd International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM,},
year={2013},
pages={244-252},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0004335202440252},
isbn={978-989-8565-41-9},
}

TY - CONF

JO - Proceedings of the 2nd International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM,
TI - HMM based Classifier for the Recognition of Roots of a Large Canonical Arabic Vocabulary
SN - 978-989-8565-41-9
AU - Ben Cheikh, I.
AU - Zouaoui, Z.
PY - 2013
SP - 244
EP - 252
DO - 10.5220/0004335202440252

Login or register to post comments.

Comments on this Paper: Be the first to review this paper.