loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Tilahun Yeshambel 1 ; Josiane Mothe 2 and Yaregal Assabie 3

Affiliations: 1 IT PhD Program, Addis Ababa University, Addis Ababa, Ethiopia ; 2 INSPE, Univ. de Toulouse, IRIT, UMR5505 CNRS, Toulouse, France ; 3 Department of Computer Science, Addis Ababa University, Addis Ababa, Ethiopia

Keyword(s): Adhoc Retrieval, Amharic, Complex Morphology, Stem, Root.

Abstract: Amharic is the official language of the government of Ethiopia currently having an estimated population of over 110 million. Like other Semitic languages, Amharic is characterized by complex morphology where thousands of words are generated from a single root form through inflection and derivation. This has made the development of tools for Amharic natural language processing a non-trivial task. Amharic adhoc retrieval faces difficulties due to the complex morphological structure of the language. In this paper, the impact of morphological features on the representation of Amharic documents and queries for adhoc retrieval is investigated. We analyze the effects of stem-based and root-based approaches on Amharic adhoc retrieval effectiveness. Various experiments are conducted on TREC-like Amharic information retrieval test collection using standard evaluation framework and measures. The findings show that a root-based approach outperforms the conventional stem-based approach that preva ils in many other languages. (More)

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 3.131.13.194

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Yeshambel, T.; Mothe, J. and Assabie, Y. (2020). Amharic Document Representation for Adhoc Retrieval. In Proceedings of the 12th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K 2020) - KDIR; ISBN 978-989-758-474-9; ISSN 2184-3228, SciTePress, pages 124-134. DOI: 10.5220/0010177301240134

@conference{kdir20,
author={Tilahun Yeshambel. and Josiane Mothe. and Yaregal Assabie.},
title={Amharic Document Representation for Adhoc Retrieval},
booktitle={Proceedings of the 12th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K 2020) - KDIR},
year={2020},
pages={124-134},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0010177301240134},
isbn={978-989-758-474-9},
issn={2184-3228},
}

TY - CONF

JO - Proceedings of the 12th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K 2020) - KDIR
TI - Amharic Document Representation for Adhoc Retrieval
SN - 978-989-758-474-9
IS - 2184-3228
AU - Yeshambel, T.
AU - Mothe, J.
AU - Assabie, Y.
PY - 2020
SP - 124
EP - 134
DO - 10.5220/0010177301240134
PB - SciTePress