loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Gordana Pavlović-Lažetić and Jelena Graovac

Affiliation: University of Belgrade, Serbia

Keyword(s): Document classification, Wordnet, SWN, Ontology, Proper name.

Related Ontology Subjects/Areas/Topics: Artificial Intelligence ; Clustering and Classification Methods ; Knowledge Discovery and Information Retrieval ; Knowledge-Based Systems ; Symbolic Systems

Abstract: Document classification based on the lexical-semantic network, wordnet, is presented. Two types of document classification in Serbian have been experimented with – classification based on chosen concepts from Serbian WordNet (SWN) and proper names-based classification. Conceptual document classification criteria are constructed from hierarchies rooted in a set of chosen concepts (first case) or in hierarchies rooted in some of the proper names' hypernyms (second case). A classificator of the first type is trained and then tested on an indexed and already classified Ebart corpus of Serbian newspapers (476917 articles). Precision, recall and F-measure show that this type of classification is promising although incomplete due mainly to SWN incompleteness. In the context of proper names-based classification, a proper names ontology based on the SWN is presented in the paper. A distance based similarity measure is defined, based on Euclidean and Manhattan distances. Classification of a su bset of Contemporary Serbian Language Corpus is presented. (More)

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 18.188.40.207

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Pavlović-Lažetić, G. and Graovac, J. (2010). ONTOLOGY-DRIVEN CONCEPTUAL DOCUMENT CLASSIFICATION. In Proceedings of the International Conference on Knowledge Discovery and Information Retrieval (IC3K 2010) - KDIR; ISBN 978-989-8425-28-7; ISSN 2184-3228, SciTePress, pages 383-386. DOI: 10.5220/0003063903830386

@conference{kdir10,
author={Gordana Pavlović{-}Lažetić. and Jelena Graovac.},
title={ONTOLOGY-DRIVEN CONCEPTUAL DOCUMENT CLASSIFICATION},
booktitle={Proceedings of the International Conference on Knowledge Discovery and Information Retrieval (IC3K 2010) - KDIR},
year={2010},
pages={383-386},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003063903830386},
isbn={978-989-8425-28-7},
issn={2184-3228},
}

TY - CONF

JO - Proceedings of the International Conference on Knowledge Discovery and Information Retrieval (IC3K 2010) - KDIR
TI - ONTOLOGY-DRIVEN CONCEPTUAL DOCUMENT CLASSIFICATION
SN - 978-989-8425-28-7
IS - 2184-3228
AU - Pavlović-Lažetić, G.
AU - Graovac, J.
PY - 2010
SP - 383
EP - 386
DO - 10.5220/0003063903830386
PB - SciTePress