loading
Documents

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Agnieszka Mykowiecka and Malgorzata Marciniak

Affiliation: Polish Academy of Sciences, Poland

ISBN: 978-989-8565-30-3

Keyword(s): Terminology Extraction, Term Clustering, Medical Data, Ontology.

Related Ontology Subjects/Areas/Topics: Applications ; Artificial Intelligence ; Domain Analysis and Modeling ; Knowledge Engineering and Ontology Development ; Knowledge-Based Systems ; Natural Language Processing ; Pattern Recognition ; Symbolic Systems

Abstract: The paper presents the first results of clustering terms extracted from hospital discharge documents written in Polish. The aim of the task is to prepare data for an ontology reflecting the domain of documents. To begin, the characteristic of the language of texts, which differs significantly from general Polish, is given. Then, we describe the method of term extraction. In the process of finding related terms, we use lexical and syntactical information. We define term similarity based on: term contexts; coordinated sequences of terms; words that are parts of terms, e.g. their heads and modifiers. Then we performed several experiments with hierarchical clustering of the 300 most frequent terms. Finally, we describe the results and present an evaluation that compares the results with manually obtained groups.

PDF ImageFull Text

Download
Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 35.171.183.163

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Mykowiecka, A. and Marciniak, M. (2012). Clustering of Medical Terms based on Morpho-syntactic Features.In Proceedings of the International Conference on Knowledge Engineering and Ontology Development - Volume 1: KEOD, (IC3K 2012) ISBN 978-989-8565-30-3, pages 214-219. DOI: 10.5220/0004137502140219

@conference{keod12,
author={Agnieszka Mykowiecka. and Malgorzata Marciniak.},
title={Clustering of Medical Terms based on Morpho-syntactic Features},
booktitle={Proceedings of the International Conference on Knowledge Engineering and Ontology Development - Volume 1: KEOD, (IC3K 2012)},
year={2012},
pages={214-219},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0004137502140219},
isbn={978-989-8565-30-3},
}

TY - CONF

JO - Proceedings of the International Conference on Knowledge Engineering and Ontology Development - Volume 1: KEOD, (IC3K 2012)
TI - Clustering of Medical Terms based on Morpho-syntactic Features
SN - 978-989-8565-30-3
AU - Mykowiecka, A.
AU - Marciniak, M.
PY - 2012
SP - 214
EP - 219
DO - 10.5220/0004137502140219

Login or register to post comments.

Comments on this Paper: Be the first to review this paper.