loading
Documents

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Matthias Pfaff 1 and Helmut Krcmar 2

Affiliations: 1 fortiss GmbH An-Institut Technische Universität München, Germany ; 2 Technische Universität München, Germany

ISBN: 978-989-758-096-3

Keyword(s): IT Benchmarking, Natural Language Processing, Heterogeneous Data, Semantic Data Integration, Ontologies.

Related Ontology Subjects/Areas/Topics: Artificial Intelligence ; Artificial Intelligence and Decision Support Systems ; Coupling and Integrating Heterogeneous Data Sources ; Data Engineering ; Data Mining ; Databases and Information Systems Integration ; Enterprise Information Systems ; Information Systems Analysis and Specification ; Knowledge Engineering and Ontology Development ; Knowledge-Based Systems ; Natural Language Interfaces to Intelligent Systems ; Ontologies and the Semantic Web ; Ontology Engineering ; Sensor Networks ; Signal Processing ; Soft Computing ; Symbolic Systems

Abstract: In the domain of IT benchmarking collected data are often stored in natural language text and therefore intrinsically unstructured. To ease data analysis and data evaluations across different types of IT benchmarking approaches a semantic representation of this information is crucial. Thus, the identification of conceptual (semantical) similarities is the first step in the development of an integrative data management in this domain. As an ontology is a specification of such a conceptualization an association of terms, relations between terms and related instances must be developed. Building on previous research we present an approach for an automated term extraction by the use of natural language processing (NLP) techniques. Terms are automatically extracted out of existing IT benchmarking documents leading to a domain specific dictionary. These extracted terms are representative for each document and describe the purpose and content of each file and server as a basis for the ontolog y development process in the domain of IT benchmarking. (More)

PDF ImageFull Text

Download
Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 54.242.193.41

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Pfaff, M. and Krcmar, H. (2015). Natural Language Processing Techniques for Document Classification in IT Benchmarking - Automated Identification of Domain Specific Terms.In Proceedings of the 17th International Conference on Enterprise Information Systems - Volume 1: ICEIS, ISBN 978-989-758-096-3, pages 360-366. DOI: 10.5220/0005462303600366

@conference{iceis15,
author={Matthias Pfaff. and Helmut Krcmar.},
title={Natural Language Processing Techniques for Document Classification in IT Benchmarking - Automated Identification of Domain Specific Terms},
booktitle={Proceedings of the 17th International Conference on Enterprise Information Systems - Volume 1: ICEIS,},
year={2015},
pages={360-366},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0005462303600366},
isbn={978-989-758-096-3},
}

TY - CONF

JO - Proceedings of the 17th International Conference on Enterprise Information Systems - Volume 1: ICEIS,
TI - Natural Language Processing Techniques for Document Classification in IT Benchmarking - Automated Identification of Domain Specific Terms
SN - 978-989-758-096-3
AU - Pfaff, M.
AU - Krcmar, H.
PY - 2015
SP - 360
EP - 366
DO - 10.5220/0005462303600366

Login or register to post comments.

Comments on this Paper: Be the first to review this paper.