Natural Language Processing Techniques for Document Classification in IT Benchmarking - Automated Identification of Domain Specific Terms

Matthias Pfaff; Helmut Krcmar

Research.Publish.Connect.

*Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Country:
Subject:

Advanced Search Affiliations Search

If you're looking for an exact phrase use quotation marks on text fields.

Proceedings

Proceedings Search *Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

Papers

Papers Search *Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

Authors

Authors Search *Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

Advanced Search

Paper

Natural Language Processing Techniques for Document Classification in IT Benchmarking - Automated Identification of Domain Specific Terms

Topics: Coupling and Integrating Heterogeneous Data Sources; Data Mining; Natural Language Interfaces to Intelligent Systems; Ontology Engineering

In Proceedings of the 17th International Conference on Enterprise Information Systems - Volume 2: ICEIS, 360-366, 2015 , Barcelona, Spain

Authors: Matthias Pfaff ¹ and Helmut Krcmar ²

Affiliations: ¹ fortiss GmbH An-Institut Technische Universität München, Germany ; ² Technische Universität München, Germany

Keyword(s): IT Benchmarking, Natural Language Processing, Heterogeneous Data, Semantic Data Integration, Ontologies.

Related Ontology Subjects/Areas/Topics: Artificial Intelligence ; Artificial Intelligence and Decision Support Systems ; Coupling and Integrating Heterogeneous Data Sources ; Data Engineering ; Data Mining ; Databases and Information Systems Integration ; Enterprise Information Systems ; Information Systems Analysis and Specification ; Knowledge Engineering and Ontology Development ; Knowledge-Based Systems ; Natural Language Interfaces to Intelligent Systems ; Ontologies and the Semantic Web ; Ontology Engineering ; Sensor Networks ; Signal Processing ; Soft Computing ; Symbolic Systems

Abstract: In the domain of IT benchmarking collected data are often stored in natural language text and therefore intrinsically unstructured. To ease data analysis and data evaluations across different types of IT benchmarking approaches a semantic representation of this information is crucial. Thus, the identification of conceptual (semantical) similarities is the first step in the development of an integrative data management in this domain. As an ontology is a specification of such a conceptualization an association of terms, relations between terms and related instances must be developed. Building on previous research we present an approach for an automated term extraction by the use of natural language processing (NLP) techniques. Terms are automatically extracted out of existing IT benchmarking documents leading to a domain specific dictionary. These extracted terms are representative for each document and describe the purpose and content of each file and server as a basis for the ontolo gy development process in the domain of IT benchmarking. (More)

CC BY-NC-ND 4.0

Guest: Register as new SciTePress user now for free.

SciTePress user: please login.

My Papers

You are not signed in, therefore limits apply to your IP address 216.73.216.12

In the current month:

Recent papers: 100 available of 100 total

2⁺ years older papers: 200 available of 200 total

Paper citation in several formats:

Pfaff, M., Krcmar and H. (2015). Natural Language Processing Techniques for Document Classification in IT Benchmarking - Automated Identification of Domain Specific Terms. In Proceedings of the 17th International Conference on Enterprise Information Systems - Volume 2: ICEIS; ISBN 978-989-758-096-3; ISSN 2184-4992, SciTePress, pages 360-366. DOI: 10.5220/0005462303600366

@conference{iceis15,
author={Matthias Pfaff and Helmut Krcmar},
title={Natural Language Processing Techniques for Document Classification in IT Benchmarking - Automated Identification of Domain Specific Terms},
booktitle={Proceedings of the 17th International Conference on Enterprise Information Systems - Volume 2: ICEIS},
year={2015},
pages={360-366},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0005462303600366},
isbn={978-989-758-096-3},
issn={2184-4992},
}

TY - CONF

JO - Proceedings of the 17th International Conference on Enterprise Information Systems - Volume 2: ICEIS
TI - Natural Language Processing Techniques for Document Classification in IT Benchmarking - Automated Identification of Domain Specific Terms
SN - 978-989-758-096-3
IS - 2184-4992
AU - Pfaff, M.
AU - Krcmar, H.
PY - 2015
SP - 360
EP - 366
DO - 10.5220/0005462303600366
PB - SciTePress