The GENIE Project - A Semantic Pipeline for Automatic Document Categorisation

Angel L. Garrido; Maria G. Buey; Sandra Escudero; Alvaro Peiro; Sergio Ilarri; Eduardo Mena

Research.Publish.Connect.

*Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Country:
Subject:

Advanced Search Affiliations Search

If you're looking for an exact phrase use quotation marks on text fields.

Proceedings

Proceedings Search *Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

Papers

Papers Search *Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

Authors

Authors Search *Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

Advanced Search

Paper

The GENIE Project - A Semantic Pipeline for Automatic Document Categorisation

Topics: Context Aware Media Tagging; Databases and Datawarehouses; Knowledge Management; Ontology and the Semantic Web; Text Mining

In Proceedings of the 10th International Conference on Web Information Systems and Technologies - Volume 1: WEBIST, 161-171, 2014 , Barcelona, Spain

Authors: Angel L. Garrido ; Maria G. Buey ; Sandra Escudero ; Alvaro Peiro ; Sergio Ilarri and Eduardo Mena

Affiliation: University of Zaragoza, Spain

Keyword(s): Knowledge Management, Text Mining, Ontologies, Linguistics.

Related Ontology Subjects/Areas/Topics: Biomedical Engineering ; Context Aware Media Tagging ; Data Engineering ; Databases and Datawarehouses ; Enterprise Information Systems ; Health Information Systems ; Information Systems Analysis and Specification ; Internet Technology ; Knowledge Management ; Mobile Information Systems ; Ontologies and the Semantic Web ; Ontology and the Semantic Web ; Society, e-Business and e-Government ; Web Information Systems and Technologies ; Web Interfaces and Applications

Abstract: Automatic text categorisation systems is a type of software that every day it is receiving more interest, due not only to its use in documentaries environments but also to its possible application to tag properly documents on the Web. Many options have been proposed to face this subject using statistical approaches, natural language processing tools, ontologies and lexical databases. Nevertheless, there have been no too many empirical evaluations comparing the influence of the different tools used to solve these problems, particularly in a multilingual environment. In this paper we propose a multi-language rule-based pipeline system for automatic document categorisation and we compare empirically the results of applying techniques that rely on statistics and supervised learning with the results of applying the same techniques but with the support of smarter tools based on language semantics and ontologies, using for this purpose several corpora of documents. GENIE is being applied to real environments, which shows the potential of the proposal. (More)

CC BY-NC-ND 4.0

Guest: Register as new SciTePress user now for free.

SciTePress user: please login.

My Papers

You are not signed in, therefore limits apply to your IP address 3.133.131.168

In the current month:

Recent papers: 100 available of 100 total

2⁺ years older papers: 200 available of 200 total

Paper citation in several formats:

Garrido, A.; Buey, M.; Escudero, S.; Peiro, A.; Ilarri, S. and Mena, E. (2014). The GENIE Project - A Semantic Pipeline for Automatic Document Categorisation. In Proceedings of the 10th International Conference on Web Information Systems and Technologies - Volume 1: WEBIST; ISBN 978-989-758-024-6; ISSN 2184-3252, SciTePress, pages 161-171. DOI: 10.5220/0004750601610171

@conference{webist14,
author={Angel L. Garrido. and Maria G. Buey. and Sandra Escudero. and Alvaro Peiro. and Sergio Ilarri. and Eduardo Mena.},
title={The GENIE Project - A Semantic Pipeline for Automatic Document Categorisation},
booktitle={Proceedings of the 10th International Conference on Web Information Systems and Technologies - Volume 1: WEBIST},
year={2014},
pages={161-171},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0004750601610171},
isbn={978-989-758-024-6},
issn={2184-3252},
}

TY - CONF

JO - Proceedings of the 10th International Conference on Web Information Systems and Technologies - Volume 1: WEBIST
TI - The GENIE Project - A Semantic Pipeline for Automatic Document Categorisation
SN - 978-989-758-024-6
IS - 2184-3252
AU - Garrido, A.
AU - Buey, M.
AU - Escudero, S.
AU - Peiro, A.
AU - Ilarri, S.
AU - Mena, E.
PY - 2014
SP - 161
EP - 171
DO - 10.5220/0004750601610171
PB - SciTePress