loading
Documents

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Dario De Nart and Carlo Tasso

Affiliation: University of Udine, Italy

ISBN: 978-989-758-024-6

Keyword(s): Keyphrase Extraction, Keyphrase Inference, Information Extraction, Text Classification, Text Summarization.

Related Ontology Subjects/Areas/Topics: Metadata and Metamodeling ; Ontology and the Semantic Web ; Web Information Systems and Technologies ; Web Interfaces and Applications

Abstract: The annotation of documents and web pages with semantic metatdata is an activity that can greatly increase the accuracy of Information Retrieval and Personalization systems, but the growing amount of text data available is too large for an extensive manual process. On the other hand, automatic keyphrase generation, a complex task involving Natural Language Processing and Knowledge Engineering, can significantly support this activity. Several different strategies have been proposed over the years, but most of them require extensive training data, which are not always available, suffer high ambiguity and differences in writing style, are highly domain-specific, and often rely on a well-structured knowledge that is very hard to acquire and encode. In order to overcome these limitations, we propose in this paper an innovative domain-independent approach that consists of an unsupervised keyphrase extraction phase and a subsequent keyphrase inference phase based on loosely structured, colla borative knowledge such as Wikipedia, Wordnik, and Urban Dictionary. This double layered approach allows us to generate keyphrases that both describe and classify the text. (More)

PDF ImageFull Text

Download
Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 54.146.195.24

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
De Nart D. and Tasso C. (2014). A Domain Independent Double Layered Approach to Keyphrase Generation.In Proceedings of the 10th International Conference on Web Information Systems and Technologies - Volume 2: WEBIST, ISBN 978-989-758-024-6, pages 305-312. DOI: 10.5220/0004855303050312

@conference{webist14,
author={Dario De Nart and Carlo Tasso},
title={A Domain Independent Double Layered Approach to Keyphrase Generation},
booktitle={Proceedings of the 10th International Conference on Web Information Systems and Technologies - Volume 2: WEBIST,},
year={2014},
pages={305-312},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0004855303050312},
isbn={978-989-758-024-6},
}

TY - CONF

JO - Proceedings of the 10th International Conference on Web Information Systems and Technologies - Volume 2: WEBIST,
TI - A Domain Independent Double Layered Approach to Keyphrase Generation
SN - 978-989-758-024-6
AU - De Nart D.
AU - Tasso C.
PY - 2014
SP - 305
EP - 312
DO - 10.5220/0004855303050312

Login or register to post comments.

Comments on this Paper: Be the first to review this paper.