loading
Papers

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Karolis Kleiza ; Patrick Klein and Klaus-Dieter Thoben

Affiliation: BIBA - Bremer Institut für Produktion und Logistik GmbH, Germany

ISBN: 978-989-8425-28-7

Keyword(s): Similarity Computation, Word Similarity, Probabilistic Topic Model, Latent Dirichlet Allocation, Word Highlighting.

Related Ontology Subjects/Areas/Topics: Artificial Intelligence ; Information Extraction ; Knowledge Discovery and Information Retrieval ; Knowledge-Based Systems ; Mining Text and Semi-Structured Data ; Pre-Processing and Post-Processing for Data Mining ; Symbolic Systems ; Visual Data Mining and Data Visualization

Abstract: This paper gives at first an introduction to similarity computation and text summarization of documents by usage of a probabilistic topic model, especially Latent Dirichlet Allocation (LDA). Afterwards it provides a discussion about the need of a better understanding for the reason and transparency at all for the end-user why documents with a computed similarity actually are similar to a given search query. The authors propose for that an approach to identify and highlight words with respect to their semantic relevance directly within documents and provide a theoretical background as well as an adequate visual assignment for that approach.

PDF ImageFull Text

Download
CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 3.81.29.226

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Kleiza, K.; Klein, P. and Thoben, K. (2010). SEMANTIC IDENTIFICATION AND VISUALIZATION OF SIGNIFICANT WORDS WITHIN DOCUMENTS - Approach to Visualize Relevant Words within Documents to a Search Query by Word Similarity Computation.In Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 1: KDIR, (IC3K 2010) ISBN 978-989-8425-28-7, pages 481-486. DOI: 10.5220/0003099004810486

@conference{kdir10,
author={Karolis Kleiza. and Patrick Klein. and Klaus{-}Dieter Thoben.},
title={SEMANTIC IDENTIFICATION AND VISUALIZATION OF SIGNIFICANT WORDS WITHIN DOCUMENTS - Approach to Visualize Relevant Words within Documents to a Search Query by Word Similarity Computation},
booktitle={Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 1: KDIR, (IC3K 2010)},
year={2010},
pages={481-486},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003099004810486},
isbn={978-989-8425-28-7},
}

TY - CONF

JO - Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 1: KDIR, (IC3K 2010)
TI - SEMANTIC IDENTIFICATION AND VISUALIZATION OF SIGNIFICANT WORDS WITHIN DOCUMENTS - Approach to Visualize Relevant Words within Documents to a Search Query by Word Similarity Computation
SN - 978-989-8425-28-7
AU - Kleiza, K.
AU - Klein, P.
AU - Thoben, K.
PY - 2010
SP - 481
EP - 486
DO - 10.5220/0003099004810486

Login or register to post comments.

Comments on this Paper: Be the first to review this paper.