loading
Documents

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Haidi Badr 1 ; Nayer Wanas 2 and Magda Fayek 3

Affiliations: 1 Electronics Researches Institute, Egypt ; 2 Cairo Microsoft Innovation Lab, Egypt ; 3 Cairo University, Egypt

ISBN: 978-989-8425-28-7

ISSN: 2184-3228

Keyword(s): LSA, Automatic dimension reduction ratio, Document-summarization.

Related Ontology Subjects/Areas/Topics: Artificial Intelligence ; Information Extraction ; Knowledge Discovery and Information Retrieval ; Knowledge-Based Systems ; Mining Text and Semi-Structured Data ; Symbolic Systems

Abstract: The role of text summarization algorithms is increasing in many applications; especially in the domain of information retrieval. In this work, we propose a generic single-document summarizer which is based on using the Latent Semantic Analysis (LSA). Generally in LSA, determining the dimension reduction ratio is usually performed experimentally which is data and document dependent. In this work, we propose a new approach to determine the dimension reduction ratio, DRr, automatically to overcome the manual determination problems. The proposed approach is tested using two benchmark datasets; namely DUC02 and LDC2008T19. The experimental results illustrate that the dimension reduction ratio obtained automatically improves the quality of the text summarization while providing a more optimal value for the DRr.

PDF ImageFull Text

Download
CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 35.172.165.53

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Badr, H.; Wanas, N. and Fayek, M. (2010). AUTOLSA: AUTOMATIC DIMENSION REDUCTION OF LSA FOR SINGLE-DOCUMENT SUMMARIZATION.In Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 1: KDIR, (IC3K 2010) ISBN 978-989-8425-28-7, ISSN 2184-3228, pages 444-448. DOI: 10.5220/0003091904440448

@conference{kdir10,
author={Haidi Badr. and Nayer Wanas. and Magda Fayek.},
title={AUTOLSA: AUTOMATIC DIMENSION REDUCTION OF LSA FOR SINGLE-DOCUMENT SUMMARIZATION},
booktitle={Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 1: KDIR, (IC3K 2010)},
year={2010},
pages={444-448},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003091904440448},
isbn={978-989-8425-28-7},
}

TY - CONF

JO - Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 1: KDIR, (IC3K 2010)
TI - AUTOLSA: AUTOMATIC DIMENSION REDUCTION OF LSA FOR SINGLE-DOCUMENT SUMMARIZATION
SN - 978-989-8425-28-7
AU - Badr, H.
AU - Wanas, N.
AU - Fayek, M.
PY - 2010
SP - 444
EP - 448
DO - 10.5220/0003091904440448

Login or register to post comments.

Comments on this Paper: Be the first to review this paper.