Authors:
Sara Botelho Silveira
1
and
Antonio Branco
2
Affiliations:
1
University of Lisbon, Portugal
;
2
Universidade de Lisboa, Portugal
Keyword(s):
Multi-document Summarization, Sentence Simplification, Sentence Compression.
Related
Ontology
Subjects/Areas/Topics:
Applications
;
Artificial Intelligence
;
Knowledge Engineering and Ontology Development
;
Knowledge-Based Systems
;
Natural Language Processing
;
Pattern Recognition
;
Symbolic Systems
Abstract:
Multi-document summarization aims at creating a single summary based on the information conveyed by a collection of texts. After the candidate sentences have been identified and ordered, it is time to select which will be included in the summary. In this paper, we propose an approach that uses sentence simplification, both lexical and syntactic, to help improve the compression step in the summarization process. Simplification is performed by removing specific sentential constructions conveying information that can be considered to be less relevant to the general message of the summary. Thus, the rationale is that sentence simplification not only removes expendable information, but also makes room for further relevant data in a summary.