Authors:
Jung Song Lee
;
Han Hee Hahm
;
Seong Soo Chang
and
Soon Cheol Park
Affiliation:
Chonbuk National University, Korea, Republic of
Keyword(s):
Automatic Document Summarization, Sentence Clustering, Extractive Summarization, Multi-objective Genetic Algorithm, NSGA-II, SPEA2, Normalized Google Distance.
Related
Ontology
Subjects/Areas/Topics:
Artificial Intelligence
;
Computational Intelligence
;
Evolutionary Computing
;
Evolutionary Multiobjective Optimization
;
Genetic Algorithms
;
Informatics in Control, Automation and Robotics
;
Intelligent Control Systems and Optimization
;
Soft Computing
Abstract:
In this paper, automatic document summarization using the sentence clustering algorithms, NSGA-II and SPEA2, is proposed. These algorithms are very effective to extract the most important and non-redundant sentences from a document. Using these, we cluster similar sentences as many groups as we need and extract the most important sentence in each group. After clustering, we rearrange the extracted sentences in the same order as in the document to generate readable summary. We tested this technique with two of the open benchmark datasets, DUC01 and DUC02. To evaluate the performances, we used F-measure and ROUGE. The experimental results show the performances of these MOGAs, NSGA-II and SPEA2, are better than those of the existing algorithms.