A Graph-based Approach at Passage Level to Investigate the Cohesiveness of Documents

Ghulam Sarwar, Colm O’Riordan

2021

Abstract

Approaches involving the representation of documents as a series of passages have been used in the past to improve the performance of ad-hoc retrieval systems. In this paper, we represent the top returned passages as a graph with each passage corresponding to a vertex. We connected the vertices (passages) that belongs to the same document to form a graph. The underlying intuition behind this approach is to identify some measure of the cohesiveness of the documents. We introduce a graph-based approach at the passage level to calculate the cohesion score of each document. The scores for both relevant and non-relevant documents are compared, and we illustrate that the cohesion score differs for relevant and non-relevant. Moreover, we also re-ranked the documents by applying the cohesion score with a document similarity score to inspect its impact on the system’s performance.

Download


Paper Citation


in Harvard Style

Sarwar G. and O’Riordan C. (2021). A Graph-based Approach at Passage Level to Investigate the Cohesiveness of Documents. In Proceedings of the 10th International Conference on Data Science, Technology and Applications - Volume 1: DATA, ISBN 978-989-758-521-0, pages 115-123. DOI: 10.5220/0010619101150123


in Bibtex Style

@conference{data21,
author={Ghulam Sarwar and Colm O’Riordan},
title={A Graph-based Approach at Passage Level to Investigate the Cohesiveness of Documents},
booktitle={Proceedings of the 10th International Conference on Data Science, Technology and Applications - Volume 1: DATA,},
year={2021},
pages={115-123},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0010619101150123},
isbn={978-989-758-521-0},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 10th International Conference on Data Science, Technology and Applications - Volume 1: DATA,
TI - A Graph-based Approach at Passage Level to Investigate the Cohesiveness of Documents
SN - 978-989-758-521-0
AU - Sarwar G.
AU - O’Riordan C.
PY - 2021
SP - 115
EP - 123
DO - 10.5220/0010619101150123