Machine Learning-based Query Augmentation for SPARQL Endpoints

Mariano Rico, Rizkallah Touma, Anna Queralt, María S. Pérez

2018

Abstract

Linked Data repositories have become a popular source of publicly-available data. Users accessing this data through SPARQL endpoints usually launch several restrictive yet similar consecutive queries, either to find the information they need through trial-and-error or to query related resources. However, instead of executing each individual query separately, query augmentation aims at modifying the incoming queries to retrieve more data that is potentially relevant to subsequent requests. In this paper, we propose a novel approach to query augmentation for SPARQL endpoints based on machine learning. Our approach separates the structure of the query from its contents and measures two types of similarity, which are then used to predict the structure and contents of the augmented query. We test the approach on the real-world query logs of the Spanish and English DBpedia and show that our approach yields high-accuracy prediction. We also show that, by caching the results of the predicted augmented queries, we can retrieve data relevant to several subsequent queries at once, achieving a higher cache hit rate than previous approaches.

Download


Paper Citation


in Harvard Style

Rico M., Touma R., Queralt A. and Pérez M. (2018). Machine Learning-based Query Augmentation for SPARQL Endpoints.In Proceedings of the 14th International Conference on Web Information Systems and Technologies - Volume 1: WEBIST, ISBN 978-989-758-324-7, pages 57-67. DOI: 10.5220/0006925300570067


in Bibtex Style

@conference{webist18,
author={Mariano Rico and Rizkallah Touma and Anna Queralt and María S. Pérez},
title={Machine Learning-based Query Augmentation for SPARQL Endpoints},
booktitle={Proceedings of the 14th International Conference on Web Information Systems and Technologies - Volume 1: WEBIST,},
year={2018},
pages={57-67},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0006925300570067},
isbn={978-989-758-324-7},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 14th International Conference on Web Information Systems and Technologies - Volume 1: WEBIST,
TI - Machine Learning-based Query Augmentation for SPARQL Endpoints
SN - 978-989-758-324-7
AU - Rico M.
AU - Touma R.
AU - Queralt A.
AU - Pérez M.
PY - 2018
SP - 57
EP - 67
DO - 10.5220/0006925300570067