Design and Implementation of the CloudMdsQL Multistore System

Boyan Kolev, Carlyna Bondiombouy, Oleksandra Levchenko, Patrick Valduriez, Ricardo Jimenez-Peris, Raquel Pau, José Pereira

Abstract

The blooming of different cloud data management infrastructures has turned multistore systems to a major topic in the nowadays cloud landscape. In this paper, we give an overview of the design of a Cloud Multidatastore Query Language (CloudMdsQL), and the implementation of its query engine. CloudMdsQL is a functional SQL-like language, capable of querying multiple heterogeneous data stores (relational, NoSQL, HDFS) within a single query that can contain embedded invocations to each data store’s native query interface. The major innovation is that a CloudMdsQL query can exploit the full power of local data stores, by simply allowing some local data store native queries (e.g. a breadth-first search query against a graph database) to be called as functions, and at the same time be optimized.

References

  1. Armbrust, M., Xin, R., Lian, C., Huai, Y., Liu, D., Bradley, J., Meng, X., Kaftan, T., Franklin, M., Ghodsi, A., Zaharia, M. 2015. Spark SQL: Relational Data Processing in Spark. In ACM SIGMOD (2015), 1383-1394.
  2. Bondiombouy, C., Kolev, B., Levchenko, O., Valduriez, P. 2015. Integrating Big Data and Relational Data with a Functional SQL-like Query Language. Int. Conf. on Databases and Expert Systems Applications (DEXA) (2015), 170-185.
  3. CoherentPaaS, http://coherentpaas.eu (2013).
  4. DeWitt, D., Halverson, A., Nehme, R., Shankar, S., Aguilar-Saborit J., Avanes, A., Flasza, M., Gramling, J. 2013. Split Query Processing in Polybase. In ACM SIGMOD (2013), 1255-1266.
  5. Duggan, J., Elmore, A. J., Stonebraker, M., Balazinska, M., Howe, B., Kepner, J., Madden, S., Maier, D., Mattson, T., Zdonik, S. 2015. The BigDAWG Polystore System. SIGMOD Rec. 44, 2 (August 2015), 11-16.
  6. Kolev, B., Valduriez, P., Bondiombouy, C., Jiménez-Peris, R., Pau, R., Pereira, J. 2015. CloudMdsQL: Querying Heterogeneous Cloud Data Stores with a Common Language. Distributed and Parallel Databases, pp 1- 41, http://hal-lirmm.ccsd.cnrs.fr/lirmm-01184016.
  7. LeFevre, J., Sankaranarayanan, J., Hacigümüs, H., Tatemura, J., Polyzotis, N., Carey, M. 2014. MISO: Souping Up Big Data Query Processing with a Multistore System. In ACM SIGMOD (2014), 1591- 1602.
  8. Özsu, T., Valduriez, P. 2011. Principles of Distributed Database Systems - Third Edition. Springer, 850 pages.
Download


Paper Citation


in Harvard Style

Kolev B., Bondiombouy C., Levchenko O., Valduriez P., Jimenez-Peris R., Pau R. and Pereira J. (2016). Design and Implementation of the CloudMdsQL Multistore System . In Proceedings of the 6th International Conference on Cloud Computing and Services Science - Volume 1: DataDiversityConvergence, (CLOSER 2016) ISBN 978-989-758-182-3, pages 352-359. DOI: 10.5220/0005923803520359


in Bibtex Style

@conference{datadiversityconvergence16,
author={Boyan Kolev and Carlyna Bondiombouy and Oleksandra Levchenko and Patrick Valduriez and Ricardo Jimenez-Peris and Raquel Pau and José Pereira},
title={Design and Implementation of the CloudMdsQL Multistore System},
booktitle={Proceedings of the 6th International Conference on Cloud Computing and Services Science - Volume 1: DataDiversityConvergence, (CLOSER 2016)},
year={2016},
pages={352-359},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0005923803520359},
isbn={978-989-758-182-3},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 6th International Conference on Cloud Computing and Services Science - Volume 1: DataDiversityConvergence, (CLOSER 2016)
TI - Design and Implementation of the CloudMdsQL Multistore System
SN - 978-989-758-182-3
AU - Kolev B.
AU - Bondiombouy C.
AU - Levchenko O.
AU - Valduriez P.
AU - Jimenez-Peris R.
AU - Pau R.
AU - Pereira J.
PY - 2016
SP - 352
EP - 359
DO - 10.5220/0005923803520359