Extraction Process of the Logical Schema of a Document-oriented NoSQL Database

Fatma Abdelhedi, Hela Rajhi, Gilles Zurfluh

2022

Abstract

The "schemaless" property, common to most NoSQL systems, means the absence of a data schema when creating a database (DB). This property brings an undeniable flexibility by allowing the schema to evolve during the use of DB. However, the absence of a schema is a major obstacle for developers and decision makers. Indeed, the expression of queries (of SQL type) requires a precise knowledge of this schema. In this paper, we propose an automatic process to extract the logical schema of document-oriented NoSQL DBs. We chose the OrientDB NoSQL system which appeared to be the most suitable for the application in our project, because of its ability to express rich data structures and a diversity of links between data: association, composition and inheritance links. Our solution, based on the MDA architecture, proposes to metamodel a NoSQL DB and its schema. From these metamodels, transformation rules allow to extract the schema of the DB. The implementation of this process on an OrientDB DB allows users to have all the necessary elements (class names, properties, data types and links) for the elaboration of queries. An experimentation of the process was carried out on three test-DBs as well as on two massive industrial DBs.

Download


Paper Citation


in Harvard Style

Abdelhedi F., Rajhi H. and Zurfluh G. (2022). Extraction Process of the Logical Schema of a Document-oriented NoSQL Database. In Proceedings of the 10th International Conference on Model-Driven Engineering and Software Development - Volume 1: MODELSWARD, ISBN 978-989-758-550-0, pages 61-71. DOI: 10.5220/0010899000003119


in Bibtex Style

@conference{modelsward22,
author={Fatma Abdelhedi and Hela Rajhi and Gilles Zurfluh},
title={Extraction Process of the Logical Schema of a Document-oriented NoSQL Database},
booktitle={Proceedings of the 10th International Conference on Model-Driven Engineering and Software Development - Volume 1: MODELSWARD,},
year={2022},
pages={61-71},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0010899000003119},
isbn={978-989-758-550-0},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 10th International Conference on Model-Driven Engineering and Software Development - Volume 1: MODELSWARD,
TI - Extraction Process of the Logical Schema of a Document-oriented NoSQL Database
SN - 978-989-758-550-0
AU - Abdelhedi F.
AU - Rajhi H.
AU - Zurfluh G.
PY - 2022
SP - 61
EP - 71
DO - 10.5220/0010899000003119