A Comparative Analysis of JSON Schema Inference Algorithms

Ivan Veinhardt Latták, Pavel Koupil

2022

Abstract

NoSQL databases are becoming increasingly more popular due to their undeniable advantages in the context of storing and processing Big Data, mainly horizontal scalability and minimal requirement to define a schema upfront. In the absence of the explicit schema, however, an implicit schema inherent to the stored data still exists and it needs to be reverse engineered from the data. Once inferred, it is of a great value to the stakeholders and database maintainers. Nevertheless, the problem of schema inference is non-trivial and is still the subject of ongoing research. In this paper we provide a comparative analysis of five recent proposals of schema inference approaches targeting the JSON format. We provide both static and dynamic comparison of the approaches. In the former case we compare various features. In the latter case we involve both functional and performance analysis. Finally, we discuss remaining challenges and open problems.

Download


Paper Citation


in Harvard Style

Veinhardt Latták I. and Koupil P. (2022). A Comparative Analysis of JSON Schema Inference Algorithms. In Proceedings of the 17th International Conference on Evaluation of Novel Approaches to Software Engineering - Volume 1: ENASE, ISBN 978-989-758-568-5, pages 379-386. DOI: 10.5220/0011046000003176


in Bibtex Style

@conference{enase22,
author={Ivan Veinhardt Latták and Pavel Koupil},
title={A Comparative Analysis of JSON Schema Inference Algorithms},
booktitle={Proceedings of the 17th International Conference on Evaluation of Novel Approaches to Software Engineering - Volume 1: ENASE,},
year={2022},
pages={379-386},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0011046000003176},
isbn={978-989-758-568-5},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 17th International Conference on Evaluation of Novel Approaches to Software Engineering - Volume 1: ENASE,
TI - A Comparative Analysis of JSON Schema Inference Algorithms
SN - 978-989-758-568-5
AU - Veinhardt Latták I.
AU - Koupil P.
PY - 2022
SP - 379
EP - 386
DO - 10.5220/0011046000003176