An Extensible Framework for Data Reliability Assessment

Óscar Oliveira, Bruno Oliveira

2022

Abstract

Data Warehouse (DW) and Data Lake (DL) systems are mature and widely used technologies to integrate data for supporting decision-making. They support organizations to explore their operational data that can be used to take competitive advantages. However, the amount of data generated by humans in the last 20 years increased exponentially. As a result, the traditional data quality problems that can compromise the use of analytical systems, assume a higher relevance due to the massive amounts and heterogeneous formats of the data. In this paper, an approach for dealing with data quality is described. Using a case study, quality metrics are identified to define a reliability indicator, allowing the identification of poor-quality records and their impact on the data used to support enterprise analytics.

Download


Paper Citation


in Harvard Style

Oliveira Ó. and Oliveira B. (2022). An Extensible Framework for Data Reliability Assessment. In Proceedings of the 24th International Conference on Enterprise Information Systems - Volume 2: ICEIS, ISBN 978-989-758-569-2, pages 77-84. DOI: 10.5220/0010863600003179


in Bibtex Style

@conference{iceis22,
author={Óscar Oliveira and Bruno Oliveira},
title={An Extensible Framework for Data Reliability Assessment},
booktitle={Proceedings of the 24th International Conference on Enterprise Information Systems - Volume 2: ICEIS,},
year={2022},
pages={77-84},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0010863600003179},
isbn={978-989-758-569-2},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 24th International Conference on Enterprise Information Systems - Volume 2: ICEIS,
TI - An Extensible Framework for Data Reliability Assessment
SN - 978-989-758-569-2
AU - Oliveira Ó.
AU - Oliveira B.
PY - 2022
SP - 77
EP - 84
DO - 10.5220/0010863600003179