loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Paulo Da Silva Carvalho 1 ; Patrik Hitzelberger 1 ; Fatma Bouali 2 and Gilles Venturini 2

Affiliations: 1 Luxembourg Institute of Science and Technology, Luxembourg ; 2 University François Rabelais of Tours, France

Keyword(s): Data Quality, Missing Values, Open Data, CSV.

Related Ontology Subjects/Areas/Topics: Data Engineering ; Data Management and Quality ; Data Modeling and Visualization ; Data Structures and Data Management Algorithms ; Information Quality ; Open Data

Abstract: Nowadays, more and more information is flowing in and is provided on the Web. Large datasets are made available covering many fields and sectors. Open Data (OD) plays an important role in this field. Thanks to the volumes and the variety of the released datasets, OD brings high societal and business potential. In order to realize this potential, the reuse of the datasets (e.g. in internal business processes) becomes primordial. However, if the aim is to reuse OD, it is also necessary to be able of assessing its quality. This paper demonstrates how Information Visualization may help on this task and presents Stacktab chart - a new chart to analyse and assess CSV files in order to understand their structure, identify the location of relevant information and detect possible problems in the datasets.

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 18.205.114.205

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Da Silva Carvalho, P.; Hitzelberger, P.; Bouali, F. and Venturini, G. (2015). A Visual Technique to Assess the Quality of Datasets - Understanding the Structure and Detecting Errors and Missing Values in Open Data CSV Files. In Proceedings of 4th International Conference on Data Management Technologies and Applications - DATA; ISBN 978-989-758-103-8; ISSN 2184-285X, SciTePress, pages 134-141. DOI: 10.5220/0005496601340141

@conference{data15,
author={Paulo {Da Silva Carvalho}. and Patrik Hitzelberger. and Fatma Bouali. and Gilles Venturini.},
title={A Visual Technique to Assess the Quality of Datasets - Understanding the Structure and Detecting Errors and Missing Values in Open Data CSV Files},
booktitle={Proceedings of 4th International Conference on Data Management Technologies and Applications - DATA},
year={2015},
pages={134-141},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0005496601340141},
isbn={978-989-758-103-8},
issn={2184-285X},
}

TY - CONF

JO - Proceedings of 4th International Conference on Data Management Technologies and Applications - DATA
TI - A Visual Technique to Assess the Quality of Datasets - Understanding the Structure and Detecting Errors and Missing Values in Open Data CSV Files
SN - 978-989-758-103-8
IS - 2184-285X
AU - Da Silva Carvalho, P.
AU - Hitzelberger, P.
AU - Bouali, F.
AU - Venturini, G.
PY - 2015
SP - 134
EP - 141
DO - 10.5220/0005496601340141
PB - SciTePress