loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Author: Roy Ruddle

Affiliation: School of Computing and Leeds Institute for Data Analytics, University of Leeds, Leeds, U.K.

Keyword(s): Visualization, Data Quality, Data Science, Empirical Study.

Abstract: Previous work has identified more than 100 distinct characteristics of data quality, most of which are aspects of completeness, accuracy and consistency. Other work has developed new techniques for visualizing data quality, but there is a lack of research into how users visualize data quality issues with existing, well-known techniques. We investigated how 166 participants identified and illustrated data quality issues that occurred in a 54-file, longitudinal collection of open data. The issues that participants identified spanned 27 different characteristics, nine of which do not appear in existing data quality taxonomies. Participants adopted nine visualization and tabular methods to illustrate the issues, using the methods in five ways (quantify; alert; examples; serendipitous discovery; explain). The variety of serendipitous discoveries was noteworthy, as was how rarely participants used visualization to illustrate completeness and consistency, compared with accuracy. We conclude by presenting a 106-item data quality taxonomy that combines seven previous works with our findings. (More)

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 18.117.196.184

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Ruddle, R. (2023). Using Well-Known Techniques to Visualize Characteristics of Data Quality. In Proceedings of the 18th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2023) - IVAPP; ISBN 978-989-758-634-7; ISSN 2184-4321, SciTePress, pages 89-100. DOI: 10.5220/0011664300003417

@conference{ivapp23,
author={Roy Ruddle.},
title={Using Well-Known Techniques to Visualize Characteristics of Data Quality},
booktitle={Proceedings of the 18th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2023) - IVAPP},
year={2023},
pages={89-100},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0011664300003417},
isbn={978-989-758-634-7},
issn={2184-4321},
}

TY - CONF

JO - Proceedings of the 18th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2023) - IVAPP
TI - Using Well-Known Techniques to Visualize Characteristics of Data Quality
SN - 978-989-758-634-7
IS - 2184-4321
AU - Ruddle, R.
PY - 2023
SP - 89
EP - 100
DO - 10.5220/0011664300003417
PB - SciTePress