DataShiftExplorer: Visualizing and Comparing Change in Multidimensional Data for Supervised Learning

Bruno Schneider, Daniel A. Keim, Mennatallah El-Assady

2020

Abstract

In supervised learning, to ensure the model's validity, it is essential to identify dataset shifts, i.e., when the data distribution changes from the one the model encountered at the time of training. To detect such changes, a comparative analysis of the multidimensional data distributions of the training data and new, unseen datasets is required. In this paper, we span the design space of visualizations for multidimensional comparative data analytics. Based on this design space, we present DataShiftExplorer, a technique tailored to identify and analyze the change in multidimensional data distributions. Throughout examples, we show how DataShiftExplorer facilitates the identification and analysis of data changes, supporting supervised learning.

Download


Paper Citation


in Harvard Style

Schneider B., Keim D. and El-Assady M. (2020). DataShiftExplorer: Visualizing and Comparing Change in Multidimensional Data for Supervised Learning. In Proceedings of the 15th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2020) - Volume 3: IVAPP; ISBN 978-989-758-402-2, SciTePress, pages 141-148. DOI: 10.5220/0008940801410148


in Bibtex Style

@conference{ivapp20,
author={Bruno Schneider and Daniel A. Keim and Mennatallah El-Assady},
title={DataShiftExplorer: Visualizing and Comparing Change in Multidimensional Data for Supervised Learning},
booktitle={Proceedings of the 15th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2020) - Volume 3: IVAPP},
year={2020},
pages={141-148},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0008940801410148},
isbn={978-989-758-402-2},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 15th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2020) - Volume 3: IVAPP
TI - DataShiftExplorer: Visualizing and Comparing Change in Multidimensional Data for Supervised Learning
SN - 978-989-758-402-2
AU - Schneider B.
AU - Keim D.
AU - El-Assady M.
PY - 2020
SP - 141
EP - 148
DO - 10.5220/0008940801410148
PB - SciTePress