Best Fit Missing Value Imputation (BFMVI) Algorithm for Incomplete Data in the Internet of Things

Benjamin Agbo, Yongrui Qin, Richard Hill

Abstract

The noticeable growth in the adoption of Internet of Things (IoT) technologies, has led to the generation of large amounts of data usually from sensor devices. When dealing with massive amounts of data, it is very common to observe databases with large amounts of missing values. This is a challenge for data miners because various methods for data analysis only work well on complete databases. A popular way to deal with this challenge is to fill-in (impute) missing values using adequate estimation techniques. Unfortunately, a good number of existing methods rely on all the observed values in the entire dataset to estimate missing values, which significantly causes unfavourable effects (low accuracy and high complexity) on imputed results. In this paper, we propose a novel imputation technique based on data clustering and a robust selection of adequate imputation equations for each missing datapoint. We evaluate our proposed method using six University of California Irvine (UCI) datasets, and relevant comparison with five recently proposed imputation methods. The results presented showed that the performance of the proposed imputation method is comparable with the Local Similarity Imputation (LSI) technique in terms of imputation accuracy, but is significantly less complex than all the existing methods identified.

Download


Paper Citation


in Harvard Style

Agbo B., Qin Y. and Hill R. (2020). Best Fit Missing Value Imputation (BFMVI) Algorithm for Incomplete Data in the Internet of Things.In Proceedings of the 5th International Conference on Internet of Things, Big Data and Security - Volume 1: IoTBDS, ISBN 978-989-758-426-8, pages 130-137. DOI: 10.5220/0009578201300137


in Bibtex Style

@conference{iotbds20,
author={Benjamin Agbo and Yongrui Qin and Richard Hill},
title={Best Fit Missing Value Imputation (BFMVI) Algorithm for Incomplete Data in the Internet of Things},
booktitle={Proceedings of the 5th International Conference on Internet of Things, Big Data and Security - Volume 1: IoTBDS,},
year={2020},
pages={130-137},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0009578201300137},
isbn={978-989-758-426-8},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 5th International Conference on Internet of Things, Big Data and Security - Volume 1: IoTBDS,
TI - Best Fit Missing Value Imputation (BFMVI) Algorithm for Incomplete Data in the Internet of Things
SN - 978-989-758-426-8
AU - Agbo B.
AU - Qin Y.
AU - Hill R.
PY - 2020
SP - 130
EP - 137
DO - 10.5220/0009578201300137