loading
Papers

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Yingying Wen ; Kevin B. Korb and Ann E. Nicholson

Affiliation: Monash University, Australia

ISBN: 978-989-8111-66-1

Keyword(s): Machine learning, Incomplete data, Data generation, Data analysis, Missing data, Data mining, Machine learning evaluation.

Related Ontology Subjects/Areas/Topics: Artificial Intelligence ; Computational Intelligence ; Data Mining ; Databases and Information Systems Integration ; Enterprise Information Systems ; Evolutionary Computing ; Knowledge Discovery and Information Retrieval ; Knowledge-Based Systems ; Machine Learning ; Sensor Networks ; Signal Processing ; Soft Computing ; Symbolic Systems

Abstract: Evaluating the relative performance of machine learners on incomplete data is important because one common problem with real data is that the data is often incomplete, which means that some values in the data are not present. DataZapper is a tool for uncreating data: given a dataset containing joint samples over variables, DataZapper will make a specified percentage of observed values disappear, replaced by an indication that the measurement failed. Since the causal mechanisms of measurement that result in failed measurements may depend in arbitrary ways upon the system under study, it is important to be able to produce incomplete data sets which allow for such arbitrary dependencies. DataZapper is the only tool that allows any kind of dependence, and any degree of dependence, in its generation of missing data. We illustrate its use in a machine learning experiment and offer it to the data mining and machine learning communities.

PDF ImageFull Text

Download
CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 3.233.217.242

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Wen Y.; B. Korb K.; E. Nicholson A. and (2009). DATAZAPPER: GENERATING INCOMPLETE DATASETS.In Proceedings of the International Conference on Agents and Artificial Intelligence - Volume 1: ICAART, ISBN 978-989-8111-66-1, pages 69-76. DOI: 10.5220/0001660700690076

@conference{icaart09,
author={Yingying Wen and Kevin {B. Korb} and Ann {E. Nicholson}},
title={ DATAZAPPER: GENERATING INCOMPLETE DATASETS},
booktitle={Proceedings of the International Conference on Agents and Artificial Intelligence - Volume 1: ICAART,},
year={2009},
pages={69-76},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0001660700690076},
isbn={978-989-8111-66-1},
}

TY - CONF

JO - Proceedings of the International Conference on Agents and Artificial Intelligence - Volume 1: ICAART,
TI - DATAZAPPER: GENERATING INCOMPLETE DATASETS
SN - 978-989-8111-66-1
AU - Wen, Y.
AU - B. Korb, K.
AU - E. Nicholson, A.
PY - 2009
SP - 69
EP - 76
DO - 10.5220/0001660700690076

Login or register to post comments.

Comments on this Paper: Be the first to review this paper.