DATAZAPPER: GENERATING INCOMPLETE DATASETS

Yingying Wen; Kevin B. Korb; Ann E. Nicholson

Research.Publish.Connect.

*Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Country:
Subject:

Advanced Search Affiliations Search

If you're looking for an exact phrase use quotation marks on text fields.

Proceedings

Proceedings Search *Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

Papers

Papers Search *Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

Authors

Authors Search *Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

Advanced Search

Paper

DATAZAPPER: GENERATING INCOMPLETE DATASETS

Topics: Data Mining; Machine Learning

In Proceedings of the International Conference on Agents and Artificial Intelligence ICAART - Volume 1, 69-76, 2009 , Porto, Portugal

Authors: Yingying Wen ; Kevin B. Korb and Ann E. Nicholson

Affiliation: Monash University, Australia

Keyword(s): Machine learning, Incomplete data, Data generation, Data analysis, Missing data, Data mining, Machine learning evaluation.

Related Ontology Subjects/Areas/Topics: Artificial Intelligence ; Computational Intelligence ; Data Mining ; Databases and Information Systems Integration ; Enterprise Information Systems ; Evolutionary Computing ; Knowledge Discovery and Information Retrieval ; Knowledge-Based Systems ; Machine Learning ; Sensor Networks ; Signal Processing ; Soft Computing ; Symbolic Systems

Abstract: Evaluating the relative performance of machine learners on incomplete data is important because one common problem with real data is that the data is often incomplete, which means that some values in the data are not present. DataZapper is a tool for uncreating data: given a dataset containing joint samples over variables, DataZapper will make a specified percentage of observed values disappear, replaced by an indication that the measurement failed. Since the causal mechanisms of measurement that result in failed measurements may depend in arbitrary ways upon the system under study, it is important to be able to produce incomplete data sets which allow for such arbitrary dependencies. DataZapper is the only tool that allows any kind of dependence, and any degree of dependence, in its generation of missing data. We illustrate its use in a machine learning experiment and offer it to the data mining and machine learning communities.

CC BY-NC-ND 4.0

Guest: Register as new SciTePress user now for free.

SciTePress user: please login.

My Papers

You are not signed in, therefore limits apply to your IP address 216.73.216.108

In the current month:

Recent papers: 100 available of 100 total

2⁺ years older papers: 200 available of 200 total

Paper citation in several formats:

Wen, Y., B. Korb, K. and E. Nicholson, A. (2009). DATAZAPPER: GENERATING INCOMPLETE DATASETS. In Proceedings of the International Conference on Agents and Artificial Intelligence - ICAART; ISBN 978-989-8111-66-1; ISSN 2184-433X, SciTePress, pages 69-76. DOI: 10.5220/0001660700690076

@conference{icaart09,
author={Yingying Wen and Kevin {B. Korb} and Ann {E. Nicholson}},
title={ DATAZAPPER: GENERATING INCOMPLETE DATASETS},
booktitle={Proceedings of the International Conference on Agents and Artificial Intelligence - ICAART},
year={2009},
pages={69-76},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0001660700690076},
isbn={978-989-8111-66-1},
issn={2184-433X},
}

TY - CONF

JO - Proceedings of the International Conference on Agents and Artificial Intelligence - ICAART
TI - DATAZAPPER: GENERATING INCOMPLETE DATASETS
SN - 978-989-8111-66-1
IS - 2184-433X
AU - Wen, Y.
AU - B. Korb, K.
AU - E. Nicholson, A.
PY - 2009
SP - 69
EP - 76
DO - 10.5220/0001660700690076
PB - SciTePress