loading
Papers

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Eric Paquet 1 ; Herna L. Viktor 2 and Hongyu Guo 3

Affiliations: 1 National Research Council of Canada and University of Ottawa, Canada ; 2 University of Ottawa, Canada ; 3 National Research Council of Canada, Canada

ISBN: 978-989-8425-79-9

Keyword(s): Data pre-processing, Aggregation, Gaussian distribution, Lévy distribution.

Related Ontology Subjects/Areas/Topics: Artificial Intelligence ; Foundations of Knowledge Discovery in Databases ; Knowledge Discovery and Information Retrieval ; Knowledge-Based Systems ; Pre-Processing and Post-Processing for Data Mining ; Structured Data Analysis and Statistical Methods ; Symbolic Systems

Abstract: Consider a scenario where one aims to learn models from data being characterized by very large fluctuations that are neither attributable to noise nor outliers. This may be the case, for instance, when examining supermarket ketchup sales, predicting earthquakes and when conducting financial data analysis. In such a situation, the standard central limit theorem does not apply, since the associated Gaussian distribution exponentially suppresses large fluctuations. In this paper, we argue that, in many cases, the incorrect assumption leads to misleading and incorrect data mining results. We illustrate this argument against synthetic data, and show some results against stock market data.

PDF ImageFull Text

Download
CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 18.210.23.15

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Paquet, E.; L. Viktor, H. and Guo, H. (2011). TO AGGREGATE OR NOT TO AGGREGATE: THAT IS THE QUESTION.In Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 1: KDIR, (IC3K 2011) ISBN 978-989-8425-79-9, pages 346-349. DOI: 10.5220/0003686903540357

@conference{kdir11,
author={Eric Paquet. and Herna L. Viktor. and Hongyu Guo.},
title={TO AGGREGATE OR NOT TO AGGREGATE: THAT IS THE QUESTION},
booktitle={Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 1: KDIR, (IC3K 2011)},
year={2011},
pages={346-349},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003686903540357},
isbn={978-989-8425-79-9},
}

TY - CONF

JO - Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 1: KDIR, (IC3K 2011)
TI - TO AGGREGATE OR NOT TO AGGREGATE: THAT IS THE QUESTION
SN - 978-989-8425-79-9
AU - Paquet, E.
AU - L. Viktor, H.
AU - Guo, H.
PY - 2011
SP - 346
EP - 349
DO - 10.5220/0003686903540357

Login or register to post comments.

Comments on this Paper: Be the first to review this paper.