loading
Papers

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Amit Rudra ; Raj Gopalan and Narasimaha Achuthan

Affiliation: Curtin University, Australia

ISBN: 978-989-8565-10-5

Keyword(s): Sampling, Approximate Query Processing, Data Warehousing.

Related Ontology Subjects/Areas/Topics: Data Warehouses and OLAP ; Databases and Information Systems Integration ; Enterprise Information Systems

Abstract: Decision support queries usually involve accessing enormous amount of data requiring significant retrieval time. Faster retrieval of query results can often save precious time for the decision maker. Pre-computation of materialised views and sampling are two ways of achieving significant speed up. However, drawing random samples for queries on range restricted attributes has two problems: small random samples may miss relevant records and drawing larger samples from disk can be inefficient due to the large number of disk accesses required. In this paper, we propose an efficient indexing scheme for quickly drawing relevant samples for data warehouse queries as well as propose the concepts of database and sample relevancy ratios. We describe a method for estimating query results for range restricted queries using this index and experimentally evaluate the scheme using a relatively large real dataset. Further, we compute the confidence intervals for the estimates to investigate whether t he results can be guaranteed to be within the desired level of confidence. Our experiments on data from a retail data warehouse show promising results. We also report the levels of accuracy achieved for various types of aggregate queries and relate them to the database relevancy ratios of the queries. (More)

PDF ImageFull Text

Download
CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 18.210.23.15

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Rudra, A.; Gopalan, R. and Achuthan, N. (2012). An Efficient Sampling Scheme for Approximate Processing of Decision Support Queries.In Proceedings of the 14th International Conference on Enterprise Information Systems - Volume 2: ICEIS, ISBN 978-989-8565-10-5, pages 16-26. DOI: 10.5220/0003995100160026

@conference{iceis12,
author={Amit Rudra. and Raj Gopalan. and Narasimaha Achuthan.},
title={An Efficient Sampling Scheme for Approximate Processing of Decision Support Queries},
booktitle={Proceedings of the 14th International Conference on Enterprise Information Systems - Volume 2: ICEIS,},
year={2012},
pages={16-26},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003995100160026},
isbn={978-989-8565-10-5},
}

TY - CONF

JO - Proceedings of the 14th International Conference on Enterprise Information Systems - Volume 2: ICEIS,
TI - An Efficient Sampling Scheme for Approximate Processing of Decision Support Queries
SN - 978-989-8565-10-5
AU - Rudra, A.
AU - Gopalan, R.
AU - Achuthan, N.
PY - 2012
SP - 16
EP - 26
DO - 10.5220/0003995100160026

Login or register to post comments.

Comments on this Paper: Be the first to review this paper.