loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Bruno Oliveira and Orlando Belo

Affiliation: University of Minho, Portugal

Keyword(s): Data Warehousing Systems, ETL Conceptual Modelling, Task Clustering, ETL Patterns, ETL Skeletons, BPMN Specification Models, and Kettle.

Related Ontology Subjects/Areas/Topics: Biomedical Engineering ; Business Analytics ; Data Engineering ; Data Integrity ; Data Management and Quality ; Data Warehouse Management ; Databases and Data Security ; Enterprise Information Systems ; Health Information Systems ; Information and Systems Security ; Information Quality ; Information Systems Analysis and Specification ; Knowledge Management ; Ontologies and the Semantic Web ; Society, e-Business and e-Government ; Web Information Systems and Technologies

Abstract: Usually, data warehousing populating processes are data-oriented workflows composed by dozens of granular tasks that are responsible for the integration of data coming from different data sources. Specific subset of these tasks can be grouped on a collection together with their relationships in order to form higherlevel constructs. Increasing task granularity allows for the generalization of processes, simplifying their views and providing methods to carry out expertise to new applications. Well-proven practices can be used to describe general solutions that use basic skeletons configured and instantiated according to a set of specific integration requirements. Patterns can be applied to ETL processes aiming to simplify not only a possible conceptual representation but also to reduce the gap that often exists between two design perspectives. In this paper, we demonstrate the feasibility and effectiveness of an ETL pattern-based approach using task clustering, analyzing a real world E TL scenario through the definitions of two commonly used clusters of tasks: a data lookup cluster and a data conciliation and integration cluster. (More)

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 3.238.62.124

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Oliveira, B. and Belo, O. (2015). Task Clustering on ETL Systems - A Pattern-Oriented Approach. In Proceedings of 4th International Conference on Data Management Technologies and Applications - DATA; ISBN 978-989-758-103-8; ISSN 2184-285X, SciTePress, pages 207-214. DOI: 10.5220/0005559302070214

@conference{data15,
author={Bruno Oliveira. and Orlando Belo.},
title={Task Clustering on ETL Systems - A Pattern-Oriented Approach},
booktitle={Proceedings of 4th International Conference on Data Management Technologies and Applications - DATA},
year={2015},
pages={207-214},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0005559302070214},
isbn={978-989-758-103-8},
issn={2184-285X},
}

TY - CONF

JO - Proceedings of 4th International Conference on Data Management Technologies and Applications - DATA
TI - Task Clustering on ETL Systems - A Pattern-Oriented Approach
SN - 978-989-758-103-8
IS - 2184-285X
AU - Oliveira, B.
AU - Belo, O.
PY - 2015
SP - 207
EP - 214
DO - 10.5220/0005559302070214
PB - SciTePress