loading
Papers

Research.Publish.Connect.

Paper

Paper Unlock

Authors: R. Raminhos 1 and J. Moura-Pires 2

Affiliations: 1 UNINOVA – Desenvolvimento de Novas Tecnologias, Portugal ; 2 CENTRIA/FCT, Portugal

ISBN: 978-972-8865-88-7

Keyword(s): ETD, ETL, IL, Declarative Language, Semi-Structured Text Files.

Related Ontology Subjects/Areas/Topics: Coupling and Integrating Heterogeneous Data Sources ; Databases and Information Systems Integration ; Enterprise Information Systems

Abstract: The World Wide Web is a major source of textual information, with a human-readable semi-structured format, referring to multiple domains, some of them highly complex. Traditional ETL approaches following the development of specific source code for each data source and based on multiple domain / computer-science experts interactions, become an inadequate solution, time consuming and prone to error. This paper presents a novel approach to ETL, based on its decomposition in two phases: ETD (Extraction, Transformation and Data Delivery) and IL (Integration and Loading). The ETD proposal is supported by a declarative language for expressing ETD statements and a graphical application for interacting with the domain expert. When applying ETD mainly domain expertise is required, while computer-science expertise will be centered in the IL phase, linking the processed data to target system models, enabling a clearer separation of concerns. This paper presents how ETD has been integrated, tested and validated in a space domain project, currently operational at the European Space Agency for the Galileo Mission. (More)

PDF ImageFull Text

Download
CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 100.26.176.182

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Raminhos R.; Moura-Pires J. and (2007). EXTRACTION AND TRANSFORMATION OF DATA FROM SEMI-STRUCTURED TEXT FILES USING A DECLARATIVE APPROACH.In Proceedings of the Ninth International Conference on Enterprise Information Systems - Volume 3: ICEIS, ISBN 978-972-8865-88-7, pages 199-205. DOI: 10.5220/0002364201990205

@conference{iceis07,
author={R. Raminhos and J. Moura{-}Pires},
title={EXTRACTION AND TRANSFORMATION OF DATA FROM SEMI-STRUCTURED TEXT FILES USING A DECLARATIVE APPROACH},
booktitle={Proceedings of the Ninth International Conference on Enterprise Information Systems - Volume 3: ICEIS,},
year={2007},
pages={199-205},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0002364201990205},
isbn={978-972-8865-88-7},
}

TY - CONF

JO - Proceedings of the Ninth International Conference on Enterprise Information Systems - Volume 3: ICEIS,
TI - EXTRACTION AND TRANSFORMATION OF DATA FROM SEMI-STRUCTURED TEXT FILES USING A DECLARATIVE APPROACH
SN - 978-972-8865-88-7
AU - Raminhos, R.
AU - Moura-Pires, J.
PY - 2007
SP - 199
EP - 205
DO - 10.5220/0002364201990205

Login or register to post comments.

Comments on this Paper: Be the first to review this paper.