ONTOLOGICAL WAREHOUSING ON SEMANTICALLY INDEXED DATA - Reusing Semantic Search Engine Ontologies to Develop Multidimensional Schemas

Filippo Sciarrone, Paolo Starace

2009

Abstract

In this article we present a first experimentation of a Business Intelligence solution to dynamically develop multidimensional OLAP schemas through a reuse of ontologies, stored in concept and relations dictionaries and used by semantic indexing engines. The particular aspect of the proposed solution consists in the integration of semantic indexing techniques of non-structured documents, based on ontologies, with dynamic management techniques of unbalanced hierarchies in a Data Warehouse. As a case study, we embedded our solution into a real system, built for the analysis and management of experts’ curricula in an e-government environment. We show how it is possible to automatically build OLAP dimensions, inheriting the hierarchic structure of ontologies, with the goal of using the semantically indexed data to carry out multidimensional OLAP analyses. The first experimental results are encouraging.

References

  1. Critchlow, T., Ganesh, M., and Musick, R. (1998). Automatic generation of warehouse mediators using an ontology engine. In Borgida, A., Chaudhri, V. K., and Staudt, M., editors, KRDB, volume 10 of CEUR Workshop Proceedings, pages 8.1-8.8. CEUR-WS.org.
  2. Golfarelli, M., Maio, D., and Rizzi, S. (1998). Conceptual design of data warehouses from e/r schema. In HICSS 7898: Proceedings of the Thirty-First Annual Hawaii International Conference on System Sciences-Volume 7, page 334, Washington, DC, USA. IEEE Computer Society.
  3. Kimball, R. and Caserta, J. (2004). The Datawarehouse ETL Toolkit: Practical Techniques for Extracting, Cleaning, Conforming and Delivering Dasta. Wiley.
  4. Kimball, R., Reeves, L., Thornthwaite, W., Ross, M., and Thornwaite, W. (1998). The Data Warehouse Lifecycle Toolkit: Expert Methods for Designing, Developing and Deploying Data Warehouses with CD Rom. John Wiley & Sons, Inc., New York, NY, USA.
  5. Kimball, R. and Ross, M. (2002). The Data Warehouse Toolkit: The Complete Guide to Dimensional Modeling (Second Edition). Wiley.
  6. Simitsis, A., Skoutas, D., and Castellanos, M. (2008). Natural language reporting for etl processes. In DOLAP 7808: Proceeding of the ACM 11th international workshop on Data warehousing and OLAP, pages 65-72, New York, NY, USA. ACM.
  7. Skoutas, D. and Simitsis, A. (2006). Designing etl processes using semantic web technologies. In DOLAP 7806: Proceedings of the 9th ACM international workshop on Data warehousing and OLAP, pages 67-74, New York, NY, USA. ACM.
  8. Song, I.-Y., yeol Song, I., Medsker, C., Ewen, E., and Rowen, W. (2001). An analysis of many-to-many relationships between fact and dimension tables in dimensional modeling. In Proc. of the Intl Workshop on Design and Management of Data Warehouses, pages 6-1.
  9. Toivonen, S. and Niemi, T. (2004). Describing Data Sources Semantically for Facilitating Efficient Creation of OLAP Cubes. In Poster Proceedings of the Third Interntional Semantic Web Conference.
Download


Paper Citation


in Harvard Style

Sciarrone F. and Starace P. (2009). ONTOLOGICAL WAREHOUSING ON SEMANTICALLY INDEXED DATA - Reusing Semantic Search Engine Ontologies to Develop Multidimensional Schemas . In Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 1: KDIR, (IC3K 2009) ISBN 978-989-674-011-5, pages 315-318. DOI: 10.5220/0002307103150318


in Bibtex Style

@conference{kdir09,
author={Filippo Sciarrone and Paolo Starace},
title={ONTOLOGICAL WAREHOUSING ON SEMANTICALLY INDEXED DATA - Reusing Semantic Search Engine Ontologies to Develop Multidimensional Schemas},
booktitle={Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 1: KDIR, (IC3K 2009)},
year={2009},
pages={315-318},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0002307103150318},
isbn={978-989-674-011-5},
}


in EndNote Style

TY - CONF
JO - Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 1: KDIR, (IC3K 2009)
TI - ONTOLOGICAL WAREHOUSING ON SEMANTICALLY INDEXED DATA - Reusing Semantic Search Engine Ontologies to Develop Multidimensional Schemas
SN - 978-989-674-011-5
AU - Sciarrone F.
AU - Starace P.
PY - 2009
SP - 315
EP - 318
DO - 10.5220/0002307103150318