Towards a Metrics Suite for Conceptual Models of Datawarehouses

Manuel Serrano, Coral Calero, Sergio Luján, Mario Piattini

2004

Abstract

Nowadays most organizations have incorporated datawarehouses as one of their principal assets for the efficient management of information. It is vital to be able to guarantee the quality of the information that is stored in the datawarehouses given that they have become the principal tool for strategic decision making. The quality of the information depends on the quality of its presentation and the quality of the datawarehouse. The latter includes the quality of the multidimensional model, at a conceptual, logical, and physical level. Over recent years we have proposed and validated several metrics for the evaluation of the complexity of the multidimensional star model (at a logical level). In this article we present an initial proposal of metrics for the multidimensional model at a conceptual level and for their theoretical validation.

References

  1. Abelló, A., Samos, J. and Saltor, F. Understanding Analysis Dimensions in a Multidimensional Object-Oriented Model. 3rd International Workshop on Design and Management of Data Warehouses (DMDW2001). Interlaken (Switzerland) (2001).
  2. Abelló, A., Samos, J. and Saltor, F. YAM2 (Yet Another Multidimensional Model): An extension of UML. International Database Engineering & Applications Symposium (IDEAS02) July 2002. Mario A. Nascimento, M. Tamer Özsu, Osmar Zaïne (eds.). IEEE Computer Society Press, (2002) 172-181.
  3. Adamson, C. and Venerable, M. Data Warehouse Design Solutions. John Wiley and Sons, USA. (1998)
  4. Basili V. and Weiss D. A Methodology for Collecting Valid Software Engineering Data, IEEE Transactions on Software Engineering 10, 728-738 (1984) Basili V. and Rombach H. The TAME project: towards improvement-oriented software environments, IEEE Transactions on Software Engineering, 14(6), 728-738 (1988)
  5. Briand, L.C., Morasca, S. and Basili, V. Property-based software engineering measurement. IEEE Transactions on Software Engineering. 22(1). pp.68-85. (1996) Bouzeghoub, M, Fabret, F. and Galhardas, H. Datawarehouse refreshment. Capitulo 4 in Fundamentals of Data Warehouses. Ed. Springer. (2000) Bouzeghoub, M. and Kedad, Z. Quality in Data Warehousing. En: Information and database quality. Kluwer Academic Publishers (2002)
  6. 9. Cabbibo, L., Torlone, R. A logical approach to multidimensional databases. Sixth International Conference on Extending Database Technology (EDBT'98), Valencia. Spain Lecture Notes in Computer Science 1377, Springer-Verlag, pp 183-197. (1998)
  7. 10. Calero, C., Piattini, M. and Genero, M. Empirical validation of referential integrity metrics”, Information and Software Technology, 43(15), 949-957 (2001)
  8. 11. Calero, C., Piattini, M., Pascual, C. and Serrano, M.A. Towards Data Warehouse Quality Metrics, Actas del 3rd Workshop on Design and Management of Data Warehouses (DMDW'01) (2001)
  9. 12. Cavero, J.M., Piattini, M., Marcos, E., and Sánchez, A.. A Methodology for Datawarehouse Design: Conceptual Modeling. 12th International Conference of the Information Resources Management Association (IRMA2001), Toronto, Ontario, Canada. (2001)
  10. 13. Celko, J. DoÁt Warehouse Dirty Data. Datamation, 15 octubre, (1995) 42-52.
  11. 14. Chaudhuri, S. and Dayal, U. An Overview of Data Warehousing and OLAP Technology. ACM SIGMOD Record 26(1) (1997)
  12. 15. English, L., Information Quality Improvement: Principles, Methods and Management, Seminar, 5th Ed., Brentwood, TN: Information Impact International, Inc., (1996).
  13. 16. Fenton N., Neil M. Software Metrics: a Roadmap. Future of Software Engineering, Ed. Anthony Finkelstein, ACM, 359-370. (2000)
  14. 17. Genero M. Defining and Validating Metrics for Conceptual Models. Ph.D. Thesis, University of Castilla-La Mancha. (2002)
  15. 18. Genero, M., Olivas, J., Piattini, M., Romero, F. Using metrics to predict OO information systems maintainability. Proc. of 13th International Conference on Advanced Information Systems Engineering (CAiSE'01). Lecture Notes in Computer Science 2068, 388-401. (2001)
  16. 19. Genero, M., Jiménez, L., Piattini, M. A Controlled Experiment for Validating Class Diagram Structural Complexity Metrics. Proc. of the 8th International Conference on Object-Oriented Information Systems (OOIS2002). Lecture Notes in Computer Science 2425, 372-383. (2002)
  17. 20. Golfarelli, M., Maio, D., Rizzi, S. Conceptual design of data warehouses from E/R schemes. 31st Hawaii International Conference on System Sciences. (1998)
  18. 21. Golfarelli, M., Rizzi, S. “Designing The Data Warehouse: Key Steps and Crucial Issues”. Journal of Computer Science and Information Management, Vol 2, N. 3. (1999)
  19. 22. Harinarayan, V., Rajaraman, A., Ullman, J. D. Implementing Data Cubes Efficiently. Proc. of the 1996 ACM SIGMOD International Conference on Management of Data, Jagadish, H. V. and Mumick, I. S. (eds.), pp. 205-216. (1996)
  20. 23. Huang, K-T., Lee, Y.W., Wang, R.Y. Quality Information and Knowledge. Prentice Hall, Upper Saddle River. (1999)
  21. 24. Inmon, W.H. Building the Data Warehouse, second edition, John Wiley and Sons, USA. (1997)
  22. 25. ISO, ISO International Standard ISO/IEC 9126. Information technology - Software product evaluation. ISO, Geneve. (2001)
  23. 26. Jarke, M., Lenzerini, M., Vassiliou, Y., Vassiliadis, P. Fundamentals of Data Warehouses, Ed. Springer. (2000)
  24. 27. Kimball, R.. The Data Warehouse Toolkit. John Wiley & Sons. (1996)
  25. 28. Kimball, R., Reeves, L., Ross, M., Thornthwaite, W. The Data Warehouse Lifecycle Toolkit, John Wiley and Sons, USA. (1998)
  26. 29. Labio, W., Quass, D., Adelberg, B. Physical Database Design for Data Warehouses. Thirteen International Conference on Data Engineering, IEEE Computer Society, Birmingham, UK, pp. 277-288. (1997)
  27. 30. Loshin D. Enterprises Knowledge Management: The Data Quality Approach. Morgan Kauffman, San Francisco (California) (2001)
  28. 31. Luján-Mora, S., Trujillo, J., Song, I-Y. Extending UML for Multidimensional Modeling. 5th International Conference on the Unified Modeling Language (UML 2002), LNCS 2460, 290-304. (2002)
  29. 32. Luján-Mora, S., Trujillo, J., Song, I-Y.. Multidimensional Modeling with UML Package Diagrams. 21st International Conference on Conceptual Modeling (ER 2002), LNCS 2503, 199-213.
  30. 33. Poels G., On the Formal Aspects of the Measurement of Object-Oriented Software Specifications, Ph.D. Thesis, Faculty of Economics and Business Administration. Katholieke Universiteit Leuven, Belgium, 1999
  31. 34. Redman, T.C. Data Quality for the Information Age. Artech House Publishers, Boston (1996)
  32. 35. Sapia, C., Blaschka, M., Höfling, G., Dinter, B. Extending the E/R Model for the Multidimensional Paradigm. ER Workshops 1998, Singapore, Lecture Notes in Computer Science (LNCS), vol. 1552, pp. 105-116, (1998).
  33. 36. Serrano, M., Calero, C., Coimbra, C., Piattini, M. Métricas de calidad para almacenes de datos. Proceedings of the VI Jornadas de Ingeniería del Software y Bases de Datos (JISBD2001), Ciudad Real, Díaz, O., Illarramendi, A. and Piattini, M. (eds.), pp. 537-548 (2001)
  34. 37. Serrano, M., Calero, C., Piattini, M. Validating metrics for datawarehouses. IEE Proceedings SOFTWARE, Vol. 149, 5, 161-166 (2002)
  35. 38. Serrano, M., Calero, C., Piattini, M. Experimental validation of multidimensional data models metrics, Proc of the Hawaii International Conference on System Sciences (HICSS'36), IEEE Computer Society (2003)
  36. 39. Suppes P., Krantz D., Luce, R., Tversky A. Foundations of Measurement: Geometrical, Threshold, and Probabilistic Representations, 2, San Diego, Calif., Academic Press. (1989)
  37. 40. Tryfona, N., Busborg, F., Christiansen, G.B. starER: A Conceptual Model for Data Warehouse Design. Proceedings of the ACM Second International Workshop on Data Warehousing and OLAP (DOLAP'99), Kansas City, USA, pp. 3-8, (1999).
  38. 41. Trujillo, J., Palomar, M., Gómez, J., Song, I-Y. Designing Data Warehouses with OO Conceptual Models. IEEE Computer, Special issue on Data Warehouses, 34 (12), 66 - 75. (2001)
  39. 42. Vassiliadis, P. Data Warehouse Modeling and Quality Issues. Ph.D. Thesis. National Technical University of Athens. (2000)
  40. 43. Weyuker, E.J. Evaluating software complexity measures. IEEE Transactions on Software Engineering. 14(9). pp.1357-1365. (1988)
  41. 44. Whitmire, S.A. Object Oriented Design Measurement. Ed. Wiley. (1997)
  42. 45. Zuse, H. A Framework of Software Measurement. Berlin. Walter de Gruyter. (1998)
Download


Paper Citation


in Harvard Style

Serrano M., Calero C., Luján S. and Piattini M. (2004). Towards a Metrics Suite for Conceptual Models of Datawarehouses . In Proceedings of the 1st International Workshop on Software Audits and Metrics - Volume 1: SAM, (ICEIS 2004) ISBN 972-8865-04-X, pages 105-117. DOI: 10.5220/0002675201050117


in Bibtex Style

@conference{sam04,
author={Manuel Serrano and Coral Calero and Sergio Luján and Mario Piattini},
title={Towards a Metrics Suite for Conceptual Models of Datawarehouses},
booktitle={Proceedings of the 1st International Workshop on Software Audits and Metrics - Volume 1: SAM, (ICEIS 2004)},
year={2004},
pages={105-117},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0002675201050117},
isbn={972-8865-04-X},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 1st International Workshop on Software Audits and Metrics - Volume 1: SAM, (ICEIS 2004)
TI - Towards a Metrics Suite for Conceptual Models of Datawarehouses
SN - 972-8865-04-X
AU - Serrano M.
AU - Calero C.
AU - Luján S.
AU - Piattini M.
PY - 2004
SP - 105
EP - 117
DO - 10.5220/0002675201050117