A TREE BASED ALGEBRA FRAMEWORK FOR XML DATA SYSTEMS

Ali El bekai, Nick Rossiter

Abstract

This paper introduces a framework in algebra for processing XML data. We develop a simple algebra, called TA (Tree Algebra), for processing storing and manipulating XML data, modelled as trees. We present assumptions of the framework, describe the input and the output of the algebraic operators, and define the syntax of these operators and their semantics in terms of algorithms. Furthermore we define the relational operators and their semantics in terms of algorithms. Examples show that this framework is flexible to capture queries expressed in the domain specific XML query language. As can be seen the input and output of our algebra is a tree that is the input and output are XML document and the XML documents are defined as trees. We also present algorithms for many of the algebra operators; these algorithms show how the algebra operators such as join, union, complement, project, select, expose and vertex work on nodes of the XML tree or element and attributes of an XML document. Detailed examples are given.

References

  1. Beech, D, Malhotra A, & Rys, M, (eds.) A formal data model and algebra for XML, Comm W3C (1999).
  2. Bourret, Ronald. XML and Database (2004) at: http://www.rpbourret.com/xml/XMLAndDatabases.ht m
  3. Christophides, V, Cluet, S & Simeon, J, On wrapping, query languages and efficient XML integration, ACM SIGMOD Conf Management Data, Dallas 141-152, May (2000).
  4. CIDOC Group (July 2002) http://www.willpowerinfo.myby.co.uk/cidoc/cidoc0.ht m
  5. Codd, E F, Relational Completeness of Data Base, Data Base Systems, Prentice Hall 6 65-98. (1972).
  6. Comon, H, Dauchet, M, Gilleron, R, Jacquemand, F, Lugiez, D, Tison, S, & Tommasi, S, Tree Automata Techniques and Applications, at: http://www.grappa.lille3.fr/tata. (1997)
  7. Fenkhauser, M, Simeon, J, & Woder, P, An algebra for XML Query, In Proc. FST TCS, New Delhi, December (2000).
  8. Fernandez, M, Simeon, J, & Wadler, P, A semi-monad for semi-structured data, Int Conf Database Theory 263- 300 (2001).
  9. Galanis, L, Viglas, E, DeWitt, D J, Naughton, J F, & Maier, D, Following the paths of XML Data: An Algebraic Framework for XML Query Evaluation, Tech Rep Univ Wisconsin (2001).
  10. Greenwald, M B, Moore, J T, Pierce, B C, Schmitt, A: Language for Bi-Directional Tree Transformations. Tech Rep MS-CIS-03-08, Dept Comp Inf Sci, Univ Pennsylvania. (Aug 2003).
  11. ICOM, International Guidelines for Museum Object Information (IGMO): CIDOC Information Categories (October (1995) http://www.cidoc.icom.org/guide
  12. McHugh, J, Abiteboul, S, Goldman, R, Quass, D, & Widom, J, Lore: A Database Management System for Semi-structured Data. SIGMOD 3(26) 54-66 (1997).
  13. McHugh, J, & Widom, J, Query optimization for Semistructured data, Tech Rep, Stanford Univ Database Group, August (1998). http://www-db.standford.edu/pub/papers/qo.ps.
  14. Roth, M A, Korth, H F, & Silberschatz, A, Extended algebra and calculus for nested relational databases, ACM TODS 13 389-417 (1988).
  15. Scholl, M H, Theoretical foundations of algebraic optimization utilization unnormalized relation, in: ICDT'86, LNCS 234 409-420. (1986).
  16. W3C, the XML Query Algebra, Working Draft, http://www.w3.org/TR/2001/WD-query-algebra20010215. February (2001).
  17. W3C, XML Schema: Formal Description, Working Draft, September (2001).
  18. W3C, XQuery 1.0: An XML Query Language, Working Draft (23 July 2004) http://www.w3.org/TR/2004/WD-xquery-20040723/
  19. Zhang, X. and Rundensteiner, E A, XML Algebra for the Rainbow System, Tech Rep WPI-CS-TR-02-24. Worcester Polytechnic Inst, July (2002).
  20. Zisman, A, An Overview of XML, Comp Control Eng J 11(4) (2000).
Download


Paper Citation


in Harvard Style

El bekai A. and Rossiter N. (2005). A TREE BASED ALGEBRA FRAMEWORK FOR XML DATA SYSTEMS . In Proceedings of the Seventh International Conference on Enterprise Information Systems - Volume 1: ICEIS, ISBN 972-8865-19-8, pages 305-312. DOI: 10.5220/0002528403050312


in Bibtex Style

@conference{iceis05,
author={Ali El bekai and Nick Rossiter},
title={A TREE BASED ALGEBRA FRAMEWORK FOR XML DATA SYSTEMS},
booktitle={Proceedings of the Seventh International Conference on Enterprise Information Systems - Volume 1: ICEIS,},
year={2005},
pages={305-312},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0002528403050312},
isbn={972-8865-19-8},
}


in EndNote Style

TY - CONF
JO - Proceedings of the Seventh International Conference on Enterprise Information Systems - Volume 1: ICEIS,
TI - A TREE BASED ALGEBRA FRAMEWORK FOR XML DATA SYSTEMS
SN - 972-8865-19-8
AU - El bekai A.
AU - Rossiter N.
PY - 2005
SP - 305
EP - 312
DO - 10.5220/0002528403050312