Antonia Bertolino, Jinghua Gao, Eda Marchetti



With XML and XML Schema widely acknowledged as the de facto standard for data exchange and interoperability between remote applications, the need for checking integrity and adequacy of XML documents, also by means of automated tools, increases. In this perspective, this paper addresses two objectives: we provide a classification and a short overview of the diverse existing approaches for the testing of XML-based documents; then, pushing further the potential of XML for testing purposes, we pursuit the application of traditional testing methods to programs using XML input data. We discuss the use of XML and XML schema as a basis for formalizing and automatizing the testing of applications using such kind of data, with particular reference to recent proposals for specification-based and perturbation-based testing approaches.


  1. Anand, S. and Wilde, E. (2005). Mapping xml instances. Shiba, Japan. Fourteenth International World Wide Web Conference (WWW2005).
  2. Bergamaschi, S., Castano, S., and Vincini, M. (1999). Semantic integration of semistructured and structured data sources, page 5459. ACM SIGMOD Record 28(1).
  3. Berlin, J. and Motro, M. (2001). Autoplex: automated discovery of content for virtual databases. In Proc 9th Int Conf OnCooperative Information Systems (CoopIS), volume 2172, page 108122, Berlin Heidelberg NewYork. Springer. Lecture Notes in Computer Science.
  4. Bertolino, A., Gao, J., Marchetti, E., and Polini, A. (2006). Partition testing from xml schema. under submission.
  5. Bertolino, A. and Marchetti, E. (2004). Software Testing, chapter 5. IEEE Computer Society. In Swebok Pierre Bourque and Robet Depuis ed.
  6. Boobna, U. and de Rougemont, M. (2004). Correctors for xml data. In International XML Database Symposium 2004, pages 97-111, Toronto, Canada.
  7. Boukottaya, A., Vanoirbeek, C., and Paganelli, F. (2004). Abou khaled: Automating xml documents transformations: a conceptual modelling based approach. volume 31 table of contents, pages 81 - 90, Dunedin, New Zealand. Proceedings of the first Asian-Pacific conference on Conceptual modelling.
  8. Castano, S., Antonellis, V. D., and diVemercati, S. D. C. (2001). Global viewing of heterogeneous data sources, page 277297. IEEE Trans Data Knowl Eng 13(2).
  9. Doan, A., Domingos, P., and Halevy, A. (2001). Reconciling schemas of disparate data sources: a machinelearning approach. In Proc ACM SIGMOD Conf, page 509520.
  10. Doan, A., Domingos, P., and Levy, A. (2000). Learning source descriptions for data integration. In ProcWebDBWorkshop, pages 81-92.
  11. DTD (1996). Dtd. http://www.w3.org/TR/2000/CR-SVG20001102/svgdtd.html.
  12. EJBSourceGenerator (2003). http://ejbgen.sourceforge.net/.
  13. Goldman, R. and Widom, J. (1997). Dataguides: enabling query formulation and optimization in semistructured databases. In Proc23th Int Conf On Very Large Data Bases, page 436445.
  14. Hagen, L., Harald, L., and Saskia, B. P. (2004). Text type structure and logical document structure. In Webber, B. and Byron, D. K., editors, ACL 2004 Workshop on Discourse Annotation, pages 49-56, Barcelona, Spain. Association for Computational Linguistics.
  15. JavaXMLBindlets (2003). Javaxmlbindlets. http://wwws.sun.com/software/xml/developers /instancegenerator/index.html.
  16. Jeong, E. and Hsu, C. (2001). Induction of Integrated View for XML Data with Heterogeneous DTDs, pages 151- 158. CIKM 2001.
  17. Lee, S. C. and Offutt, J. (2001). Generating test cases for xml-based web component interactions using mutation analysis. In In Proceedings of the 12th International Symposium on Software Reliability Engineering, pages 200-209, Hong Kong China. IEEE Computer Society Press.
  18. Levy, A. (1999). More on data management for xml. http://www.cs.washington.edu/homes/alon/widom response.html. University of Washington.
  19. Li, J. B. and Miller, J. (2005). Testing the Semantics of W3C XML Schema, pages 443 - 448. COMPSAC 2005.
  20. Li, W. and Clifton, C. (1994). Semanticinte gration in heterogeneous databases using neural networks. In Proc20th Int Conf On Very Large Data Bases, page 112.
  21. Li, W. and Clifton, C. (2000). SemInt: a tool for identifying attribute correspondences in heterogeneous databases using neural network, page 49 84. Data Knowl Eng 33(1).
  22. Li, W., Clifton, C., and Liu, S. (2000). Database integration using neural network: implementation and experiences, page 7396. Knowl Inf Syst 2(1).
  23. Meo, P. D., Quattrone, G., Terracina, G., and Ursino, D. (2003). Almost automatic” and semantic integration of xml schemas at various ”severity” levels. pages 4- 21. CoopIS/DOA/ODBASE.
  24. Meo, P. D., Quattrone, G., Terracina, G., and Ursino, D. (2005). An approach for clustering semantically heterogeneous xml schemas. pages 329 - 346. OTM Conferences (1).
  25. (2003). Software diagnostics & conformance testing division: Web technologies.
  26. http://xw2k.sdct.itl.nist.gov/brady/xml/index.asp.
  27. ObjectModelGenerator (2004). Objectmodelgenerator. http://sourceforge.net/projects/omgen.
  28. Offutt, J. and Xu, W. (2004). Generating test cases for web services using data perturbation workshop on testing, analysis and verification of web services. Boston Mass.
  29. Ostrand, T. and Balcer, M. (1988). The category-partition method for specifying and generating functional tests. Communications of ACM, 31(6).
  30. Rahm, E. and Bernstein, P. A. (2001). Survey of approaches to automatic schema matching. VLDB Journal: Very Large Data Bases, 10:334-350.
  31. Shafazand, M. and Tjoa, A. M. (2002). A Levelized Schema Extraction for XML Document Using User-Defined Graphs, page 434441. Number LNCS 2510. EurAsiaICT 2002.
  32. Shanmugasundaran, J., Tufte, K., He, G., Zhang, C., DeWit, D., and Naughton, J. (1999). Relational databases for querying xml documents: Limitations and opportunities. In Proceedings of the 25th VLDB Conference. (2001). Xml schema quality checker.
  33. SunXMLInstanceGenerator (2003). Sun xml instance generator. http://wwws.sun.com/software/xml/developers /instancegenerator/index.html.
  34. Tian, K. B., Bhowmick, S. S., and Sanjay Kumar, M. (2003). VACXENE: A User-Friendly Visual Synthetic XML Generator. Object-Oriented and EntityRelationship Modelling.
  35. W3C (2005). W3c world wide web consortium. http://www.w3.org.
  36. W3CXML (1996). W3cxml. http://www.w3.org/XML/.
  37. W3CXMLSchema (1998). http://www.w3.org/XML/Schema.
  38. W3CXMLValidator (2001). W3c validator for xml schema. http://www.w3.org/2001/03/webdata/xsv.
  39. Wang, Q., Yu, J., and Wong, K. (2000). Approximate graph schema extraction for semi-structured data. In Proc Extending DataBase Technologies, Lecture Notes in Computer Science, volume 1777, page 302316, Berlin Heidelberg NewYork. Springer.
  40. Widom, J. (1999). Data Management for XML, volume 22(3), page 4452. IEEE Data Engineering Bulletin, Special Issue on XML. Working Document, initial draft appeared April 1999.

Paper Citation

in Harvard Style

Bertolino A., Gao J. and Marchetti E. (2006). XML EVERY-FLAVOR TESTING . In Proceedings of WEBIST 2006 - Second International Conference on Web Information Systems and Technologies - Volume 1: WEBIST, ISBN 978-972-8865-46-7, pages 268-273. DOI: 10.5220/0001254602680273

in Bibtex Style

author={Antonia Bertolino and Jinghua Gao and Eda Marchetti},
booktitle={Proceedings of WEBIST 2006 - Second International Conference on Web Information Systems and Technologies - Volume 1: WEBIST,},

in EndNote Style

JO - Proceedings of WEBIST 2006 - Second International Conference on Web Information Systems and Technologies - Volume 1: WEBIST,
SN - 978-972-8865-46-7
AU - Bertolino A.
AU - Gao J.
AU - Marchetti E.
PY - 2006
SP - 268
EP - 273
DO - 10.5220/0001254602680273