DESIGNING TEMPLATES FOR MINING ASSOCIATION RULES FROM XML DOCUMENTS

F. H. Ismail, H. K. Mohammed, M. A. Ismail, I. EL-Maddah

2007

Abstract

Nowadays, some information is semi-structured. The main characteristic of semi-structured data (XML) is that they have irregular structure. There is no distinction between data and structure. Even though, it is quite common that semi-structured objects representing the same sort of information have similar, though not identical, structure (pattern). Previous work has introduced templates for mining association rules from XML based on prior knowledge about the structure of the XML document. If the users do not have any knowledge about the structure in advance, what would be their clue in writing templates? In this paper, we introduce a new approach for designing association rule templates based on the automatic discovery of frequent structure in the XML document. Frequent structure serves as a schema built over the semi-structured data. This layer guides the user to the useful structure that might yield useful associations rather than choosing any piece of structure at random. The structured layer is displayed from which the user can select templates of interest. Association rules that comply with the specified templates are generated.

References

  1. Agrawal, R. and Srikant, R., 1994. Fast algorithms for mining association rules in large databases. In Proceedings of 20th International Conference on Very Large Data Bases, Santiago, Chile, September 12-15. pages 487-499.
  2. Baraga, D., Campi, A., Ceri, S., Klemettinen, M. and Lanza, P., 2003. Discovering interesting information in XML data with association rules. In Proc. Of the 18th Symposium on Applied Computing, Florida, USA, March, pp.450-454.
  3. Chaudhuri, S., 1998. Data Mining and Database Systems: Where is the Intersection? In Data Engineering Bulletin, 21(1):4-8.
  4. Feng, L. and Dillon, T., Weigand, H. and Chang, E., 2003. An xml-enabled association rule framework. In Proc. Of the 14th Intl. Conf. on Database and Expert Systems Applications, Prague, Czech Republic, September, pp.88-97.
  5. Feng, L. and Dillon, T., 2004. Mining xml-enabled association rules with templates. In Proc. Of the Intl. Workshop on Knowledge Discovery in Inductive Databases, Pisa, Italy, September, pp.61-72.
  6. Feng, L. and Dillon, T., 2005. an XML-enabled data mining query language XML-DMQL. In Int. J. Business and Data Mining, Vol. 1, No. 1, pp. 22-41.
  7. Fu, Y. and Han, J., 1995 a. Meta-rule-guided mining of association rules in relational databases. In Proc. 1st Int'l Workshop on Integration of Knowledge Discovery with Deductive and Object-Oriented Databases, Singapore, Dec, pages 39-46.
  8. Fu, Y. and Han, J., 1995 b. Exploration of the Power of Attribute Oriented Induction in Data Mining. In Advances in Knowledge Discovery and Data Mining.
  9. Han, J. and Kamber, M., 2001. Data Mining: Concepts and Techniques, Morgan Kaufmann Publishers, San Francisco, California, USA.
  10. Maruyama, K. and Uehara, K., 2000. Mining association rules from semi-structured data. In Proc. Of the ICDCS Workshop of Knowledge Discovery and Data Mining in the World-Wide Web, April, Taipei, Taiwan.
  11. Nayak, R., 2005. Discovering Knowledge from XML Documents. In Wong, John, Eds. Encyclopedia of Data Warehousing and Mining, Idea Group Publications.
  12. Pavón, J., Viana, S. and Gómez, S., 2006. Matrix Apriori: Speeding Up the Search for Frequent Patterns. In Proc. Of the 24th IASTED International Mukticonference on Database and Applications, Innsbruck, Austria.
  13. World Wide Web Consortium, 2002. XQuery 1.0: An XML Query Language, April, http://www.w3.org/TR/xquery-operators/.
Download


Paper Citation


in Harvard Style

H. Ismail F., K. Mohammed H., A. Ismail M. and EL-Maddah I. (2007). DESIGNING TEMPLATES FOR MINING ASSOCIATION RULES FROM XML DOCUMENTS . In Proceedings of the Third International Conference on Web Information Systems and Technologies - Volume 2: WEBIST, ISBN 978-972-8865-78-8, pages 214-219. DOI: 10.5220/0001276802140219


in Bibtex Style

@conference{webist07,
author={F. H. Ismail and H. K. Mohammed and M. A. Ismail and I. EL-Maddah},
title={DESIGNING TEMPLATES FOR MINING ASSOCIATION RULES FROM XML DOCUMENTS},
booktitle={Proceedings of the Third International Conference on Web Information Systems and Technologies - Volume 2: WEBIST,},
year={2007},
pages={214-219},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0001276802140219},
isbn={978-972-8865-78-8},
}


in EndNote Style

TY - CONF
JO - Proceedings of the Third International Conference on Web Information Systems and Technologies - Volume 2: WEBIST,
TI - DESIGNING TEMPLATES FOR MINING ASSOCIATION RULES FROM XML DOCUMENTS
SN - 978-972-8865-78-8
AU - H. Ismail F.
AU - K. Mohammed H.
AU - A. Ismail M.
AU - EL-Maddah I.
PY - 2007
SP - 214
EP - 219
DO - 10.5220/0001276802140219