Maryam Hazman, Samhaa R. El-Beltagy, Ahmed Rafea, Salwa El-Gamal


The World Wide Web is a rich resource of information and knowledge. Within this resource, finding relevant answers to some given question is often a time consuming activity for a user. In the presented work we construct a web mining technique that can extract information from the web and create knowledge from it. The extracted knowledge can be used to respond more intelligently to user requests within the diagnosis domain. Our system has three main phases namely: a categorization phase, an indexing phase, and search a phase. The categorization phase is concerned with extracting important words/phrases from web pages then generating the categories included in them. The indexing phase is concerned with indexing web page sections. While the search phase interacts with the user in order to find relevant answers to their questions. The system was tested using a training web pages set for the categorization phase. Work in the indexing and search phase is still in going.


  1. Borges, J. and Levene, M., 1999. Data mining of user navigation patterns, In Web Usage Analysis and User Profiling, vol. 1836, pp. 92-111.
  2. Chen, H. and Chau, M., 2004. Web Mining: Machine Learning for Web Applications. In the Annual Review of Information Science and Technology, vol. 38, pp. 289-329.
  3. Doherty, P., 2000. Web Mining - The E-Tailer's Holy Grail. In DM Direct.
  4. El-Beltagy, S. R., Rafea, A. and Abdelhamid, Y., 2004. Using Dynamically Acquired Background Knowledge For Information Extraction And Intelligent Search. In M. Mohammadian, (Ed.) Intelligent Agents for Data Mining and Information Retrieval, Idea Group Publishing, Hershey, PA, USA, pp. 196-207.
  5. Guan, T. and Wong, K., 1999. KPS: a Web information mining algorithm. In Proceedings 8th Int. World Wide Web Conf., Canada, pp. 417-429.
  6. Hsu, J., 2002. Web Mining: A Survey of World Wide Web Data Mining Research and Applications. In Decision Sciences Institute Annual Meeting Proceedings, PP. 753-758.
  7. Kosala, R. and Blockeel, H., 2000. Web Mining Research: A Survey. In SIGKDD Explorations, vol. 2, no. 1,pp 1-15.
  8. Liu, B., Chin, Ch. W. and Ng, H. T., 2003. Mining TopicSpecific Concepts and Definitions on the Web, In Proceedings of the twelfth international World Wide Web conference (WWW-2003), Budapest, Hungry, pp. 20-24.
  9. Loh, S., Wives, L. K. and de Oliveira, J. P. M., 2000. Concept-Based Knowledge Discovery. In Texts Extracted from the Web SIGKDD Explorations, vol. 2, no. 1, pp. 29-39.
  10. Madria, S.K., Bhowmick, S.S., Ng, W.K. and Lim, E.P., 1999. Research issues in web data mining. in Proceedings 1st International Conf. On Data Warehousing and Knowledge Discovery Florence Italy, PP. 303-312.
  11. Pal, S., Talwar, V., and Mitra, P., 2002. Web Mining in Soft Computing Framework: Relevance, State of the Art and Future Directions. IEEE Trans. on Neural Networks, 13(5):1163 -1177, 2002.
  12. Rafea, A. and Shaalan, K.,1993. Lexical Analysis of An Inflected Arabic Word Using Exhaustive Search of an Augmented Transition Network, In Software Practice & Experience, vol. 23, no. 6, pp. 567-588.
  13. Scime, A., 2004. Guest Editor's Introduction: Special Issue on Web Content Mining. In Journal of Intelligent Information Systems, vol. 22, no. 3, pp. 211-213.
  14. Xu, J., Huang, Y. and Madey, G., 2003. A Research Support System Framework for Web Data mining Research. In Workshop on Applications, Products and Services of Web-based Support Systems at the Joint International Conference on Web Intelligence (2003 IEEE/WIC) and Intelligent Agent Technology, Halifax, Canada, October 2003, 37-41.
  15. Zaiane, O. R., 1999. Resource and Knowledge Discovery from the Internet and Multimedia Repositories, Ph.D. thesis, Simon Fraser University.

Paper Citation

in Harvard Style

Hazman M., R. El-Beltagy S., Rafea A. and El-Gamal S. (2005). KNOWLEDGE DISCOVERY FROM THE WEB . In Proceedings of the Seventh International Conference on Enterprise Information Systems - Volume 2: ICEIS, ISBN 972-8865-19-8, pages 303-308. DOI: 10.5220/0002547103030308

in Bibtex Style

author={Maryam Hazman and Samhaa R. El-Beltagy and Ahmed Rafea and Salwa El-Gamal},
booktitle={Proceedings of the Seventh International Conference on Enterprise Information Systems - Volume 2: ICEIS,},

in EndNote Style

JO - Proceedings of the Seventh International Conference on Enterprise Information Systems - Volume 2: ICEIS,
SN - 972-8865-19-8
AU - Hazman M.
AU - R. El-Beltagy S.
AU - Rafea A.
AU - El-Gamal S.
PY - 2005
SP - 303
EP - 308
DO - 10.5220/0002547103030308