
 
confirm one of them. If there is a picture associated 
with any of these sections, it is displayed to get 
further confirmation. If the data entered by the user 
is still not enough to confirm or rule out a category, 
suspected categories are presented to the user with 
links to their original section as a reference to the 
user. 
9 CONCLUSION 
The objective of our research is to help Web users to 
quickly and easily find an answer to some given 
diagnostic question they have from specific 
section(s) in some given document set. To achieve 
this goal, we have constructed a web mining 
technique that can extract information from the web 
and create knowledge from it. Our system has been 
built in the agricultural domain to extract 
information from its related web pages, and to index 
the diagnostic sections in it. The constructed index is 
used for finding relevant knowledge to answer a user 
query.  
Our system has three main phases: the 
categorization phase, the indexing phase, and the 
search phase. The categorization phase has been 
tested on a training web pages set, which is a 
collection of extension documents. It automatically 
generated 100 main categories, 145 sub categories, 
and 127 sub-subcategory items.  These categories 
are used by the indexing component to assign for 
each section in an input web page, a category if 
possible. The indexing and search phases are still 
under construction. Also, there are still some 
problems must need to be solved like inheritance 
from more than one category, and synonymous 
words used in different web pages content. 
REFERENCES 
Borges,  J. and Levene, M., 1999. Data mining of user 
navigation patterns, In Web Usage Analysis and User 
Profiling, vol. 1836, pp. 92-111. 
Chen, H. and Chau, M., 2004. Web Mining: Machine 
Learning for Web Applications. In the Annual Review 
of Information Science and Technology, vol. 38, pp. 
289-329. 
Doherty, P., 2000. Web Mining - The E-Tailer's Holy 
Grail. In DM Direct. 
El-Beltagy, S. R., Rafea, A. and Abdelhamid, Y., 2004. 
Using Dynamically Acquired Background Knowledge 
For Information Extraction And Intelligent Search. In 
M. Mohammadian, (Ed.) Intelligent Agents for Data 
Mining and Information Retrieval, Idea Group 
Publishing, Hershey, PA, USA, pp. 196-207. 
Guan, T. and Wong, K., 1999. KPS: a Web information 
mining algorithm. In Proceedings 8th Int. World Wide 
Web Conf., Canada, pp. 417-429. 
Hsu,  J., 2002. Web Mining: A Survey of World Wide 
Web Data Mining Research and Applications. In 
Decision Sciences Institute Annual Meeting 
Proceedings, PP. 753-758. 
Kosala, R. and Blockeel, H., 2000. Web Mining Research: 
A Survey. In SIGKDD Explorations, vol. 2, no. 1,pp 
1-15. 
Liu, B., Chin, Ch. W. and Ng, H. T., 2003. Mining Topic-
Specific Concepts and Definitions on the Web, In 
Proceedings of the twelfth international World Wide 
Web conference (WWW-2003), Budapest, Hungry, pp. 
20-24.  
Loh, S., Wives, L. K. and de Oliveira, J. P. M., 2000. 
Concept-Based Knowledge Discovery. In Texts 
Extracted from the Web SIGKDD Explorations, vol. 2, 
no. 1, pp. 29-39. 
Madria, S.K., Bhowmick, S.S., Ng, W.K. and Lim, E.P., 
1999. Research issues in web data mining. in 
Proceedings 1st International Conf. On Data 
Warehousing and Knowledge Discovery  Florence 
Italy, PP. 303-312. 
Pal, S., Talwar, V., and Mitra, P., 2002. Web Mining in 
Soft Computing Framework: Relevance, State of the 
Art and Future Directions. IEEE Trans. on Neural 
Networks, 13(5):1163 -1177, 2002.  
Rafea, A. and Shaalan, K.,1993. Lexical Analysis of An 
Inflected Arabic Word Using Exhaustive Search of an 
Augmented Transition Network, In Software Practice 
& Experience, vol. 23, no. 6, pp. 567-588. 
Scime, A., 2004. Guest Editor's Introduction: Special Issue 
on Web Content Mining. In Journal of Intelligent 
Information Systems, vol. 22, no. 3, pp. 211-213. 
Xu, J., Huang, Y. and Madey, G., 2003. A Research 
Support System Framework for Web Data mining 
Research. In Workshop on Applications, Products and 
Services of Web-based Support Systems at the Joint 
International Conference on Web Intelligence (2003 
IEEE/WIC) and Intelligent Agent Technology, 
Halifax, Canada, October 2003, 37-41. 
Zaiane, O. R., 1999. Resource and Knowledge Discovery 
from the Internet and Multimedia Repositories, Ph.D. 
thesis, Simon Fraser University. 
ICEIS 2005 - ARTIFICIAL INTELLIGENCE AND DECISION SUPPORT SYSTEMS
308