
 
without demanding manual updating on the source 
code. The ArchCollect software current version 
presents a significant improvement to the interaction 
pattern and to the monitored application markup 
language implementation, when compared to the 
previous version (Lima, 2003). 
A direct interaction acquisition mechanism can 
generate fast and accurate n-dimensional 
information cube used as a collaborative filtering 
algorithm input, to establish some patterns based on 
usage, layout, content and performance focus. None 
of the related works collect the element page 
directly, what compromises the accuracy, making 
future preprocessing necessary. 
The acquisition mechanism compatibility to the 
Netscape© browser is not complete. It cannot 
capture the end of the session in Netscape© 
browsers, because these assume that regions b and c 
form a single region. For this browser type only the 
interactions in region b are captured. 
A possible limitation, not only to the ArchCollect 
acquisition mechanism, but to every Web usage 
mining tool that collects data from the user, is the 
risk of the user disabling the monitoring in his/her 
browser. That is valid for tools that use cookies, Java 
applets or any plugin to be installed. 
The collecting mechanism can be extended to 
collect sounds and facial images produced by users, 
and also to collect defined interactions as XML 
elements/metatags from distributed systems.  
Finally, it is necessary to extend the acquisition 
mechanism compatibility, which is so far limited to 
the Internet Explorer© and Netscape© browsers. 
REFERENCES 
 
Etzioni, O., 1999. The world wide web: Quaqmire or gold    
mine.  Communications of the ACM, 39(11):65-68. 
Shahabi, C., Banaei-Kashani, F., Faruque, J., 2001. A 
reliable, eficient, and scalable system for web usage 
data acquisition. WebKDD'01 Workshop, ACM-
SIGKDD 2001, São Francisco, CA. 
Spiliopoulou, M., 2000. Web usage mining for site 
evolution: Making a site better fit its users. Special 
Section of the Communications of ACM on 
“Personalization Technologies with Data Mining”, 
43(8):127-134, August, 2000. 
Perkowitz, M., and Etzioni, O., 2000. Toward adaptive 
Web sites: conceptual framework and case study. 
Artificial Intelligence 118, p.p245-275, 2000. 
Buchner, A.G., and Mulvenna, M.D., 1998. Discovering 
Internet Marketing Intelligence through Online 
Analytical Web Usage Mining. ACM SIGMOD 
Record, ISSN 0163-5808, Vol. 27, No.4, p.p 54-61, 
1998. 
Sarwar, B.M., Karypis, G., Kostan, J.A., and Riedl, R., 
2000. Analysis of Recommender Algorithms for E-
Commerce. ACM E-Commerce’00 Conference. 
October, 2000. 
Srivastava, J.,  Cooley, R., Deshpande, M., Tan, P., 2000. 
Web usage minig : Discovery and applications of 
usage patterns from web data, SIGKDD, January, 
2000. 
Cooley, R., Tan, P., Srivastava, J., 2000. Discovery of 
Interesting Usage Patterns from Web Data. Advances 
in Web Usage Analysis and User Profiling, Lecture 
Notes in Computer Science, Vol. 1836, Springer-
Verlag, 2000. 
Gomory, S., Hoch, R., Lee, J., Poldlaseck, M., Schonberg, 
E., 1999. Ecommerce Intelligence : Measuring, 
Analyzing, and Reporting on Merchandising 
Effectiveness of Online Stores, IBM Watson Research 
Center. 
Wu, K., Yu, P.S., and Ballman, A., 1999. SpeedTracer: A 
web usage mining and analysis tool. IBM Systems 
Journal, 37(1), 1999. 
Zaiane, O.R., Xin, M., Han, J., 1999. Discovering Web 
Access Patterns and Trends by Applying OLAP and 
Data Mining Technology on Web Logs, Proc. of 
Advances in Digital Libraries Conference, 1999. 
Ackerman M. D., et al. 1997. “Learning Probabilistic user 
profiles: Applications to finding interesting Web sites, 
notifying users of relevant changes to the Web pages, 
and locating grant opportunities”. AI Magazine 18(2) 
47-56, 1997. 
Lieberman H., 1995. “ Letizia:  An agent that assists Web 
browsing”. Proceedings of the international joint 
conference on Artificial Intelligence, Montreal, 
August 1995. 
Alladvantage - http://www.alladvantage.com 
NetZero - http://www.netzero.com 
Lee, J., Lee, H.S., Wang, P., 2001. Design and 
Implementation of a Visual Online Product Catalog 
Interface. ICEIS (2) 2001: 1010-1017. 
Ghani, R., Fano, A., 2002. Towards Semantic Data 
Mining: Creating and Using a Knowledge Base of 
Product Semantics, KDD 2002, Edmonton, Canada. 
Menascé, Daniel A. & Almeida, Virgilio A.F., "Capacity 
Planning for WEB Performance - Metrics, Models & 
Methods", Prentice Hall, PTR, 1998. 
Lima J.C., Carneiro T.G.S., Pagliares R.M., et. al, 2003. 
ArchCollect: A set of Components directed towards 
web users' interaction,  ICEIS 2003 Conference, 
Angers - France. 
Lima J.C., Carneiro T.G.S., Esmin, A.A.A., 2003. 
ArchCollect: concepts of na architecture that offers 
services for analyzing and understanding Web users’ 
interactions. Conexão Ciência Magazine– Year I, n° 
01, pp. 39-46 - Formiga – Brazil – 2003. 
Kimball, R., 2002. Data Webhouse Toolkit. Editora 
Campus. 
ICEIS 2004 - SOFTWARE AGENTS AND INTERNET COMPUTING
262