Combined Mining of Web Server Logs and Web Contents for Classifying User Navigation Patterns and Predicting Users' Future Requests

Authors: 

Haibin Liu

Author Addresses: 

Faculty of Computer Science
Dalhousie University
6050 University Ave.
PO Box 15000
Halifax, Nova Scotia, Canada
B3H 4R2

Abstract: 

With the explosive growth of knowledge available on the World Wide Web, it becomes much more difficult for users to access relevant information efficiently and it also presents a challenging task for web designers to organize site contents to meet the needs of users. Automatic classification of user navigation patterns provides a solution to these problems. In this report, we propose a novel approach to classifying user navigation patterns and predicting users' future requests using the N-gram-based user navigation profiles extracted by combined mining of Web server logs and web contents. We have applied the approach to build an experimental system. The performance of the system is evaluated based on both classification and prediction accuracy. Our system achieves the classification accuracy of nearly 70% and the prediction accuracy of about 65% which is exceedes some published results of the state-of-the-art systems. This approach may be used to facilitate better web personalization and website organization.

Tech Report Number: 
CS-2005-14
Report Date: 
July 19, 2005
AttachmentSize
PDF icon CS-2005-14.pdf1.69 MB