Back to Search Start Over

Predicting Web Users' Next Access Based on Log Data

Authors :
Rituparna Sen
Mark Hansen
Source :
Journal of Computational and Graphical Statistics. 12:143-155
Publication Year :
2003
Publisher :
Informa UK Limited, 2003.

Abstract

This article considers models that describe how people browse the Web. We restrict our attention to navigation patterns within a single site, and base our study on standard Web server access logs. Given a visitor's previous activities on the site, we propose models that predict their next page request. If the prediction is reasonably accurate, we might consider “prefetching” the page before the visitor requests it. A more conservative use for such predictions would be to simply update the freshness records in a proxy or network cache, eliminating unnecessary If-Modified-Since requests. Using data from the Web site for the Computing and Mathematical Sciences Research Division of Lucent Technologies (cm.bell-labs.com) we first evaluate the predictive performance of low-order Markov models. We next consider mixtures of first-order Markov models, achieving a kind of clustering of Web pages in the site. This approach is shown to perform well, while significantly reducing the space required to store the model. ...

Details

ISSN :
15372715 and 10618600
Volume :
12
Database :
OpenAIRE
Journal :
Journal of Computational and Graphical Statistics
Accession number :
edsair.doi...........65131c3cb06a8d89febf6ecc753e5e2f