Prefetching and caching documents according to probability ranked need S list
Intelligent method, apparatus and computer program product for automated refreshing of internet web pages
System and method for focussed web crawling
Taxonomy generation for document collections Patent #: 6446061
AbstractA method for estimating an association between the media objects and the seed Web page accessed by a user. The method is employed in the context of a Web space on a network having Web pages and links between those Web pages modeled as a directed graph. Each Web page comprises a set of media objects and a page author. For each object a size, a user preference and a page author preference are determined. The network has an available pre-fetch bandwidth. The method calculates a weight for each Web object by applying preference rules defined by and user preference and page author preference to the contents of the set of media objects. Next, a random walk graph is generated, and object gains are calculated by finding a steady state distribution of the random walk graph. The object gain represents an association between the object and the seed Web page.