Patent ReferencesIdentifying duplicate documents from search results without comparing document content Real-time document collection search engine with phrase indexing 6263329 Light weight document matcher Patent #: 6286000 InventorApplicationNo. 771677 filed on 01/30/2001US Classes:709/219, Accessing a remote server707/3, Query processing (i.e., searching)707/7SortingExaminersPrimary: Vu, Viet D.Attorney, Agent or FirmInternational ClassG06F 013/00AbstractA search engine for searching a corpus improves the relevancy of the results by refining a standard relevancy score based on the interconnectivity of the initially returned set of documents. The search engine obtains an initial set of relevant documents by matching a user's search terms to an index of a corpus. A re-ranking component in the search engine then refines the initially returned document rankings so that documents that are frequently cited in the initial set of relevant documents are preferred over documents that are less frequently cited within the initial set.Other References
| |