U.S. patents available from 1976 to present.
U.S. patent applications available from 2005 to present.

Icon_funbox Quotables

"What, sir, would you make a ship sail against the wind and currents by lighting a bonfire under her deck? I pray you, excuse me, I have not the time to listen to such nonsense."

Napoleon Bonaparte ; When told of the Robert Fulton steamboat

Newsletter  PatentStorm News

Make the Most of PatentStorm

See this month's Top Inventors and Most Cited Patents.

Stay on top of the latest patents by subscribing to an RSS feed.

Got questions? Ask a Patent Expert!

Registered users: Manage your profile, comments and alerts.

 

US Patent 6990628 - Method and apparatus for measuring similarity among electronic documents

US Patent Issued on January 24, 2006
Estimated Patent Expiration Date: Icon_subject June 14, 2019Estimated Expiration Date is calculated based on simple USPTO term provisions. It does not account for terminal disclaimers, term adjustments, failure to pay maintenance fees, or other factors which might affect the term of a patent.
loading...


View Patent Images (PDF)
(Registered users only)

Abstract

A method and apparatus are provided for determining when electronic documents stored in a large collection of documents are similar to one another. A plurality of similarity information is derived from the documents. The similarity information may be based on a variety of factors, including hyperlinks in the documents, text similarity, user click-through information, similarity in the titles of the documents or their location identifiers, and patterns of user viewing. The similarity information is fed to a combination function that synthesizes the various measures of similarity information into combined similarity information. Using the combined similarity information, an objective function is iteratively maximized in order to yield a generalized similarity value that expresses the similarity of particular pairs of documents. In an embodiment, the generalized similarity value is used to determine the proper category, among a taxonomy of categories in an index, cache or search system, into which certain documents belong.

Other References

  • Hermann Kaindl, Stefan Kramer, Luis Miguel Afonso, Combining Structure Search and Content Search for the World-Wide Web, Proceedings of the Seventh ACM Conference on Hypertext and Hypermedia, pp. 217-224 (1998).
  • Ron Weiss, Bienvenido VĂ©lez, Mark A. Sheldon, HyPursuit: A Hierarchical Network Search Engine That Exploits Content-Link Hypertext Clustering, Proceedings of the Seventh ACM Conference on Hypertext, pp. 180-193 (1996).

Inventors

Assignee

Application

No. 09333121 filed on 06/14/1999

US Classes:

715/500, PRESENTATION PROCESSING OF DOCUMENT715/501.1, Hypermedia707/3, Query processing (i.e., searching)707/6Pattern matching access

Field of Search

707/3, Query processing (i.e., searching)707/6Pattern matching access

Examiners

Primary: Bashore, William

Attorney, Agent or Firm

US Patent References

5745893, Process and system for arrangement of documents
Issued on: 04/28/1998
Inventor: Hill, et al.
5799292, Adaptive hypermedia presentation method and system
Issued on: 08/25/1998
Inventor: Hekmatpour
5835905, System for predicting documents relevant to focus documents by spreading activation through network representations of a linked collection of documents
Issued on: 11/10/1998
Inventor: Pirolli, et al.
5895470, System for categorizing documents in a linked collection of documents
Issued on: 04/20/1999
Inventor: Pirolli, et al.
5920859, Hypertext document retrieval system and method
Issued on: 07/06/1999
Inventor: Li
5931907, Software agent for comparing locally accessible keywords with meta-information and having pointers associated with distributed information
Issued on: 08/03/1999
Inventor: Davies, et al.
5960422, System and method for optimized source selection in an information retrieval system
Issued on: 09/28/1999
Inventor: Prasad
6052676, Adaptive hypermedia presentation method and system
Issued on: 04/18/2000
Inventor: Hekmatpour
6112203, Method for ranking documents in a hyperlinked environment using connectivity and selective content analysis
Issued on: 08/29/2000
Inventor: Bharat, et al.
6115718, Method and apparatus for predicting document access in a collection of linked documents featuring link proprabilities and spreading activation
Issued on: 09/05/2000
Inventor: Huberman, et al.
6128606Module for constructing trainable modular network in which each module inputs and outputs data structured as a graph
Issued on: 10/03/2000
Inventor: Bengio, et al.

International Classes

G06F 15/00
G06F 17/00
G06F 17/21

Comments

No comments for this page
 
 
Forgot password?
Register here