U.S. patents available from 1976 to present.
U.S. patent applications available from 2005 to present.

Method for document retrieval and for word sense disambiguation using neural networks

Patent 5317507 Issued on May 31, 1994. Estimated Expiration Date: Icon_subject May 31, 2011. Estimated Expiration Date is calculated based on simple USPTO term provisions. It does not account for terminal disclaimers, term adjustments, failure to pay maintenance fees, or other factors which might affect the term of a patent.

Patent References

Arrays of machines such as computers
Patent #: 4247892
Issued on: 01/27/1981
Inventor: Lawrence

Multiple-parts-of-speech disambiguating method and apparatus for machine translation system
Patent #: 4661924
Issued on: 04/28/1987
Inventor: Okamoto ,   et al.

Sentence analyzer
Patent #: 4864502
Issued on: 09/05/1989
Inventor: Kucera ,   et al.

Search improvements for electronic spelling machine Patent #: 5113340
Issued on: 05/12/1992
Inventor: McWherter

Inventor

Application

No. 610430 filed on 11/07/1990

US Classes:

715/532Dictionary

Examiners

Primary: Envall, Roy N. Jr.
Assistant: Shingala, Gita D.

Attorney, Agent or Firm

International Class

G06F 015/38

Abstract

A method for storing and searching documents also useful in disambiguating word senses and a method for generating a dictionary of context vectors. The dictionary of context vectors provides a context vector for each word stem in the dictionary. A context vector is a fixed length list of component values corresponding to a list of word-based features, the component values being an approximate measure of the conceptual relationship between the word stem and the word-based feature. Documents are stored by combining the context vectors of the words remaining in the document after uninteresting words are removed. The summary vector obtained by adding all of the context vectors of the remaining words is normalized. The normalized summary vector is stored for each document. The data base of normalized summary vectors is searched using a query vector and identifying the document whose vector is closest to that query vector. The normalized summary vectors of each document can be stored using cluster trees according to a centroid consistent algorithm to accelerate the searching process. Said searching process also gives an efficient way of finding nearest neighbor vectors in high-dimensional spaces.

Other References

  • "Document Retrieval System based on Nearest Neighour Searching" by Lucarella, Dario, 1988, Source: Journal of Information Science
  • Koll, Matthew, B., "WEIRD: An Approach to Concept-Based Information Retrieval", SGIR Forum Vol. 13, No. 4, Spring 1979
  • Wong, S. K. M. et al., "On Modeling of Information Retrieval Concepts in Vector Spaces". ACM Trans. On Database Systems Jun. 1987
  • Ossorio, Peter G., "Classification Space: A Multivariate Procedure For Automatic Document Indexing and Retrieval". Multivariate Behavioral Research, Oct. 1966
  • Computational Models of Cognition and Perception: Parallel Distributed Processing Explorations in the Microstructure of Cognition vol. 2: Psychological and Bioligical Models
  • Cognitive Science 9, 51-74 (1985) Massively Parallel Parsing: A Strongly Interactive Model of Natural Language Interpretation
  • Connectionist Parsing, Garrison W. Wottrell, Department of Computer Science University of Rochester
  • Adaptive Information Retrieval: Using a connectionist representation to retrieve and learn about documents, Richard K. Belew
  • Indexing by Latent Semantic Analysis, Deerwester, Dumais, Furnas, Landauer, Harshman
  • Application of the Interactive Activation Model to Document Retrieval, Bein & Smolensky, University of Colorado at Boulde
PatentsPlus Images
Enhanced PDF formats
loading...
PatentsPlus: add to cart
PatentsPlus: add to cartSearch-enhanced full patent PDF image
$9.95more info
PatentsPlus: add to cart
PatentsPlus: add to cartIntelligent turbocharged patent PDFs with marked up images
$16.95more info
 
Sign InRegister
Username  
Password   
forgot password?