U.S. patents available from 1976 to present.
U.S. patent applications available from 2005 to present.

Classification of data records by comparison of records to a training database using probability weights

Patent 5251131 Issued on October 5, 1993. Estimated Expiration Date: Icon_subject July 31, 2011. Estimated Expiration Date is calculated based on simple USPTO term provisions. It does not account for terminal disclaimers, term adjustments, failure to pay maintenance fees, or other factors which might affect the term of a patent.

Patent References

Parallel processor
Patent #: 4814973
Issued on: 03/21/1989
Inventor: Hillis

Text search system
Patent #: 4823306
Issued on: 04/18/1989
Inventor: Barbic ,   et al.

Method and system for generating lexicon of cooccurrence relations in natural language Patent #: 4942526
Issued on: 07/17/1990
Inventor: Okajima, et al.

Inventors

Assignee

Application

No. 739111 filed on 07/31/1991

US Classes:

704/9Natural language

Examiners

Primary: Envall, Roy N. Jr.
Assistant: Poinvil, Frantzy

Attorney, Agent or Firm

International Classes

G06F 015/38
G01L 001/06

Abstract

Classification of natural language data wherein the natural language data has an open-ended range of possible values or the data values do not have a relative order. A training database stores training records, wherein each training record includes predictor data fields. Each predictor data field containes a feature, wherein each feature is a natural language term, and a target data field containing a target value representing a classification of the record. Features may also include conjunctions of natural language terms and each feature may also be a member of a category subset of features. The training database stores, for each feature, a probability weight value representing the probability that a record will have the target value contained in the target data field if a feature contained in a corresponding predictor data field occurs in the record. Features are extracted from a new record and each feature from the new record is used to query the training records to determine the probability weights from the training records having matching features. The probability weights are accumulated for each training record to determine a comparison score representing the probability that the training record matches the new record and provide an output indicating the training records most probability matching the new record.

PatentsPlus Images
Enhanced PDF formats
loading...
PatentsPlus: add to cart
PatentsPlus: add to cartSearch-enhanced full patent PDF image
$9.95more info
PatentsPlus: add to cart
PatentsPlus: add to cartIntelligent turbocharged patent PDFs with marked up images
$16.95more info
 
Sign InRegister
Username  
Password   
forgot password?