U.S. patents available from 1976 to present.
U.S. patent applications available from 2005 to present.

Method and apparatus for partitioning a database upon a timestamp, support values for phrases and generating a history of frequently occurring phrases

Patent 6308172 Issued on October 23, 2001. Estimated Expiration Date: Icon_subject July 6, 2019. Estimated Expiration Date is calculated based on simple USPTO term provisions. It does not account for terminal disclaimers, term adjustments, failure to pay maintenance fees, or other factors which might affect the term of a patent.

Patent References

Method and system for natural language translation
Patent #: 5768603
Issued on: 06/16/1998
Inventor: Brown, et al.

Method and apparatus for data access and update in a shared file environment
Patent #: 5790848
Issued on: 08/04/1998
Inventor: Wlaschin

Visualization of information using graphical representations of context vector based relationships and attributes
Patent #: 5794178
Issued on: 08/11/1998
Inventor: Caid, et al.

Mapping words, phrases using sequential-pattern to find user specific trends in a text database Patent #: 6006223
Issued on: 12/21/1999
Inventor: Agrawal, et al.

Inventors

Application

No. 348595 filed on 07/06/1999

US Classes:

707/5, Query augmenting and refining (e.g., inexact access)704/4, Based on phrase, clause, or idiom704/8, Multilingual or national language support704/9, Natural language707/2, Access augmentation or optimizing707/6, Pattern matching access707/100, DATABASE SCHEMA OR DATA STRUCTURE707/102, Generating database or data structure (e.g., via user interface)707/203, Version management715/511, Version management715/536Multilingual

Examiners

Primary: Choules, Jack M.
Assistant: Channavajjala, Srirama

Attorney, Agent or Firm

International Class

G06F 017/30

Abstract

A method and apparatus for mining text databases, employing sequential pattern phrase identification and shape queries, to discover trends. The method passes over a desired database using a dynamically generated shape query. Documents within the database are selected based on specific classifications and user defined partitions. Once a partition is specified, transaction IDs are assigned to the words in the text documents depending on their placement within each document. The transaction IDs encode both the position of each word within the document as well as representing sentence, paragraph, and section breaks, and are represented in one embodiment as long integers with the sentence boundaries. A maximum and minimum gap between words in the phrases and the minimum support all phrases must meet for the selected time period may be specified. A generalized sequential pattern method is used to generate those phrases in each partition that meet the minimum support threshold. The shape query engine takes the set of phrases for the partition of interest and selects those that match a given shape query. A query may take the form of requesting a trend such as "recent upwards trend", "recent spikes in usage", "downward trends", and "resurgence of usage". Once the phrases matching the shape query are found, they are presented to the user.

PatentsPlus Images
Enhanced PDF formats
loading...
PatentsPlus: add to cart
PatentsPlus: add to cartSearch-enhanced full patent PDF image
$9.95more info
PatentsPlus: add to cart
PatentsPlus: add to cartIntelligent turbocharged patent PDFs with marked up images
$18.95more info
 
Sign InRegister
Username  
Password   
forgot password?