Patent ReferencesSystem and method for mining generalized association rules in databases Document information retrieval using global word co-occurrence patterns Method and apparatus for improved information storage and retrieval system Compact encoding of multi-lingual translation dictionaries Method and apparatus for data access and update in a shared file environment Visualization of information using graphical representations of context vector based relationships and attributes Patent #: 5794178 InventorsApplicationNo. 909911 filed on 08/12/1997US Classes:707/5, Query augmenting and refining (e.g., inexact access)707/3, Query processing (i.e., searching)707/6, Pattern matching access707/102, Generating database or data structure (e.g., via user interface)707/203, Version management715/511, Version management715/532DictionaryExaminersPrimary: Amsbury, WayneAssistant: Channavajjala, Srirama Attorney, Agent or FirmInternational ClassG06F 017/30AbstractA method and apparatus for mining text databases, employing sequential pattern phrase identification and shape queries, to discover trends. The method passes over a desired database using a dynamically generated shape query. Documents within the database are selected based on specific classifications and user defined partitions. Once a partition is specified, transaction IDs are assigned to the words in the text documents depending on their placement within each document. The transaction IDs encode both the position of each word within the document as well as representing sentence, paragraph, and section breaks, and are represented in one embodiment as long integers with the sentence boundaries. A maximum and minimum gap between words in the phrases and the minimum support all phrases must meet for the selected time period may be specified. A generalized sequential pattern method is used to generate those phrases in each partition that meet the minimum support threshold. The shape query engine takes the set of phrases for the partition of interest and selects those that match a given shape query. A query may take the form of requesting a trend such as "recent upwards trend", "recent spikes in usage", "downward trends", and "resurgence of usage". Once the phrases matching the shape query are found, they are presented to the user.Other References
Field of SearchDATABASE OR FILE ACCESSINGAccess augmentation or optimizing Query processing (i.e., searching) Pattern matching access Sorting Concurrency (e.g., lock management in shared database) Privileged access Distributed or remote access DATABASE SCHEMA OR DATA STRUCTURE Version management Manipulating data structure (e.g., compression, compaction, compilation) Generating database or data structure (e.g., via user interface) Coherency (e.g., same view to multiple users) Market analysis, demand forecasting or surveying LINGUISTICS Multilingual or national language support Natural language Dictionary building, modification, or prioritization | |