Computational method for discovering patterns in data sets
Patent 5809499 Issued on September 15, 1998. Estimated Expiration Date: October 18, 2016. Estimated Expiration Date is calculated based on simple USPTO term provisions. It does not account for terminal disclaimers, term adjustments, failure to pay maintenance fees, or other factors which might affect the term of a patent.
707/6, Pattern matching access178/18.01, Position coordinate determination for writing (e.g., writing digitizer pad, stylus, or circuitry)345/156, DISPLAY PERIPHERAL INTERFACE INPUT DEVICE704/1, LINGUISTICS704/200, SPEECH SIGNAL PROCESSING706/11, HAVING PARTICULAR USER INTERFACE706/45, KNOWLEDGE PROCESSING SYSTEM707/3, Query processing (i.e., searching)707/104.1Application of database or data structure (e.g., distributed, multimedia, image)
Automatic discovery of qualitative and quantitative patterns inherent in data sets is accomplished by use of a unified framework which employs adjusted residual analysis in statistics to test the significance of the pattern candidates generated from data sets. This framework consists of a search engine for different order patterns, a mechanism to avoid exhaustive search by eliminating impossible pattern candidates, an attributed hypergraph (AHG) based knowledge representation language and an inference engine which measures the weight of evidence of each pattern for classification and prediction. If a pattern candidate passes the statistical significance test of adjusted residual, it is regarded as a pattern and represented by an attributed hyperedge in AHG. In the task of classification and/or prediction, the weights of evidence are calculated and compared to draw the conclusion.
Other References
Smyth et al., "An information theoretic approach to rule induction from database," IEEE, pp. 301-316, Aug. 1992
Langley et al., "Approaches to machine learning," IEEE, pp. 306-316, Sep. 1984
Michalski et al., "Automated construction of classifications: conceptual clustering versus numerical taxonomy," IEEE, pp. 396-410, Jul. 1983
Michalski et al., "Knowledge acquisition by encoding expert rules versus computer induction from examples: a case study involving soybean pathology," IEEE, pp. 63-87, Jun. 1979
Chan et al., "APACS: A system for Automatited pattern analysis and classification," computational Intelligence, vol. 6, No. 3, pp. 119-131, May, 1989
D.H. Fischer, "Conceptual clustering, learning from examples, and inference", proceedings of the 4th Intenational workshop on machine learning, pp. 38-49, Jan. 1987
Langey et al., "Conceptual clustering as discrimination learning", Proceedings of the Fifth Biennial Conference of the Canadian Society for computational Studies of Intelligence, pp. 95-98, Jan. 1984
Fisher et al., "An empirical comparison of ID3 and Back-propagation", Proceedings of the 11th International joint conference on artificial Intelligence, vol. 1, pp. 788-793, Aug. 1989
S.J. Harberman, "The analysis of residuals in cross-classified tables", Biometrics, vol.29, No. 1-4, pp. 205-220, Jan. 1973
Shelby J. Haberman "The Analysis of reiduals in cross-classified tables", Biometrics, vol. 29, No. 1-4, pp. 205-220, Mar. 1973
Douglas H. Fisher, "Knowledge acquisition via Incremental conceptual clustering", Machine Learning, vol.2, No.2, pp. 139-172, May 1987
J.R. Quinlan, "Induction of Decision Trees", Machine Learning, V.1, No.1, pp. 81-106, Jun. 1986
D.H. Fisher, "A Hierarchical conceptual Clustering Algorithm", Technical report, Dept of information and computer science, University of California, Irvine, Mar. 1984
C. Berge, "Hypergraphes, combinatoires des ensembles finis", Annales de L'IHP--Analyse Non Lineaire, Jan. 1987
Holsheimer et al., "Data mining the search for knowledge in databases", Computer Science/Department of Algorithmics and Architecture, Jan. 199