Patent ReferencesMethod and apparatus for automatic character type classification of European script documents Method and apparatus for automatic language determination of European script documents Method and apparatus for automatic determination of text line, word and character cell spatial features Method and apparatus for automatic language determination of Asian language documents Method for matching text images and documents using character shape codes Method and apparatus for automatic character script determination Method and apparatus for enhanced automatic determination of text line dependent parameters Method and apparatus for highlighting and categorizing documents using coded word tokens Relaxation word recognizer Method and system for natural language translation InventorAssigneeApplicationNo. 858884 filed on 05/19/1997US Classes:382/229, Context analysis or word recognition (e.g., character string)382/173, IMAGE SEGMENTATION382/181, PATTERN RECOGNITION382/224, Classification382/227, With a multilevel classifier382/228, Statistical decision process382/239, Adaptive coding (i.e., changes based upon history, activity, busyness, etc.)715/500, PRESENTATION PROCESSING OF DOCUMENT715/530Edit, composition, or storage controlExaminersPrimary: Boudreau, Leo H.Assistant: Patel, Kiran Attorney, Agent or FirmInternational ClassG06K 009/72AbstractA word shape token-based document classification system prepares a plurality of sets of training data degraded by image quality and selects the optimum training data set by examining scores from a relevance measurement. The system achieves high accuracy from a wide range of image quality. | |