Patent ReferencesMethods and apparatus for evolving a starter set of handwriting prototypes into a user-specific set Method and apparatus for cursive script recognition Method and apparatus for automatic character script determination Detecting function words without converting a scanned document to character codes Method and apparatus for supplementing significant portions of a document selected without document image decoding with retrieved information Script identification from images using cluster-based templates Patent #: 5844991 InventorsApplicationNo. 008225 filed on 01/16/1998US Classes:382/190, Feature extraction382/177, Segmenting individual characters or words382/201, Point features (e.g., spatial coordinate descriptors)382/224ClassificationExaminersPrimary: Mehta, Bhavesh M.Attorney, Agent or FirmInternational ClassG06K 009/46AbstractA computer-implemented process identifies an unknown language used to create a document. A set of training documents is defined in a variety of known languages and formed from a variety of text styles. Black and white electronic pixel images are formed of text material forming the training documents and the document in the unknown language. A plurality of line strokes are defined from the black pixels and point features are extracted from the strokes that are effective to characterize each of the languages. Point features from the unknown language are compared with point features from the known languages to identify one of the known languages that best represents the unknown language.Other References
| |