Patent ReferencesMethod and an apparatus for validating the electronic encoding of an ideographic character Process and apparatus involving pattern recognition Method for storing Chinese character description information in a character generating apparatus Method and apparatus for encoding and decoding Chinese characters Patent #: 5212769 InventorsApplicationNo. 662854 filed on 06/12/1996US Classes:382/185, Ideographic characters (e.g., Japanese or Chinese)382/187, On-line recognition of handwritten characters382/202, Linear stroke analysis (e.g., limited to straight lines)382/209Template matching (e.g., specific devices that determine the best match)ExaminersPrimary: Mancuso, JosephAssistant: Nguyen, Ha Tran Attorney, Agent or FirmInternational ClassesG06K 009/18G06K 009/00 G06K 009/46 G06K 009/62 AbstractA hierarchical database system for on-line Chinese character recognition is disclosed. The hierarchical database system comprises: (1) the database of character description; (b) the database of stroke correspondence rules of components; (3) the database of character structures; (4) the database of standard component patterns; and (5) the database of spatial relationships between strokes of components. The database of character description stores the rule code(s) of the constituent component(s) and the character structure for all the characters included in the vocabulary. The database of stroke correspondence rules of components stores stroke correspondence rules for all the components denoted by the above mentioned rule codes. The database of character structure stores the synthesis rules of character patterns, the decomposition rules of character structures, and the spatial relationships between components, for all the 208 character structures utilized in a preferred embodiment. The database of standard component patterns stores the coordinates of extreme points of line segments constituting the standard patterns of components. And the database of spatial relationships between strokes of components stores the spatial relationships between strokes of each component. When an input script is matched against a template character, the input script is decomposed into constituent component(s) according to the decomposition rule of that character described as a rule code (stored in the database of character description). Then, the stroke correspondence rules are retrieved from the database of stroke correspondence rules of components via the rule code(s) of that template character. After the matching is completed at the stroke level, the character distance is computed using the database of spatial relationships between strokes of components and the character pattern, which can be synthesized using the database of standard component patterns and the database of character structures. The present invention also discloses a method of finding the best decomposition rule of components for each character, by which various "geometric features" of strokes and possible decomposition sequences are taken into consideration to thereby derive the best rule for each character for minimizing the number of erroneously decomposed strokes.Other References
Field of SearchIdeographic characters (e.g., Japanese or Chinese)On-line recognition of handwritten characters Point features (e.g., spatial coordinate descriptors) Template matching (e.g., specific devices that determine the best match) Comparator Directional codes and vectors (e.g., Freeman chains, compasslike codes) Unconstrained handwriting (e.g., cursive) Linear stroke analysis (e.g., limited to straight lines) Shape and form analysis Electronic template Pyramid, hierarchy, or tree structure Multilingual or national language support Dictionary building, modification, or prioritization | |