Patent ReferencesMultiple-parts-of-speech disambiguating method and apparatus for machine translation system Language processing dictionary for bidirectionally retrieving morphemic and semantic expressions Method and apparatus for the electronic storage and retrieval of expressions and linguistic information Electronic dictionary having means for linking two or more different groups of vocabulary entries in a closed loop Electronic dictionary Method for segmenting a text into words Method for entering text using abbreviated word forms Method and apparatus for text analysis Patent #: 4773009 InventorsAssigneeApplicationNo. 07/106127 filed on 10/07/1987US Classes:704/8Multilingual or national language supportExaminersPrimary: Fleming, Michael R.Attorney, Agent or FirmInternational ClassG06F 17/28 (20060101)AbstractA system for the grammatical annotation of natural language receives natural language text and annotates each word with a set of tags indicative of its possible grammatical or syntactic uses. An empirical probability of collocation function defined on pairs of tags is iteratively extended to a selected set of tag sequences of increasing length so as to select a most probable tag for each word of a sequence of ambiguously-tagged words. For listed pairs of commonly confused words a substitute calculation reveals erroneous use of the wrong word. For words with tags having abnormally low frequency of occurrence, a stored table of reduced probability factors corrects the calculation. Once the text words have been annotated with their most probable tags, the tagged text is parsed by a parser which successively applies phrasal, predicate and clausal analysis to build higher structures from the disambiguated tag strings. A voice/text translator including such a tag annotator resolves sound or spelling ambiguity of words by their differing tags. A database retrieval system, such as a spelling checker, includes a tag annotator to identify desired data by syntactic features.Other References
| |