Patent 6092044 Issued on July 18, 2000. Estimated Expiration Date: March 28, 2017. Estimated Expiration Date is calculated based on simple USPTO term provisions. It does not account for terminal disclaimers, term adjustments, failure to pay maintenance fees, or other factors which might affect the term of a patent.
A method of adding a word to a speech recognition vocabulary includes creating a collection of possible phonetic pronunciations from a spelling of the word and using speech recognition to find a pronunciation from the collection that best matches an utterance of the word. The collection is created by comparing the spelling to a rules list of letter strings with associated phonemes. The list is searched for a letter string from the spelling of length greater than one letter. The collection is limited to phonetic pronunciations containing phonemes associated with the letter string of length greater than one. In another method, a net of possible phonetic pronunciations of the word is created from the spelling and speech recognition is used to find the pronunciation from the net that best matches the utterance of the word. The invention also features methods of assigning a pre-filtering class to a word.
Other References
Kita, Kenji et al., "Processing Unknown Words in Continuous Speech Recognition," IEICE Trans., vol. E74, No. 7 (Jul. 1991), pp. 1811-1815
Asadi, et al.; "Automatic Modeling for Adding New Words to a Large-Vocabulary Continuous Speech Recognition System"; ICASSP 91 vol. 1; International Conference; pp. 305-308
Bahl, et al.; "A Maximum Likelihood Approach to Continuous Speech Recognition"; IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. PAMI-5; No. 2, Mar. 1983
European Search Report dated Apr. 7, 1999
Asadi, Ayman, "Automatic Modeling for Adding New Words to a Large Vocabulary . . . ", ICASSP 91, vol. 1, pp. 305-308, 1991
Bahl, Lalit, "A Maximum LikeLihood Approach to Continuous Speech Recognition", IEEE Transactions on Patern Analysis and Machine Intelligence, vol. PAMI-5, No. 2, pp. 179-190, Mar. 1983
Bahl, L.R., "Automatic High-Resolution Labeling of Speech Waveforms", IBM Technical Disclosure Bulletin, vol. 23, No. 7B, pp. 3466-3467, Dec. 1980
Bahl, L.R., "Automatic Phonetic Baseform Determination", ICASSP 91, vol. 1, pp. 173-176, May 1991
Bahl, L.R., "Adaptation of Large Vocabulary Recognition System" ICASSP-92, vol. 1, pp. I477-480 Mar. 1992
Bahl, L.R., "Automatic Selection of Speech Prototypes " IBM Technical Disclosure Bulletin vol. 24, No. 4, pp. 2042-2043, Sep. 1981
Bahl, L.R., "Interpolation of Estimators Derived From Sparse Data", IBM Technical Disclosure Bulletin vol. 24, No. 4, pp. 2038-2041, Sep. 1981
Das, S.K., "System for Temporal Registration of Quasi-Phonemic Utterance Representations", IBM Technical Disclosure Bulletin, Bol. 23, No. 7A, pp. 3047-3050, Dec. 1980
Haeb-Unbach, R., "Automatic Transcription of Unknown Words in a Speech Recognition System", The 1995 International Conference on Acoustice, Speech, and Signal Processing, vol. 1, pp. 840-843, May 1995
Imai, Toru, "ANew Method for Automatic Generation of Speaker-Dependent Phonological Rules", The 1995 International Conference on Acoustice, Speech, and Signal Processing, vol. 1, pp. 864-867, May 1995
Merialdo B., "Multilevel decoding for Very-Large-Size-Dictionary speech recognition", IBM J. Res. Develop., vol. 32, No. 2, Mar. 1988
Wothke, K., "Morphologically based automatic phonetic transcription", IBM Systems Journal, vol. 32, No. 3, 199