U.S. patents available from 1976 to present.
U.S. patent applications available from 2005 to present.

Pronunciation generation in speech recognition

Patent 6092044 Issued on July 18, 2000. Estimated Expiration Date: Icon_subject March 28, 2017. Estimated Expiration Date is calculated based on simple USPTO term provisions. It does not account for terminal disclaimers, term adjustments, failure to pay maintenance fees, or other factors which might affect the term of a patent.

Patent References

Continuous speech recognition
Patent #: 4481593
Issued on: 11/06/1984
Inventor: Bahler

Method and apparatus for continuous word string recognition
Patent #: 4489435
Issued on: 12/18/1984
Inventor: Moshier

Speech recognition system
Patent #: 4718094
Issued on: 01/05/1988
Inventor: Bahl ,   et al.

Speech recognition apparatus and method
Patent #: 4783803
Issued on: 11/08/1988
Inventor: Baker ,   et al.

Method for speech analysis and speech recognition
Patent #: 4805218
Issued on: 02/14/1989
Inventor: Bamberg ,   et al.

Method for speech recognition
Patent #: 4805219
Issued on: 02/14/1989
Inventor: Baker ,   et al.

Voice recognition system
Patent #: 4829576
Issued on: 05/09/1989
Inventor: Porter

Automatic generation of simple Markov model stunted baseforms for words in a vocabulary
Patent #: 4833712
Issued on: 05/23/1989
Inventor: Bahl ,   et al.

Method for interactive speech recognition and training
Patent #: 5027406
Issued on: 06/25/1991
Inventor: Roberts, et al.

Method and apparatus for speech recognition based on subsyllable spellings
Patent #: 5208897
Issued on: 05/04/1993
Inventor: Hutchins

More ...

Inventors

Assignee

Application

No. 825141 filed on 03/28/1997

US Classes:

704/254Subportions

Examiners

Primary: Zele, Krista M.
Assistant: Opsasnick, Michael N.

Attorney, Agent or Firm

Foreign Patent References

  • 0 562 138 A1 EP. 09/13/1993

International Class

G10L 015/08

Abstract

A method of adding a word to a speech recognition vocabulary includes creating a collection of possible phonetic pronunciations from a spelling of the word and using speech recognition to find a pronunciation from the collection that best matches an utterance of the word. The collection is created by comparing the spelling to a rules list of letter strings with associated phonemes. The list is searched for a letter string from the spelling of length greater than one letter. The collection is limited to phonetic pronunciations containing phonemes associated with the letter string of length greater than one. In another method, a net of possible phonetic pronunciations of the word is created from the spelling and speech recognition is used to find the pronunciation from the net that best matches the utterance of the word. The invention also features methods of assigning a pre-filtering class to a word.

Other References

  • Kita, Kenji et al., "Processing Unknown Words in Continuous Speech Recognition," IEICE Trans., vol. E74, No. 7 (Jul. 1991), pp. 1811-1815
  • Asadi, et al.; "Automatic Modeling for Adding New Words to a Large-Vocabulary Continuous Speech Recognition System"; ICASSP 91 vol. 1; International Conference; pp. 305-308
  • Bahl, et al.; "A Maximum Likelihood Approach to Continuous Speech Recognition"; IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. PAMI-5; No. 2, Mar. 1983
  • European Search Report dated Apr. 7, 1999
  • Asadi, Ayman, "Automatic Modeling for Adding New Words to a Large Vocabulary . . . ", ICASSP 91, vol. 1, pp. 305-308, 1991
  • Bahl, Lalit, "A Maximum LikeLihood Approach to Continuous Speech Recognition", IEEE Transactions on Patern Analysis and Machine Intelligence, vol. PAMI-5, No. 2, pp. 179-190, Mar. 1983
  • Bahl, L.R., "Automatic High-Resolution Labeling of Speech Waveforms", IBM Technical Disclosure Bulletin, vol. 23, No. 7B, pp. 3466-3467, Dec. 1980
  • Bahl, L.R., "Automatic Phonetic Baseform Determination", ICASSP 91, vol. 1, pp. 173-176, May 1991
  • Bahl, L.R., "Adaptation of Large Vocabulary Recognition System" ICASSP-92, vol. 1, pp. I477-480 Mar. 1992
  • Bahl, L.R., "Automatic Selection of Speech Prototypes " IBM Technical Disclosure Bulletin vol. 24, No. 4, pp. 2042-2043, Sep. 1981
  • Bahl, L.R., "Interpolation of Estimators Derived From Sparse Data", IBM Technical Disclosure Bulletin vol. 24, No. 4, pp. 2038-2041, Sep. 1981
  • Das, S.K., "System for Temporal Registration of Quasi-Phonemic Utterance Representations", IBM Technical Disclosure Bulletin, Bol. 23, No. 7A, pp. 3047-3050, Dec. 1980
  • Haeb-Unbach, R., "Automatic Transcription of Unknown Words in a Speech Recognition System", The 1995 International Conference on Acoustice, Speech, and Signal Processing, vol. 1, pp. 840-843, May 1995
  • Hunnicutt, Sheri, "Reversible Letter-to-Sound Sound-to-Letter Generation . . . ", Eurospeech '93, vol. 2, pp. 763-766
  • Imai, Toru, "ANew Method for Automatic Generation of Speaker-Dependent Phonological Rules", The 1995 International Conference on Acoustice, Speech, and Signal Processing, vol. 1, pp. 864-867, May 1995
  • Merialdo B., "Multilevel decoding for Very-Large-Size-Dictionary speech recognition", IBM J. Res. Develop., vol. 32, No. 2, Mar. 1988
  • Wothke, K., "Morphologically based automatic phonetic transcription", IBM Systems Journal, vol. 32, No. 3, 199
PatentsPlus Images
Enhanced PDF formats
loading...
PatentsPlus: add to cart
PatentsPlus: add to cartSearch-enhanced full patent PDF image
$9.95more info
PatentsPlus: add to cart
PatentsPlus: add to cartIntelligent turbocharged patent PDFs with marked up images
$18.95more info
 
Sign InRegister
Username  
Password   
forgot password?