U.S. patents available from 1976 to present.
U.S. patent applications available from 2005 to present.

Quantization using frequency and mean compensated frequency input data for robust speech recognition

Patent 6219642 Issued on April 17, 2001. Estimated Expiration Date: Icon_subject October 5, 2018. Estimated Expiration Date is calculated based on simple USPTO term provisions. It does not account for terminal disclaimers, term adjustments, failure to pay maintenance fees, or other factors which might affect the term of a patent.

Patent References

Speech recognition system using Markov models having independent label output sets
Patent #: 5031217
Issued on: 07/09/1991
Inventor: Nishimura

Variable rate vocoder
Patent #: 5414796
Issued on: 05/09/1995
Inventor: Jacobs, et al.

Vector quantization of a time sequential signal by quantizing an error between subframe and interpolated feature vectors
Patent #: 5583888
Issued on: 12/10/1996
Inventor: Ono

Cepstral correction vector quantizer for speech recognition
Patent #: 5598505
Issued on: 01/28/1997
Inventor: Austin, et al.

Speech recognition system and method which permits a speaker's utterance to be recognized using a hidden markov model with subsequent calculation reduction
Patent #: 5649056
Issued on: 07/15/1997
Inventor: Nitta

Speech recognition using equal division quantization
Patent #: 5970445
Issued on: 10/19/1999
Inventor: Yamamoto, et al.

Speech recognition apparatus
Patent #: 6061652
Issued on: 05/09/2000
Inventor: Tsuboka, et al.

Split matrix quantization with split vector quantization error compensation and selective enhanced processing for robust speech recognition
Patent #: 6067515
Issued on: 05/23/2000
Inventor: Cong, et al.

Matrix quantization with vector quantization error compensation for robust speech recognition Patent #: 6070136
Issued on: 05/30/2000
Inventor: Cong, et al.

Inventors

Application

No. 166648 filed on 10/05/1998

US Classes:

704/256, Markov704/243Creating patterns for matching

Examiners

Primary: Dorvil, Richemond
Assistant: Armstrong, Angela

Attorney, Agent or Firm

International Classes

G10L 015/14
G10L 015/08

Abstract

A speech recognition system utilizes multiple quantizers to process frequency parameters and mean compensated frequency parameters derived from an input signal. The quantizers may be matrix and vector quantizer pairs, and such quantizer pairs may also function as front ends to a second stage speech classifiers such as hidden Markov models (HMMs) and/or utilizes neural network postprocessing to, for example, improve speech recognition performance. Mean compensating the frequency parameters can remove noise frequency components that remain approximately constant during the duration of the input signal. HMM initial state and state transition probabilities derived from common quantizer types and the same input signal may be consolidated to improve recognition system performance and efficiency. Matrix quantization exploits the "evolution" of the speech short-term spectral envelopes as well as frequency domain information, and vector quantization (VQ) primarily operates on frequency domain information. Time domain information may be substantially limited which may introduce error into the matrix quantization, and the VQ may provide error compensation. The matrix and vector quantizers may split spectral subbands to target selected frequencies for enhanced processing and may use fuzzy associations to develop fuzzy observation sequence data. A mixer may provide a variety of input data to the neural network for classification determination. Fuzzy operators may be utilized to reduce quantization error. Multiple codebooks may also be combined to form single respective codebooks for split matrix and split vector quantization to reduce processing resources demand.

Other References

  • Xydeas, C.S. and Cong, L. (1995) "Robust Speech Recognition in a Car Environment"; Presented at DSP95 International Conference om DSP, Jun. 26-28, 1995, Limassol, Cyprus, vol. 1, pp. 84-89.
  • Xydeas, C.S. and Cong, L., (1996) "Robust Speech Recognition using Fuzzy Mtrix Quantization, Neural Networks, and Hidden Markov Models,"Proc. of EUSIPCO-96, vol. 3, pp. 1587-1590.
  • Xydeas, C.S., Cong, L. (1995) Combining Neural Network Classification with Fuzzy Vector Quantization and Hidden Markov Models for Robust Isolated Word Speech Recognition, Proc. 1995.
  • IEEE International Symposium on Information Theory, p. 174.
  • Cong, Lin; "A Study of Robust IWSR Systems"; PhD Thesis submitted to The University of Manchester School of Engineering, Division of Electrical Engineering; Manchester, United Kingdom; pp. 1-209. May 1996
  • Waibel, Alexander; "Neural Network Approaches for Speech Recognition"; Chapter 18 of Advances in Speech Signal Processing; edited by Sadaoki Furui and M. Mohan Sondhi; Marcel Dekker, Inc.; New York, New York; 1992; pp. 555-595
  • Xydeas, C. S. and Cong, L.;"Combining Neural Network Classification with Fuzzy Vector Quantization and Hidden Markov Models for Robust Isolated Word Speech Recognition"; Signal Processing VIII Theories and Applications, vol. III; Proceedings of the IEEE International Symposium on Information Theory, IEEE Press, 1995, p. 174
  • Xydeas, C. S. and Cong, L.; "Robust Speech Recognition in A Car Environment"; Presented at DSP95 International Conference on Digital Signal Processing, Jun. 26-28, 1995, Limassol, Cyprus; vol. 1, pp. 84-89
  • Cong, Lin, Prof. C.S. Xydeas, and Anthony Ferwood; "A Study of Robust Isolated Word Speech Recognition Based on Fuzzy Methods"; Presented at EUSIPCO-94, VII European Signal Processing Conference, Sep. 13-16, 1994; Scotland, UK.; 4 pages
  • Gibson, Jerry D.; "Coding, Transmission, and Storage"; Chapter 14, Speech Signal Processing, of The Electrical Engineering Handbook; Editor-in-Chief Richard C. Dorf; .COPYRGT.1993 by CRC Press, Inc.; pp. 279-314
  • Gersho, Allen and Shihua Wang; "Vector Quantization Techniques in Speech Coding"; Chapter 2 of Advances in Speech Signal Processing; edited by Sadaoki Furui and M. Mohan Sondhi; Marcel Dekker, Inc.; New York, New York; 1992; pp. 49-84
  • Kroon, Peter and Bishnu S. Atal; "Predictive Coding of Speech Using Analysis-by-Synthesis Techniques"; Chapter 5 of Advances in Speech Signal Processing; edited by Sadaoki Furui and M. Mohan Sondhi; Marcel Dekker, Inc.; New York, New York; 1992; pp. 141-164
  • Honda, Masaaki and Yoshinao Shiraki; "Very Low-Bit-Rate Speech Coding"; Chapter 7 of Advances in Speech Signal Processing; edited by Sadaoki Furui and M. Mohan Sondhi; Marcel Dekker, Inc.; New York, New York; 1992; pp. 209-230
  • Schroeter, Juergen and M. Mohan Sondhi; "Speech Coding Based on Physiological Models of Speech Production"; Chapter 8 of Advances in Speech Signal Processing; edited by Sadaoki Furui and M. Mohan Sondhi; Marcel Dekker, Inc.; New York, New York; 1992; pp. 231-26
PatentsPlus Images
Enhanced PDF formats
loading...
PatentsPlus: add to cart
PatentsPlus: add to cartSearch-enhanced full patent PDF image
$9.95more info
PatentsPlus: add to cart
PatentsPlus: add to cartIntelligent turbocharged patent PDFs with marked up images
$18.95more info
 
Sign InRegister
Username  
Password   
forgot password?