Matrix quantization with vector quantization error compensation for robust speech recognition
Patent 6070136 Issued on May 30, 2000. Estimated Expiration Date: October 27, 2017. Estimated Expiration Date is calculated based on simple USPTO term provisions. It does not account for terminal disclaimers, term adjustments, failure to pay maintenance fees, or other factors which might affect the term of a patent.
A speech recognition system utilizes both matrix and vector quantizers as front ends to a second stage speech classifier. Matrix quantization exploits input signal information in both frequency and time domains, and the vector quantizer primarily operates on frequency domain information. However, in some circumstances, time domain information may be substantially limited which may introduce error into the matrix quantization. Information derived from vector quantization may be utilized by a hybrid decision generator to error compensate information derived from matrix quantization. Additionally, fuzz methods of quantization and robust distance measures may be introduced to also enhance speech recognition accuracy. Furthermore, other speech classification stages may be used, such as hidden Markov models which introduce probabilistic processes to further enhance speech recognition accuracy. Multiple codebooks may also be combined to form single respective codebooks for matrix and vector quantization to lessen the demand on processing resources.
Other References
Lin Cong "A Study Of Robust IWSR Systems", May 1996
Lin Cong "Robust Speech Recognition In A Car Environment", Jun. 1995
Lawrence Rabiner and Biing-Hwang Juang, "Fundamentals of Speech Recognition," Prentice Hall PTR (Englewood Cliffs, New Jersey, 1993), pp. 190-195
Cong, Lin; "A Study of Robust IWSR Systems"; PhD Thesis submitted to The University of Manchester School of Engineering, Division of Electrical Engineering; Manchester, United Kingdom; pp. 1-209., May 1996
Waibel, Alexander; "Neural Network Approaches for Speech Recognition"; Chapter 18 of Advances in Speech Signal Processing; edited by Sadaoki Furui and M. Mohan Sondhi; Marcel Dekker, Inc.; New York, New York; 1992; pp. 555-595
Xydeas, C. S. and Cong, L.; "Combining Neural Network Classification with Fuzzy Vector Quantization and Hidden Markov Models for Robust Isolated Word Speech Recognition"; Signal Processing VIII Theories and Applications, vol. III; Proceedings of the IEEE International Symposium on Information Theory, IEEE Press, 1995, p. 174
Xydeas, C. S. and Cong, L.; "Robust Speech Recognition in A Car Environment"; Presented at DSP95 International Conference on Digital Signal Processing, Jun. 26-28, 1995, Limassol, Cyprus; vol. 1, pp. 84-89
Cong, Lin, Prof. C.S. Xydeas, and Anthony Ferwood; "A Study of Robust Isolated Word Speech Recognition Based on Fuzzy Methods"; Presented at EUSIPCO-94, VII European Signal Processing Conferences, Sep. 13-16, 1994; Scotland, UK.; 4 pages
Gibson, Jerry D.; "Coding, Transmission, and Storage"; Chapter 14, Speech Signal Processing, of The Electrical Engineering Handbook; Editor-in-Chief Richard C. Dorf; .COPYRGT.1993 by CRC Press, Inc.; pp. 279-314
Gersho, Allen and Shihua Wang; "Vector Qunatization Techniques in Speech Coding"; Chapter 2 of Advances in Speech Signal Processing; edited by Sadaoki Furui and M. Mohan Sondhi; Marcel Dekker Inc.; New York, New York; 1992; pp. 49-84
Kroon, Peter and Bishnu S. Atal; "Predictive Coding of Speech Using Analysis-by-Synthesis Techniques"; Chapter 5 of Advances in Speech Signal Processing; edited by Sadaoki Furui and M. Mohan Sondhi; Marcel Dekker, Inc.; New York, New York; 1992; pp. 141-164
Honda, Masaaki and Yoshinao Shiraki; "Very Low-Bit-Rate Speech Coding"; Chapter 7 of Advances in Speech Signal Processing; edited by Sadaoki Furui and M. Mohan Sondhi; Marcel Dekker, Inc.; New York, New York; 1992; pp. 209-230
Schroeter, Juergen and M. Mohan Sondhi; "Speech Coding Based on Physiological Models of Speech Production"; Chapter 8 of Advances in Speech Signal Processing; edited by Sadaoki Furui and M. Mohan Sondhi; Marcel Dekker, Inc.; New York, New York; 1992; pp. 231-268
Cong, Ling, Xydeas, Costas S. Prof and Ferwood, Anthony F. Combining Fuzzy Vector Quantization and Neural Network Classification for Robust Isolated Word Speech Recognition: Singapore ICCS 1994, pp. 884-887
Xydeas, C.S. Prof. and Cong, Lin "Robust Speech Recognition Using Fuzzy Martix Quantisation, Neural Networks and Hidden Markov Models" Sep. 1996, pp. 1587-1590
Xydeas, C.S. and Lin Cong; "Robust Speech Recognition Using Fuzzy Matrix Quantization and Neural Networks"; Proceedings of International Conference on Communication Technology; Beijing, China--ICCT '96; pp. 432-435; IEEE; New York (May 5-7, 1996)
Parsons, Thomas W.; "Voice and Speech Processing"; McGraw-Hill, Inc., New York, 1987; pp. 170-17