U.S. patents available from 1976 to present.
U.S. patent applications available from 2005 to present.

Method and apparatus of hierarchically organizing an acoustic model for speech recognition and adaptation of the model to unseen domains

Patent 6324510 Issued on November 27, 2001. Estimated Expiration Date: Icon_subject November 6, 2018. Estimated Expiration Date is calculated based on simple USPTO term provisions. It does not account for terminal disclaimers, term adjustments, failure to pay maintenance fees, or other factors which might affect the term of a patent.

Patent References

Speech recognition method
Patent #: 4803729
Issued on: 02/07/1989
Inventor: Baker

Method for continuous recognition of alphanumeric strings spoken over a telephone network
Patent #: 5303299
Issued on: 04/12/1994
Inventor: Hunt, et al.

Speech analysis method and apparatus
Patent #: 5345535
Issued on: 09/06/1994
Inventor: Doddington

Apparatuses and methods for developing and using models for speech recognition
Patent #: 5715367
Issued on: 02/03/1998
Inventor: Gillick, et al.

Automated speech recognition using a plurality of different multilayer perception structures to model a plurality of distinct phoneme categories
Patent #: 5745649
Issued on: 04/28/1998
Inventor: Lubensky

Acoustic model generating method for speech recognition
Patent #: 5799277
Issued on: 08/25/1998
Inventor: Takami

Low complexity, high accuracy clustering method for speech recognizer
Patent #: 5806030
Issued on: 09/08/1998
Inventor: Junqua

Recognition of sequential data using finite state sequence models organized in a tree structure
Patent #: 5983180
Issued on: 11/09/1999
Inventor: Robinson

Transcription of speech data with segments from acoustically dissimilar environments Patent #: 6067517
Issued on: 05/23/2000
Inventor: Bahl, et al.

Inventors

Assignee

Application

No. 187902 filed on 11/06/1998

US Classes:

704/256, Markov704/232, Neural network704/242, Viterbi trellis704/254Subportions

Examiners

Primary: Dorvil, Richemond
Assistant: Abebe, Daniel

Attorney, Agent or Firm

International Class

G10L 015/14

Abstract

A method of organizing an acoustic model for speech recognition is comprised of the steps of calculating a measure of acoustic dissimilarity of subphonetic units. A clustering technique is recursively applied to the subphonetic units based on the calculated measure of acoustic dissimilarity to automatically generate a hierarchically arranged model. Each application of the clustering technique produces another level of the hierarchy with the levels progressing from the least specific to the most specific. A technique for adapting the structure and size of a trained acoustic model to an unseen domain using only a small amount of adaptation data is also disclosed.

Other References

  • Jurgen Fritsch, Michael Finke, "Acid/HNN: Clustering Hierarchies of Neural Networks for Context--Dependent Connectionist Acoustic Modeling," IEEE International conference on Acoustics, Speech and Signal Processing, Conference 23 (New York, New York), p. 505-508, (1998)
  • J. Fritsch, M. Finke, A. Waibel, "Effective Structural Adaptation of LVCSR Systems to Unsen domains using hierarchical connectionist acoustic models," Proceedings of the International Conference on Spoken Language Processing, p. 2919-2922, (Nov. 30-Dec. 4, 1998)
  • Paul, D.B., "Extensions to Phone-State Decision-Tree Clustering: Single Tree and Tagged Clustering," IEEE Comp. Soc. Press, IEEE International Conference on Acoustic, Speech, and Signal Processing (Los Alamitos, US), p. 1487-1490, ( 1997)
  • H. Franco, "Context-Dependent Connectionist Probability Estimation in a Hybrid Markov Model-Neural Net Speech Recognition System," Computer Speech and Language, vol. 8 (No. 3), (Feb. 22, 1994)
  • J Fritsch, et al., "Context-Dependent Hybrid HME/HMM Speech Recognition Using Polyphone Clustering Decision Trees," Proc. of ICASSP '97
  • D.J. Kershaw, et al., "Context-Dependent Classes in a Hybrid Recurrent Network HMM Speech Recognition System," Tech. Rep. CUED/F-INFENG/TR217, CUED, Cambridge England 1995
  • D.L. Thomson, "Ten Case Studies of the Effect of Field Conditions on Speech Recognition Errors," Proceedings of the IEEE ASRU Workshop, (Feb. 22, 1997)
  • J. Schurmann and W. Doster, "A Decision Theoretic Approach to Hierarchical Classifier Design," Pattern Recognition 17(3), (Feb. 22, 1994)
  • J. Fritsch, "Acid/HNN; A Framework for Hierarchical Connectionist Acoustic Modeling," Proceedsing of IEEE ASRU Workshop, (Feb. 22, 1997)
  • C.J. Leggetter and P.C. Woodland, "Speaker Adaptation of HMMs using Linear Regression," Tech. Rep. CUED/F-INFENG/TR181, CUED, (Feb. 22, 1994)
  • Franco, H., "Context-Dependent Connectionist Probability Estimation in a Hybrid Markov Model-Neural Net Speech Recognition System", Computer Speech and Language, vol. 8, No. 3, Jul. 1994
  • Fritsch, J., et al, "Context-Dependent Hybrid MHE/HMM Speech Recognition Using Polyphone Clustering Decision Trees", Proc. Of ICASS '97, Apr. 21-24, 1997
  • Kershaw, D. J., et al, "Contest-Dependent Classes in a Hybrid Recurrent Network HMM Speech Recognition System", Tech. Rep CUED/F-INFENG/TR217, CUED, Cambridge, England, Jul. 1995
  • Thomson, D. L., "Ten Case Studies of the Effect of Field Conditions on Speech Recognition Errors", Proceedings of the IEEE ASRU Workshop, Dec. 17, 1997
  • Schurmann, J., et al. "A Decision Theoretic Approach to Hierarchical Classifier Design", Pattern Recognition, 17 (3), 1984
  • Fritsch., J., "ACIDHNN; A Framcwork for Hierarchical Connectionist Acoustic Modeling", Proceedings of IEEE ASRU Workshop, Dec. 14-17, 1997
  • Leggetter, C.J., et al, "Speaker Adaptation of HMM's Using Linear Regression", Tech. Rep. CUED/F-INFENG/TR181, CUED, Jun. 199
PatentsPlus Images
Enhanced PDF formats
loading...
PatentsPlus: add to cart
PatentsPlus: add to cartSearch-enhanced full patent PDF image
$9.95more info
PatentsPlus: add to cart
PatentsPlus: add to cartIntelligent turbocharged patent PDFs with marked up images
$18.95more info
 
Sign InRegister
Username  
Password   
forgot password?