U.S. patents available from 1976 to present.
U.S. patent applications available from 2005 to present.

Icon_funbox Quotables

"Rail travel at high speeds is not possible because passengers, unable to breathe, would die of asphyxia."

Dionysius Lardner, Professor of Natural Philosophy and Astronomy at University College, London ; 1830

Newsletter  PatentStorm News

Make the Most of Our Site

See this month's Top Inventors and Most Cited Patents.

Stay on top of the latest innovations by subscribing to an RSS feed.

Registered users: Manage your profile.

 

Class 704/256.1 - Hidden Markov Model (HMM) (EPO)


Subclass of Class 704 - Data processing: speech signal processing, linguistics, language translation, and audio compression/decompression
Definition: Subject matter wherein a Markov chain used in the recognition
No. of patents: 36
Last issue date: 04/17/2012


NumberTitleIssue Date
8160878Piecewise-based variable-parameter Hidden Markov Models and the training thereof
A speech recognition system uses Gaussian mixture variable-parameter hidden Markov models (VPHMMs) to recognize speech under many different conditions. Each Gaussian mixture component of the VPHMMs is characterized by a mean parameter μ and a variance parameter Σ....
04/17/2012
8041567Method of speaker adaptation for a hidden markov model based voice recognition system
Commercially available voice recognition systems are generally speaker-dependent, with the voice recognition system first being trained to the voice of the speaker before it can be used. A disadvantage with this method is that modified reference data has to be buffe...
10/18/2011
7873518Device and method for assessing a quality class of an object to be tested
A device for assessing a quality class of an object to be tested includes a unit for detecting a test signal from the object to be tested. Furthermore, the device for assessing includes a unit for providing a stochastic Markov model including states and transitions ...
01/18/2011
7684988Testing and tuning of automatic speech recognition systems using synthetic inputs generated from its acoustic models
A system and method of testing and tuning a speech recognition system by providing pronunciations to the speech recognizer. First a text document is provided to the system and converted into a sequence of phonemes representative of the words in the text. The phoneme...
03/23/2010
7650282Word spotting score normalization
An approach to scoring acoustically-based events, such as hypothesized instances of keywords, in a speech processing system make use of scores of individual components of the event. Data characterizing an instance of an event are first accepted. This data includes a...
01/19/2010
7472063Audio-visual feature fusion and support vector machine useful for continuous speech recognition
A speech recognition method includes several embodiments describing application of support vector machine analysis to a mouth region. Lip position can be accurately determined and used in conjunction with synchronous or asynchronous audio data to enhance speech reco...
12/30/2008
7437288Speech recognition apparatus
A speech recognition apparatus using a probability model that employs a mixed distribution, the apparatus formed by a standard pattern storage means for storing a standard pattern; a recognition means for outputting recognition results corresponding to an input spee...
10/14/2008
7437289Methods and apparatus for the systematic adaptation of classification systems from sparse adaptation data
Methods and apparatus for the rapid adaptation of classification systems using small amounts of adaptation data. Improvements in classification accuracy are attainable when conditions similar to those that present in adaptation are observed. The attendant methods an...
10/14/2008
7424427Systems and methods for classifying audio into broad phoneme classes
An audio classification system classifies sounds in an audio stream as belonging to one of a relatively small number of classes. The audio classification system includes a signal analysis component [301] and a decoder [302]. The decoder [302] in...
09/09/2008
7353173System and method for Mandarin Chinese speech recognition using an optimized phone set
The present invention comprises a system and method for implementing a Mandarin Chinese speech recognizer with an optimized phone set, and may include a recognizer configured to compare input speech data to phone strings from a vocabulary dictionary that is implemen...
04/01/2008
7353172System and method for cantonese speech recognition using an optimized phone set
The present invention comprises a system and method for implementing a Cantonese speech recognizer with an optimized phone set, and may include a recognizer configured to compare input speech data to phone strings from a vocabulary dictionary that is implemented acc...
04/01/2008
7353174System and method for effectively implementing a Mandarin Chinese speech recognition dictionary
The present invention comprises a system and method for effectively implementing a Mandarin Chinese speech recognition dictionary, and may include a recognizer configured to compare input speech data to phone strings from a vocabulary dictionary that is implemented ...
04/01/2008
7319960Speech recognition method and system
A speech recognition system uses a phoneme counter to determine the length of a word to be recognized. The result is used to split a lexicon into one or more sub-lexicons containing only words which have the same or similar length to that of the word to be recognize...
01/15/2008
7313269Unsupervised learning of video structures in videos using hierarchical statistical models to detect events
A method learns a structure of a video, in an unsupervised setting, to detect events in the video consistent with the structure. Sets of features are selected from the video. Based on the selected features, a hierarchical statistical model is updated, and an informa...
12/25/2007
7308443Techniques for video retrieval based on HMM similarity
A query is received. The query may be an object containing temporal information. A query model including static and temporal components is then determined for the object. A weighting for static and temporal components is also determined. The query model is then comp...
12/11/2007
7308030Object activity modeling method
An object activity modeling method which can efficiently model complex objects such as a human body is provided. The object activity modeling method includes the steps of (a) obtaining an optical flow vector from a video sequence; (b) obtaining the probability distr...
12/11/2007
7292974Method for recognizing speech with noise-dependent variance normalization
As the application of a variance normalization (VN) to a speech signal (S) may be advantageous as well as disadvantageous with respect to the recognition rate in a speech recognizing process in dependence of the degree of the signal disturbance it is suggested to ca...
11/06/2007
7283959Compact easily parseable binary format for a context-free grammar
A computer-loadable data structure is provided that represents a state-and-transition-based description of a speech grammar. The data structure includes first and second transition entries that both represent transitions from a first state. The second transition ent...
10/16/2007
7277851Automated creation of phonemic variations
A method of generating a phonemic transcription for a word using a computer system is described. In one embodiment, an existing pronunciation generation program is applied to generate an initial transcription. The initial transcription can then be evaluated to ident...
10/02/2007
7254538Nonlinear mapping for feature extraction in automatic speech recognition
The present invention successfully combines neural-net discriminative feature processing with Gaussian-mixture distribution modeling (GMM). By training one or more neural networks to generate subword probability posteriors, then using transformations of these estima...
08/07/2007
7236931Systems and methods for automatic acoustic speaker adaptation in computer-assisted transcription systems
The invention is a system and method for automatic acoustic speaker adaptation in an automatic speech recognition assisted transcription system. Partial transcripts of audio files are generated by a transcriptionist. A topic language model is generated from the part...
06/26/2007
7231019Automatic identification of telephone callers based on voice characteristics
A method and apparatus are provided for identifying a caller of a call from the caller to a recipient. A voice input is received from the caller, and characteristics of the voice input are applied to a plurality of acoustic models, which include a generic acoustic m...
06/12/2007
7219065Emphasis of short-duration transient speech features
A sound processor including a microphone (1), a pre-amplifier (2), a bank of N parallel filters (3), means for detecting short-duration transitions in the envelope signal of each filter channel, and means for applying gain to the outputs of thes...
05/15/2007
7209881Preparing acoustic models by sufficient statistics and noise-superimposed speech data
Noise-superimposed speech data is grouped according to acoustic similarity, and sufficient statistics are prepared using the speech data in each of the groups. A group acoustically similar to voice data of a user of the speech recognition is selected, and sufficient...
04/24/2007
7203368Embedded bayesian network for pattern recognition
A pattern recognition procedure forms a hierarchical statistical model using a hidden Markov model and a coupled hidden Markov model. The hierarchical statistical model supports a pa 20 layer having multiple supernodes and a child layer having multiple nodes associa...
04/10/2007
7191130Method and system for automatically optimizing recognition configuration parameters for speech recognition systems
The present invention introduces a system and method for automatically optimizing recognition configuration parameters for speech recognition systems. In one embodiment, a method comprises receiving an utterance at a speech recognizer, wherein the speech recognizer ...
03/13/2007
7181399Recognizing the numeric language in natural spoken dialogue
A system for recognizing connected digits in natural spoken dialogue includes a speech recognition processor that receives unconstrained fluent input speech and produces a string of words that can include a numeric language, and a numeric understanding processor tha...
02/20/2007
7171043Image recognition using hidden markov models and coupled hidden markov models
An image processing system useful for facial recognition and security identification obtains an array of observation vectors from a facial image to be identified. A Viterbi algorithm is applied to the observation vectors given the parameters of a hierarchical statis...
01/30/2007
7165029Coupled hidden Markov model for audiovisual speech recognition
A speech recognition method includes use of synchronous or asynchronous audio and a video data to enhance speech recognition probabilities. A two stream coupled hidden Markov model is trained and used to identify speech. At least one stream is derived from audio dat...
01/16/2007
7162641Weight based background discriminant functions in authentication systems
Methods and apparatus for providing speech-based authentication, including the determination of a target discriminant based on an identity claim and on at least one target voiceprint model relating to a target speaker, of a background discriminant based on the ident...
01/09/2007
7076102Video monitoring system employing hierarchical hidden markov model (HMM) event learning and classification
A method and apparatus are disclosed for automatically learning and identifying events in image data using hierarchical HMMs to define and detect one or more events. The hierarchical HMMs include multiple paths that encompass variations of the same event. Hierarchic...
07/11/2006
7024350Compact easily parseable binary format for a context-free grammer
A computer-loadable data structure is provided that represents a state-and-transition-based description of a speech grammar. The data structure includes first and second transition entries that both represent transitions from a first state. The second transition ent...
04/04/2006
6961703Method for speech processing involving whole-utterance modeling
A speech verification process involves comparison of enrollment and test speech data and an improved method of comparing the data is disclosed, wherein segmented frames of speech are analyzed jointly, rather than independently. The enrollment and test speech are bot...
11/01/2005
6246985Method and apparatus for automatic segregation and routing of signals of different origins by using prototypes
A method and apparatus is disclosed for automatic segregation of signals of different origin, using models that statistically characterize a wave signal, more particularly including feature vectors consisting of a plurality of parameters extracted from a ...
06/12/2001
5212730Voice recognition of proper names using text-derived recognition models
A name recognition system (FIG. 1 )used to provide access to a database based on the voice recognition of a proper name spoken by a person who may not know the correct pronunciation of the name. During an enrollment phase (10), for each name-text entered ...
05/18/1993
4748670Apparatus and method for determining a likely word sequence from labels generated by an acoustic processor
Continuous speech recognition is improved by use of a known vocabulary and context probabilities. First, the unknown utterance is analyzed as a sequence of phonemes, then each phoneme labelled to form a string of labels. The shortest label interval which ...
05/31/1988
 
Sign InRegister
Username  
Password   
forgot password?