U.S. patents available from 1976 to present.
U.S. patent applications available from 2005 to present.

Icon_funbox Bizarre Patents

Patent No. 6612440

Banana Protective Device

A banana protective device for storing and transporting a banana carefully.

Newsletter  PatentStorm News

Make the Most of Our Site

See this month's Top Inventors and Most Cited Patents.

Stay on top of the latest innovations by subscribing to an RSS feed.

Registered users: Manage your profile.

 

Class 704/256.2 - Training of HMM (EPO)


Subclass of Class 704 - Data processing: speech signal processing, linguistics, language translation, and audio compression/decompression
Definition: Subject matter wherein the models include a learning
No. of patents: 51
Last issue date: 06/28/2011


1    
NumberTitleIssue Date
7970614Continuous adaptation in detection systems via self-tuning from target population subsets
The present invention provides a system and method for treating distortion propagated though a detection system. The system includes a compensation module that compensates for untreated distortions propagating through the detection compensation system, a user model ...
06/28/2011
7818172Voice recognition method and system based on the contexual modeling of voice units
The method of recognizing speech in an acoustic signal comprises developing acoustic stochastic models of voice units in the form of a set of states of an acoustic signal and using the acoustic models for recognition by a comparison of the signal with predetermined ...
10/19/2010
7805301Covariance estimation for pattern recognition
A reliable full covariance matrix estimation algorithm for pattern unit's state output distribution in pattern recognition system is discussed. An intermediate hierarchical tree structure is built to relate models for product units. Full covariance matrices of patte...
09/28/2010
7689419Updating hidden conditional random field model parameters after processing individual training samples
A method and apparatus are provided for training parameters in a hidden conditional random field model for use in speech recognition and phonetic classification. The hidden conditional random field model uses parameterized features that are determined from a segment...
03/30/2010
7680664Parsimonious modeling by non-uniform kernel allocation
A multi-state pattern recognition model with non-uniform kernel allocation is formed by setting a number of states for a multi-state pattern recognition model and assigning different numbers of kernels to different states. The kernels are then trained using training...
03/16/2010
7672847Discriminative training of hidden Markov models for continuous speech recognition
Methods are given for improving discriminative training of hidden Markov models for continuous speech recognition. For a mixture component of a hidden Markov model state, a gradient adjustment is calculated of the standard deviation of the mixture component. If the ...
03/02/2010
7660717Speech recognition system and program thereof
Speech recognition is performed by matching between a characteristic quantity of an inputted speech and a composite HMM obtained by synthesizing a speech HMM (hidden Markov model) and a noise HMM for each speech frame of the inputted speech by use of the composite H...
02/09/2010
7603276Standard-model generation for speech recognition using a reference model
A standard model creating apparatus which provides a high-precision standard model used for pattern recognition such as speech recognition, character recognition, or image recognition using a probability model based on a hidden Markov model, Bayesian theory, or line...
10/13/2009
7509259Method of refining statistical pattern recognition models and statistical pattern recognizers
A device (800) performs statistical pattern recognition using model parameters that are refined by optimizing an objective function that includes a term for many items of training data for which recognition errors occur wherein each term depends on a relative...
03/24/2009
7472064Method and system to scale down a decision tree-based hidden markov model (HMM) for speech recognition
A method and system are provided in which a decision tree-based model (“general model”) is scaled down (“trim-down”) for a given task. The trim-down model can be adapted for the given task using task specific data. The general model can be based on a hidden ...
12/30/2008
7464033Decoding multiple HMM sets using a single sentence grammar
For a given sentence grammar, speech recognizers are often required to decode M sets of HMMs each of which models a specific acoustic environment. In order to match input acoustic observations to each of the environments, typically recognition search methods require...
12/09/2008
7437288Speech recognition apparatus
A speech recognition apparatus using a probability model that employs a mixed distribution, the apparatus formed by a standard pattern storage means for storing a standard pattern; a recognition means for outputting recognition results corresponding to an input spee...
10/14/2008
7424427Systems and methods for classifying audio into broad phoneme classes
An audio classification system classifies sounds in an audio stream as belonging to one of a relatively small number of classes. The audio classification system includes a signal analysis component [301] and a decoder [302]. The decoder [302] in...
09/09/2008
7403896Speech recognition system and program thereof
Speech recognition is performed by matching between a characteristic quantity of an inputted speech and a composite HMM obtained by synthesizing a speech HMM (hidden Markov model) and a noise HMM for each speech frame of the inputted speech by use of the composite H...
07/22/2008
7353173System and method for Mandarin Chinese speech recognition using an optimized phone set
The present invention comprises a system and method for implementing a Mandarin Chinese speech recognizer with an optimized phone set, and may include a recognizer configured to compare input speech data to phone strings from a vocabulary dictionary that is implemen...
04/01/2008
7353174System and method for effectively implementing a Mandarin Chinese speech recognition dictionary
The present invention comprises a system and method for effectively implementing a Mandarin Chinese speech recognition dictionary, and may include a recognizer configured to compare input speech data to phone strings from a vocabulary dictionary that is implemented ...
04/01/2008
7353172System and method for cantonese speech recognition using an optimized phone set
The present invention comprises a system and method for implementing a Cantonese speech recognizer with an optimized phone set, and may include a recognizer configured to compare input speech data to phone strings from a vocabulary dictionary that is implemented acc...
04/01/2008
7346507Method and apparatus for training an automated speech recognition-based system
A method and apparatus for building a training set for an automated speech recognition-based system, which determines the statistically optimal number of frequently requested responses to automate in order to achieve a desired automation rate. The invention may be u...
03/18/2008
7319958Polyphone network method and apparatus
Acoustic phones (preferably drawn 12 from a plurality of spoken languages) are provided 11. A hierarchically-organized polyphone network (20) organizes views of these phones of varying resolution and phone categorization as a function, at least ...
01/15/2008
7313269Unsupervised learning of video structures in videos using hierarchical statistical models to detect events
A method learns a structure of a video, in an unsupervised setting, to detect events in the video consistent with the structure. Sets of features are selected from the video. Based on the selected features, a hierarchical statistical model is updated, and an informa...
12/25/2007
7310599Removing noise from feature vectors
A method and computer-readable medium are provided for identifying clean signal feature vectors from noisy signal feature vectors. Aspects of the invention use mixtures of distributions of noise feature vectors and/or channel distortion feature vectors when identify...
12/18/2007
7308030Object activity modeling method
An object activity modeling method which can efficiently model complex objects such as a human body is provided. The object activity modeling method includes the steps of (a) obtaining an optical flow vector from a video sequence; (b) obtaining the probability distr...
12/11/2007
7305341Method of reflecting time/language distortion in objective speech quality assessment
Disclosed is an objective speech quality assessment technique that reflects the impact of distortions which can dominate overall speech quality assessment by modeling the impact of such distortions on subjective speech quality assessment, thereby, accounting for lan...
12/04/2007
7295979Language context dependent data labeling
Bootstrapping of a system from one language to another often works well when the two languages share the similar acoustic space. However, when the new language has sounds that do not occur in the language from which the bootstrapping is to be done, bootstrapping doe...
11/13/2007
7292979Time ordered indexing of audio data
Methods and apparatuses in which attributes including one or more types of accents and one or more types of human languages from an audio information stream are identified. Each identified attribute from the audio information stream is encoded into a time ordered in...
11/06/2007
7289958Automatic language independent triphone training using a phonetic table
A method for training acoustic models for a new target language is provided using a phonetic table, which characterizes the phones, used in one or more reference language(s) with respect to their articulatory properties; a phonetic table, which characterizes the pho...
10/30/2007
7286989Speech-processing system and method
A speech processing system has an arbitrary number of speech recognition modules (Ei, i=1 . . . n) and speech output modules (Aj, j−1 . . . m). The modules provided respectively for a particular type of speech recognition or, respectively, speech output are select...
10/23/2007
7269558Decoding multiple HMM sets using a single sentence grammar
For a given sentence grammar, speech recognizers are often required to decode M sets of HMMs each of which models a specific acoustic environment. In order to match input acoustic observations to each of the environments, typically recognition search methods require...
09/11/2007
7269555Unsupervised incremental adaptation using maximum likelihood spectral transformation
In a speech recognition system, a method of transforming speech feature vectors associated with speech data provided to the speech recognition system includes the steps of receiving likelihood of utterance information corresponding to a previous feature vector trans...
09/11/2007
7266494Method and apparatus for identifying noise environments from noisy signals
A method and apparatus are provided for identifying a noise environment for a frame of an input signal based on at least one feature for that frame. To identify the noise environment, a probability for a noise environment is determined by applying the noisy input fe...
09/04/2007
7260535Web server controls for web enabled recognition and/or audible prompting for call controls
Web server controls are provided for generating client side markups with recognition and/or audible prompting to enable telephone call controls such as making, transferring and disconnecting telephone calls. ...
08/21/2007
7249198Method and system for device bootstrapping via server synchronization
A method and system for restoring basic functionality to a portable computer system via a server accessed remotely by telephone. A user of a portable computer system which has lost data and software which was held in volatile memory may connect to a server to restor...
07/24/2007
7231019Automatic identification of telephone callers based on voice characteristics
A method and apparatus are provided for identifying a caller of a call from the caller to a recipient. A voice input is received from the caller, and characteristics of the voice input are applied to a plurality of acoustic models, which include a generic acoustic m...
06/12/2007
7231349Method and apparatus for compressing asymmetric clustering language models
A method and data structure are provided for efficiently storing asymmetric clustering models. The models are stored by storing a first level record for a word identifier and two second level records, one for a word identifier and one for a cluster identifier. An in...
06/12/2007
7228277Mobile communications terminal, voice recognition method for same, and record medium storing program for voice recognition
A voice input section receives voice of the user designating a name etc. and outputs a voice signal to a speech recognition section. The speech recognition section analyzes and recognizes the voice signal and thereby obtains voice data. The voice data is compared wi...
06/05/2007
7225125Speech recognition system trained with regional speech characteristics
A speech recognition system uses speech recognition models which are specifically trained and optimized for users residing in a particular geographic area or region. The speech models are trained with samples of word variants expected to be used in a natural languag...
05/29/2007
7219055Speech recognition apparatus and method adapting best transformation function to transform one of the input speech and acoustic model
The present invention relates to a speech recognition apparatus for recognizing speeches of a plurality of users with high accuracy. An adapting unit 12 detects a best transformation function for adapting an input speech to an acoustic model from at least one...
05/15/2007
7209883Factorial hidden markov model for audiovisual speech recognition
A speech recognition method includes use of synchronous or asynchronous audio and a video data to enhance speech recognition probabilities. A two stream factorial hidden Markov model is trained and used to identify speech. At least one stream is derived from audio d...
04/24/2007
7209881Preparing acoustic models by sufficient statistics and noise-superimposed speech data
Noise-superimposed speech data is grouped according to acoustic similarity, and sufficient statistics are prepared using the speech data in each of the groups. A group acoustically similar to voice data of a user of the speech recognition is selected, and sufficient...
04/24/2007
7192283System and method for visual analysis of word frequency and distribution in a text
A main computer processing system accesses a text, counts the number of times each word appears, and arranges the words on the display in a way that makes understanding the text easier. On the display, the user can see which words are used most frequently, and the p...
03/20/2007
1    
 
Sign InRegister
Username  
Password   
forgot password?