U.S. patents available from 1976 to present.
U.S. patent applications available from 2005 to present.

Icon_funbox Did You Know...

...that after Walter Hunt patented the safety pin in 1849, he sold the rights to it for $400?

Newsletter  PatentStorm News

Make the Most of Our Site

See this month's Top Inventors and Most Cited Patents.

Stay on top of the latest innovations by subscribing to an RSS feed.

Registered users: Manage your profile.

 

Class 704/266 - Specialized model


Subclass of Class 704 - Data processing: speech signal processing, linguistics, language translation, and audio compression/decompression
Definition: Subject matter wherein the component parts are combined
No. of patents: 175
Last issue date: 04/05/2011


1          
NumberTitleIssue Date
7921016Method and device for providing 3D audio work
A method for providing a 3D audio work includes providing a one-ear HRTF filter and a related function synthesizer storing a related function therein, and inputting sound signals into the one-ear HRTF filter. The sound signals are converted into one-ear output sound...
04/05/2011
7890330Voice recording tool for creating database used in text to speech synthesis system
A method records verbal expressions of a person for use in a vehicle navigation system. The vehicle navigation system has a database including a map and text describing street names and points of interest of the map. The method includes the steps of obtaining from t...
02/15/2011
7792673Method of generating a prosodic model for adjusting speech style and apparatus and method of synthesizing conversational speech using the same
An apparatus and method for adjusting the friendliness of a synthesized speech and thus generating synthesized speech of various styles in a speech synthesis system are provided. The method includes the steps of defining at least two friendliness levels; storing rec...
09/07/2010
7630898System and method for preparing a pronunciation dictionary for a text-to-speech voice
Disclosed are various elements of a toolkit used for generating a TTS voice for use in a spoken dialog system. The embodiments in each case may be in the form of the system, a computer-readable medium or a method for generating the TTS voice. One embodiment of the i...
12/08/2009
7519535Frame erasure concealment in voice communications
A voice decoder configured to receive a sequence of frames, each of the frames having voice parameters. The voice decoder includes a speech generator that generates speech from the voice parameters. A frame erasure concealment module is configured to reconstruct the...
04/14/2009
7487093Text structure for voice synthesis, voice synthesis method, voice synthesis apparatus, and computer program thereof
In a voice synthesis apparatus, by bounding a desired range of input text to be output by, e.g., a start tag “” and end tag , a feature of synthetic voice is continuously changed while gra...
02/03/2009
7472066Automatic speech segmentation and verification using segment confidence measures
An automatic speech segmentation and verification system and method is disclosed, which has a known text script and a recorded speech corpus corresponding to the known text script. A speech unit segmentor segments the recorded speech corpus into N test speech unit s...
12/30/2008
7464034Voice converter for assimilation by frame synthesis with temporal alignment
A voice converting apparatus is constructed for converting an input voice into an output voice according to a target voice. The apparatus includes a storage section, an analyzing section including a characteristic analyzer, a producing section, a synthesizing sectio...
12/09/2008
7415118System and method for distributed gain control
In accordance with an embodiment, the invention provides a spectral enhancement system that includes a plurality of distributed filters, a plurality of energy distribution units, and a weighted-averaging unit. At least one of the distributed filters receives a multi...
08/19/2008
7406417Method for conditioning a database for automatic speech processing
A neural network can be trained for synthesizing or recognizing speech with the aid of a database produced by automatically matching graphemes and phonemes. First, graphemes and phonemes are matched for words which have the same number of graphemes and phonemes. Nex...
07/29/2008
7400651Device and method for interpolating frequency components of signal
A frequency interpolation apparatus is provided which reproduces a signal similar to an original signal by approximately recovering suppressed frequency components, from an input signal having the suppressed frequency components in a specific frequency band of the o...
07/15/2008
7369994Methods and apparatus for rapid acoustic unit selection from a large speech corpus
A speech synthesis system can select recorded speech fragments, or acoustic units, from a very large database of acoustic units to produce artificial speech. The selected acoustic units are chosen to minimize a combination of target and concatenation costs for a giv...
05/06/2008
7366500SMS shorthand dictionary service
The present invention provides a lookup service for shorthand terms directly from within an application. A lookup pane is provided to the user from which they can lookup a definition for the shorthand term. The lookup pane provides a consistent user interface for lo...
04/29/2008
7365260Apparatus and method for reproducing voice in synchronism with music piece
Music piece sequence data are composed of a plurality of event data which include performance event data and user event data designed for linking a voice to progression of a music piece. A plurality of voice data files are stored in a memory separately from the musi...
04/29/2008
7353177System and method of providing conversational visual prosody for talking heads
A system and method of controlling the movement of a virtual agent while the agent is listening to a human user during a conversation is disclosed. The method comprises receiving speech data from the user, performing a prosodic analysis of the speech data and contro...
04/01/2008
7349846Information processing apparatus, method, program, and storage medium for inputting a pronunciation symbol
An information processing apparatus for inputting a pronunciation symbol corresponding to an English notation includes pronunciation symbol information holding means for holding pronunciation symbol information indicating a relationship between a predetermined alpha...
03/25/2008
7349852System and method of providing conversational visual prosody for talking heads
A system and method of controlling the movement of a virtual agent while the agent is listening to a human user during a conversation is disclosed. The method comprises receiving speech data from the user, performing a prosodic analysis of the speech data and contro...
03/25/2008
7346507Method and apparatus for training an automated speech recognition-based system
A method and apparatus for building a training set for an automated speech recognition-based system, which determines the statistically optimal number of frequently requested responses to automate in order to achieve a desired automation rate. The invention may be u...
03/18/2008
7328159Interactive speech recognition apparatus and method with conditioned voice prompts
An improved system for an interactive voice recognition system (400) includes a voice prompt generator (401) for generating voice prompt in a first frequency band (501). A speech detector (406) detects presence of speech energy in a secon...
02/05/2008
7328157Domain adaptation for TTS systems
Embodiments of the present invention pertain to adaptation of a corpus-driven general-purpose TTS system to at least one specific domain. The domain adaptation is realized by adding a limited amount of domain-specific speech that provides a maximum impact on improve...
02/05/2008
7319756Audio coding
Coding of an audio signal (x) is provided where the coded bitstream (AS) comprises a parametric representation of the audio signal. One component of the parametric representation comprises tracks of linked sinusoidal components (CS) where subsequently linked compone...
01/15/2008
7313523Method and apparatus for assigning word prominence to new or previous information in speech synthesis
A method and apparatus is provided for generating speech that sounds more natural. In one embodiment, word prominence and latent semantic analysis are used to generate more natural sounding speech. A method for generating speech that sounds more natural may comprise...
12/25/2007
7310599Removing noise from feature vectors
A method and computer-readable medium are provided for identifying clean signal feature vectors from noisy signal feature vectors. Aspects of the invention use mixtures of distributions of noise feature vectors and/or channel distortion feature vectors when identify...
12/18/2007
7308408Providing services for an information processing system using an audio interface
A method and system for providing efficient menu services for an information processing system that uses a telephone or other form of audio user interface. In one embodiment, the menu services provide effective support for novice users by providing a full listing of...
12/11/2007
7308407Method and system for generating natural sounding concatenative synthetic speech
A method for generating synthetic speech can include identifying a recording of conversational speech and creating a transcription of the conversational speech. Using the transcription, rather than a predefined script, the recording can be analyzed and acoustic unit...
12/11/2007
7292980Graphical user interface and method for modifying pronunciations in text-to-speech and speech recognition systems
A method and user interface which allow users to make decisions about how to pronounce words and parts of words based on audio cues and common words with well known pronunciations. Users input or select words for which they want to set or modify pronunciations. To s...
11/06/2007
7280968Synthetically generated speech responses including prosodic characteristics of speech inputs
A method for digitally generating speech with improved prosodic characteristics can include receiving a speech input, determining at least one prosodic characteristic contained within the speech input, and generating a speech output including the prosodic characteri...
10/09/2007
7277856System and method for speech synthesis using a smoothing filter
A speech synthesis system for controlling a discontinuous distortion that occurs at the transition portion between concatenated phonemes which are speech units of a synthesized speech using a smoothing technique, comprising: a discontinuous distortion processing mea...
10/02/2007
7276655Music synthesis system
The invention relates to a music synthesis system for synthesizing a corresponding digital music output according to commands from a music data file. The music data file comprises a plurality of music data units. Each music data unit records related information of t...
10/02/2007
7275034Word-specific acoustic models in a speech recognition system
A speech recognizer has an acoustic model that includes word-specific models, that are specific to candidate words. The candidate words would otherwise be mapped to a series of general phones. A decoder transcribes input speech into words formed by shared phones, wo...
09/25/2007
7271330Rendition style determination apparatus and computer program therefor
There is provided a rendition style selector operable by a user to select a desired rendition style from among a plurality of rendition styles associated with a plurality of portions of tones. In response to selecting operation via the selector, a rendition style is...
09/18/2007
7266497Automatic segmentation in speech synthesis
Systems and methods for automatically segmenting speech inventories. A set of Hidden Markov Models (HMMs) are initialized using bootstrap data. The HMMs are next re-estimated and aligned to produce phone labels. The phone boundaries of the phone labels are then corr...
09/04/2007
7266495Method and system for learning linguistically valid word pronunciations from acoustic data
A computerized pronunciation system is provided for generating pronunciations for words and storing the pronunciations in a pronunciation dictionary. The system includes a word list including at least one word; transcribed acoustic data including at least one wavefo...
09/04/2007
7263488Method and apparatus for identifying prosodic word boundaries
A method and computer-readable medium are provided that identify prosodic word boundaries for a text. If the text is unsegmented, it is first segmented into lexical words. The lexical words are then converted into prosodic words using an annotated lexicon to divide ...
08/28/2007
7260523Sub-band speech coding system
An improved sub-band speech coding system is provided by subdividing signals into a lower an higher subband, downsampling the lower subband before coding and coding the higher subband without downsampling. The decoder includes decoding and upsampling of the lower su...
08/21/2007
7236901Digital broadband frequency measurement
The present invention relates to a device and method that digitally replicates the analog processing that is normally associated with an instantaneous frequency measurement device. Specifically, the present relates to a digital frequency measurement device comprisin...
06/26/2007
7233901Synthesis-based pre-selection of suitable units for concatenative speech
A system and computer-readable medium synthesize speech from text using a triphone unit selection database. The instructions on the computer-readable medium control a computing device to perform the steps: receiving input text, selecting a plurality of N phoneme uni...
06/19/2007
7224721System for direct acquisition of received signals
Signal processing architectures for direct acquisition of spread spectrum signals using long codes. Techniques are described for achieving a high of parallelism, employing code matched filter banks and other hardware sharing. In one embodiment, upper and lower sideb...
05/29/2007
7219061Method for detecting the time sequences of a fundamental frequency of an audio response unit to be synthesized
Predetermined macrosegments of the fundamental frequency are determined by a neural network, and these predefined macrosegments are reproduced by fundamental-frequency sequences stored in a database. The fundamental frequency is generated on the basis of a relativel...
05/15/2007
7191131Electronic document processing apparatus
On receipt of a tagged file, as a tagged document, at step S1, a document processing apparatus at step S2 derives the attribute information for read-out from tags of the tagged file and embeds the attribute information to generate a speech read-out fil...
03/13/2007
1          
 
Sign InRegister
Username  
Password   
forgot password?