U.S. patents available from 1976 to present.
U.S. patent applications available from 2005 to present.

Icon_funbox Bizarre Patents

Patent No. 6612440

Banana Protective Device

A banana protective device for storing and transporting a banana carefully.

Newsletter  PatentStorm News

Make the Most of Our Site

See this month's Top Inventors and Most Cited Patents.

Stay on top of the latest innovations by subscribing to an RSS feed.

Registered users: Manage your profile.

 

Class 704/276 - Pattern display


Subclass of Class 704 - Data processing: speech signal processing, linguistics, language translation, and audio compression/decompression
Definition: Subject matter for providing visual output representing speech (e.g., computer displays of speech data).
No. of patents: 264
Last issue date: 12/28/2010


1              
NumberTitleIssue Date
7860717System and method for customizing speech recognition input and output
A system and method may be disclosed for facilitating the site-specific customization of automated speech recognition systems by providing a customization client for site-specific individuals to update and modify language model input files and post processor input f...
12/28/2010
7860718Apparatus and method for speech segment detection and system for speech recognition
Provided are an apparatus and method for speech segment detection, and a system for speech recognition. The apparatus is equipped with a sound receiver and an image receiver and includes: a lip motion signal detector for detecting a motion region from image frames o...
12/28/2010
RE42000System for synchronization between moving picture and a text-to-speech converter
A method of formatting and normalizing continuous lip motions to events in a moving picture besides text in a Text-To-Speech converter is provided. A synthesized speech is synchronized with a moving picture by using the method wherein the real speech data and the sh...
12/14/2010
7788104Information processing terminal for notification of emotion
The present invention is to provide an information processing terminal which can use another expression means to indicate undesirable emotions directly transmitted to a party by a method of directly expressing talking person's emotions in real time, so that the whol...
08/31/2010
7729921Apparatus, method, and program for supporting speech interface design
For design of a speech interface accepting speech control options, speech samples are stored on a computer-readable medium. A similarity calculating unit calculates a certain indication of similarity of first and second sets of ones of the speech samples, the first ...
06/01/2010
7676373Displaying text of speech in synchronization with the speech
Displays a character string representing content of speech in synchronization with reproduction of the speech. An apparatus includes: a unit for obtaining scenario data representing the speech; a unit for dividing textual data resulting from recognition of the speec...
03/09/2010
7643999Microphone feedback and control
A system and method for positioning a software User Interface (UI) window on a display screen is provided, wherein the method includes displaying the software UI window on the display screen and identifying at least one suitable location on the display screen respon...
01/05/2010
7567908Differential dynamic content delivery with text display in dependence upon simultaneous speech
Differential dynamic content delivery including providing a session document for a presentation, wherein the session document includes a session grammar and a session structured document; selecting from the session structured document a classified structural element...
07/28/2009
7505911Combined speech recognition and sound recording
A handheld device with both large-vocabulary speech recognition and audio recoding allows users to switch between at least two of the following three modes: (1) recording audio without corresponding speech recognition; (2) recording with speech recognition; and (3) ...
03/17/2009
7457756Method of generating time-frequency signal representation preserving phase information
A method of generating a time-frequency representation of a signal that preserves phase information by receiving the signal, calculating a joint time-frequency domain of the signal, estimating instantaneous frequencies of the joint time-frequency domain, modifying e...
11/25/2008
7406409System and method for recording and reproducing multimedia based on an audio signal
A system and method summarizes multimedia stored in a compressed multimedia file partitioned into a sequence of segments, where the content of the multimedia is, for example, video signals, audio signals, text, and binary data. An associated metadata file includes i...
07/29/2008
7386437System for providing translated information to a driver of a vehicle
A vehicle mounted translation system for providing language translation to a driver of a vehicle. The translation system may be associated with a vehicle navigation system. The translation system includes a translation device and a storage unit for storing language ...
06/10/2008
7370086Web-based speech recognition with scripting and semantic objects
The present invention is a system and method for creating and implementing transactional speech applications (SAs) using Web technologies, without reliance on server-side standard or custom services. A transactional speech application may be any application that req...
05/06/2008
7365260Apparatus and method for reproducing voice in synchronism with music piece
Music piece sequence data are composed of a plurality of event data which include performance event data and user event data designed for linking a voice to progression of a music piece. A plurality of voice data files are stored in a memory separately from the musi...
04/29/2008
7366766Web-based speech recognition with scripting and semantic objects
The present invention is a system and method for creating and implementing transactional speech applications (SAs) using Web technologies, without reliance on server-side standard or custom services. A transactional speech application may be any application that req...
04/29/2008
7366670Method and system for aligning natural and synthetic video to speech synthesis
Facial animation in MPEG-4 can be driven by a text stream and a Facial Animation Parameters (FAP) stream. Text input is sent to a TTS converter that drives the mouth shapes of the face. FAPs are sent from an encoder to the face over the communication channel. Disclo...
04/29/2008
7366671Speech displaying system and method
A speech displaying system and method can display playing progress by waveform and synchronously display text of a speech file using rolling subtitles when playing the speech file. After the speech file is loaded via a loading module, a sentence unit determining mod...
04/29/2008
7356470Text-to-speech and image generation of multimedia attachments to e-mail
A multi-mail system and method is disclosed in which a sender may convey and a recipient can realize emotional aspects associated with substantive content of a multi-mail message by receiving a message that is more than textual in nature. Voice recognition technolog...
04/08/2008
7353177System and method of providing conversational visual prosody for talking heads
A system and method of controlling the movement of a virtual agent while the agent is listening to a human user during a conversation is disclosed. The method comprises receiving speech data from the user, performing a prosodic analysis of the speech data and contro...
04/01/2008
7349851Speech recognition user interface
A speech recognition system having a user interface that provides both visual and auditory feedback to a user is described. In one aspect, a response time in which to receive an audible utterance is initiated. A graphic representing the response time is displayed. A...
03/25/2008
7349852System and method of providing conversational visual prosody for talking heads
A system and method of controlling the movement of a virtual agent while the agent is listening to a human user during a conversation is disclosed. The method comprises receiving speech data from the user, performing a prosodic analysis of the speech data and contro...
03/25/2008
7343289System and method for audio/video speaker detection
A system and method for detecting speech utilizing audio and video inputs. In one aspect, the invention collects audio data generated from a microphone device. In another aspect, the invention collects video data and processes the data to determine a mouth location ...
03/11/2008
7340397Speech recognition optimization tool
A method of optimizing audio input for speech recognition applications can include identifying a source waveform and at least one optimization parameter, wherein the optimization parameter is configured to adjust audio input to a speech recognition application. The ...
03/04/2008
7334183Domain-specific concatenative audio
One embodiment of the present invention provides a system for generating speech output from a text string. During operation, the system first receives the text string and then examines the text string to locate one or more substrings within the text string that are ...
02/19/2008
7333865Aligning data streams
The invention aligns two wide-bandwidth, high resolution data streams, in a manner that retains the full bandwidth of the data streams, by using magnitude-only spectrograms as inputs into the cross-correlation and sampling the cross-correlation at a coarse sampling ...
02/19/2008
7324927Fast feature selection method and system for maximum entropy modeling
A method to select features for maximum entropy modeling in which the gains for all candidate features are determined during an initialization stage and gains for only top-ranked features are determined during each feature selection stage. The candidate features are...
01/29/2008
7324947Global speech user interface
A global speech user interface (GSUI) comprises an input system to receive a user's spoken command, a feedback system along with a set of feedback overlays to give the user information on the progress of his spoken requests, a set of visual cues on the television sc...
01/29/2008
7321853Speech recognition apparatus and speech recognition method
The present invention relates to a speech recognition apparatus and a speech recognition method for speech recognition with improved accuracy. A distance calculator 47 determines the distance from a microphone 21 to a user uttering. Data indicating the...
01/22/2008
7321854Prosody based audio/visual co-analysis for co-verbal gesture recognition
The present method incorporates audio and visual cues from human gesticulation for automatic recognition. The methodology articulates a framework for co-analyzing gestures and prosodic elements of a person's speech. The methodology can be applied to a wide range of ...
01/22/2008
7315820Text-derived speech animation tool
A text-derived speech animation tool for producing simple, effective animations of digital media content that educate, entertain, and inform viewers by the presentation of speaking digital characters. The invention makes the creation of digital talking characters bo...
01/01/2008
7310602Navigation apparatus
In this navigation apparatus, when speech recognition of inputted speech is carried out, keywords included in the content of the recognized speech are searched from a dictionary DB, and then these words are displayed as keywords of a POI search. When a correction of...
12/18/2007
7302280Mobile phone operation based upon context sensing
A mobile device is provided that includes at least one sensor that provides contextual information to the device. When the mobile device receives an incoming message, or notification, the device responds thereto based at least in part upon the contextual information...
11/27/2007
7299188Method and apparatus for providing an interactive language tutor
A method and apparatus for generating a pronunciation score by receiving a user phrase intended to conform to a reference phrase and processing the user phrase in accordance with at least one of an articulation-scoring engine, a duration scoring engine and an intona...
11/20/2007
7292986Method and apparatus for displaying speech recognition progress
A graphical user interface provides a graphical volume meter indicating the volume of the user's speech and a speech recognition meter showing the progress of a speech recognizer. The graphical volume meter and recognition meter are both located near each other on t...
11/06/2007
7289102Method and apparatus using multiple sensors in a device with a display
In a device having a display, at least one sensor signal is generated from a sensor in the device. One or more context values are then generated from the sensor signal. The context values indicate how the device is situated relative to one or more objects. At least ...
10/30/2007
7286991Computer, display control device, pointer position control method, and program
To provide a pointer position control method and the like for manipulating a pointer more easily. The user moves the pointer P two-dimensionally and perform click and other operations by using only “voice”—by varying the volume and pitch of produced voice with...
10/23/2007
7284232Automated generation of aliases based on embedded alias information
An apparatus, program product and method incorporate embedded alias information into a document for use in automatically generating aliases (e.g., bookmarks, favorites, shortcuts, etc.) in a computer environment. The embedded alias information may incorporate both a...
10/16/2007
7283964Method and apparatus for voice controlled devices with improved phrase storage, use, conversion, transfer, and recognition
The embodiments of the invention provide for the storage of speech phrases. Speech phrases are processed by a speaker-independent speech recognition engine of a voice controlled device. This engine returns a speaker-independent representation of the phrase. The spea...
10/16/2007
7280963Method for learning linguistically valid word pronunciations from acoustic data
A computerized method is provided for generating pronunciations for words and storing the pronunciations in a pronunciation dictionary. The method includes graphing sets of initial pronunciations; thereafter in an ASR subsystem determining a highest-scoring set of i...
10/09/2007
7275032Telephone call handling center where operators utilize synthesized voices generated or modified to exhibit or omit prescribed speech characteristics
A human operator's voice is artificially varied prior to transmission to a remote caller. In one example, the operator indicates target speech content (e.g., actual speech, pre-prepared text, manually entered text) to a speech processing facility, which enunciates t...
09/25/2007
1              
 
Sign InRegister
Username  
Password   
forgot password?