Patent ReferencesSpeech synchronized animation Authoring and use systems for sound synchronized animation Audio-visual work with a series of visual word symbols coordinated with oral word utterances Method and apparatus for cross-modal predictive coding for talking head sequences Simulated natural movement of a computer-generated synthesized talking head Method and system for making an audio-visual work with a series of visual word symbols coordinated with oral word utterances and such audio-visual work Method for generating photo-realistic animated characters Method of associating oral utterances meaningfully with word symbols seriatim in an audio-visual work and apparatus for linear and interactive application Method and apparatus of facial image conversion by interpolation/extrapolation for plurality of facial expression components representing facial image Method and apparatus for synthesizing realistic animations of a human speaking using a computer Patent #: 6097381 InventorsApplicationNo. 223858 filed on 12/31/1998US Classes:434/185, Speech345/473, Animation434/167, Spelling, phonics, word recognition, or sentence formation434/169Electrical component included in teaching meansExaminersPrimary: Martin-Wallace, ValenciaAssistant: Harris, Chanda Attorney, Agent or FirmInternational ClassG09B 019/04AbstractA method and apparatus of converting input text into an audio-visual speech stream resulting in a talking face image enunciating the text. This method of converting input text into an audio-visual speech stream comprises the steps of: recording a visual corpus of a human-subject, building a viseme interpolation database, and synchronizing the talking face image with the text stream. In a preferred embodiment, viseme transitions are automatically calculated using optical flow methods, and morphing techniques are employed to result in smooth viseme transitions. The viseme transitions are concatenated together and synchronized with the phonemes according to the timing information. The audio-visual speech stream is then displayed in real time, thereby displaying a photo-realistic talking face.Other References
| |