U.S. patents available from 1976 to present.
U.S. patent applications available from 2005 to present.

Method and apparatus for synthetic speech in facial animation

Patent 5878396 Issued on March 2, 1999. Estimated Expiration Date: Icon_subject February 5, 2018. Estimated Expiration Date is calculated based on simple USPTO term provisions. It does not account for terminal disclaimers, term adjustments, failure to pay maintenance fees, or other factors which might affect the term of a patent.

Patent References

Sound-synchronized video system
Patent #: 5608839
Issued on: 03/04/1997
Inventor: Chen

Method and apparatus for producing audio-visual synthetic speech Patent #: 5657426
Issued on: 08/12/1997
Inventor: Waters, et al.

Inventor

Assignee

Application

No. 019514 filed on 02/05/1998

US Classes:

704/276, Pattern display704/235, Speech to image704/270Application

Examiners

Primary: Hudspeth, David
Assistant: Opsasnick, Michael N.

Attorney, Agent or Firm

International Class

G10L 003/00

Abstract

The present invention utilizes a novel approach to facial imaging synchronized with synthetic speech. Mapping viseme images to a diphone requires the same `transitioning` in that the imaging associated with a diphone is not a static image, but rather, a series of images which dynamically depict, with lip, teeth and tongue positioning, the sound transition occurring in the relevant diphone. Each series of lip, teeth, and tongue positioning transitions is referred to herein as a `diseme.` A diseme (like a diphone) thus begins somewhere during one viseme (phone) and ends somewhere during a following viseme (phone). Due to lip, teeth and tongue position imaging commonality, phones are grouped into archiphonic families. A single diseme, which depicts the transition from a phone in one archiphonic family to another phone in a different archiphonic family, can be used for displaying the transition between any phone in the first archiphonic family to any phone in the second archiphonic family. In this way, the approximately 1800 diphones in General American English can be visually depicted by a relatively small number of disemes, again, due to their similarity in lip, teeth, and tongue image positioning. This results in a mapping between synthetic speech and facial imaging which more accurately reflects the speech transitional movements of a realistic speaker image.

PatentsPlus Images
Enhanced PDF formats
loading...
PatentsPlus: add to cart
PatentsPlus: add to cartSearch-enhanced full patent PDF image
$9.95more info
PatentsPlus: add to cart
PatentsPlus: add to cartIntelligent turbocharged patent PDFs with marked up images
$18.95more info
 
Sign InRegister
Username  
Password   
forgot password?