U.S. patents available from 1976 to present.
U.S. patent applications available from 2005 to present.

Method and apparatus for speech synthesis based on prosodic analysis

Patent 5384893 Issued on January 24, 1995. Estimated Expiration Date: Icon_subject September 23, 2012. Estimated Expiration Date is calculated based on simple USPTO term provisions. It does not account for terminal disclaimers, term adjustments, failure to pay maintenance fees, or other factors which might affect the term of a patent.

Patent References

3704345

Method and apparatus for speech synthesizing
Patent #: 4214125
Issued on: 07/22/1980
Inventor: Mozer ,   et al.

Delta modulation method and system for signal compression
Patent #: 4314105
Issued on: 02/02/1982
Inventor: Mozer

Method and apparatus for speech synthesizing
Patent #: 4384170
Issued on: 05/17/1983
Inventor: Mozer ,   et al.

Method and apparatus for time domain compression and synthesis of audible signals
Patent #: 4433434
Issued on: 02/21/1984
Inventor: Mozer

Method and apparatus for time domain compression and synthesis of unvoiced audible signals
Patent #: 4435831
Issued on: 03/06/1984
Inventor: Mozer

Storage element for speech synthesizer
Patent #: 4458110
Issued on: 07/03/1984
Inventor: Mozer

Method and apparatus for converting voice characteristics of synthesized speech
Patent #: 4624012
Issued on: 11/18/1986
Inventor: Lin ,   et al.

Text-to-speech synthesis system
Patent #: 4685135
Issued on: 08/04/1987
Inventor: Lin ,   et al.

Real-time text-to-speech conversion system
Patent #: 4692941
Issued on: 09/08/1987
Inventor: Jacks ,   et al.

More ...

Inventor

Assignee

Application

No. 949208 filed on 09/23/1992

US Classes:

704/267, Time element704/258, Synthesis704/260Image to speech

Examiners

Primary: MacDonald, Allen R.
Assistant: Doerrler, Michelle

Attorney, Agent or Firm

International Class

G10L 009/00

Abstract

A system for synthesizing a speech signal from strings of words, which are themselves strings of characters, includes a memory in which predetermined syntax tags are stored in association with entered words and phonetic transcriptions are stored in association with the syntax tags. A parser accesses the memory and groups the syntax tags of the entered words into phrases according to a first set of predetermined grammatical rules relating the syntax tags to one another. The parser also verifies the conformance of sequences of the phrases to a second set of predetermined grammatical rules relating the phrases to one another. The system retrieves the phonetic transcriptions associated with the syntax tags that were grouped into phrases conforming to the second set of rules, and also translates predetermined strings of characters into words. The system generates strings of phonetic transcriptions and prosody markers corresponding to respective strings of the words, and adds markers for rhythm and stress to the strings, which are then converted into data arrays having prosody information on a diphone-by-diphone basis. Predetermined diphone waveforms are retrieved from memory that correspond to the entered words, and these retrieved waveforms are adjusted based on the prosody information in the arrays. The adjusted diphone waveforms, which may also be adjusted for coarticulation, are then concatenated to form the speech signal. Methods in a digital computer are also disclosed.

Other References

  • D. Klatt, "Software for a Cascade/Parallel Formant Synthesizer", J. Acoust. Soc. of Amer., vol. 67, pp. 971-994 (Mar. 1980)
  • D. Malah, "Time-Domain Algorithms for Harmonic Bandwidth Reduction and Time Scaling of Speech Signals", IEEE Trans. on Acoustic, Speech and Signal Processing, vol. ASSP-27, pp. 121-133 (Apr. 1979)
  • F. Lee, "Time Compression and Expansion of Speech by the Sampling Method", J. Audio Eng'g Soc., vol. 20, pp. 738-742 (Nov. 1972)
  • T. Sakai et al., "On-Line, Real-Time, Multiple-Speech Output System", Proc. Int'l Fed. for Info. Processing Cong. Booklet TA-4 Ljubljana, Yugoslavia (Aug. 1971) pp. 3-7
  • T. Tremain, "The Government Standard Linear Predictive Coding Algorithm: LPC-10", Speech Technology, vol. 1, No. 2, pp. 40-49 (Apr. 1982
PatentsPlus Images
Enhanced PDF formats
loading...
PatentsPlus: add to cart
PatentsPlus: add to cartSearch-enhanced full patent PDF image
$9.95more info
PatentsPlus: add to cart
PatentsPlus: add to cartIntelligent turbocharged patent PDFs with marked up images
$16.95more info
 
Sign InRegister
Username  
Password   
forgot password?