U.S. patents available from 1976 to present.
U.S. patent applications available from 2005 to present.

Icon_funbox Did You Know...

...that Thomas Edison's patent application on his phonograph was approved by the Patent Office in just seven weeks? In contrast, it took Gordon Gould, the inventor of the laser, 30 years to obtain his patent -- finally awarded in 1988!

Newsletter  PatentStorm News

Make the Most of PatentStorm

See this month's Top Inventors and Most Cited Patents.

Stay on top of the latest patents by subscribing to an RSS feed.

Got questions? Ask a Patent Expert!

Registered users: Manage your profile, comments and alerts.

 

US Patent 5832435 - Methods for controlling the generation of speech from text representing one or more names

US Patent Issued on November 3, 1998
Estimated Patent Expiration Date: Icon_subject January 29, 2017Estimated Expiration Date is calculated based on simple USPTO term provisions. It does not account for terminal disclaimers, term adjustments, failure to pay maintenance fees, or other factors which might affect the term of a patent.
loading...


View Patent Images (PDF)
(Registered users only)

Abstract

Improved automated synthesis of human audible speech from text is disclosed. Performance enhancement of the underlying text comprehensibility is obtained through prosodic treatment of the synthesized material, improved speaking rate treatment, and improved methods of spelling words or terms for the system user. Prosodic shaping of text sequences appropriate for the discourse in large groupings of text segments, with prosodic boundaries developed to indicate conceptual units within the text groupings, is implemented in a preferred embodiment.

Other References

  • Taylor et al, "An interactive synthetic speech generation system," IEE Colloquim on `systems and applications of man-machine interaction using speech i/o`, p. 6/1-3, Mar. 1991
  • Bachenko et al, "Prosodic phrasing for speech synthesis of written telecommunications by the deaf," IEEE Global telecommunications Conference. Globecom '91, pp. 1391-5 vol. 2, Dec. 1991
  • Chen et al, "A first study of neural net based generation of prosodic and spectral information for mandrin text-to-speech," ICASSP-92, pp. 45-8 vol. 2, Mar. 1992
  • Bang et al, "A text-to-speech system for spanish with a frequency domain based prosodic modification algorithm," ICASSP '93, pp. II-183--II-186, Apr. 1993
  • Chen et al, "Word recognition based on the combination of a sequential neural network and the GPDM discriminative training algorithm," Neural Networks for Signal Processing. Proceedings of the 1991 IEEE Workshop, pp. 376-84, Oct. 1991
  • Hwang et al, "Neural-network based FO text-to-speech synthesizer for Mandarin," IEE Proceedings-Vision, Image, and Signal Processing, vol. 141, iss. 6, pp. 384-90, Dec. 1994
  • Julia Hirschberg and Janet Pierrehumbert, "The Intonational Structuring of Discourse", Association of Computational Linguistics: 1986 (ACL-86) pp. 1-9
  • J.S. Young, F. Fallside, "Synthesis by Rule of Prosodic Features in Word Concatenation Synthesis", Int. Journal Man-Machine Studies, (1980) v12, pp. 241-258
  • A.W.F. Huggins, "speech Timing and Intelligibility", Attention and Performance VII, Hillsdale, NJ: Erlbaum 1978, pp. 279-297
  • S.J. Young and F. Fallside, "Speech Synthesis from Concept: A Method for Speech Output From Information Systems", J. Acoust. Soc. Am. 66(3), Sep. 1979, pp. 685-695
  • B.G. Green, J.S. Logan, D.B. Pisoni, "Perception of Synthetic Speech Produced Automatically by Rule: Intelligibility of Eight Text-to-Speech Systems", Behavior Research Methods, Instruments & Computers, v18, 1986, pp. 100-107
  • B.G. Greene, L.M. Manous, D.B. Pisoni, "Perceptual Evaluation of DECtalk: A Final Report on Version 1.8*", Research on Speech Perception Progress Report No. 10, Bloomington, IN. Speech Research Laboratory, Indiana University (1984), pp. 77-127
  • Kim E.A. Silverman, Doctoral Thesis, "The Structure and Processing of Fundamental Frequency Contours", University of Cambridge (UK) 1987
  • J.C. Thomas and M.B. Rosson, "Human Factors Synthetic Speech", Human Computer Interaction--INTERACT '84, North Holland Elsevier Science Publishers (1984) pp. 219-224
  • Y. Sagisaka, "Speech Synthesis From Text", IEEE Communications Magazine, vol. 28, iss 1, Jan. 1990, pp. 35-41
  • E. Fitzpatrick and J. Bachenko, "Parsing for Prosody: What a Text-to-Speech System Needs from Syntax", pp. 188-194, 27-31 Mar. 1989
  • Moulines et al., "A Real-Time French Text-To-Speech System Generating High-Quality Synthetic Speech", ICASSP 90, pp. 309-312, vol. 1, 3-6 Apr. 1990
  • Wilemse et al, "Context Free Card Parsing In A Text-To-Speech System", ICASSP 91, pp. 757-760, vol. 2, 14-17 May, 1991
  • James Raymond Davis and Julia Hirschberg, "Assigning Intonational Features in Synthesized Spoken Directions", 26th Annual Meeting of Assoc. Computational Lingustistics; 1988, pp. 1-9
  • K. Silverman, S. Basson, S. Levas, "Evaluating Synthesizer Performance: Is Segmental Intelligibility Enough", International Conf. on spoken Language Processing, 1990
  • J. Allen, M.S. Hunnicutt, D. Klatt, "From Text to Speech: The MIT Talk System", Cambridge University Press, 1987
  • T. Boogaart, K. Silverman, "Evaluating the Overall Comprehensibility of speech Synthesizers", Proc. Int'l Conference on Spoken Language Processing, 1990
  • K. Silverman, S. Basson, S. Levas, "On Evaluating Synthetic Speech: What Load Does It Place on a Listener's Cognitive Resources", Proc. 3rd Austal. Int'l Conf. Speech Science & Technology, 199

Inventor

Assignee

Application

No. 790578 filed on 01/29/1997

US Classes:

704/260, Image to speech704/9, Natural language704/266Specialized model

Field of Search

704/200, SPEECH SIGNAL PROCESSING704/260, Image to speech704/258, Synthesis704/235, Speech to image704/9, Natural language704/266, Specialized model704/277Translation

Examiners

Primary: Hudspeth, David
Assistant: Storm, Donald L.

Attorney, Agent or Firm

US Patent References

3704345, 4470150, Voice synthesizer with automatic pitch and speech rate modulation
Issued on: 09/04/1984
Inventor: Ostrowski
4685135, Text-to-speech synthesis system
Issued on: 08/04/1987
Inventor: Lin ,   et al.
4689817, Device for generating the audio information of a set of characters
Issued on: 08/25/1987
Inventor: Kroon
4692941, Real-time text-to-speech conversion system
Issued on: 09/08/1987
Inventor: Jacks ,   et al.
4695962, Speaking apparatus having differing speech modes for word and phrase synthesis
Issued on: 09/22/1987
Inventor: Goudie
4783810, Device for generating the audio information of a set of characters
Issued on: 11/08/1988
Inventor: Kroon
4783811, Method and apparatus for determining syllable boundaries
Issued on: 11/08/1988
Inventor: Fisher ,   et al.
4829580, Text analysis system with letter sequence recognition and speech stress assignment arrangement
Issued on: 05/09/1989
Inventor: Church
4831654, Apparatus for making and editing dictionary entries in a text to speech conversion system
Issued on: 05/16/1989
Inventor: Dick
4896359, Speech synthesis system by rule using phonemes as systhesis units
Issued on: 01/23/1990
Inventor: Yamamoto, et al.
4907279, Pitch frequency generation system in a speech synthesis system
Issued on: 03/06/1990
Inventor: Higuchi, et al.
4908867, Speech synthesis
Issued on: 03/13/1990
Inventor: Silverman
4912768, Speech encoding process combining written and spoken message codes
Issued on: 03/27/1990
Inventor: Benbassat
4964167, Apparatus for generating synthesized voice from text
Issued on: 10/16/1990
Inventor: Kunizawa, et al.
4979216, Text to speech synthesis system and method using context dependent vowel allophones
Issued on: 12/18/1990
Inventor: Malsheen, et al.
5040218, Name pronounciation by synthesizer
Issued on: 08/13/1991
Inventor: Vitale, et al.
5204905, Text-to-speech synthesizer having formant-rule and speech-parameter synthesis modes
Issued on: 04/20/1993
Inventor: Mitome
5212731, Apparatus for providing sentence-final accents in synthesized american english speech
Issued on: 05/18/1993
Inventor: Zimmermann
5384893, Method and apparatus for speech synthesis based on prosodic analysis
Issued on: 01/24/1995
Inventor: Hutchins
5475796, Pitch pattern generation apparatus
Issued on: 12/12/1995
Inventor: Iwata
5615300, Text-to-speech synthesis with controllable processing time and speech quality
Issued on: 03/25/1997
Inventor: Hara, et al.
5617507, Speech segment coding and pitch control methods for speech synthesis systems
Issued on: 04/01/1997
Inventor: Lee, et al.
5636325, Speech synthesis and analysis of dialects
Issued on: 06/03/1997
Inventor: Farrett
5673362Speech synthesis system in which a plurality of clients and at least one voice synthesizing server are connected to a local area network
Issued on: 09/30/1997
Inventor: Matsumoto

International Class

G10L 005/02

Comments

No comments for this page
 
 
Forgot password?
Register here