WO1991006091A1 - Lpc based speech synthesis with adaptive pitch prefilter - Google Patents

Lpc based speech synthesis with adaptive pitch prefilter Download PDF

Info

Publication number
WO1991006091A1
WO1991006091A1 PCT/US1990/005191 US9005191W WO9106091A1 WO 1991006091 A1 WO1991006091 A1 WO 1991006091A1 US 9005191 W US9005191 W US 9005191W WO 9106091 A1 WO9106091 A1 WO 9106091A1
Authority
WO
WIPO (PCT)
Prior art keywords
signal
pitch
excitation signal
speech
excitation
Prior art date
Application number
PCT/US1990/005191
Other languages
French (fr)
Inventor
Ira Alan Gerson
Mark Antoni Jasiuk
Original Assignee
Motorola, Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Motorola, Inc. filed Critical Motorola, Inc.
Priority to EP90916987A priority Critical patent/EP0496829B1/en
Priority to DE69033672T priority patent/DE69033672T2/en
Publication of WO1991006091A1 publication Critical patent/WO1991006091A1/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/26Pre-filtering or post-filtering

Definitions

  • This invention relates generally to speech synthesis, and more particularly to linear predictive coding based speech synthesis.
  • the synthesis of speech through use of a linear predictive coding (LPC) based platform is known in the art.
  • a prior art radio that embodies such a platform is depicted generally in Fig. 1 by the reference numeral 100.
  • the radio (100) receives a speech coded signal (101) through an appropriate energy transducer (102), such as an antenna.
  • An RF unit (103) converts the received signal (101) to baseband and demodulates the signal to recover the speech coded information.
  • a parameter decoder (105) develops control parameters for various subsequent processes from this information.
  • An excitation source (104) as described above utilizes the parameters provided to it to create an excitation signal and then provides that excitation signal (which excitation signal includes pitch information that has been inserted by a pitch filter) to an LPC filter (106) which in turn provides, at it's output, a synthesized speech signal.
  • This synthesized speech signal is then filtered in an adaptive pitch postfilter (107) and an adaptive spectral postfilter (108), as well as a post emphasis filter (109), to enhance the perception of natural speech and to minimize the impact of various distortions and artifacts introduced in the synthesis process.
  • the enhanced synthesized speech signal is then properly processed in an audio processing unit (111) and rendered audible through an appropriate audio transducer (112).
  • the pitch postfilter (107) serves an important function, in that it provides additional control of the pitch content of the synthesized speech. Without this filter, the resultant synthesized speech product may be rougher and of lower quality. Notwithstanding this important benefit, the pitch postfilter (107) frequently contributes artifacts to the resultant synthesized speech, which artifacts can themselves noticeably disturb the perception of natural speech. Accordingly, a need exists for providing appropriate pitch enhancement filtering in an LPC based speech synthesizing unit that minimizes a concurrent perceptible expression of artifacts in a resultant synthesized speech signal.
  • the pitch enhancement postfilter is moved from a position of processing the LPC filter output to a position where it processes the excitation input to the LPC filter.
  • the subsequent processing of the LPC filter itself functions to minimize the perceptible effect of any artifacts introduced by the pitch enhancement prefilter in the resultant synthesized speech signal.
  • Fig. 1 comprises a block diagram depiction of a prior art embodiment of a radio receiver having LPC based speech synthesis
  • Fig. 2 comprises a block diagram depiction of a radio having LPC 3as ⁇ d speech synthesis in accordance with the invention.
  • a radio embodying the invention includes an antenna (102) for receiving a speech coded signal (101 ).
  • An RF unit (103) processes the received signal to recover the speech coded information.
  • This information is provided to a parameter decoder (105) that develops control parameters for various subsequent processes.
  • An excitation source (104) as described above utilizes the parameters provided to it to create an excitation signal.
  • This resultant excitation signal from the excitation source (104) is provided to a pitch prefilter (201) that functions to filter the pitch information contained in the excitation signal.
  • the resultant filtered signal then passes to the LPC filter (106) which yields a
  • this resultant signal is then further processed in an adaptive spectral postfilter (108) and post emphasis filter (109) to further enhance the quality of the synthesized speech, and is then processed in an audio processing unit (111 ) and rendered audible by an audio transducer (112).

Abstract

A linear predictive coding speech synthetizer wherein an excitation signal that includes pitch information is filtered in a pitch enhancement filter (201) prior to provision of the excitation signal to an LPC filter (106). In one embodiment, the speech synthesizer is configured in a radio.

Description

LPC BASED SPEECH SYNTHESIS WITH ADAPTIVE PITCH
PREFILTER
Technical Field
This invention relates generally to speech synthesis, and more particularly to linear predictive coding based speech synthesis.
Background of the Invention
The synthesis of speech through use of a linear predictive coding (LPC) based platform is known in the art. A prior art radio that embodies such a platform is depicted generally in Fig. 1 by the reference numeral 100. The radio (100) receives a speech coded signal (101) through an appropriate energy transducer (102), such as an antenna. An RF unit (103) converts the received signal (101) to baseband and demodulates the signal to recover the speech coded information. A parameter decoder (105) develops control parameters for various subsequent processes from this information. An excitation source (104) as described above utilizes the parameters provided to it to create an excitation signal and then provides that excitation signal (which excitation signal includes pitch information that has been inserted by a pitch filter) to an LPC filter (106) which in turn provides, at it's output, a synthesized speech signal.
This synthesized speech signal is then filtered in an adaptive pitch postfilter (107) and an adaptive spectral postfilter (108), as well as a post emphasis filter (109), to enhance the perception of natural speech and to minimize the impact of various distortions and artifacts introduced in the synthesis process. The enhanced synthesized speech signal is then properly processed in an audio processing unit (111) and rendered audible through an appropriate audio transducer (112).
Each of the above generally referred to components is well known and understood in the art.
The pitch postfilter (107) serves an important function, in that it provides additional control of the pitch content of the synthesized speech. Without this filter, the resultant synthesized speech product may be rougher and of lower quality. Notwithstanding this important benefit, the pitch postfilter (107) frequently contributes artifacts to the resultant synthesized speech, which artifacts can themselves noticeably disturb the perception of natural speech. Accordingly, a need exists for providing appropriate pitch enhancement filtering in an LPC based speech synthesizing unit that minimizes a concurrent perceptible expression of artifacts in a resultant synthesized speech signal.
Summary of the Invention
This need and others are substantially met through provision of the LPC platform disclosed herein. According to the invention, the pitch enhancement postfilter is moved from a position of processing the LPC filter output to a position where it processes the excitation input to the LPC filter. The subsequent processing of the LPC filter itself functions to minimize the perceptible effect of any artifacts introduced by the pitch enhancement prefilter in the resultant synthesized speech signal.
Brief Description of the Drawinos
Fig. 1 comprises a block diagram depiction of a prior art embodiment of a radio receiver having LPC based speech synthesis; and
Fig. 2 comprises a block diagram depiction of a radio having LPC 3asθd speech synthesis in accordance with the invention.
Best Mode For Carrying Out The Invention
In Fig. 2, a radio embodying the invention includes an antenna (102) for receiving a speech coded signal (101 ). An RF unit (103) processes the received signal to recover the speech coded information. This information is provided to a parameter decoder (105) that develops control parameters for various subsequent processes. An excitation source (104) as described above utilizes the parameters provided to it to create an excitation signal. This resultant excitation signal from the excitation source (104) is provided to a pitch prefilter (201) that functions to filter the pitch information contained in the excitation signal. The resultant filtered signal then passes to the LPC filter (106) which yields a
SUBSTITUTE SHEET synthesized speech signal in accordance with the coded information.
Consistent with known methodology, this resultant signal is then further processed in an adaptive spectral postfilter (108) and post emphasis filter (109) to further enhance the quality of the synthesized speech, and is then processed in an audio processing unit (111 ) and rendered audible by an audio transducer (112).
What is claimed is:
SUBSTITUTE SHEET

Claims

Claims
1. A method of synthesizing speech, comprising:
A) providing an excitation signal that includes pitch information; further characterized by the steps of:
B) filtering the excitation signal in a pitch enhancement filter to provide a pitch filtered " excitation signal;
C) filtering the pitch filtered excitation signal in a speech synthesis filter to provide a synthesized speech signal.
2. A method of synthesizing speech, comprising:
A) receiving a coded information signal;
B) using the coded information signal to provide an excitation signal that includes pitch information; further characterized by the steps of:
C) filtering the excitation signal in a pitch enhancement filter to provide a pitch filtered excitation signal;
D) filtering the pitch filtered excitation signal in a speech synthesis filter to provide a synthesized speech signal.
3. A radio, comprising:
A) RF means for receiving a broadcast signal and for recovering a coded information signal included therewith; B) excitation source means operably coupled to the
RF means for providing an excitation signal that includes pitch information in response to the coded information signal; wherein the radio is further characterized by: C) pitch enhancement filter means operably coupled to the excitation source means for filtering the pitch information in the excitation signal to provide a filtered excitation signal;
D) LPC filter means for receiving the filtered excitation signal and for providing a synthesized speech signal in response thereto.
PCT/US1990/005191 1989-10-17 1990-09-17 Lpc based speech synthesis with adaptive pitch prefilter WO1991006091A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
EP90916987A EP0496829B1 (en) 1989-10-17 1990-09-17 Lpc based speech synthesis with adaptive pitch prefilter
DE69033672T DE69033672T2 (en) 1989-10-17 1990-09-17 LANGUAGE SYNTHESIS BASED ON THE LPC METHOD WITH ADAPTIVE PITCH PRE-FILTER

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US42287189A 1989-10-17 1989-10-17
US422,871 1989-10-17

Publications (1)

Publication Number Publication Date
WO1991006091A1 true WO1991006091A1 (en) 1991-05-02

Family

ID=23676771

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US1990/005191 WO1991006091A1 (en) 1989-10-17 1990-09-17 Lpc based speech synthesis with adaptive pitch prefilter

Country Status (6)

Country Link
EP (1) EP0496829B1 (en)
CN (1) CN1051100A (en)
AU (1) AU644119B2 (en)
CA (1) CA2066568A1 (en)
DE (1) DE69033672T2 (en)
WO (1) WO1991006091A1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE19629946A1 (en) * 1996-07-25 1998-01-29 Joachim Dipl Ing Mersdorf LPC analysis and synthesis method for basic frequency descriptive functions
US5946651A (en) * 1995-06-16 1999-08-31 Nokia Mobile Phones Speech synthesizer employing post-processing for enhancing the quality of the synthesized speech

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3328080B2 (en) * 1994-11-22 2002-09-24 沖電気工業株式会社 Code-excited linear predictive decoder

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4969192A (en) * 1987-04-06 1990-11-06 Voicecraft, Inc. Vector adaptive predictive coder for speech and audio

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4220819A (en) * 1979-03-30 1980-09-02 Bell Telephone Laboratories, Incorporated Residual excited predictive speech coding system

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4969192A (en) * 1987-04-06 1990-11-06 Voicecraft, Inc. Vector adaptive predictive coder for speech and audio

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See also references of EP0496829A4 *

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5946651A (en) * 1995-06-16 1999-08-31 Nokia Mobile Phones Speech synthesizer employing post-processing for enhancing the quality of the synthesized speech
US6029128A (en) * 1995-06-16 2000-02-22 Nokia Mobile Phones Ltd. Speech synthesizer
DE19629946A1 (en) * 1996-07-25 1998-01-29 Joachim Dipl Ing Mersdorf LPC analysis and synthesis method for basic frequency descriptive functions

Also Published As

Publication number Publication date
EP0496829A1 (en) 1992-08-05
DE69033672T2 (en) 2001-05-10
CN1051100A (en) 1991-05-01
DE69033672D1 (en) 2001-01-11
AU6725690A (en) 1991-05-16
CA2066568A1 (en) 1991-04-18
AU644119B2 (en) 1993-12-02
EP0496829A4 (en) 1993-08-18
EP0496829B1 (en) 2000-12-06

Similar Documents

Publication Publication Date Title
JP3483891B2 (en) Speech coder
DE69634179T2 (en) Method and apparatus for speech coding and decoding
EP0294020A3 (en) Vector adaptive coding method for speech and audio
EP0770988A3 (en) Speech decoding method and portable terminal apparatus
EP1959435A3 (en) Speech encoder
WO2004084180B1 (en) Voicing index controls for celp speech coding
WO2001029825B1 (en) Variable bit-rate celp coding of speech with phonetic classification
WO1999046764A3 (en) Speech coding
WO2000025298A1 (en) A method and device for adaptive bandwidth pitch search in coding wideband signals
EP0788091A3 (en) Speech encoding and decoding method and apparatus therefor
CA2160749A1 (en) Speech Coding Apparatus, Speech Decoding Apparatus, Speech Coding and Decoding Method and a Phase Amplitude Characteristic Extracting Apparatus for Carrying Out the Method
CN1591575A (en) Method and arrangement for synthesizing speech
AU644119B2 (en) Lpc based speech synthesis with adaptive pitch prefilter
JP3483853B2 (en) Application criteria for speech coding
US5241650A (en) Digital speech decoder having a postfilter with reduced spectral distortion
EP1073039A3 (en) Speech decoder with gain processing
WO2001003125A1 (en) Bi-directional pitch enhancement in speech coding systems
EP0570362B1 (en) Digital speech decoder having a postfilter with reduced spectral distortion
JPH10143195A (en) Post filter
EP1083548A3 (en) Method for gain control of a CELP speech decoder
AU2014211524B2 (en) Apparatus and method for synthesizing an audio signal, decoder, encoder, system and computer program
JPH0876799A (en) Wide band voice signal restoration method
JP2650355B2 (en) Voice analysis and synthesis device
CA2513842A1 (en) Apparatus and method for speech coding
CA2224688A1 (en) Speech coder

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): AU CA

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): AT BE CH DE DK ES FR GB IT LU NL SE

WWE Wipo information: entry into national phase

Ref document number: 2066568

Country of ref document: CA

WWE Wipo information: entry into national phase

Ref document number: 1990916987

Country of ref document: EP

WWP Wipo information: published in national office

Ref document number: 1990916987

Country of ref document: EP

WWG Wipo information: grant in national office

Ref document number: 1990916987

Country of ref document: EP