CA2309921A1 - Method and apparatus for pitch estimation using perception based analysis by synthesis - Google Patents

Method and apparatus for pitch estimation using perception based analysis by synthesis Download PDF

Info

Publication number
CA2309921A1
CA2309921A1 CA002309921A CA2309921A CA2309921A1 CA 2309921 A1 CA2309921 A1 CA 2309921A1 CA 002309921 A CA002309921 A CA 002309921A CA 2309921 A CA2309921 A CA 2309921A CA 2309921 A1 CA2309921 A1 CA 2309921A1
Authority
CA
Canada
Prior art keywords
pitch
item
synthesis
signal
speech signal
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CA002309921A
Other languages
French (fr)
Other versions
CA2309921C (en
Inventor
Suat Yeldener
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Comsat Corp
Original Assignee
Comsat Corporation
Suat Yeldener
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Comsat Corporation, Suat Yeldener filed Critical Comsat Corporation
Publication of CA2309921A1 publication Critical patent/CA2309921A1/en
Application granted granted Critical
Publication of CA2309921C publication Critical patent/CA2309921C/en
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/90Pitch determination of speech signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/08Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
    • G10L19/09Long term prediction, i.e. removing periodical redundancies, e.g. by using adaptive codebook or pitch predictor

Abstract

The present invention provides a method of pitch estimation which utilizes perception based analysis by synthesis for improved pitch estimation over a variety of input speech conditions. Initially, pitch candidates are generated corresponding to a plurality of sub-ranges within a pitch search range (item 2). Then a residual spectrum is determined for a segment of speech (item 4) and a reference speech signal is generated from the residual spectrum using sinusoidal synthesis (item 8) and linear predictive coding (LPC) synthesis (item 9).
A synthetic speech signal is generated for each of the pitch candidates using sinusoidal (item 12) and LPC synthesis (item 13). Finally, the synthetic speech signal for each pitch candidate is compared with the reference residual signal (item 14) to determine an optimal pitch estimate based on a pitch period of a synthetic speech signal that provides a maximum signal to noise ratio.
CA002309921A 1997-11-14 1998-11-16 Method and apparatus for pitch estimation using perception based analysis by synthesis Expired - Fee Related CA2309921C (en)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US08/970,396 1997-11-14
US08/970,396 US5999897A (en) 1997-11-14 1997-11-14 Method and apparatus for pitch estimation using perception based analysis by synthesis
PCT/US1998/023251 WO1999026234A1 (en) 1997-11-14 1998-11-16 Method and apparatus for pitch estimation using perception based analysis by synthesis

Publications (2)

Publication Number Publication Date
CA2309921A1 true CA2309921A1 (en) 1999-05-27
CA2309921C CA2309921C (en) 2004-06-15

Family

ID=25516886

Family Applications (1)

Application Number Title Priority Date Filing Date
CA002309921A Expired - Fee Related CA2309921C (en) 1997-11-14 1998-11-16 Method and apparatus for pitch estimation using perception based analysis by synthesis

Country Status (8)

Country Link
US (1) US5999897A (en)
EP (1) EP1031141B1 (en)
KR (1) KR100383377B1 (en)
AU (1) AU746342B2 (en)
CA (1) CA2309921C (en)
DE (1) DE69832195T2 (en)
IL (1) IL136117A (en)
WO (1) WO1999026234A1 (en)

Families Citing this family (24)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CA2252170A1 (en) * 1998-10-27 2000-04-27 Bruno Bessette A method and device for high quality coding of wideband speech and audio signals
US6766288B1 (en) 1998-10-29 2004-07-20 Paul Reed Smith Guitars Fast find fundamental method
US7194752B1 (en) * 1999-10-19 2007-03-20 Iceberg Industries, Llc Method and apparatus for automatically recognizing input audio and/or video streams
WO2001030049A1 (en) * 1999-10-19 2001-04-26 Fujitsu Limited Received speech processing unit and received speech reproducing unit
US6480821B2 (en) * 2001-01-31 2002-11-12 Motorola, Inc. Methods and apparatus for reducing noise associated with an electrical speech signal
JP3582589B2 (en) * 2001-03-07 2004-10-27 日本電気株式会社 Speech coding apparatus and speech decoding apparatus
AU2001270365A1 (en) * 2001-06-11 2002-12-23 Ivl Technologies Ltd. Pitch candidate selection method for multi-channel pitch detectors
KR100446242B1 (en) * 2002-04-30 2004-08-30 엘지전자 주식회사 Apparatus and Method for Estimating Hamonic in Voice-Encoder
US8447592B2 (en) * 2005-09-13 2013-05-21 Nuance Communications, Inc. Methods and apparatus for formant-based voice systems
EP1783604A3 (en) * 2005-11-07 2007-10-03 Slawomir Adam Janczewski Object-oriented, parallel language, method of programming and multi-processor computer
KR100647336B1 (en) * 2005-11-08 2006-11-23 삼성전자주식회사 Apparatus and method for adaptive time/frequency-based encoding/decoding
KR100735343B1 (en) * 2006-04-11 2007-07-04 삼성전자주식회사 Apparatus and method for extracting pitch information of a speech signal
KR20070115637A (en) * 2006-06-03 2007-12-06 삼성전자주식회사 Method and apparatus for bandwidth extension encoding and decoding
US8935158B2 (en) 2006-12-13 2015-01-13 Samsung Electronics Co., Ltd. Apparatus and method for comparing frames using spectral information of audio signal
KR100860830B1 (en) * 2006-12-13 2008-09-30 삼성전자주식회사 Method and apparatus for estimating spectrum information of audio signal
CN101030374B (en) * 2007-03-26 2011-02-16 北京中星微电子有限公司 Method and apparatus for extracting base sound period
CN102016530B (en) * 2009-02-13 2012-11-14 华为技术有限公司 Method and device for pitch period detection
US8924222B2 (en) 2010-07-30 2014-12-30 Qualcomm Incorporated Systems, methods, apparatus, and computer-readable media for coding of harmonic signals
US9208792B2 (en) 2010-08-17 2015-12-08 Qualcomm Incorporated Systems, methods, apparatus, and computer-readable media for noise injection
US8862465B2 (en) * 2010-09-17 2014-10-14 Qualcomm Incorporated Determining pitch cycle energy and scaling an excitation signal
DE102012000788B4 (en) * 2012-01-17 2013-10-10 Atlas Elektronik Gmbh Method and device for processing waterborne sound signals
EP2685448B1 (en) * 2012-07-12 2018-09-05 Harman Becker Automotive Systems GmbH Engine sound synthesis
GB201713946D0 (en) * 2017-06-16 2017-10-18 Cirrus Logic Int Semiconductor Ltd Earbud speech estimation
US10861484B2 (en) * 2018-12-10 2020-12-08 Cirrus Logic, Inc. Methods and systems for speech detection

Family Cites Families (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0754440B2 (en) * 1986-06-09 1995-06-07 日本電気株式会社 Speech analysis / synthesis device
NL8701798A (en) * 1987-07-30 1989-02-16 Philips Nv METHOD AND APPARATUS FOR DETERMINING THE PROGRESS OF A VOICE PARAMETER, FOR EXAMPLE THE TONE HEIGHT, IN A SPEECH SIGNAL
US4980916A (en) * 1989-10-26 1990-12-25 General Electric Company Method for improving speech quality in code excited linear predictive speech coding
US5216747A (en) * 1990-09-20 1993-06-01 Digital Voice Systems, Inc. Voiced/unvoiced estimation of an acoustic signal
US5226108A (en) * 1990-09-20 1993-07-06 Digital Voice Systems, Inc. Processing a speech signal with estimated pitch
US5327518A (en) * 1991-08-22 1994-07-05 Georgia Tech Research Corporation Audio analysis/synthesis system
FI95085C (en) * 1992-05-11 1995-12-11 Nokia Mobile Phones Ltd A method for digitally encoding a speech signal and a speech encoder for performing the method
US5734789A (en) * 1992-06-01 1998-03-31 Hughes Electronics Voiced, unvoiced or noise modes in a CELP vocoder
JP3343965B2 (en) * 1992-10-31 2002-11-11 ソニー株式会社 Voice encoding method and decoding method
FI95086C (en) * 1992-11-26 1995-12-11 Nokia Mobile Phones Ltd Method for efficient coding of a speech signal
IT1270438B (en) * 1993-06-10 1997-05-05 Sip PROCEDURE AND DEVICE FOR THE DETERMINATION OF THE FUNDAMENTAL TONE PERIOD AND THE CLASSIFICATION OF THE VOICE SIGNAL IN NUMERICAL CODERS OF THE VOICE
JP3475446B2 (en) * 1993-07-27 2003-12-08 ソニー株式会社 Encoding method
JP2658816B2 (en) * 1993-08-26 1997-09-30 日本電気株式会社 Speech pitch coding device

Also Published As

Publication number Publication date
KR100383377B1 (en) 2003-05-12
EP1031141A1 (en) 2000-08-30
CA2309921C (en) 2004-06-15
WO1999026234B1 (en) 1999-07-01
IL136117A (en) 2004-07-25
IL136117A0 (en) 2001-05-20
EP1031141B1 (en) 2005-11-02
EP1031141A4 (en) 2002-01-02
DE69832195D1 (en) 2005-12-08
DE69832195T2 (en) 2006-08-03
US5999897A (en) 1999-12-07
AU746342B2 (en) 2002-04-18
AU1373899A (en) 1999-06-07
KR20010024639A (en) 2001-03-26
WO1999026234A1 (en) 1999-05-27

Similar Documents

Publication Publication Date Title
CA2309921A1 (en) Method and apparatus for pitch estimation using perception based analysis by synthesis
EP1164578A3 (en) Speech decoding method and apparatus
GB2102254A (en) A speech analysis-synthesis system
BR9506841A (en) Voice coidification process
EP0785631A3 (en) Perceptual noise shaping in the time domain via LPC prediction in the frequency domain
EP1391879A3 (en) Speech coding apparatus, linear prediction coefficient analyzing apparatus and noise reducing apparatus
EP0788091A3 (en) Speech encoding and decoding method and apparatus therefor
CA2176665A1 (en) Method of adapting the noise masking level in an analysis-by-synthesis speech coder employing a short-term perceptual weighting filter
EP0459363B1 (en) Voice signal coding system
CA2014279A1 (en) Speech coding apparatus
KR880004426A (en) Language encoding processing system and speech synthesis method
WO1999060561A3 (en) Split band linear prediction vocoder
CA2455059A1 (en) Speech bandwidth extension apparatus and speech bandwidth extension method
DE69126062D1 (en) Speech coding and decoding system
FI955345A (en) Rough pitch estimation method and apparatus for telephone conversation
CA2189142C (en) A multi-pulse analysis speech processing system and method
Eriksson et al. Exploiting interframe correlation in spectral quantization: a study of different memory VQ schemes
CA2317435A1 (en) Apparatus and method for hybrid excited linear prediction speech encoding
TW353748B (en) Speech encoding method and apparatus and pitch detection method and apparatus
EP0724252A3 (en) A CELP-type speech encoder having an improved long-term predictor
EP0772185A3 (en) Speech decoding method and apparatus
WO1996036041A3 (en) Transmission system and method for encoding speech with improved pitch detection
FR2596893B1 (en) DEVICE FOR IMPLEMENTING A LEROUX-GUEGUEN ALGORITHM FOR CODING A SIGNAL BY LINEAR PREDICTION
AU3452397A (en) Speech synthesis system
Akamine et al. CELP coding with an adaptive density pulse excitation model

Legal Events

Date Code Title Description
EEER Examination request
MKLA Lapsed

Effective date: 20131118