CA2309921A1 - Method and apparatus for pitch estimation using perception based analysis by synthesis - Google Patents
Method and apparatus for pitch estimation using perception based analysis by synthesis Download PDFInfo
- Publication number
- CA2309921A1 CA2309921A1 CA002309921A CA2309921A CA2309921A1 CA 2309921 A1 CA2309921 A1 CA 2309921A1 CA 002309921 A CA002309921 A CA 002309921A CA 2309921 A CA2309921 A CA 2309921A CA 2309921 A1 CA2309921 A1 CA 2309921A1
- Authority
- CA
- Canada
- Prior art keywords
- pitch
- item
- synthesis
- signal
- speech signal
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/90—Pitch determination of speech signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/08—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
- G10L19/09—Long term prediction, i.e. removing periodical redundancies, e.g. by using adaptive codebook or pitch predictor
Abstract
The present invention provides a method of pitch estimation which utilizes perception based analysis by synthesis for improved pitch estimation over a variety of input speech conditions. Initially, pitch candidates are generated corresponding to a plurality of sub-ranges within a pitch search range (item 2). Then a residual spectrum is determined for a segment of speech (item 4) and a reference speech signal is generated from the residual spectrum using sinusoidal synthesis (item 8) and linear predictive coding (LPC) synthesis (item 9).
A synthetic speech signal is generated for each of the pitch candidates using sinusoidal (item 12) and LPC synthesis (item 13). Finally, the synthetic speech signal for each pitch candidate is compared with the reference residual signal (item 14) to determine an optimal pitch estimate based on a pitch period of a synthetic speech signal that provides a maximum signal to noise ratio.
A synthetic speech signal is generated for each of the pitch candidates using sinusoidal (item 12) and LPC synthesis (item 13). Finally, the synthetic speech signal for each pitch candidate is compared with the reference residual signal (item 14) to determine an optimal pitch estimate based on a pitch period of a synthetic speech signal that provides a maximum signal to noise ratio.
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US08/970,396 | 1997-11-14 | ||
US08/970,396 US5999897A (en) | 1997-11-14 | 1997-11-14 | Method and apparatus for pitch estimation using perception based analysis by synthesis |
PCT/US1998/023251 WO1999026234A1 (en) | 1997-11-14 | 1998-11-16 | Method and apparatus for pitch estimation using perception based analysis by synthesis |
Publications (2)
Publication Number | Publication Date |
---|---|
CA2309921A1 true CA2309921A1 (en) | 1999-05-27 |
CA2309921C CA2309921C (en) | 2004-06-15 |
Family
ID=25516886
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CA002309921A Expired - Fee Related CA2309921C (en) | 1997-11-14 | 1998-11-16 | Method and apparatus for pitch estimation using perception based analysis by synthesis |
Country Status (8)
Country | Link |
---|---|
US (1) | US5999897A (en) |
EP (1) | EP1031141B1 (en) |
KR (1) | KR100383377B1 (en) |
AU (1) | AU746342B2 (en) |
CA (1) | CA2309921C (en) |
DE (1) | DE69832195T2 (en) |
IL (1) | IL136117A (en) |
WO (1) | WO1999026234A1 (en) |
Families Citing this family (24)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CA2252170A1 (en) * | 1998-10-27 | 2000-04-27 | Bruno Bessette | A method and device for high quality coding of wideband speech and audio signals |
US6766288B1 (en) | 1998-10-29 | 2004-07-20 | Paul Reed Smith Guitars | Fast find fundamental method |
US7194752B1 (en) * | 1999-10-19 | 2007-03-20 | Iceberg Industries, Llc | Method and apparatus for automatically recognizing input audio and/or video streams |
WO2001030049A1 (en) * | 1999-10-19 | 2001-04-26 | Fujitsu Limited | Received speech processing unit and received speech reproducing unit |
US6480821B2 (en) * | 2001-01-31 | 2002-11-12 | Motorola, Inc. | Methods and apparatus for reducing noise associated with an electrical speech signal |
JP3582589B2 (en) * | 2001-03-07 | 2004-10-27 | 日本電気株式会社 | Speech coding apparatus and speech decoding apparatus |
AU2001270365A1 (en) * | 2001-06-11 | 2002-12-23 | Ivl Technologies Ltd. | Pitch candidate selection method for multi-channel pitch detectors |
KR100446242B1 (en) * | 2002-04-30 | 2004-08-30 | 엘지전자 주식회사 | Apparatus and Method for Estimating Hamonic in Voice-Encoder |
US8447592B2 (en) * | 2005-09-13 | 2013-05-21 | Nuance Communications, Inc. | Methods and apparatus for formant-based voice systems |
EP1783604A3 (en) * | 2005-11-07 | 2007-10-03 | Slawomir Adam Janczewski | Object-oriented, parallel language, method of programming and multi-processor computer |
KR100647336B1 (en) * | 2005-11-08 | 2006-11-23 | 삼성전자주식회사 | Apparatus and method for adaptive time/frequency-based encoding/decoding |
KR100735343B1 (en) * | 2006-04-11 | 2007-07-04 | 삼성전자주식회사 | Apparatus and method for extracting pitch information of a speech signal |
KR20070115637A (en) * | 2006-06-03 | 2007-12-06 | 삼성전자주식회사 | Method and apparatus for bandwidth extension encoding and decoding |
US8935158B2 (en) | 2006-12-13 | 2015-01-13 | Samsung Electronics Co., Ltd. | Apparatus and method for comparing frames using spectral information of audio signal |
KR100860830B1 (en) * | 2006-12-13 | 2008-09-30 | 삼성전자주식회사 | Method and apparatus for estimating spectrum information of audio signal |
CN101030374B (en) * | 2007-03-26 | 2011-02-16 | 北京中星微电子有限公司 | Method and apparatus for extracting base sound period |
CN102016530B (en) * | 2009-02-13 | 2012-11-14 | 华为技术有限公司 | Method and device for pitch period detection |
US8924222B2 (en) | 2010-07-30 | 2014-12-30 | Qualcomm Incorporated | Systems, methods, apparatus, and computer-readable media for coding of harmonic signals |
US9208792B2 (en) | 2010-08-17 | 2015-12-08 | Qualcomm Incorporated | Systems, methods, apparatus, and computer-readable media for noise injection |
US8862465B2 (en) * | 2010-09-17 | 2014-10-14 | Qualcomm Incorporated | Determining pitch cycle energy and scaling an excitation signal |
DE102012000788B4 (en) * | 2012-01-17 | 2013-10-10 | Atlas Elektronik Gmbh | Method and device for processing waterborne sound signals |
EP2685448B1 (en) * | 2012-07-12 | 2018-09-05 | Harman Becker Automotive Systems GmbH | Engine sound synthesis |
GB201713946D0 (en) * | 2017-06-16 | 2017-10-18 | Cirrus Logic Int Semiconductor Ltd | Earbud speech estimation |
US10861484B2 (en) * | 2018-12-10 | 2020-12-08 | Cirrus Logic, Inc. | Methods and systems for speech detection |
Family Cites Families (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH0754440B2 (en) * | 1986-06-09 | 1995-06-07 | 日本電気株式会社 | Speech analysis / synthesis device |
NL8701798A (en) * | 1987-07-30 | 1989-02-16 | Philips Nv | METHOD AND APPARATUS FOR DETERMINING THE PROGRESS OF A VOICE PARAMETER, FOR EXAMPLE THE TONE HEIGHT, IN A SPEECH SIGNAL |
US4980916A (en) * | 1989-10-26 | 1990-12-25 | General Electric Company | Method for improving speech quality in code excited linear predictive speech coding |
US5216747A (en) * | 1990-09-20 | 1993-06-01 | Digital Voice Systems, Inc. | Voiced/unvoiced estimation of an acoustic signal |
US5226108A (en) * | 1990-09-20 | 1993-07-06 | Digital Voice Systems, Inc. | Processing a speech signal with estimated pitch |
US5327518A (en) * | 1991-08-22 | 1994-07-05 | Georgia Tech Research Corporation | Audio analysis/synthesis system |
FI95085C (en) * | 1992-05-11 | 1995-12-11 | Nokia Mobile Phones Ltd | A method for digitally encoding a speech signal and a speech encoder for performing the method |
US5734789A (en) * | 1992-06-01 | 1998-03-31 | Hughes Electronics | Voiced, unvoiced or noise modes in a CELP vocoder |
JP3343965B2 (en) * | 1992-10-31 | 2002-11-11 | ソニー株式会社 | Voice encoding method and decoding method |
FI95086C (en) * | 1992-11-26 | 1995-12-11 | Nokia Mobile Phones Ltd | Method for efficient coding of a speech signal |
IT1270438B (en) * | 1993-06-10 | 1997-05-05 | Sip | PROCEDURE AND DEVICE FOR THE DETERMINATION OF THE FUNDAMENTAL TONE PERIOD AND THE CLASSIFICATION OF THE VOICE SIGNAL IN NUMERICAL CODERS OF THE VOICE |
JP3475446B2 (en) * | 1993-07-27 | 2003-12-08 | ソニー株式会社 | Encoding method |
JP2658816B2 (en) * | 1993-08-26 | 1997-09-30 | 日本電気株式会社 | Speech pitch coding device |
-
1997
- 1997-11-14 US US08/970,396 patent/US5999897A/en not_active Expired - Lifetime
-
1998
- 1998-11-16 KR KR10-2000-7005286A patent/KR100383377B1/en not_active IP Right Cessation
- 1998-11-16 DE DE69832195T patent/DE69832195T2/en not_active Expired - Lifetime
- 1998-11-16 EP EP98957492A patent/EP1031141B1/en not_active Expired - Lifetime
- 1998-11-16 CA CA002309921A patent/CA2309921C/en not_active Expired - Fee Related
- 1998-11-16 AU AU13738/99A patent/AU746342B2/en not_active Ceased
- 1998-11-16 WO PCT/US1998/023251 patent/WO1999026234A1/en active IP Right Grant
- 1998-11-16 IL IL13611798A patent/IL136117A/en not_active IP Right Cessation
Also Published As
Publication number | Publication date |
---|---|
KR100383377B1 (en) | 2003-05-12 |
EP1031141A1 (en) | 2000-08-30 |
CA2309921C (en) | 2004-06-15 |
WO1999026234B1 (en) | 1999-07-01 |
IL136117A (en) | 2004-07-25 |
IL136117A0 (en) | 2001-05-20 |
EP1031141B1 (en) | 2005-11-02 |
EP1031141A4 (en) | 2002-01-02 |
DE69832195D1 (en) | 2005-12-08 |
DE69832195T2 (en) | 2006-08-03 |
US5999897A (en) | 1999-12-07 |
AU746342B2 (en) | 2002-04-18 |
AU1373899A (en) | 1999-06-07 |
KR20010024639A (en) | 2001-03-26 |
WO1999026234A1 (en) | 1999-05-27 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CA2309921A1 (en) | Method and apparatus for pitch estimation using perception based analysis by synthesis | |
EP1164578A3 (en) | Speech decoding method and apparatus | |
GB2102254A (en) | A speech analysis-synthesis system | |
BR9506841A (en) | Voice coidification process | |
EP0785631A3 (en) | Perceptual noise shaping in the time domain via LPC prediction in the frequency domain | |
EP1391879A3 (en) | Speech coding apparatus, linear prediction coefficient analyzing apparatus and noise reducing apparatus | |
EP0788091A3 (en) | Speech encoding and decoding method and apparatus therefor | |
CA2176665A1 (en) | Method of adapting the noise masking level in an analysis-by-synthesis speech coder employing a short-term perceptual weighting filter | |
EP0459363B1 (en) | Voice signal coding system | |
CA2014279A1 (en) | Speech coding apparatus | |
KR880004426A (en) | Language encoding processing system and speech synthesis method | |
WO1999060561A3 (en) | Split band linear prediction vocoder | |
CA2455059A1 (en) | Speech bandwidth extension apparatus and speech bandwidth extension method | |
DE69126062D1 (en) | Speech coding and decoding system | |
FI955345A (en) | Rough pitch estimation method and apparatus for telephone conversation | |
CA2189142C (en) | A multi-pulse analysis speech processing system and method | |
Eriksson et al. | Exploiting interframe correlation in spectral quantization: a study of different memory VQ schemes | |
CA2317435A1 (en) | Apparatus and method for hybrid excited linear prediction speech encoding | |
TW353748B (en) | Speech encoding method and apparatus and pitch detection method and apparatus | |
EP0724252A3 (en) | A CELP-type speech encoder having an improved long-term predictor | |
EP0772185A3 (en) | Speech decoding method and apparatus | |
WO1996036041A3 (en) | Transmission system and method for encoding speech with improved pitch detection | |
FR2596893B1 (en) | DEVICE FOR IMPLEMENTING A LEROUX-GUEGUEN ALGORITHM FOR CODING A SIGNAL BY LINEAR PREDICTION | |
AU3452397A (en) | Speech synthesis system | |
Akamine et al. | CELP coding with an adaptive density pulse excitation model |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
EEER | Examination request | ||
MKLA | Lapsed |
Effective date: 20131118 |