CA2600713A1 - Time warping frames inside the vocoder by modifying the residual - Google Patents

Time warping frames inside the vocoder by modifying the residual Download PDF

Info

Publication number
CA2600713A1
CA2600713A1 CA002600713A CA2600713A CA2600713A1 CA 2600713 A1 CA2600713 A1 CA 2600713A1 CA 002600713 A CA002600713 A CA 002600713A CA 2600713 A CA2600713 A CA 2600713A CA 2600713 A1 CA2600713 A1 CA 2600713A1
Authority
CA
Canada
Prior art keywords
speech
pitch
segments
residual
pitch period
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CA002600713A
Other languages
French (fr)
Other versions
CA2600713C (en
Inventor
Rohit Kapoor
Serafin Diaz Spindola
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Qualcomm Inc
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Family has litigation
First worldwide family litigation filed litigation Critical https://patents.darts-ip.com/?family=36575961&utm_source=google_patent&utm_medium=platform_link&utm_campaign=public_patent_search&patent=CA2600713(A1) "Global patent litigation dataset” by Darts-ip is licensed under a Creative Commons Attribution 4.0 International License.
Application filed by Individual filed Critical Individual
Publication of CA2600713A1 publication Critical patent/CA2600713A1/en
Application granted granted Critical
Publication of CA2600713C publication Critical patent/CA2600713C/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • G10L19/20Vocoders using multiple modes using sound class specific coding, hybrid encoders or object based coding
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/08Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
    • G10L19/12Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/003Changing voice quality, e.g. pitch or formants
    • G10L21/007Changing voice quality, e.g. pitch or formants characterised by the process used
    • G10L21/01Correction of time axis

Abstract

In one embodiment, the present invention comprises a vocoder having at least one input and at least one output, an encoder comprising a filter having at least one input operably connected to the input of the vocoder and at least one output, a decoder comprising a synthesizer having at least one input operably connected to the at least one output of the encoder, and at least one output operably connected to the at least one output of the vocoder, wherein the encoder comprises a memory and the encoder is adapted to execute instructions stored in the memory comprising classifying speech segments and encoding speech segments, and the decoder comprises a memory and the decoder is adapted to execute instructions stored in the memory comprising time-warping a residual speech signal to an expanded or compressed version of the residual speech signal.

Claims (46)

1. A method communicating speech, comprising:
time-warping a residual speech signal to an expanded or compressed version of said residual speech signal; and synthesizing said time-warped residual speech signal.
2. The method communicating speech according to claim 1, further comprising the steps of:
classifying speech segments; and encoding said speech segments.
3. The method of communicating speech according to claim 2, wherein said step of encoding speech segments comprises using prototype pitch period, code-excited linear prediction, noise-excited linear prediction or 1/8 frame coding.
4. The method of communicating speech according to claim 2, further comprising the steps of:
sending said speech signal through a linear predictive coding filter, whereby short-term correlations in said speech signal are filtered out; and outputting linear predictive coding coefficients and a residual signal.
5. The method of communicating speech according to claim 2, wherein said step of classifying speech segments comprises categorizing speech frames as periodic, slightly periodic or noisy depending on whether the frames represents voiced, unvoiced or transient speech.
6. The method of communicating speech according to claim 2, wherein said encoding is code-excited linear prediction encoding.
7. The method of communicating speech according to claim 2, wherein said encoding is prototype pitch period encoding.
8. The method of communicating speech according to claim 2, wherein said encoding is noise-excited linear prediction encoding.
9. The method according to claim 6, wherein said step of time-warping comprises:
estimating a pitch period; and adding or subtracting at least one of said pitch period after receiving said residual signal.
10. The method according to claim 6, wherein said step of time warping comprises:
estimating pitch delay;
dividing a speech frame into pitch periods, wherein boundaries of said pitch periods are determined using said pitch delay at various points in said speech frame;
overlapping said pitch periods if said residual speech signal is decreased;
and adding said pitch periods if said residual speech signal is increased.
11. The method according to claim 7, wherein said step of time warping comprises the steps of:
estimating at least one pitch period;
interpolating said at least one pitch period;
adding said at least one pitch period when expanding said residual speech signal;
and subtracting said at least one pitch period when compressing said residual speech signal.
12. The method according to claim 8, wherein said step of encoding comprises encoding linear predictive coding information as gains of different parts of a speech segment.
13. The method according to claim 10, wherein said step of overlapping said pitch periods if said speech residual signal is decreased comprises:

segmenting an input sample sequence into blocks of samples;
removing segments of said residual signal at regular time intervals;
merging said removed segments; and replacing said removed segments with a merged segment;
14. The method according to claim 10, wherein said step of estimating pitch delay comprises interpolating between a pitch delay of an end of a last frame and an end of a current frame.
15. The method according to claim 10, wherein said step of adding said pitch periods comprises merging speech segments.
16. The method according to claim 10, wherein said step of adding said pitch periods if said residual speech signal is increased comprises adding an additional pitch period created from a first pitch segment and a second pitch period segment.
17. The method according to claim 12, wherein said gains are encoded for sets of speech samples.
18. The method according to claim 13, wherein said step of merging said removed segments comprises increasing a first pitch period segment's contribution and decreasing a second pitch period segment's contribution.
19. The method according to claim 15, further comprising the step of selecting similar speech segments, wherein said similar speech segments are merged.
20. The method according to claim 15, further comprising the step of correlating speech segments, whereby similar speech segments are selected.
21. The method according to claim 16, wherein said step of adding an additional pitch period created from a first pitch segment and a second pitch period segment comprises adding said first and said second pitch segments such that said first pitch period segment's contribution increases and said second pitch period segment's contribution decreases.
22. The method according to claim 17, further comprising the step of generating a residual signal by generating random values and then applying said gains to said random values.
23. The method according to claim 17, further comprising the step of representing said linear predictive coding information as 10 encoded gain values, wherein each encoded gain value represents 16 samples of speech.
24. A vocoder having at least one input and at least one output, comprising:
an encoder comprising a filter having at least one input operably connected to the input of the vocoder and at least one output; and a decoder comprising a synthesizer having at least one input operably connected to said at least one output of said encoder and at least one output operably connected to said at least one output of the vocoder.
25. The vocoder according to claim 24, wherein said decoder comprises:
a memory, wherein said decoder is adapted to execute software instructions stored in said memory comprising time-warping a residual speech signal to an expanded or compressed version of said residual signal.
26. The vocoder according to claim 24, wherein said encoder comprises:
a memory and said encoder is adapted to execute software instructions stored in said memory comprising classifying speech segments as 1/8 frame, prototype pitch period, code-excited linear prediction or noise-excited linear prediction.
27. The vocoder according to claim 26, wherein said decoder comprises:
a memory and said decoder is adapted to execute software instructions stored in said memory comprising time-warping a residual signal to an expanded or compressed version of said residual speech signal.
28. The vocoder according to claim 27, wherein said filter is a linear predictive coding filter which is adapted to:
filter out short-term correlations in a speech signal; and output linear predictive coding coefficients and a residual signal.
29. The vocoder according to claim 27, wherein said encoder comprises:
a memory and said encoder is adapted to execute software instructions stored in said memory comprising encoding said speech segments using code-excited linear prediction encoding.
30. The vocoder according to claim 27, wherein said encoder comprises:
a memory and said encoder is adapted to execute software instructions stored in said memory comprising encoding said speech segments using prototype pitch period encoding.
31. The vocoder according to claim 27, wherein said encoder comprises:
a memory and said encoder is adapted to execute software instructions stored in said memory comprising encoding said speech segments using noise-excited linear prediction encoding.
32. The vocoder according to claim 29, wherein said time-warping software instruction comprises estimating at least one pitch period; and adding or subtracting said at least one pitch period after receiving said residual signal.
33. The vocoder according to claim 29, wherein said time-warping software instruction comprises estimating pitch delay;
dividing a speech frame into pitch periods, wherein boundaries of said pitch periods are determined using said pitch delay at various points in said speech frame;
overlapping said pitch periods if said residual speech signal is decreased;
and adding said pitch periods if said residual speech signal is increased.
34. The vocoder according to claim 30, wherein said time-warping software instruction comprises estimating at least one pitch period;
interpolating said at least one pitch period;
adding said at least one pitch period when expanding said residual speech signal;
and subtracting said at least one pitch period when compressing said residual speech signal.
35. The vocoder according to claim 31, wherein said encoding said speech segments using noise-excited linear prediction encoding software instruction comprises encoding linear predictive coding information as gains of different parts of a speech segment.
36. The vocoder according to claim 33, wherein said overlapping said pitch periods if said speech residual signal is decreased instruction comprises segmenting an input sample sequence into blocks of samples;
removing segments of said residual signal at regular time intervals;
merging said removed segments; and replacing said removed segments with a merged segment.
37. The vocoder according to claim 33, wherein said estimating pitch delay instruction comprises interpolating between a pitch delay of an end of a last frame and an end of a current frame.
38. The vocoder according to claim 33, wherein said adding said pitch periods instruction comprises merging speech segments.
39. The vocoder according to claim 33, wherein said adding said pitch periods if said speech residual signal is increased instruction comprises adding an additional pitch period created from a first pitch segment and a second pitch period segment.
40. The vocoder according to claim 35, wherein said gains are encoded for sets of speech samples.
41. The vocoder according to claim 36, wherein said merging said removed segments instruction comprises increasing a first pitch period segment's contribution and decreasing a second pitch period segment's contribution.
42. The vocoder according to claim 38, further comprising the step of selecting similar speech segments, wherein said similar speech segments are merged.
43. The vocoder to claim 38, wherein said time-warping instruction further comprises correlating speech segments, whereby similar speech segments are selected.
44. The vocoder according to claim 39, wherein said adding an additional pitch period created from a first pitch segment and a second pitch period segment instruction comprises adding said first and said second pitch segments such that said first pitch period segment's contribution increases and said second pitch period segment's contribution decreases.
45. The vocoder according to claim 40, wherein said time-warping instruction further comprises generating a residual speech signal by generating random values and then applying said gains to said random values.
46. The vocoder according to claim 40, wherein said time-warping instruction further comprises representing said linear predictive coding information as 10 encoded gain values, wherein each encoded gain value represents 16 samples of speech.
CA2600713A 2005-03-11 2006-03-13 Time warping frames inside the vocoder by modifying the residual Active CA2600713C (en)

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
US66082405P 2005-03-11 2005-03-11
US60/660,824 2005-03-11
US11/123,467 US8155965B2 (en) 2005-03-11 2005-05-05 Time warping frames inside the vocoder by modifying the residual
US11/123,467 2005-05-05
PCT/US2006/009472 WO2006099529A1 (en) 2005-03-11 2006-03-13 Time warping frames inside the vocoder by modifying the residual

Publications (2)

Publication Number Publication Date
CA2600713A1 true CA2600713A1 (en) 2006-09-21
CA2600713C CA2600713C (en) 2012-05-22

Family

ID=36575961

Family Applications (1)

Application Number Title Priority Date Filing Date
CA2600713A Active CA2600713C (en) 2005-03-11 2006-03-13 Time warping frames inside the vocoder by modifying the residual

Country Status (14)

Country Link
US (1) US8155965B2 (en)
EP (1) EP1856689A1 (en)
JP (1) JP5203923B2 (en)
KR (2) KR100957265B1 (en)
AU (1) AU2006222963C1 (en)
BR (1) BRPI0607624B1 (en)
CA (1) CA2600713C (en)
IL (1) IL185935A (en)
MX (1) MX2007011102A (en)
NO (1) NO20075180L (en)
RU (1) RU2371784C2 (en)
SG (1) SG160380A1 (en)
TW (1) TWI389099B (en)
WO (1) WO2006099529A1 (en)

Families Citing this family (38)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6691084B2 (en) * 1998-12-21 2004-02-10 Qualcomm Incorporated Multiple mode variable rate speech coding
CA2691959C (en) * 2004-08-30 2013-07-30 Qualcomm Incorporated Method and apparatus for an adaptive de-jitter buffer
US7674096B2 (en) * 2004-09-22 2010-03-09 Sundheim Gregroy S Portable, rotary vane vacuum pump with removable oil reservoir cartridge
US8085678B2 (en) * 2004-10-13 2011-12-27 Qualcomm Incorporated Media (voice) playback (de-jitter) buffer adjustments based on air interface
US8355907B2 (en) * 2005-03-11 2013-01-15 Qualcomm Incorporated Method and apparatus for phase matching frames in vocoders
RU2376657C2 (en) * 2005-04-01 2009-12-20 Квэлкомм Инкорпорейтед Systems, methods and apparatus for highband time warping
TWI317933B (en) * 2005-04-22 2009-12-01 Qualcomm Inc Methods, data storage medium,apparatus of signal processing,and cellular telephone including the same
US8259840B2 (en) * 2005-10-24 2012-09-04 General Motors Llc Data communication via a voice channel of a wireless communication network using discontinuities
US7720677B2 (en) * 2005-11-03 2010-05-18 Coding Technologies Ab Time warped modified transform coding of audio signals
US8239190B2 (en) * 2006-08-22 2012-08-07 Qualcomm Incorporated Time-warping frames of wideband vocoder
US8279889B2 (en) * 2007-01-04 2012-10-02 Qualcomm Incorporated Systems and methods for dimming a first packet associated with a first bit rate to a second packet associated with a second bit rate
US9653088B2 (en) * 2007-06-13 2017-05-16 Qualcomm Incorporated Systems, methods, and apparatus for signal encoding using pitch-regularizing and non-pitch-regularizing coding
US8401865B2 (en) 2007-07-18 2013-03-19 Nokia Corporation Flexible parameter update in audio/speech coded signals
US8768690B2 (en) * 2008-06-20 2014-07-01 Qualcomm Incorporated Coding scheme selection for low-bit-rate applications
US20090319261A1 (en) * 2008-06-20 2009-12-24 Qualcomm Incorporated Coding of transitional speech frames for low-bit-rate applications
US20090319263A1 (en) * 2008-06-20 2009-12-24 Qualcomm Incorporated Coding of transitional speech frames for low-bit-rate applications
EP2144230A1 (en) * 2008-07-11 2010-01-13 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Low bitrate audio encoding/decoding scheme having cascaded switches
MY154452A (en) 2008-07-11 2015-06-15 Fraunhofer Ges Forschung An apparatus and a method for decoding an encoded audio signal
EP2410522B1 (en) 2008-07-11 2017-10-04 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Audio signal encoder, method for encoding an audio signal and computer program
US8798776B2 (en) 2008-09-30 2014-08-05 Dolby International Ab Transcoding of audio metadata
US20100191534A1 (en) * 2009-01-23 2010-07-29 Qualcomm Incorporated Method and apparatus for compression or decompression of digital signals
US8428938B2 (en) * 2009-06-04 2013-04-23 Qualcomm Incorporated Systems and methods for reconstructing an erased speech frame
CA2778240C (en) 2009-10-20 2016-09-06 Fraunhofer Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Multi-mode audio codec and celp coding adapted therefore
WO2011130325A1 (en) 2010-04-12 2011-10-20 Smule, Inc. Continuous score-coded pitch correction and harmony generation techniques for geographically distributed glee club
TWI409802B (en) * 2010-04-14 2013-09-21 Univ Da Yeh Method and apparatus for processing audio feature
CN105304090B (en) 2011-02-14 2019-04-09 弗劳恩霍夫应用研究促进协会 Using the prediction part of alignment by audio-frequency signal coding and decoded apparatus and method
BR112013020588B1 (en) 2011-02-14 2021-07-13 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. APPARATUS AND METHOD FOR ENCODING A PART OF AN AUDIO SIGNAL USING A TRANSIENT DETECTION AND A QUALITY RESULT
PL2676266T3 (en) 2011-02-14 2015-08-31 Fraunhofer Ges Forschung Linear prediction based coding scheme using spectral domain noise shaping
TWI484479B (en) 2011-02-14 2015-05-11 Fraunhofer Ges Forschung Apparatus and method for error concealment in low-delay unified speech and audio coding
SG192718A1 (en) 2011-02-14 2013-09-30 Fraunhofer Ges Forschung Audio codec using noise synthesis during inactive phases
MY159444A (en) 2011-02-14 2017-01-13 Fraunhofer-Gesellschaft Zur Forderung Der Angewandten Forschung E V Encoding and decoding of pulse positions of tracks of an audio signal
PT2676267T (en) 2011-02-14 2017-09-26 Fraunhofer Ges Forschung Encoding and decoding of pulse positions of tracks of an audio signal
MX2012013025A (en) * 2011-02-14 2013-01-22 Fraunhofer Ges Forschung Information signal representation using lapped transform.
ES2529025T3 (en) 2011-02-14 2015-02-16 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus and method for processing a decoded audio signal in a spectral domain
CN103092330B (en) * 2011-10-27 2015-11-25 宏碁股份有限公司 Electronic installation and speech identifying method thereof
TWI584269B (en) * 2012-07-11 2017-05-21 Univ Nat Central Unsupervised language conversion detection method
FR3024582A1 (en) * 2014-07-29 2016-02-05 Orange MANAGING FRAME LOSS IN A FD / LPD TRANSITION CONTEXT
WO2016142002A1 (en) * 2015-03-09 2016-09-15 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Audio encoder, audio decoder, method for encoding an audio signal and method for decoding an encoded audio signal

Family Cites Families (96)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS5643800A (en) 1979-09-19 1981-04-22 Fujitsu Ltd Multilayer printed board
JPS57158247A (en) 1981-03-24 1982-09-30 Tokuyama Soda Co Ltd Flame retardant polyolefin composition
JPS59153346A (en) 1983-02-21 1984-09-01 Nec Corp Voice encoding and decoding device
JPS61156949A (en) 1984-12-27 1986-07-16 Matsushita Electric Ind Co Ltd Packetized voice communication system
BE1000415A7 (en) 1987-03-18 1988-11-22 Bell Telephone Mfg Asynchronous based on time division operating communication.
JPS6429141A (en) 1987-07-24 1989-01-31 Nec Corp Packet exchange system
JP2760810B2 (en) 1988-09-19 1998-06-04 株式会社日立製作所 Voice packet processing method
SE462277B (en) 1988-10-05 1990-05-28 Vme Ind Sweden Ab HYDRAULIC CONTROL SYSTEM
JPH04113744A (en) 1990-09-04 1992-04-15 Fujitsu Ltd Variable speed packet transmission system
AU642540B2 (en) 1990-09-19 1993-10-21 Philips Electronics N.V. Record carrier on which a main data file and a control file have been recorded, method of and device for recording the main data file and the control file, and device for reading the record carrier
JP2846443B2 (en) 1990-10-09 1999-01-13 三菱電機株式会社 Packet assembly and disassembly device
US5283811A (en) 1991-09-03 1994-02-01 General Electric Company Decision feedback equalization for digital cellular radio
US5371853A (en) * 1991-10-28 1994-12-06 University Of Maryland At College Park Method and system for CELP speech coding and codebook for use therewith
US5317604A (en) 1992-12-30 1994-05-31 Gte Government Systems Corporation Isochronous interface method
JP3186315B2 (en) 1993-02-27 2001-07-11 ソニー株式会社 Signal compression device, signal decompression device, signal transmission device, signal reception device, and signal transmission / reception device
US5490479A (en) 1993-05-10 1996-02-13 Shalev; Matti Method and a product resulting from the use of the method for elevating feed storage bins
US5440562A (en) 1993-12-27 1995-08-08 Motorola, Inc. Communication through a channel having a variable propagation delay
WO1996005697A1 (en) 1994-08-12 1996-02-22 Sony Corporation Video signal editing device
NL9401696A (en) 1994-10-14 1996-05-01 Nederland Ptt Buffer readout control from ATM receiver.
US5602959A (en) 1994-12-05 1997-02-11 Motorola, Inc. Method and apparatus for characterization and reconstruction of speech excitation waveforms
US5699478A (en) 1995-03-10 1997-12-16 Lucent Technologies Inc. Frame erasure compensation technique
US5929921A (en) 1995-03-16 1999-07-27 Matsushita Electric Industrial Co., Ltd. Video and audio signal multiplex sending apparatus, receiving apparatus and transmitting apparatus
JP3286110B2 (en) 1995-03-16 2002-05-27 松下電器産業株式会社 Voice packet interpolation device
KR0164827B1 (en) 1995-03-31 1999-03-20 김광호 The program guide signal receiver
JPH09127995A (en) 1995-10-26 1997-05-16 Sony Corp Signal decoding method and signal decoder
US5640388A (en) 1995-12-21 1997-06-17 Scientific-Atlanta, Inc. Method and apparatus for removing jitter and correcting timestamps in a packet stream
JPH09261613A (en) 1996-03-26 1997-10-03 Mitsubishi Electric Corp Data reception/reproducing device
US5940479A (en) 1996-10-01 1999-08-17 Northern Telecom Limited System and method for transmitting aural information between a computer and telephone equipment
JPH10190735A (en) 1996-12-27 1998-07-21 Secom Co Ltd Communication system
US6073092A (en) 1997-06-26 2000-06-06 Telogy Networks, Inc. Method for speech coding based on a code excited linear prediction (CELP) model
US6240386B1 (en) 1998-08-24 2001-05-29 Conexant Systems, Inc. Speech codec employing noise classification for noise compensation
US6259677B1 (en) 1998-09-30 2001-07-10 Cisco Technology, Inc. Clock synchronization and dynamic jitter management for voice over IP and real-time data
US6370125B1 (en) 1998-10-08 2002-04-09 Adtran, Inc. Dynamic delay compensation for packet-based voice network
US6456964B2 (en) * 1998-12-21 2002-09-24 Qualcomm, Incorporated Encoding of periodic speech using prototype waveforms
US6922669B2 (en) 1998-12-29 2005-07-26 Koninklijke Philips Electronics N.V. Knowledge-based strategies applied to N-best lists in automatic speech recognition systems
US7117156B1 (en) 1999-04-19 2006-10-03 At&T Corp. Method and apparatus for performing packet loss or frame erasure concealment
JP4975213B2 (en) 1999-04-19 2012-07-11 エイ・ティ・アンド・ティ・コーポレーション Frame erasing concealment processor
GB9911737D0 (en) * 1999-05-21 1999-07-21 Philips Electronics Nv Audio signal time scale modification
US6785230B1 (en) 1999-05-25 2004-08-31 Matsushita Electric Industrial Co., Ltd. Audio transmission apparatus
JP4218186B2 (en) 1999-05-25 2009-02-04 パナソニック株式会社 Audio transmission device
JP4895418B2 (en) 1999-08-24 2012-03-14 ソニー株式会社 Audio reproduction method and audio reproduction apparatus
WO2001020595A1 (en) 1999-09-14 2001-03-22 Fujitsu Limited Voice encoder/decoder
US6377931B1 (en) 1999-09-28 2002-04-23 Mindspeed Technologies Speech manipulation for continuous speech playback over a packet network
US6859460B1 (en) 1999-10-22 2005-02-22 Cisco Technology, Inc. System and method for providing multimedia jitter buffer adjustment for packet-switched networks
US6665317B1 (en) 1999-10-29 2003-12-16 Array Telecom Corporation Method, system, and computer program product for managing jitter
US6496794B1 (en) 1999-11-22 2002-12-17 Motorola, Inc. Method and apparatus for seamless multi-rate speech coding
US6693921B1 (en) 1999-11-30 2004-02-17 Mindspeed Technologies, Inc. System for use of packet statistics in de-jitter delay adaption in a packet network
US6366880B1 (en) * 1999-11-30 2002-04-02 Motorola, Inc. Method and apparatus for suppressing acoustic background noise in a communication system by equaliztion of pre-and post-comb-filtered subband spectral energies
GB2360178B (en) 2000-03-06 2004-04-14 Mitel Corp Sub-packet insertion for packet loss compensation in Voice Over IP networks
US6813274B1 (en) 2000-03-21 2004-11-02 Cisco Technology, Inc. Network switch and method for data switching using a crossbar switch fabric with output port groups operating concurrently and independently
DE60132080T2 (en) 2000-04-03 2008-12-11 Ericsson Inc., Plano METHOD AND DEVICE FOR EFFICIENT FIELDS IN DATA PACKET COMMUNICATION SYSTEMS
EP1796083B1 (en) 2000-04-24 2009-01-07 Qualcomm Incorporated Method and apparatus for predictively quantizing voiced speech
US6584438B1 (en) * 2000-04-24 2003-06-24 Qualcomm Incorporated Frame erasure compensation method in a variable rate speech coder
SE518941C2 (en) 2000-05-31 2002-12-10 Ericsson Telefon Ab L M Device and method related to communication of speech
US7016970B2 (en) 2000-07-06 2006-03-21 Matsushita Electric Industrial Co., Ltd. System for transmitting stream data from server to client based on buffer and transmission capacities and delay time of the client
US7155518B2 (en) 2001-01-08 2006-12-26 Interactive People Unplugged Ab Extranet workgroup formation across multiple mobile virtual private networks
US20020133334A1 (en) * 2001-02-02 2002-09-19 Geert Coorman Time scale modification of digitally sampled waveforms in the time domain
US20040204935A1 (en) 2001-02-21 2004-10-14 Krishnasamy Anandakumar Adaptive voice playout in VOP
US7212517B2 (en) 2001-04-09 2007-05-01 Lucent Technologies Inc. Method and apparatus for jitter and frame erasure correction in packetized voice communication systems
ATE353503T1 (en) * 2001-04-24 2007-02-15 Nokia Corp METHOD FOR CHANGING THE SIZE OF A CLIMBER BUFFER FOR TIME ALIGNMENT, COMMUNICATIONS SYSTEM, RECEIVER SIDE AND TRANSCODER
US7006511B2 (en) 2001-07-17 2006-02-28 Avaya Technology Corp. Dynamic jitter buffering for voice-over-IP and other packet-based communication systems
US7266127B2 (en) 2002-02-08 2007-09-04 Lucent Technologies Inc. Method and system to compensate for the effects of packet delays on speech quality in a Voice-over IP system
US7079486B2 (en) 2002-02-13 2006-07-18 Agere Systems Inc. Adaptive threshold based jitter buffer management for packetized data
US7158572B2 (en) 2002-02-14 2007-01-02 Tellabs Operations, Inc. Audio enhancement communication techniques
US7126957B1 (en) 2002-03-07 2006-10-24 Utstarcom, Inc. Media flow method for transferring real-time data between asynchronous and synchronous networks
US7263109B2 (en) 2002-03-11 2007-08-28 Conexant, Inc. Clock skew compensation for a jitter buffer
US20030187663A1 (en) 2002-03-28 2003-10-02 Truman Michael Mead Broadband frequency translation for high frequency regeneration
JP3761486B2 (en) 2002-03-29 2006-03-29 Necインフロンティア株式会社 Wireless LAN system, main device and program
US20050228648A1 (en) 2002-04-22 2005-10-13 Ari Heikkinen Method and device for obtaining parameters for parametric speech coding of frames
US7496086B2 (en) 2002-04-30 2009-02-24 Alcatel-Lucent Usa Inc. Techniques for jitter buffer delay management
US7280510B2 (en) 2002-05-21 2007-10-09 Nortel Networks Limited Controlling reverse channel activity in a wireless communications system
AU2002309146A1 (en) 2002-06-14 2003-12-31 Nokia Corporation Enhanced error concealment for spatial audio
US7336678B2 (en) 2002-07-31 2008-02-26 Intel Corporation State-based jitter buffer and method of operation
US8520519B2 (en) 2002-09-20 2013-08-27 Broadcom Corporation External jitter buffer in a packet voice system
JP3796240B2 (en) 2002-09-30 2006-07-12 三洋電機株式会社 Network telephone and voice decoding apparatus
JP4146708B2 (en) 2002-10-31 2008-09-10 京セラ株式会社 COMMUNICATION SYSTEM, RADIO COMMUNICATION TERMINAL, DATA DISTRIBUTION DEVICE, AND COMMUNICATION METHOD
US6996626B1 (en) 2002-12-03 2006-02-07 Crystalvoice Communications Continuous bandwidth assessment and feedback for voice-over-internet-protocol (VoIP) comparing packet's voice duration and arrival rate
KR100517237B1 (en) 2002-12-09 2005-09-27 한국전자통신연구원 Method and apparatus for channel quality estimation and link adaptation in the orthogonal frequency division multiplexing wireless communications systems
US7525918B2 (en) 2003-01-21 2009-04-28 Broadcom Corporation Using RTCP statistics for media system control
US7394833B2 (en) * 2003-02-11 2008-07-01 Nokia Corporation Method and apparatus for reducing synchronization delay in packet switched voice terminals using speech decoder modification
JP2004266724A (en) 2003-03-04 2004-09-24 Matsushita Electric Ind Co Ltd Real time voice buffer control apparatus
JP3825007B2 (en) 2003-03-11 2006-09-20 沖電気工業株式会社 Jitter buffer control method
US7551671B2 (en) 2003-04-16 2009-06-23 General Dynamics Decision Systems, Inc. System and method for transmission of video signals using multiple channels
JP2005057504A (en) 2003-08-05 2005-03-03 Matsushita Electric Ind Co Ltd Data communication apparatus and data communication method
DE60323886D1 (en) 2003-08-15 2008-11-13 Research In Motion Ltd DEVICE AND ASSOCIATED METHOD FOR OBTAINING QUALITY OF SERVICE QUALITY DURING TRANSMISSION IN A RADIO COMMUNICATION SYSTEM
US7596488B2 (en) 2003-09-15 2009-09-29 Microsoft Corporation System and method for real-time jitter control and packet-loss concealment in an audio signal
US7505764B2 (en) 2003-10-28 2009-03-17 Motorola, Inc. Method for retransmitting a speech packet
US7272400B1 (en) 2003-12-19 2007-09-18 Core Mobility, Inc. Load balancing between users of a wireless base station
US7424026B2 (en) 2004-04-28 2008-09-09 Nokia Corporation Method and apparatus providing continuous adaptive control of voice packet buffer at receiver terminal
JP4076981B2 (en) 2004-08-09 2008-04-16 Kddi株式会社 Communication terminal apparatus and buffer control method
US8085678B2 (en) 2004-10-13 2011-12-27 Qualcomm Incorporated Media (voice) playback (de-jitter) buffer adjustments based on air interface
SG124307A1 (en) * 2005-01-20 2006-08-30 St Microelectronics Asia Method and system for lost packet concealment in high quality audio streaming applications
US8102872B2 (en) 2005-02-01 2012-01-24 Qualcomm Incorporated Method for discontinuous transmission and accurate reproduction of background noise information
US20060187970A1 (en) 2005-02-22 2006-08-24 Minkyu Lee Method and apparatus for handling network jitter in a Voice-over IP communications network using a virtual jitter buffer and time scale modification
US8355907B2 (en) 2005-03-11 2013-01-15 Qualcomm Incorporated Method and apparatus for phase matching frames in vocoders
RU2376657C2 (en) * 2005-04-01 2009-12-20 Квэлкомм Инкорпорейтед Systems, methods and apparatus for highband time warping

Also Published As

Publication number Publication date
NO20075180L (en) 2007-10-31
IL185935A0 (en) 2008-01-06
KR20090119936A (en) 2009-11-20
RU2007137643A (en) 2009-04-20
CA2600713C (en) 2012-05-22
EP1856689A1 (en) 2007-11-21
AU2006222963A1 (en) 2006-09-21
US20060206334A1 (en) 2006-09-14
JP2008533529A (en) 2008-08-21
TWI389099B (en) 2013-03-11
RU2371784C2 (en) 2009-10-27
US8155965B2 (en) 2012-04-10
WO2006099529A1 (en) 2006-09-21
KR20070112832A (en) 2007-11-27
BRPI0607624B1 (en) 2019-03-26
TW200638336A (en) 2006-11-01
MX2007011102A (en) 2007-11-22
JP5203923B2 (en) 2013-06-05
IL185935A (en) 2013-09-30
BRPI0607624A2 (en) 2009-09-22
KR100957265B1 (en) 2010-05-12
AU2006222963B2 (en) 2010-04-08
AU2006222963C1 (en) 2010-09-16
KR100956623B1 (en) 2010-05-11
SG160380A1 (en) 2010-04-29

Similar Documents

Publication Publication Date Title
CA2600713A1 (en) Time warping frames inside the vocoder by modifying the residual
EP3493204B1 (en) Method for encoding of integrated speech and audio
KR100647336B1 (en) Apparatus and method for adaptive time/frequency-based encoding/decoding
US6427135B1 (en) Method for encoding speech wherein pitch periods are changed based upon input speech signal
EP2272062B1 (en) An audio signal classifier
KR101058761B1 (en) Time-warping of Frames in Wideband Vocoder
US10706865B2 (en) Apparatus and method for selecting one of a first encoding algorithm and a second encoding algorithm using harmonics reduction
CN103493129B (en) For using Transient detection and quality results by the apparatus and method of the code segment of audio signal
KR102561265B1 (en) Coding mode determination method and apparatus, audio encoding method and apparatus, and audio decoding method and apparatus
EP2096631A1 (en) Audio decoding device and power adjusting method
EP2772914A1 (en) Hybrid sound-signal decoder, hybrid sound-signal encoder, sound-signal decoding method, and sound-signal encoding method
JP6607921B2 (en) Budget determination for LPD / FD transition frame encoding
JP3559485B2 (en) Post-processing method and device for audio signal and recording medium recording program
US20120143602A1 (en) Speech decoder and method for decoding segmented speech frames
KR101847213B1 (en) Method and apparatus for decoding audio signal using shaping function
KR20100006491A (en) Method and apparatus for encoding and decoding silence signal
WO2005045808A1 (en) Harmonic noise weighting in digital speech coders
KR100624545B1 (en) Method for the speech compression and synthesis in TTS system
CN113826161A (en) Method and device for detecting attack in a sound signal to be coded and decoded and for coding and decoding the detected attack

Legal Events

Date Code Title Description
EEER Examination request