CA2600713A1 - Time warping frames inside the vocoder by modifying the residual - Google Patents
Time warping frames inside the vocoder by modifying the residual Download PDFInfo
- Publication number
- CA2600713A1 CA2600713A1 CA002600713A CA2600713A CA2600713A1 CA 2600713 A1 CA2600713 A1 CA 2600713A1 CA 002600713 A CA002600713 A CA 002600713A CA 2600713 A CA2600713 A CA 2600713A CA 2600713 A1 CA2600713 A1 CA 2600713A1
- Authority
- CA
- Canada
- Prior art keywords
- speech
- pitch
- segments
- residual
- pitch period
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
- G10L19/20—Vocoders using multiple modes using sound class specific coding, hybrid encoders or object based coding
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/08—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
- G10L19/12—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/003—Changing voice quality, e.g. pitch or formants
- G10L21/007—Changing voice quality, e.g. pitch or formants characterised by the process used
- G10L21/01—Correction of time axis
Abstract
In one embodiment, the present invention comprises a vocoder having at least one input and at least one output, an encoder comprising a filter having at least one input operably connected to the input of the vocoder and at least one output, a decoder comprising a synthesizer having at least one input operably connected to the at least one output of the encoder, and at least one output operably connected to the at least one output of the vocoder, wherein the encoder comprises a memory and the encoder is adapted to execute instructions stored in the memory comprising classifying speech segments and encoding speech segments, and the decoder comprises a memory and the decoder is adapted to execute instructions stored in the memory comprising time-warping a residual speech signal to an expanded or compressed version of the residual speech signal.
Claims (46)
1. A method communicating speech, comprising:
time-warping a residual speech signal to an expanded or compressed version of said residual speech signal; and synthesizing said time-warped residual speech signal.
time-warping a residual speech signal to an expanded or compressed version of said residual speech signal; and synthesizing said time-warped residual speech signal.
2. The method communicating speech according to claim 1, further comprising the steps of:
classifying speech segments; and encoding said speech segments.
classifying speech segments; and encoding said speech segments.
3. The method of communicating speech according to claim 2, wherein said step of encoding speech segments comprises using prototype pitch period, code-excited linear prediction, noise-excited linear prediction or 1/8 frame coding.
4. The method of communicating speech according to claim 2, further comprising the steps of:
sending said speech signal through a linear predictive coding filter, whereby short-term correlations in said speech signal are filtered out; and outputting linear predictive coding coefficients and a residual signal.
sending said speech signal through a linear predictive coding filter, whereby short-term correlations in said speech signal are filtered out; and outputting linear predictive coding coefficients and a residual signal.
5. The method of communicating speech according to claim 2, wherein said step of classifying speech segments comprises categorizing speech frames as periodic, slightly periodic or noisy depending on whether the frames represents voiced, unvoiced or transient speech.
6. The method of communicating speech according to claim 2, wherein said encoding is code-excited linear prediction encoding.
7. The method of communicating speech according to claim 2, wherein said encoding is prototype pitch period encoding.
8. The method of communicating speech according to claim 2, wherein said encoding is noise-excited linear prediction encoding.
9. The method according to claim 6, wherein said step of time-warping comprises:
estimating a pitch period; and adding or subtracting at least one of said pitch period after receiving said residual signal.
estimating a pitch period; and adding or subtracting at least one of said pitch period after receiving said residual signal.
10. The method according to claim 6, wherein said step of time warping comprises:
estimating pitch delay;
dividing a speech frame into pitch periods, wherein boundaries of said pitch periods are determined using said pitch delay at various points in said speech frame;
overlapping said pitch periods if said residual speech signal is decreased;
and adding said pitch periods if said residual speech signal is increased.
estimating pitch delay;
dividing a speech frame into pitch periods, wherein boundaries of said pitch periods are determined using said pitch delay at various points in said speech frame;
overlapping said pitch periods if said residual speech signal is decreased;
and adding said pitch periods if said residual speech signal is increased.
11. The method according to claim 7, wherein said step of time warping comprises the steps of:
estimating at least one pitch period;
interpolating said at least one pitch period;
adding said at least one pitch period when expanding said residual speech signal;
and subtracting said at least one pitch period when compressing said residual speech signal.
estimating at least one pitch period;
interpolating said at least one pitch period;
adding said at least one pitch period when expanding said residual speech signal;
and subtracting said at least one pitch period when compressing said residual speech signal.
12. The method according to claim 8, wherein said step of encoding comprises encoding linear predictive coding information as gains of different parts of a speech segment.
13. The method according to claim 10, wherein said step of overlapping said pitch periods if said speech residual signal is decreased comprises:
segmenting an input sample sequence into blocks of samples;
removing segments of said residual signal at regular time intervals;
merging said removed segments; and replacing said removed segments with a merged segment;
segmenting an input sample sequence into blocks of samples;
removing segments of said residual signal at regular time intervals;
merging said removed segments; and replacing said removed segments with a merged segment;
14. The method according to claim 10, wherein said step of estimating pitch delay comprises interpolating between a pitch delay of an end of a last frame and an end of a current frame.
15. The method according to claim 10, wherein said step of adding said pitch periods comprises merging speech segments.
16. The method according to claim 10, wherein said step of adding said pitch periods if said residual speech signal is increased comprises adding an additional pitch period created from a first pitch segment and a second pitch period segment.
17. The method according to claim 12, wherein said gains are encoded for sets of speech samples.
18. The method according to claim 13, wherein said step of merging said removed segments comprises increasing a first pitch period segment's contribution and decreasing a second pitch period segment's contribution.
19. The method according to claim 15, further comprising the step of selecting similar speech segments, wherein said similar speech segments are merged.
20. The method according to claim 15, further comprising the step of correlating speech segments, whereby similar speech segments are selected.
21. The method according to claim 16, wherein said step of adding an additional pitch period created from a first pitch segment and a second pitch period segment comprises adding said first and said second pitch segments such that said first pitch period segment's contribution increases and said second pitch period segment's contribution decreases.
22. The method according to claim 17, further comprising the step of generating a residual signal by generating random values and then applying said gains to said random values.
23. The method according to claim 17, further comprising the step of representing said linear predictive coding information as 10 encoded gain values, wherein each encoded gain value represents 16 samples of speech.
24. A vocoder having at least one input and at least one output, comprising:
an encoder comprising a filter having at least one input operably connected to the input of the vocoder and at least one output; and a decoder comprising a synthesizer having at least one input operably connected to said at least one output of said encoder and at least one output operably connected to said at least one output of the vocoder.
an encoder comprising a filter having at least one input operably connected to the input of the vocoder and at least one output; and a decoder comprising a synthesizer having at least one input operably connected to said at least one output of said encoder and at least one output operably connected to said at least one output of the vocoder.
25. The vocoder according to claim 24, wherein said decoder comprises:
a memory, wherein said decoder is adapted to execute software instructions stored in said memory comprising time-warping a residual speech signal to an expanded or compressed version of said residual signal.
a memory, wherein said decoder is adapted to execute software instructions stored in said memory comprising time-warping a residual speech signal to an expanded or compressed version of said residual signal.
26. The vocoder according to claim 24, wherein said encoder comprises:
a memory and said encoder is adapted to execute software instructions stored in said memory comprising classifying speech segments as 1/8 frame, prototype pitch period, code-excited linear prediction or noise-excited linear prediction.
a memory and said encoder is adapted to execute software instructions stored in said memory comprising classifying speech segments as 1/8 frame, prototype pitch period, code-excited linear prediction or noise-excited linear prediction.
27. The vocoder according to claim 26, wherein said decoder comprises:
a memory and said decoder is adapted to execute software instructions stored in said memory comprising time-warping a residual signal to an expanded or compressed version of said residual speech signal.
a memory and said decoder is adapted to execute software instructions stored in said memory comprising time-warping a residual signal to an expanded or compressed version of said residual speech signal.
28. The vocoder according to claim 27, wherein said filter is a linear predictive coding filter which is adapted to:
filter out short-term correlations in a speech signal; and output linear predictive coding coefficients and a residual signal.
filter out short-term correlations in a speech signal; and output linear predictive coding coefficients and a residual signal.
29. The vocoder according to claim 27, wherein said encoder comprises:
a memory and said encoder is adapted to execute software instructions stored in said memory comprising encoding said speech segments using code-excited linear prediction encoding.
a memory and said encoder is adapted to execute software instructions stored in said memory comprising encoding said speech segments using code-excited linear prediction encoding.
30. The vocoder according to claim 27, wherein said encoder comprises:
a memory and said encoder is adapted to execute software instructions stored in said memory comprising encoding said speech segments using prototype pitch period encoding.
a memory and said encoder is adapted to execute software instructions stored in said memory comprising encoding said speech segments using prototype pitch period encoding.
31. The vocoder according to claim 27, wherein said encoder comprises:
a memory and said encoder is adapted to execute software instructions stored in said memory comprising encoding said speech segments using noise-excited linear prediction encoding.
a memory and said encoder is adapted to execute software instructions stored in said memory comprising encoding said speech segments using noise-excited linear prediction encoding.
32. The vocoder according to claim 29, wherein said time-warping software instruction comprises estimating at least one pitch period; and adding or subtracting said at least one pitch period after receiving said residual signal.
33. The vocoder according to claim 29, wherein said time-warping software instruction comprises estimating pitch delay;
dividing a speech frame into pitch periods, wherein boundaries of said pitch periods are determined using said pitch delay at various points in said speech frame;
overlapping said pitch periods if said residual speech signal is decreased;
and adding said pitch periods if said residual speech signal is increased.
dividing a speech frame into pitch periods, wherein boundaries of said pitch periods are determined using said pitch delay at various points in said speech frame;
overlapping said pitch periods if said residual speech signal is decreased;
and adding said pitch periods if said residual speech signal is increased.
34. The vocoder according to claim 30, wherein said time-warping software instruction comprises estimating at least one pitch period;
interpolating said at least one pitch period;
adding said at least one pitch period when expanding said residual speech signal;
and subtracting said at least one pitch period when compressing said residual speech signal.
interpolating said at least one pitch period;
adding said at least one pitch period when expanding said residual speech signal;
and subtracting said at least one pitch period when compressing said residual speech signal.
35. The vocoder according to claim 31, wherein said encoding said speech segments using noise-excited linear prediction encoding software instruction comprises encoding linear predictive coding information as gains of different parts of a speech segment.
36. The vocoder according to claim 33, wherein said overlapping said pitch periods if said speech residual signal is decreased instruction comprises segmenting an input sample sequence into blocks of samples;
removing segments of said residual signal at regular time intervals;
merging said removed segments; and replacing said removed segments with a merged segment.
removing segments of said residual signal at regular time intervals;
merging said removed segments; and replacing said removed segments with a merged segment.
37. The vocoder according to claim 33, wherein said estimating pitch delay instruction comprises interpolating between a pitch delay of an end of a last frame and an end of a current frame.
38. The vocoder according to claim 33, wherein said adding said pitch periods instruction comprises merging speech segments.
39. The vocoder according to claim 33, wherein said adding said pitch periods if said speech residual signal is increased instruction comprises adding an additional pitch period created from a first pitch segment and a second pitch period segment.
40. The vocoder according to claim 35, wherein said gains are encoded for sets of speech samples.
41. The vocoder according to claim 36, wherein said merging said removed segments instruction comprises increasing a first pitch period segment's contribution and decreasing a second pitch period segment's contribution.
42. The vocoder according to claim 38, further comprising the step of selecting similar speech segments, wherein said similar speech segments are merged.
43. The vocoder to claim 38, wherein said time-warping instruction further comprises correlating speech segments, whereby similar speech segments are selected.
44. The vocoder according to claim 39, wherein said adding an additional pitch period created from a first pitch segment and a second pitch period segment instruction comprises adding said first and said second pitch segments such that said first pitch period segment's contribution increases and said second pitch period segment's contribution decreases.
45. The vocoder according to claim 40, wherein said time-warping instruction further comprises generating a residual speech signal by generating random values and then applying said gains to said random values.
46. The vocoder according to claim 40, wherein said time-warping instruction further comprises representing said linear predictive coding information as 10 encoded gain values, wherein each encoded gain value represents 16 samples of speech.
Applications Claiming Priority (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US66082405P | 2005-03-11 | 2005-03-11 | |
US60/660,824 | 2005-03-11 | ||
US11/123,467 US8155965B2 (en) | 2005-03-11 | 2005-05-05 | Time warping frames inside the vocoder by modifying the residual |
US11/123,467 | 2005-05-05 | ||
PCT/US2006/009472 WO2006099529A1 (en) | 2005-03-11 | 2006-03-13 | Time warping frames inside the vocoder by modifying the residual |
Publications (2)
Publication Number | Publication Date |
---|---|
CA2600713A1 true CA2600713A1 (en) | 2006-09-21 |
CA2600713C CA2600713C (en) | 2012-05-22 |
Family
ID=36575961
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CA2600713A Active CA2600713C (en) | 2005-03-11 | 2006-03-13 | Time warping frames inside the vocoder by modifying the residual |
Country Status (14)
Country | Link |
---|---|
US (1) | US8155965B2 (en) |
EP (1) | EP1856689A1 (en) |
JP (1) | JP5203923B2 (en) |
KR (2) | KR100957265B1 (en) |
AU (1) | AU2006222963C1 (en) |
BR (1) | BRPI0607624B1 (en) |
CA (1) | CA2600713C (en) |
IL (1) | IL185935A (en) |
MX (1) | MX2007011102A (en) |
NO (1) | NO20075180L (en) |
RU (1) | RU2371784C2 (en) |
SG (1) | SG160380A1 (en) |
TW (1) | TWI389099B (en) |
WO (1) | WO2006099529A1 (en) |
Families Citing this family (38)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6691084B2 (en) * | 1998-12-21 | 2004-02-10 | Qualcomm Incorporated | Multiple mode variable rate speech coding |
CA2691959C (en) * | 2004-08-30 | 2013-07-30 | Qualcomm Incorporated | Method and apparatus for an adaptive de-jitter buffer |
US7674096B2 (en) * | 2004-09-22 | 2010-03-09 | Sundheim Gregroy S | Portable, rotary vane vacuum pump with removable oil reservoir cartridge |
US8085678B2 (en) * | 2004-10-13 | 2011-12-27 | Qualcomm Incorporated | Media (voice) playback (de-jitter) buffer adjustments based on air interface |
US8355907B2 (en) * | 2005-03-11 | 2013-01-15 | Qualcomm Incorporated | Method and apparatus for phase matching frames in vocoders |
RU2376657C2 (en) * | 2005-04-01 | 2009-12-20 | Квэлкомм Инкорпорейтед | Systems, methods and apparatus for highband time warping |
TWI317933B (en) * | 2005-04-22 | 2009-12-01 | Qualcomm Inc | Methods, data storage medium,apparatus of signal processing,and cellular telephone including the same |
US8259840B2 (en) * | 2005-10-24 | 2012-09-04 | General Motors Llc | Data communication via a voice channel of a wireless communication network using discontinuities |
US7720677B2 (en) * | 2005-11-03 | 2010-05-18 | Coding Technologies Ab | Time warped modified transform coding of audio signals |
US8239190B2 (en) * | 2006-08-22 | 2012-08-07 | Qualcomm Incorporated | Time-warping frames of wideband vocoder |
US8279889B2 (en) * | 2007-01-04 | 2012-10-02 | Qualcomm Incorporated | Systems and methods for dimming a first packet associated with a first bit rate to a second packet associated with a second bit rate |
US9653088B2 (en) * | 2007-06-13 | 2017-05-16 | Qualcomm Incorporated | Systems, methods, and apparatus for signal encoding using pitch-regularizing and non-pitch-regularizing coding |
US8401865B2 (en) | 2007-07-18 | 2013-03-19 | Nokia Corporation | Flexible parameter update in audio/speech coded signals |
US8768690B2 (en) * | 2008-06-20 | 2014-07-01 | Qualcomm Incorporated | Coding scheme selection for low-bit-rate applications |
US20090319261A1 (en) * | 2008-06-20 | 2009-12-24 | Qualcomm Incorporated | Coding of transitional speech frames for low-bit-rate applications |
US20090319263A1 (en) * | 2008-06-20 | 2009-12-24 | Qualcomm Incorporated | Coding of transitional speech frames for low-bit-rate applications |
EP2144230A1 (en) * | 2008-07-11 | 2010-01-13 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Low bitrate audio encoding/decoding scheme having cascaded switches |
MY154452A (en) | 2008-07-11 | 2015-06-15 | Fraunhofer Ges Forschung | An apparatus and a method for decoding an encoded audio signal |
EP2410522B1 (en) | 2008-07-11 | 2017-10-04 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Audio signal encoder, method for encoding an audio signal and computer program |
US8798776B2 (en) | 2008-09-30 | 2014-08-05 | Dolby International Ab | Transcoding of audio metadata |
US20100191534A1 (en) * | 2009-01-23 | 2010-07-29 | Qualcomm Incorporated | Method and apparatus for compression or decompression of digital signals |
US8428938B2 (en) * | 2009-06-04 | 2013-04-23 | Qualcomm Incorporated | Systems and methods for reconstructing an erased speech frame |
CA2778240C (en) | 2009-10-20 | 2016-09-06 | Fraunhofer Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Multi-mode audio codec and celp coding adapted therefore |
WO2011130325A1 (en) | 2010-04-12 | 2011-10-20 | Smule, Inc. | Continuous score-coded pitch correction and harmony generation techniques for geographically distributed glee club |
TWI409802B (en) * | 2010-04-14 | 2013-09-21 | Univ Da Yeh | Method and apparatus for processing audio feature |
CN105304090B (en) | 2011-02-14 | 2019-04-09 | 弗劳恩霍夫应用研究促进协会 | Using the prediction part of alignment by audio-frequency signal coding and decoded apparatus and method |
BR112013020588B1 (en) | 2011-02-14 | 2021-07-13 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | APPARATUS AND METHOD FOR ENCODING A PART OF AN AUDIO SIGNAL USING A TRANSIENT DETECTION AND A QUALITY RESULT |
PL2676266T3 (en) | 2011-02-14 | 2015-08-31 | Fraunhofer Ges Forschung | Linear prediction based coding scheme using spectral domain noise shaping |
TWI484479B (en) | 2011-02-14 | 2015-05-11 | Fraunhofer Ges Forschung | Apparatus and method for error concealment in low-delay unified speech and audio coding |
SG192718A1 (en) | 2011-02-14 | 2013-09-30 | Fraunhofer Ges Forschung | Audio codec using noise synthesis during inactive phases |
MY159444A (en) | 2011-02-14 | 2017-01-13 | Fraunhofer-Gesellschaft Zur Forderung Der Angewandten Forschung E V | Encoding and decoding of pulse positions of tracks of an audio signal |
PT2676267T (en) | 2011-02-14 | 2017-09-26 | Fraunhofer Ges Forschung | Encoding and decoding of pulse positions of tracks of an audio signal |
MX2012013025A (en) * | 2011-02-14 | 2013-01-22 | Fraunhofer Ges Forschung | Information signal representation using lapped transform. |
ES2529025T3 (en) | 2011-02-14 | 2015-02-16 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Apparatus and method for processing a decoded audio signal in a spectral domain |
CN103092330B (en) * | 2011-10-27 | 2015-11-25 | 宏碁股份有限公司 | Electronic installation and speech identifying method thereof |
TWI584269B (en) * | 2012-07-11 | 2017-05-21 | Univ Nat Central | Unsupervised language conversion detection method |
FR3024582A1 (en) * | 2014-07-29 | 2016-02-05 | Orange | MANAGING FRAME LOSS IN A FD / LPD TRANSITION CONTEXT |
WO2016142002A1 (en) * | 2015-03-09 | 2016-09-15 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Audio encoder, audio decoder, method for encoding an audio signal and method for decoding an encoded audio signal |
Family Cites Families (96)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPS5643800A (en) | 1979-09-19 | 1981-04-22 | Fujitsu Ltd | Multilayer printed board |
JPS57158247A (en) | 1981-03-24 | 1982-09-30 | Tokuyama Soda Co Ltd | Flame retardant polyolefin composition |
JPS59153346A (en) | 1983-02-21 | 1984-09-01 | Nec Corp | Voice encoding and decoding device |
JPS61156949A (en) | 1984-12-27 | 1986-07-16 | Matsushita Electric Ind Co Ltd | Packetized voice communication system |
BE1000415A7 (en) | 1987-03-18 | 1988-11-22 | Bell Telephone Mfg | Asynchronous based on time division operating communication. |
JPS6429141A (en) | 1987-07-24 | 1989-01-31 | Nec Corp | Packet exchange system |
JP2760810B2 (en) | 1988-09-19 | 1998-06-04 | 株式会社日立製作所 | Voice packet processing method |
SE462277B (en) | 1988-10-05 | 1990-05-28 | Vme Ind Sweden Ab | HYDRAULIC CONTROL SYSTEM |
JPH04113744A (en) | 1990-09-04 | 1992-04-15 | Fujitsu Ltd | Variable speed packet transmission system |
AU642540B2 (en) | 1990-09-19 | 1993-10-21 | Philips Electronics N.V. | Record carrier on which a main data file and a control file have been recorded, method of and device for recording the main data file and the control file, and device for reading the record carrier |
JP2846443B2 (en) | 1990-10-09 | 1999-01-13 | 三菱電機株式会社 | Packet assembly and disassembly device |
US5283811A (en) | 1991-09-03 | 1994-02-01 | General Electric Company | Decision feedback equalization for digital cellular radio |
US5371853A (en) * | 1991-10-28 | 1994-12-06 | University Of Maryland At College Park | Method and system for CELP speech coding and codebook for use therewith |
US5317604A (en) | 1992-12-30 | 1994-05-31 | Gte Government Systems Corporation | Isochronous interface method |
JP3186315B2 (en) | 1993-02-27 | 2001-07-11 | ソニー株式会社 | Signal compression device, signal decompression device, signal transmission device, signal reception device, and signal transmission / reception device |
US5490479A (en) | 1993-05-10 | 1996-02-13 | Shalev; Matti | Method and a product resulting from the use of the method for elevating feed storage bins |
US5440562A (en) | 1993-12-27 | 1995-08-08 | Motorola, Inc. | Communication through a channel having a variable propagation delay |
WO1996005697A1 (en) | 1994-08-12 | 1996-02-22 | Sony Corporation | Video signal editing device |
NL9401696A (en) | 1994-10-14 | 1996-05-01 | Nederland Ptt | Buffer readout control from ATM receiver. |
US5602959A (en) | 1994-12-05 | 1997-02-11 | Motorola, Inc. | Method and apparatus for characterization and reconstruction of speech excitation waveforms |
US5699478A (en) | 1995-03-10 | 1997-12-16 | Lucent Technologies Inc. | Frame erasure compensation technique |
US5929921A (en) | 1995-03-16 | 1999-07-27 | Matsushita Electric Industrial Co., Ltd. | Video and audio signal multiplex sending apparatus, receiving apparatus and transmitting apparatus |
JP3286110B2 (en) | 1995-03-16 | 2002-05-27 | 松下電器産業株式会社 | Voice packet interpolation device |
KR0164827B1 (en) | 1995-03-31 | 1999-03-20 | 김광호 | The program guide signal receiver |
JPH09127995A (en) | 1995-10-26 | 1997-05-16 | Sony Corp | Signal decoding method and signal decoder |
US5640388A (en) | 1995-12-21 | 1997-06-17 | Scientific-Atlanta, Inc. | Method and apparatus for removing jitter and correcting timestamps in a packet stream |
JPH09261613A (en) | 1996-03-26 | 1997-10-03 | Mitsubishi Electric Corp | Data reception/reproducing device |
US5940479A (en) | 1996-10-01 | 1999-08-17 | Northern Telecom Limited | System and method for transmitting aural information between a computer and telephone equipment |
JPH10190735A (en) | 1996-12-27 | 1998-07-21 | Secom Co Ltd | Communication system |
US6073092A (en) | 1997-06-26 | 2000-06-06 | Telogy Networks, Inc. | Method for speech coding based on a code excited linear prediction (CELP) model |
US6240386B1 (en) | 1998-08-24 | 2001-05-29 | Conexant Systems, Inc. | Speech codec employing noise classification for noise compensation |
US6259677B1 (en) | 1998-09-30 | 2001-07-10 | Cisco Technology, Inc. | Clock synchronization and dynamic jitter management for voice over IP and real-time data |
US6370125B1 (en) | 1998-10-08 | 2002-04-09 | Adtran, Inc. | Dynamic delay compensation for packet-based voice network |
US6456964B2 (en) * | 1998-12-21 | 2002-09-24 | Qualcomm, Incorporated | Encoding of periodic speech using prototype waveforms |
US6922669B2 (en) | 1998-12-29 | 2005-07-26 | Koninklijke Philips Electronics N.V. | Knowledge-based strategies applied to N-best lists in automatic speech recognition systems |
US7117156B1 (en) | 1999-04-19 | 2006-10-03 | At&T Corp. | Method and apparatus for performing packet loss or frame erasure concealment |
JP4975213B2 (en) | 1999-04-19 | 2012-07-11 | エイ・ティ・アンド・ティ・コーポレーション | Frame erasing concealment processor |
GB9911737D0 (en) * | 1999-05-21 | 1999-07-21 | Philips Electronics Nv | Audio signal time scale modification |
US6785230B1 (en) | 1999-05-25 | 2004-08-31 | Matsushita Electric Industrial Co., Ltd. | Audio transmission apparatus |
JP4218186B2 (en) | 1999-05-25 | 2009-02-04 | パナソニック株式会社 | Audio transmission device |
JP4895418B2 (en) | 1999-08-24 | 2012-03-14 | ソニー株式会社 | Audio reproduction method and audio reproduction apparatus |
WO2001020595A1 (en) | 1999-09-14 | 2001-03-22 | Fujitsu Limited | Voice encoder/decoder |
US6377931B1 (en) | 1999-09-28 | 2002-04-23 | Mindspeed Technologies | Speech manipulation for continuous speech playback over a packet network |
US6859460B1 (en) | 1999-10-22 | 2005-02-22 | Cisco Technology, Inc. | System and method for providing multimedia jitter buffer adjustment for packet-switched networks |
US6665317B1 (en) | 1999-10-29 | 2003-12-16 | Array Telecom Corporation | Method, system, and computer program product for managing jitter |
US6496794B1 (en) | 1999-11-22 | 2002-12-17 | Motorola, Inc. | Method and apparatus for seamless multi-rate speech coding |
US6693921B1 (en) | 1999-11-30 | 2004-02-17 | Mindspeed Technologies, Inc. | System for use of packet statistics in de-jitter delay adaption in a packet network |
US6366880B1 (en) * | 1999-11-30 | 2002-04-02 | Motorola, Inc. | Method and apparatus for suppressing acoustic background noise in a communication system by equaliztion of pre-and post-comb-filtered subband spectral energies |
GB2360178B (en) | 2000-03-06 | 2004-04-14 | Mitel Corp | Sub-packet insertion for packet loss compensation in Voice Over IP networks |
US6813274B1 (en) | 2000-03-21 | 2004-11-02 | Cisco Technology, Inc. | Network switch and method for data switching using a crossbar switch fabric with output port groups operating concurrently and independently |
DE60132080T2 (en) | 2000-04-03 | 2008-12-11 | Ericsson Inc., Plano | METHOD AND DEVICE FOR EFFICIENT FIELDS IN DATA PACKET COMMUNICATION SYSTEMS |
EP1796083B1 (en) | 2000-04-24 | 2009-01-07 | Qualcomm Incorporated | Method and apparatus for predictively quantizing voiced speech |
US6584438B1 (en) * | 2000-04-24 | 2003-06-24 | Qualcomm Incorporated | Frame erasure compensation method in a variable rate speech coder |
SE518941C2 (en) | 2000-05-31 | 2002-12-10 | Ericsson Telefon Ab L M | Device and method related to communication of speech |
US7016970B2 (en) | 2000-07-06 | 2006-03-21 | Matsushita Electric Industrial Co., Ltd. | System for transmitting stream data from server to client based on buffer and transmission capacities and delay time of the client |
US7155518B2 (en) | 2001-01-08 | 2006-12-26 | Interactive People Unplugged Ab | Extranet workgroup formation across multiple mobile virtual private networks |
US20020133334A1 (en) * | 2001-02-02 | 2002-09-19 | Geert Coorman | Time scale modification of digitally sampled waveforms in the time domain |
US20040204935A1 (en) | 2001-02-21 | 2004-10-14 | Krishnasamy Anandakumar | Adaptive voice playout in VOP |
US7212517B2 (en) | 2001-04-09 | 2007-05-01 | Lucent Technologies Inc. | Method and apparatus for jitter and frame erasure correction in packetized voice communication systems |
ATE353503T1 (en) * | 2001-04-24 | 2007-02-15 | Nokia Corp | METHOD FOR CHANGING THE SIZE OF A CLIMBER BUFFER FOR TIME ALIGNMENT, COMMUNICATIONS SYSTEM, RECEIVER SIDE AND TRANSCODER |
US7006511B2 (en) | 2001-07-17 | 2006-02-28 | Avaya Technology Corp. | Dynamic jitter buffering for voice-over-IP and other packet-based communication systems |
US7266127B2 (en) | 2002-02-08 | 2007-09-04 | Lucent Technologies Inc. | Method and system to compensate for the effects of packet delays on speech quality in a Voice-over IP system |
US7079486B2 (en) | 2002-02-13 | 2006-07-18 | Agere Systems Inc. | Adaptive threshold based jitter buffer management for packetized data |
US7158572B2 (en) | 2002-02-14 | 2007-01-02 | Tellabs Operations, Inc. | Audio enhancement communication techniques |
US7126957B1 (en) | 2002-03-07 | 2006-10-24 | Utstarcom, Inc. | Media flow method for transferring real-time data between asynchronous and synchronous networks |
US7263109B2 (en) | 2002-03-11 | 2007-08-28 | Conexant, Inc. | Clock skew compensation for a jitter buffer |
US20030187663A1 (en) | 2002-03-28 | 2003-10-02 | Truman Michael Mead | Broadband frequency translation for high frequency regeneration |
JP3761486B2 (en) | 2002-03-29 | 2006-03-29 | Necインフロンティア株式会社 | Wireless LAN system, main device and program |
US20050228648A1 (en) | 2002-04-22 | 2005-10-13 | Ari Heikkinen | Method and device for obtaining parameters for parametric speech coding of frames |
US7496086B2 (en) | 2002-04-30 | 2009-02-24 | Alcatel-Lucent Usa Inc. | Techniques for jitter buffer delay management |
US7280510B2 (en) | 2002-05-21 | 2007-10-09 | Nortel Networks Limited | Controlling reverse channel activity in a wireless communications system |
AU2002309146A1 (en) | 2002-06-14 | 2003-12-31 | Nokia Corporation | Enhanced error concealment for spatial audio |
US7336678B2 (en) | 2002-07-31 | 2008-02-26 | Intel Corporation | State-based jitter buffer and method of operation |
US8520519B2 (en) | 2002-09-20 | 2013-08-27 | Broadcom Corporation | External jitter buffer in a packet voice system |
JP3796240B2 (en) | 2002-09-30 | 2006-07-12 | 三洋電機株式会社 | Network telephone and voice decoding apparatus |
JP4146708B2 (en) | 2002-10-31 | 2008-09-10 | 京セラ株式会社 | COMMUNICATION SYSTEM, RADIO COMMUNICATION TERMINAL, DATA DISTRIBUTION DEVICE, AND COMMUNICATION METHOD |
US6996626B1 (en) | 2002-12-03 | 2006-02-07 | Crystalvoice Communications | Continuous bandwidth assessment and feedback for voice-over-internet-protocol (VoIP) comparing packet's voice duration and arrival rate |
KR100517237B1 (en) | 2002-12-09 | 2005-09-27 | 한국전자통신연구원 | Method and apparatus for channel quality estimation and link adaptation in the orthogonal frequency division multiplexing wireless communications systems |
US7525918B2 (en) | 2003-01-21 | 2009-04-28 | Broadcom Corporation | Using RTCP statistics for media system control |
US7394833B2 (en) * | 2003-02-11 | 2008-07-01 | Nokia Corporation | Method and apparatus for reducing synchronization delay in packet switched voice terminals using speech decoder modification |
JP2004266724A (en) | 2003-03-04 | 2004-09-24 | Matsushita Electric Ind Co Ltd | Real time voice buffer control apparatus |
JP3825007B2 (en) | 2003-03-11 | 2006-09-20 | 沖電気工業株式会社 | Jitter buffer control method |
US7551671B2 (en) | 2003-04-16 | 2009-06-23 | General Dynamics Decision Systems, Inc. | System and method for transmission of video signals using multiple channels |
JP2005057504A (en) | 2003-08-05 | 2005-03-03 | Matsushita Electric Ind Co Ltd | Data communication apparatus and data communication method |
DE60323886D1 (en) | 2003-08-15 | 2008-11-13 | Research In Motion Ltd | DEVICE AND ASSOCIATED METHOD FOR OBTAINING QUALITY OF SERVICE QUALITY DURING TRANSMISSION IN A RADIO COMMUNICATION SYSTEM |
US7596488B2 (en) | 2003-09-15 | 2009-09-29 | Microsoft Corporation | System and method for real-time jitter control and packet-loss concealment in an audio signal |
US7505764B2 (en) | 2003-10-28 | 2009-03-17 | Motorola, Inc. | Method for retransmitting a speech packet |
US7272400B1 (en) | 2003-12-19 | 2007-09-18 | Core Mobility, Inc. | Load balancing between users of a wireless base station |
US7424026B2 (en) | 2004-04-28 | 2008-09-09 | Nokia Corporation | Method and apparatus providing continuous adaptive control of voice packet buffer at receiver terminal |
JP4076981B2 (en) | 2004-08-09 | 2008-04-16 | Kddi株式会社 | Communication terminal apparatus and buffer control method |
US8085678B2 (en) | 2004-10-13 | 2011-12-27 | Qualcomm Incorporated | Media (voice) playback (de-jitter) buffer adjustments based on air interface |
SG124307A1 (en) * | 2005-01-20 | 2006-08-30 | St Microelectronics Asia | Method and system for lost packet concealment in high quality audio streaming applications |
US8102872B2 (en) | 2005-02-01 | 2012-01-24 | Qualcomm Incorporated | Method for discontinuous transmission and accurate reproduction of background noise information |
US20060187970A1 (en) | 2005-02-22 | 2006-08-24 | Minkyu Lee | Method and apparatus for handling network jitter in a Voice-over IP communications network using a virtual jitter buffer and time scale modification |
US8355907B2 (en) | 2005-03-11 | 2013-01-15 | Qualcomm Incorporated | Method and apparatus for phase matching frames in vocoders |
RU2376657C2 (en) * | 2005-04-01 | 2009-12-20 | Квэлкомм Инкорпорейтед | Systems, methods and apparatus for highband time warping |
-
2005
- 2005-05-05 US US11/123,467 patent/US8155965B2/en active Active
-
2006
- 2006-03-10 TW TW095108057A patent/TWI389099B/en active
- 2006-03-13 CA CA2600713A patent/CA2600713C/en active Active
- 2006-03-13 BR BRPI0607624-6A patent/BRPI0607624B1/en active IP Right Grant
- 2006-03-13 MX MX2007011102A patent/MX2007011102A/en active IP Right Grant
- 2006-03-13 EP EP06738524A patent/EP1856689A1/en not_active Withdrawn
- 2006-03-13 KR KR1020097022915A patent/KR100957265B1/en active IP Right Grant
- 2006-03-13 JP JP2008501073A patent/JP5203923B2/en active Active
- 2006-03-13 WO PCT/US2006/009472 patent/WO2006099529A1/en active Application Filing
- 2006-03-13 RU RU2007137643/09A patent/RU2371784C2/en active
- 2006-03-13 SG SG201001616-0A patent/SG160380A1/en unknown
- 2006-03-13 AU AU2006222963A patent/AU2006222963C1/en active Active
- 2006-03-13 KR KR1020077022667A patent/KR100956623B1/en active IP Right Grant
-
2007
- 2007-09-11 IL IL185935A patent/IL185935A/en not_active IP Right Cessation
- 2007-10-10 NO NO20075180A patent/NO20075180L/en not_active Application Discontinuation
Also Published As
Publication number | Publication date |
---|---|
NO20075180L (en) | 2007-10-31 |
IL185935A0 (en) | 2008-01-06 |
KR20090119936A (en) | 2009-11-20 |
RU2007137643A (en) | 2009-04-20 |
CA2600713C (en) | 2012-05-22 |
EP1856689A1 (en) | 2007-11-21 |
AU2006222963A1 (en) | 2006-09-21 |
US20060206334A1 (en) | 2006-09-14 |
JP2008533529A (en) | 2008-08-21 |
TWI389099B (en) | 2013-03-11 |
RU2371784C2 (en) | 2009-10-27 |
US8155965B2 (en) | 2012-04-10 |
WO2006099529A1 (en) | 2006-09-21 |
KR20070112832A (en) | 2007-11-27 |
BRPI0607624B1 (en) | 2019-03-26 |
TW200638336A (en) | 2006-11-01 |
MX2007011102A (en) | 2007-11-22 |
JP5203923B2 (en) | 2013-06-05 |
IL185935A (en) | 2013-09-30 |
BRPI0607624A2 (en) | 2009-09-22 |
KR100957265B1 (en) | 2010-05-12 |
AU2006222963B2 (en) | 2010-04-08 |
AU2006222963C1 (en) | 2010-09-16 |
KR100956623B1 (en) | 2010-05-11 |
SG160380A1 (en) | 2010-04-29 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CA2600713A1 (en) | Time warping frames inside the vocoder by modifying the residual | |
EP3493204B1 (en) | Method for encoding of integrated speech and audio | |
KR100647336B1 (en) | Apparatus and method for adaptive time/frequency-based encoding/decoding | |
US6427135B1 (en) | Method for encoding speech wherein pitch periods are changed based upon input speech signal | |
EP2272062B1 (en) | An audio signal classifier | |
KR101058761B1 (en) | Time-warping of Frames in Wideband Vocoder | |
US10706865B2 (en) | Apparatus and method for selecting one of a first encoding algorithm and a second encoding algorithm using harmonics reduction | |
CN103493129B (en) | For using Transient detection and quality results by the apparatus and method of the code segment of audio signal | |
KR102561265B1 (en) | Coding mode determination method and apparatus, audio encoding method and apparatus, and audio decoding method and apparatus | |
EP2096631A1 (en) | Audio decoding device and power adjusting method | |
EP2772914A1 (en) | Hybrid sound-signal decoder, hybrid sound-signal encoder, sound-signal decoding method, and sound-signal encoding method | |
JP6607921B2 (en) | Budget determination for LPD / FD transition frame encoding | |
JP3559485B2 (en) | Post-processing method and device for audio signal and recording medium recording program | |
US20120143602A1 (en) | Speech decoder and method for decoding segmented speech frames | |
KR101847213B1 (en) | Method and apparatus for decoding audio signal using shaping function | |
KR20100006491A (en) | Method and apparatus for encoding and decoding silence signal | |
WO2005045808A1 (en) | Harmonic noise weighting in digital speech coders | |
KR100624545B1 (en) | Method for the speech compression and synthesis in TTS system | |
CN113826161A (en) | Method and device for detecting attack in a sound signal to be coded and decoded and for coding and decoding the detected attack |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
EEER | Examination request |