CA2206129A1 - Method and apparatus for applying waveform prediction to subbands of a perceptual coding system - Google Patents
Method and apparatus for applying waveform prediction to subbands of a perceptual coding systemInfo
- Publication number
- CA2206129A1 CA2206129A1 CA002206129A CA2206129A CA2206129A1 CA 2206129 A1 CA2206129 A1 CA 2206129A1 CA 002206129 A CA002206129 A CA 002206129A CA 2206129 A CA2206129 A CA 2206129A CA 2206129 A1 CA2206129 A1 CA 2206129A1
- Authority
- CA
- Canada
- Prior art keywords
- signal
- subband
- input
- switch
- coupled
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04B—TRANSMISSION
- H04B1/00—Details of transmission systems, not covered by a single one of groups H04B3/00 - H04B13/00; Details of transmission systems not characterised by the medium used for transmission
- H04B1/66—Details of transmission systems, not covered by a single one of groups H04B3/00 - H04B13/00; Details of transmission systems not characterised by the medium used for transmission for reducing bandwidth of signals; for improving efficiency of transmission
- H04B1/665—Details of transmission systems, not covered by a single one of groups H04B3/00 - H04B13/00; Details of transmission systems not characterised by the medium used for transmission for reducing bandwidth of signals; for improving efficiency of transmission using psychoacoustic properties of the ear, e.g. masking effect
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/12—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being prediction coefficients
Abstract
A split-band perceptual coding system utilizes generalized waveform predictive coding in frequency bands to further reduce coded signal information requirements. The order of the predictors are selected to balance requirements for prediction accuracy and rapid response time. Predictive coding may be adaptively inhibited in a band during intervals in which no predictive coding gain is realized.
Claims (16)
1. An encoder comprising:
receiver means (100) for receiving an input signal representing audio information, subband means (200) for generating a plurality of subband signals, each subband signal corresponding to a respective frequency subband of said input signal having a bandwidth commensurate with or less than a corresponding critical band of human perception, processor means (300a,300b) for generating quantized subband information in response to a respective subband signal, wherein said processor means comprises means (312) for generating a first measure signal representing the information capacity requirements of said respective subband signal, means (310) for generating a prediction error signal from the difference between said respective subband signal and a predicted signal generated by predicting said respective subband signal using a waveform predictor (340) of order three or more, means (314) for generating a second measure signal representing the information capacity requirements of said prediction error signal, means (370) for analyzing said first measure signal and said second measure signal, for generating a prediction override signal in response thereto, and for generating said quantized subband information by quantizing said prediction error signal when the information capacity requirement of said respective subband signal is higher than said prediction error signal and by quantizing said respective subband signal otherwise, and formatter means (400) for formatting an encoded signal by assembling quantized subband information and prediction override signals for said frequencysubbands into a form suitable for transmission or storage.
receiver means (100) for receiving an input signal representing audio information, subband means (200) for generating a plurality of subband signals, each subband signal corresponding to a respective frequency subband of said input signal having a bandwidth commensurate with or less than a corresponding critical band of human perception, processor means (300a,300b) for generating quantized subband information in response to a respective subband signal, wherein said processor means comprises means (312) for generating a first measure signal representing the information capacity requirements of said respective subband signal, means (310) for generating a prediction error signal from the difference between said respective subband signal and a predicted signal generated by predicting said respective subband signal using a waveform predictor (340) of order three or more, means (314) for generating a second measure signal representing the information capacity requirements of said prediction error signal, means (370) for analyzing said first measure signal and said second measure signal, for generating a prediction override signal in response thereto, and for generating said quantized subband information by quantizing said prediction error signal when the information capacity requirement of said respective subband signal is higher than said prediction error signal and by quantizing said respective subband signal otherwise, and formatter means (400) for formatting an encoded signal by assembling quantized subband information and prediction override signals for said frequencysubbands into a form suitable for transmission or storage.
2. An encoder according to claim 1 wherein said input signal comprises input signal samples and each of said subband signals comprise one or more transform coefficients, said transform coefficients generated by applying a transform to said input signal.
3. An encoder according to claim 2 wherein said transform coefficients substantially correspond to coefficients produced by applying either an evenly-stacked Time Domain Aliasing Cancellation transform or an oddly-stacked Time Domain Aliasing Cancellation transform.
4. An encoder according to claim 2 or 3 wherein said transform generates a blockof transform coefficients in response to an interval of said input signal samples and said waveform predictor (340) is applied to groups of transform coefficients within a respective block, said waveform predictor having a minimum order of 8, 17 and 33 for blockscomprising 256, 128 and 64 transform coefficients, respectively.
5. An encoder according to any one of claims 1 through 4 wherein said waveform predictor (340) for a respective subband signal has an order substantially equal to three times the bandwidth of said respective subband signal expressed in critical bandwidths.
6. An encoder according to any one of claims 1 through 5 wherein said waveform predictor (340) has an order less than or equal to a quotient of a time intervalcommensurate with the post-masking interval of the human auditory system divided by a time interval between adjacent ones of said input signal samples.
7. A decoder comprising:
deformatter means (700) for receiving an encoded signal representing audio information and obtaining therefrom prediction override signals and quantized subband information for respective frequency subbands of said audio information having bandwidths commensurate with or less than a corresponding critical band of human perception, wherein the prediction override signal of a respective frequency subband indicates whether the quantized subband information for that frequency subband is either quantized prediction errors or quantized subband signals, processor means (800a,800b) for generating a replica subband signal for a respective frequency subband, wherein said processor means comprises means for generating a prediction signal by applying a waveform predictor (840) of order three or more to quantized subband information for said respective frequency subband, means (871a-871d, 872) for controlling said waveform predictor such that said processor means generates said replica subband signal in response to said prediction signal when said respective prediction override signal is false and generates said replica subband signal in response to said quantized subband signal otherwise, and output means (900) for generating a replica of said audio information in response to replica subband signals for said frequency subbands.
deformatter means (700) for receiving an encoded signal representing audio information and obtaining therefrom prediction override signals and quantized subband information for respective frequency subbands of said audio information having bandwidths commensurate with or less than a corresponding critical band of human perception, wherein the prediction override signal of a respective frequency subband indicates whether the quantized subband information for that frequency subband is either quantized prediction errors or quantized subband signals, processor means (800a,800b) for generating a replica subband signal for a respective frequency subband, wherein said processor means comprises means for generating a prediction signal by applying a waveform predictor (840) of order three or more to quantized subband information for said respective frequency subband, means (871a-871d, 872) for controlling said waveform predictor such that said processor means generates said replica subband signal in response to said prediction signal when said respective prediction override signal is false and generates said replica subband signal in response to said quantized subband signal otherwise, and output means (900) for generating a replica of said audio information in response to replica subband signals for said frequency subbands.
8. A decoder according to claim 7 wherein said subband signal comprises transform coefficients, said replica of said audio information generated by applying an inverse transform to said subband signals for said plurality of frequency subbands.
9. A decoder according to claim 8 wherein said inverse transform substantially corresponds to either an evenly-stacked Time Domain Aliasing Cancellation inverse transform or an oddly-stacked Time Domain Aliasing Cancellation inverse transform.
10. An encoder according to claim 8 or 9 wherein subband signals for said plurality of subbands constitute a block of transform coefficients and said waveform predictor (840) has a minimum order of 8, 17 and 33 for blocks comprising 256, 128 and 64 transform coefficients, respectively.
11. A decoder according to any one of claims 7 through 10 wherein said waveform predictor (840) for a respective subband signal has an order substantially equal to three times the bandwidth of said respective subband signal expressed in critical bandwidths.
12. A decoder according to any one of claims 7 through 11 wherein said replica of audio information comprises audio samples and said waveform predictor (840) has an order less than or equal to a quotient of a time interval commensurate with the post-masking interval of the human auditory system divided by a time interval between adjacent ones of said audio samples.
13. An encoder comprising:
an input terminal (100), a plurality of bandpass filters (200) coupled to said input terminal, said bandpass filters having respective center frequencies and respective passband bandwidths commensurate with or narrower than critical bands of the human auditory system, a circuit (300a,300b) coupled to a respective bandpass filter, said circuit comprising a linear prediction filter (340) of order three or more, a comparator (372) having a first comparator input, a second comparator input and a comparator output, said first comparator input coupled to said respective bandpass filter and said second comparator input coupled to said prediction filter, a switch control coupled to said comparator output, a switch (371a) with a first switch input, a second switch input and a switch output, said first switch input coupled to said respective bandpass filter and said second switch input coupled to said prediction filter, wherein said switch output is switchably connected to either said first switch input or said second switch input in response to said switch control, and a quantizer (320) coupled to said switch output, and a multiplexor (400) coupled to said comparator output and said quantizer.
an input terminal (100), a plurality of bandpass filters (200) coupled to said input terminal, said bandpass filters having respective center frequencies and respective passband bandwidths commensurate with or narrower than critical bands of the human auditory system, a circuit (300a,300b) coupled to a respective bandpass filter, said circuit comprising a linear prediction filter (340) of order three or more, a comparator (372) having a first comparator input, a second comparator input and a comparator output, said first comparator input coupled to said respective bandpass filter and said second comparator input coupled to said prediction filter, a switch control coupled to said comparator output, a switch (371a) with a first switch input, a second switch input and a switch output, said first switch input coupled to said respective bandpass filter and said second switch input coupled to said prediction filter, wherein said switch output is switchably connected to either said first switch input or said second switch input in response to said switch control, and a quantizer (320) coupled to said switch output, and a multiplexor (400) coupled to said comparator output and said quantizer.
14. An encoder according to claim 13 wherein said prediction filter (340) comprises a filter tap having a weighting circuit, said weighting circuit coupled to said quantizer (320).
15. A decoder comprising:
an input terminal (600), a demultiplexor (700) having an input and a plurality of demultiplexor outputs, said input of said demultiplexor coupled to said input terminal, a circuit (800a,800b) coupled to a first respective demultiplexor output, said circuit comprising a linear prediction filter (840) of order three or more, a switch control (872) coupled to a second respective demultiplexor output, a switch (871a) with a first switch input, a second switch input and a switch output, said first switch input coupled to said first respective demultiplexor output and said second switch input coupled to said prediction filter, wherein said switch output is switchably connected to either said first switch input or said second switch input in response to said switch control, and a plurality of inverse bandpass filters (900) having respective center frequencies and respective passband bandwidths commensurate with or narrower than critical bands of the human auditory system, a respective one of said plurality of inverse bandpass filters coupled to said switch output.
an input terminal (600), a demultiplexor (700) having an input and a plurality of demultiplexor outputs, said input of said demultiplexor coupled to said input terminal, a circuit (800a,800b) coupled to a first respective demultiplexor output, said circuit comprising a linear prediction filter (840) of order three or more, a switch control (872) coupled to a second respective demultiplexor output, a switch (871a) with a first switch input, a second switch input and a switch output, said first switch input coupled to said first respective demultiplexor output and said second switch input coupled to said prediction filter, wherein said switch output is switchably connected to either said first switch input or said second switch input in response to said switch control, and a plurality of inverse bandpass filters (900) having respective center frequencies and respective passband bandwidths commensurate with or narrower than critical bands of the human auditory system, a respective one of said plurality of inverse bandpass filters coupled to said switch output.
16. A decoder according to claim 15 wherein said prediction filter (840) comprises a filter tap having a weighting circuit, said weighting circuit coupled to said respective one of said plurality of outputs of said demultiplexor.
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US36015094A | 1994-12-20 | 1994-12-20 | |
US08/360,150 | 1994-12-20 | ||
PCT/US1995/016748 WO1996019876A1 (en) | 1994-12-20 | 1995-12-20 | Method and apparatus for applying waveform prediction to subbands of a perceptual coding system |
Publications (2)
Publication Number | Publication Date |
---|---|
CA2206129A1 true CA2206129A1 (en) | 1996-06-27 |
CA2206129C CA2206129C (en) | 2009-11-10 |
Family
ID=23416807
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CA002206129A Expired - Lifetime CA2206129C (en) | 1994-12-20 | 1995-12-20 | Method and apparatus for applying waveform prediction to subbands of a perceptual coding system |
Country Status (10)
Country | Link |
---|---|
US (1) | US5699484A (en) |
EP (1) | EP0799531B1 (en) |
JP (1) | JP4033898B2 (en) |
AT (1) | ATE191107T1 (en) |
AU (1) | AU704693B2 (en) |
CA (1) | CA2206129C (en) |
DE (1) | DE69515907T2 (en) |
DK (1) | DK0799531T3 (en) |
ES (1) | ES2143673T3 (en) |
WO (1) | WO1996019876A1 (en) |
Families Citing this family (29)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6757913B2 (en) | 1996-07-15 | 2004-06-29 | Gregory D. Knox | Wireless music and data transceiver system |
JP2891193B2 (en) * | 1996-08-16 | 1999-05-17 | 日本電気株式会社 | Wideband speech spectral coefficient quantizer |
US6173317B1 (en) | 1997-03-14 | 2001-01-09 | Microsoft Corporation | Streaming and displaying a video stream with synchronized annotations over a computer network |
US6115420A (en) * | 1997-03-14 | 2000-09-05 | Microsoft Corporation | Digital video signal encoder and encoding method |
FI114248B (en) * | 1997-03-14 | 2004-09-15 | Nokia Corp | Method and apparatus for audio coding and audio decoding |
US6151632A (en) * | 1997-03-14 | 2000-11-21 | Microsoft Corporation | Method and apparatus for distributed transmission of real-time multimedia information |
US5890125A (en) * | 1997-07-16 | 1999-03-30 | Dolby Laboratories Licensing Corporation | Method and apparatus for encoding and decoding multiple audio channels at low bit rates using adaptive selection of encoding method |
US5913191A (en) * | 1997-10-17 | 1999-06-15 | Dolby Laboratories Licensing Corporation | Frame-based audio coding with additional filterbank to suppress aliasing artifacts at frame boundaries |
US6012025A (en) * | 1998-01-28 | 2000-01-04 | Nokia Mobile Phones Limited | Audio coding method and apparatus using backward adaptive prediction |
JP3199020B2 (en) * | 1998-02-27 | 2001-08-13 | 日本電気株式会社 | Audio music signal encoding device and decoding device |
US6266644B1 (en) * | 1998-09-26 | 2001-07-24 | Liquid Audio, Inc. | Audio encoding apparatus and methods |
EP1076297A1 (en) * | 1999-08-09 | 2001-02-14 | Deutsche Thomson-Brandt Gmbh | Method for fast Fourier transformation of audio signals |
US6567781B1 (en) | 1999-12-30 | 2003-05-20 | Quikcat.Com, Inc. | Method and apparatus for compressing audio data using a dynamical system having a multi-state dynamical rule set and associated transform basis function |
US6735561B1 (en) * | 2000-03-29 | 2004-05-11 | At&T Corp. | Effective deployment of temporal noise shaping (TNS) filters |
US7099830B1 (en) * | 2000-03-29 | 2006-08-29 | At&T Corp. | Effective deployment of temporal noise shaping (TNS) filters |
US7110953B1 (en) * | 2000-06-02 | 2006-09-19 | Agere Systems Inc. | Perceptual coding of audio signals using separated irrelevancy reduction and redundancy reduction |
AU2002220233A1 (en) * | 2000-12-01 | 2002-06-11 | Lizardtech, Inc. | Method for lossless encoding of image data by approximating linear transforms and preserving selected properties |
US6647149B2 (en) | 2001-01-03 | 2003-11-11 | Electronics For Imaging, Inc. | Methods and apparatus for securely transmitting and processing digital image data |
US6980933B2 (en) * | 2004-01-27 | 2005-12-27 | Dolby Laboratories Licensing Corporation | Coding techniques using estimated spectral magnitude and phase derived from MDCT coefficients |
JP4661074B2 (en) * | 2004-04-07 | 2011-03-30 | ソニー株式会社 | Information processing system, information processing method, and robot apparatus |
EP1852848A1 (en) * | 2006-05-05 | 2007-11-07 | Deutsche Thomson-Brandt GmbH | Method and apparatus for lossless encoding of a source signal using a lossy encoded data stream and a lossless extension data stream |
DE102006022346B4 (en) * | 2006-05-12 | 2008-02-28 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Information signal coding |
US8005671B2 (en) * | 2006-12-04 | 2011-08-23 | Qualcomm Incorporated | Systems and methods for dynamic normalization to reduce loss in precision for low-level signals |
WO2008071353A2 (en) * | 2006-12-12 | 2008-06-19 | Fraunhofer-Gesellschaft Zur Förderung Der Angewandten Forschung E.V: | Encoder, decoder and methods for encoding and decoding data segments representing a time-domain data stream |
US8571503B2 (en) * | 2010-03-05 | 2013-10-29 | Mitac International Corp. | Signal receiving methods and devices |
CN102812512B (en) * | 2010-03-23 | 2014-06-25 | Lg电子株式会社 | Method and apparatus for processing an audio signal |
CN103854653B (en) * | 2012-12-06 | 2016-12-28 | 华为技术有限公司 | The method and apparatus of signal decoding |
EP3893240B1 (en) * | 2013-01-08 | 2024-04-24 | Dolby International AB | Model based prediction in a critically sampled filterbank |
JP6906876B2 (en) * | 2017-06-19 | 2021-07-21 | アールティーエックス アー/エス | Audio signal coding and decoding |
Family Cites Families (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
DE3276651D1 (en) * | 1982-11-26 | 1987-07-30 | Ibm | Speech signal coding method and apparatus |
US4751736A (en) * | 1985-01-31 | 1988-06-14 | Communications Satellite Corporation | Variable bit rate speech codec with backward-type prediction and quantization |
GB8803390D0 (en) * | 1988-02-13 | 1988-03-16 | Univ Belfast | Method and apparatus for electrical signal coding |
US5185800A (en) * | 1989-10-13 | 1993-02-09 | Centre National D'etudes Des Telecommunications | Bit allocation device for transformed digital audio broadcasting signals with adaptive quantization based on psychoauditive criterion |
CN1062963C (en) * | 1990-04-12 | 2001-03-07 | 多尔拜实验特许公司 | Adaptive-block-lenght, adaptive-transform, and adaptive-window transform coder, decoder, and encoder/decoder for high-quality audio |
US5274740A (en) * | 1991-01-08 | 1993-12-28 | Dolby Laboratories Licensing Corporation | Decoder for variable number of channel presentation of multidimensional sound fields |
JP3310682B2 (en) * | 1992-01-21 | 2002-08-05 | 日本ビクター株式会社 | Audio signal encoding method and reproduction method |
US5291557A (en) * | 1992-10-13 | 1994-03-01 | Dolby Laboratories Licensing Corporation | Adaptive rematrixing of matrixed audio signals |
US5451954A (en) * | 1993-08-04 | 1995-09-19 | Dolby Laboratories Licensing Corporation | Quantization noise suppression for encoder/decoder system |
-
1995
- 1995-12-20 AU AU46874/96A patent/AU704693B2/en not_active Expired
- 1995-12-20 ES ES95944513T patent/ES2143673T3/en not_active Expired - Lifetime
- 1995-12-20 EP EP95944513A patent/EP0799531B1/en not_active Expired - Lifetime
- 1995-12-20 CA CA002206129A patent/CA2206129C/en not_active Expired - Lifetime
- 1995-12-20 WO PCT/US1995/016748 patent/WO1996019876A1/en active IP Right Grant
- 1995-12-20 JP JP52000596A patent/JP4033898B2/en not_active Expired - Lifetime
- 1995-12-20 AT AT95944513T patent/ATE191107T1/en active
- 1995-12-20 DK DK95944513T patent/DK0799531T3/en active
- 1995-12-20 DE DE69515907T patent/DE69515907T2/en not_active Expired - Lifetime
-
1996
- 1996-04-26 US US08/638,498 patent/US5699484A/en not_active Expired - Lifetime
Also Published As
Publication number | Publication date |
---|---|
JP4033898B2 (en) | 2008-01-16 |
DK0799531T3 (en) | 2000-07-10 |
EP0799531B1 (en) | 2000-03-22 |
AU704693B2 (en) | 1999-04-29 |
ES2143673T3 (en) | 2000-05-16 |
ATE191107T1 (en) | 2000-04-15 |
AU4687496A (en) | 1996-07-10 |
US5699484A (en) | 1997-12-16 |
WO1996019876A1 (en) | 1996-06-27 |
DE69515907D1 (en) | 2000-04-27 |
CA2206129C (en) | 2009-11-10 |
DE69515907T2 (en) | 2000-08-17 |
JPH10511243A (en) | 1998-10-27 |
EP0799531A1 (en) | 1997-10-08 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CA2206129A1 (en) | Method and apparatus for applying waveform prediction to subbands of a perceptual coding system | |
CA2199070C (en) | Switched filterbank for use in audio signal coding | |
Brandenburg | OCF--A new coding algorithm for high quality sound signals | |
CA2140779C (en) | Method, apparatus and recording medium for coding of separated tone and noise characteristics spectral components of an acoustic signal | |
EP0714173B1 (en) | Method and device for encoding signal, method and device for decoding signal, recording medium, and signal transmitting method | |
JP3186292B2 (en) | High efficiency coding method and apparatus | |
KR100419545B1 (en) | Transmission system using different coding principles | |
WO1995002928A1 (en) | Hybrid adaptive allocation for audio encoder and decoder | |
JPH07336232A (en) | Method and device for coding information, method and device for decoding information and information recording medium | |
US6415251B1 (en) | Subband coder or decoder band-limiting the overlap region between a processed subband and an adjacent non-processed one | |
KR20000022434A (en) | Coding and decoding of audio signals by using intensity stereo and prediction processes | |
US5781586A (en) | Method and apparatus for encoding the information, method and apparatus for decoding the information and information recording medium | |
US5758316A (en) | Methods and apparatus for information encoding and decoding based upon tonal components of plural channels | |
KR100303580B1 (en) | Transmitter, Encoding Device and Transmission Method | |
JP3189401B2 (en) | Audio data encoding method and audio data encoding device | |
CA2208384C (en) | Wideband speech coder | |
EP0734617B1 (en) | Transmission system utilizing different coding principles | |
JPH09135173A (en) | Device and method for encoding, device and method for decoding, device and method for transmission and recording medium | |
JP2575265B2 (en) | Digital audio signal encoding method | |
JPH06224862A (en) | Method and equipment for processing digital audio signal | |
JP2000091921A (en) | Encoding method, decoding method, encoding device, decoding device, device and method for recording digital signal, and method and device for transmitting digital signal | |
JPH03132216A (en) | Encoder for digital data | |
JPH07106977A (en) | Information decoding device | |
EP0573103A1 (en) | Transmitter, receiver and record carrier in a digital transmission system | |
JPH0470800A (en) | Voice information compressing device |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
EEER | Examination request | ||
MKEX | Expiry |
Effective date: 20151221 |