CA2206129A1

CA2206129A1 - Method and apparatus for applying waveform prediction to subbands of a perceptual coding system

Info

Publication number: CA2206129A1
Application number: CA002206129A
Authority: CA
Inventors: Mark Franklin Davis
Original assignee: Dolby Laboratories Licensing Corporation; Mark Franklin Davis
Current assignee: Dolby Laboratories Licensing Corp
Priority date: 1994-12-20
Filing date: 1995-12-20
Publication date: 1996-06-27
Anticipated expiration: 2015-12-20
Also published as: JP4033898B2; DK0799531T3; EP0799531B1; AU704693B2; ES2143673T3; ATE191107T1; AU4687496A; US5699484A; WO1996019876A1; DE69515907D1; CA2206129C; DE69515907T2; JPH10511243A; EP0799531A1

Abstract

A split-band perceptual coding system utilizes generalized waveform predictive coding in frequency bands to further reduce coded signal information requirements. The order of the predictors are selected to balance requirements for prediction accuracy and rapid response time. Predictive coding may be adaptively inhibited in a band during intervals in which no predictive coding gain is realized.

Claims

1. An encoder comprising:
receiver means (100) for receiving an input signal representing audio information, subband means (200) for generating a plurality of subband signals, each subband signal corresponding to a respective frequency subband of said input signal having a bandwidth commensurate with or less than a corresponding critical band of human perception, processor means (300a,300b) for generating quantized subband information in response to a respective subband signal, wherein said processor means comprises means (312) for generating a first measure signal representing the information capacity requirements of said respective subband signal, means (310) for generating a prediction error signal from the difference between said respective subband signal and a predicted signal generated by predicting said respective subband signal using a waveform predictor (340) of order three or more, means (314) for generating a second measure signal representing the information capacity requirements of said prediction error signal, means (370) for analyzing said first measure signal and said second measure signal, for generating a prediction override signal in response thereto, and for generating said quantized subband information by quantizing said prediction error signal when the information capacity requirement of said respective subband signal is higher than said prediction error signal and by quantizing said respective subband signal otherwise, and formatter means (400) for formatting an encoded signal by assembling quantized subband information and prediction override signals for said frequencysubbands into a form suitable for transmission or storage.

2. An encoder according to claim 1 wherein said input signal comprises input signal samples and each of said subband signals comprise one or more transform coefficients, said transform coefficients generated by applying a transform to said input signal.

3. An encoder according to claim 2 wherein said transform coefficients substantially correspond to coefficients produced by applying either an evenly-stacked Time Domain Aliasing Cancellation transform or an oddly-stacked Time Domain Aliasing Cancellation transform.

4. An encoder according to claim 2 or 3 wherein said transform generates a blockof transform coefficients in response to an interval of said input signal samples and said waveform predictor (340) is applied to groups of transform coefficients within a respective block, said waveform predictor having a minimum order of 8, 17 and 33 for blockscomprising 256, 128 and 64 transform coefficients, respectively.

5. An encoder according to any one of claims 1 through 4 wherein said waveform predictor (340) for a respective subband signal has an order substantially equal to three times the bandwidth of said respective subband signal expressed in critical bandwidths.

6. An encoder according to any one of claims 1 through 5 wherein said waveform predictor (340) has an order less than or equal to a quotient of a time intervalcommensurate with the post-masking interval of the human auditory system divided by a time interval between adjacent ones of said input signal samples.

7. A decoder comprising:
deformatter means (700) for receiving an encoded signal representing audio information and obtaining therefrom prediction override signals and quantized subband information for respective frequency subbands of said audio information having bandwidths commensurate with or less than a corresponding critical band of human perception, wherein the prediction override signal of a respective frequency subband indicates whether the quantized subband information for that frequency subband is either quantized prediction errors or quantized subband signals, processor means (800a,800b) for generating a replica subband signal for a respective frequency subband, wherein said processor means comprises means for generating a prediction signal by applying a waveform predictor (840) of order three or more to quantized subband information for said respective frequency subband, means (871a-871d, 872) for controlling said waveform predictor such that said processor means generates said replica subband signal in response to said prediction signal when said respective prediction override signal is false and generates said replica subband signal in response to said quantized subband signal otherwise, and output means (900) for generating a replica of said audio information in response to replica subband signals for said frequency subbands.

8. A decoder according to claim 7 wherein said subband signal comprises transform coefficients, said replica of said audio information generated by applying an inverse transform to said subband signals for said plurality of frequency subbands.

9. A decoder according to claim 8 wherein said inverse transform substantially corresponds to either an evenly-stacked Time Domain Aliasing Cancellation inverse transform or an oddly-stacked Time Domain Aliasing Cancellation inverse transform.

10. An encoder according to claim 8 or 9 wherein subband signals for said plurality of subbands constitute a block of transform coefficients and said waveform predictor (840) has a minimum order of 8, 17 and 33 for blocks comprising 256, 128 and 64 transform coefficients, respectively.

11. A decoder according to any one of claims 7 through 10 wherein said waveform predictor (840) for a respective subband signal has an order substantially equal to three times the bandwidth of said respective subband signal expressed in critical bandwidths.

12. A decoder according to any one of claims 7 through 11 wherein said replica of audio information comprises audio samples and said waveform predictor (840) has an order less than or equal to a quotient of a time interval commensurate with the post-masking interval of the human auditory system divided by a time interval between adjacent ones of said audio samples.

13. An encoder comprising:
an input terminal (100), a plurality of bandpass filters (200) coupled to said input terminal, said bandpass filters having respective center frequencies and respective passband bandwidths commensurate with or narrower than critical bands of the human auditory system, a circuit (300a,300b) coupled to a respective bandpass filter, said circuit comprising a linear prediction filter (340) of order three or more, a comparator (372) having a first comparator input, a second comparator input and a comparator output, said first comparator input coupled to said respective bandpass filter and said second comparator input coupled to said prediction filter, a switch control coupled to said comparator output, a switch (371a) with a first switch input, a second switch input and a switch output, said first switch input coupled to said respective bandpass filter and said second switch input coupled to said prediction filter, wherein said switch output is switchably connected to either said first switch input or said second switch input in response to said switch control, and a quantizer (320) coupled to said switch output, and a multiplexor (400) coupled to said comparator output and said quantizer.

14. An encoder according to claim 13 wherein said prediction filter (340) comprises a filter tap having a weighting circuit, said weighting circuit coupled to said quantizer (320).

15. A decoder comprising:
an input terminal (600), a demultiplexor (700) having an input and a plurality of demultiplexor outputs, said input of said demultiplexor coupled to said input terminal, a circuit (800a,800b) coupled to a first respective demultiplexor output, said circuit comprising a linear prediction filter (840) of order three or more, a switch control (872) coupled to a second respective demultiplexor output, a switch (871a) with a first switch input, a second switch input and a switch output, said first switch input coupled to said first respective demultiplexor output and said second switch input coupled to said prediction filter, wherein said switch output is switchably connected to either said first switch input or said second switch input in response to said switch control, and a plurality of inverse bandpass filters (900) having respective center frequencies and respective passband bandwidths commensurate with or narrower than critical bands of the human auditory system, a respective one of said plurality of inverse bandpass filters coupled to said switch output.

16. A decoder according to claim 15 wherein said prediction filter (840) comprises a filter tap having a weighting circuit, said weighting circuit coupled to said respective one of said plurality of outputs of said demultiplexor.