CA2206129A1 - Method and apparatus for applying waveform prediction to subbands of a perceptual coding system - Google Patents

Method and apparatus for applying waveform prediction to subbands of a perceptual coding system

Info

Publication number
CA2206129A1
CA2206129A1 CA002206129A CA2206129A CA2206129A1 CA 2206129 A1 CA2206129 A1 CA 2206129A1 CA 002206129 A CA002206129 A CA 002206129A CA 2206129 A CA2206129 A CA 2206129A CA 2206129 A1 CA2206129 A1 CA 2206129A1
Authority
CA
Canada
Prior art keywords
signal
subband
input
switch
coupled
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CA002206129A
Other languages
French (fr)
Other versions
CA2206129C (en
Inventor
Mark Franklin Davis
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Dolby Laboratories Licensing Corp
Original Assignee
Dolby Laboratories Licensing Corporation
Mark Franklin Davis
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Dolby Laboratories Licensing Corporation, Mark Franklin Davis filed Critical Dolby Laboratories Licensing Corporation
Publication of CA2206129A1 publication Critical patent/CA2206129A1/en
Application granted granted Critical
Publication of CA2206129C publication Critical patent/CA2206129C/en
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04BTRANSMISSION
    • H04B1/00Details of transmission systems, not covered by a single one of groups H04B3/00 - H04B13/00; Details of transmission systems not characterised by the medium used for transmission
    • H04B1/66Details of transmission systems, not covered by a single one of groups H04B3/00 - H04B13/00; Details of transmission systems not characterised by the medium used for transmission for reducing bandwidth of signals; for improving efficiency of transmission
    • H04B1/665Details of transmission systems, not covered by a single one of groups H04B3/00 - H04B13/00; Details of transmission systems not characterised by the medium used for transmission for reducing bandwidth of signals; for improving efficiency of transmission using psychoacoustic properties of the ear, e.g. masking effect
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/12Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being prediction coefficients

Abstract

A split-band perceptual coding system utilizes generalized waveform predictive coding in frequency bands to further reduce coded signal information requirements. The order of the predictors are selected to balance requirements for prediction accuracy and rapid response time. Predictive coding may be adaptively inhibited in a band during intervals in which no predictive coding gain is realized.

Claims (16)

1. An encoder comprising:
receiver means (100) for receiving an input signal representing audio information, subband means (200) for generating a plurality of subband signals, each subband signal corresponding to a respective frequency subband of said input signal having a bandwidth commensurate with or less than a corresponding critical band of human perception, processor means (300a,300b) for generating quantized subband information in response to a respective subband signal, wherein said processor means comprises means (312) for generating a first measure signal representing the information capacity requirements of said respective subband signal, means (310) for generating a prediction error signal from the difference between said respective subband signal and a predicted signal generated by predicting said respective subband signal using a waveform predictor (340) of order three or more, means (314) for generating a second measure signal representing the information capacity requirements of said prediction error signal, means (370) for analyzing said first measure signal and said second measure signal, for generating a prediction override signal in response thereto, and for generating said quantized subband information by quantizing said prediction error signal when the information capacity requirement of said respective subband signal is higher than said prediction error signal and by quantizing said respective subband signal otherwise, and formatter means (400) for formatting an encoded signal by assembling quantized subband information and prediction override signals for said frequencysubbands into a form suitable for transmission or storage.
2. An encoder according to claim 1 wherein said input signal comprises input signal samples and each of said subband signals comprise one or more transform coefficients, said transform coefficients generated by applying a transform to said input signal.
3. An encoder according to claim 2 wherein said transform coefficients substantially correspond to coefficients produced by applying either an evenly-stacked Time Domain Aliasing Cancellation transform or an oddly-stacked Time Domain Aliasing Cancellation transform.
4. An encoder according to claim 2 or 3 wherein said transform generates a blockof transform coefficients in response to an interval of said input signal samples and said waveform predictor (340) is applied to groups of transform coefficients within a respective block, said waveform predictor having a minimum order of 8, 17 and 33 for blockscomprising 256, 128 and 64 transform coefficients, respectively.
5. An encoder according to any one of claims 1 through 4 wherein said waveform predictor (340) for a respective subband signal has an order substantially equal to three times the bandwidth of said respective subband signal expressed in critical bandwidths.
6. An encoder according to any one of claims 1 through 5 wherein said waveform predictor (340) has an order less than or equal to a quotient of a time intervalcommensurate with the post-masking interval of the human auditory system divided by a time interval between adjacent ones of said input signal samples.
7. A decoder comprising:
deformatter means (700) for receiving an encoded signal representing audio information and obtaining therefrom prediction override signals and quantized subband information for respective frequency subbands of said audio information having bandwidths commensurate with or less than a corresponding critical band of human perception, wherein the prediction override signal of a respective frequency subband indicates whether the quantized subband information for that frequency subband is either quantized prediction errors or quantized subband signals, processor means (800a,800b) for generating a replica subband signal for a respective frequency subband, wherein said processor means comprises means for generating a prediction signal by applying a waveform predictor (840) of order three or more to quantized subband information for said respective frequency subband, means (871a-871d, 872) for controlling said waveform predictor such that said processor means generates said replica subband signal in response to said prediction signal when said respective prediction override signal is false and generates said replica subband signal in response to said quantized subband signal otherwise, and output means (900) for generating a replica of said audio information in response to replica subband signals for said frequency subbands.
8. A decoder according to claim 7 wherein said subband signal comprises transform coefficients, said replica of said audio information generated by applying an inverse transform to said subband signals for said plurality of frequency subbands.
9. A decoder according to claim 8 wherein said inverse transform substantially corresponds to either an evenly-stacked Time Domain Aliasing Cancellation inverse transform or an oddly-stacked Time Domain Aliasing Cancellation inverse transform.
10. An encoder according to claim 8 or 9 wherein subband signals for said plurality of subbands constitute a block of transform coefficients and said waveform predictor (840) has a minimum order of 8, 17 and 33 for blocks comprising 256, 128 and 64 transform coefficients, respectively.
11. A decoder according to any one of claims 7 through 10 wherein said waveform predictor (840) for a respective subband signal has an order substantially equal to three times the bandwidth of said respective subband signal expressed in critical bandwidths.
12. A decoder according to any one of claims 7 through 11 wherein said replica of audio information comprises audio samples and said waveform predictor (840) has an order less than or equal to a quotient of a time interval commensurate with the post-masking interval of the human auditory system divided by a time interval between adjacent ones of said audio samples.
13. An encoder comprising:
an input terminal (100), a plurality of bandpass filters (200) coupled to said input terminal, said bandpass filters having respective center frequencies and respective passband bandwidths commensurate with or narrower than critical bands of the human auditory system, a circuit (300a,300b) coupled to a respective bandpass filter, said circuit comprising a linear prediction filter (340) of order three or more, a comparator (372) having a first comparator input, a second comparator input and a comparator output, said first comparator input coupled to said respective bandpass filter and said second comparator input coupled to said prediction filter, a switch control coupled to said comparator output, a switch (371a) with a first switch input, a second switch input and a switch output, said first switch input coupled to said respective bandpass filter and said second switch input coupled to said prediction filter, wherein said switch output is switchably connected to either said first switch input or said second switch input in response to said switch control, and a quantizer (320) coupled to said switch output, and a multiplexor (400) coupled to said comparator output and said quantizer.
14. An encoder according to claim 13 wherein said prediction filter (340) comprises a filter tap having a weighting circuit, said weighting circuit coupled to said quantizer (320).
15. A decoder comprising:
an input terminal (600), a demultiplexor (700) having an input and a plurality of demultiplexor outputs, said input of said demultiplexor coupled to said input terminal, a circuit (800a,800b) coupled to a first respective demultiplexor output, said circuit comprising a linear prediction filter (840) of order three or more, a switch control (872) coupled to a second respective demultiplexor output, a switch (871a) with a first switch input, a second switch input and a switch output, said first switch input coupled to said first respective demultiplexor output and said second switch input coupled to said prediction filter, wherein said switch output is switchably connected to either said first switch input or said second switch input in response to said switch control, and a plurality of inverse bandpass filters (900) having respective center frequencies and respective passband bandwidths commensurate with or narrower than critical bands of the human auditory system, a respective one of said plurality of inverse bandpass filters coupled to said switch output.
16. A decoder according to claim 15 wherein said prediction filter (840) comprises a filter tap having a weighting circuit, said weighting circuit coupled to said respective one of said plurality of outputs of said demultiplexor.
CA002206129A 1994-12-20 1995-12-20 Method and apparatus for applying waveform prediction to subbands of a perceptual coding system Expired - Lifetime CA2206129C (en)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US36015094A 1994-12-20 1994-12-20
US08/360,150 1994-12-20
PCT/US1995/016748 WO1996019876A1 (en) 1994-12-20 1995-12-20 Method and apparatus for applying waveform prediction to subbands of a perceptual coding system

Publications (2)

Publication Number Publication Date
CA2206129A1 true CA2206129A1 (en) 1996-06-27
CA2206129C CA2206129C (en) 2009-11-10

Family

ID=23416807

Family Applications (1)

Application Number Title Priority Date Filing Date
CA002206129A Expired - Lifetime CA2206129C (en) 1994-12-20 1995-12-20 Method and apparatus for applying waveform prediction to subbands of a perceptual coding system

Country Status (10)

Country Link
US (1) US5699484A (en)
EP (1) EP0799531B1 (en)
JP (1) JP4033898B2 (en)
AT (1) ATE191107T1 (en)
AU (1) AU704693B2 (en)
CA (1) CA2206129C (en)
DE (1) DE69515907T2 (en)
DK (1) DK0799531T3 (en)
ES (1) ES2143673T3 (en)
WO (1) WO1996019876A1 (en)

Families Citing this family (29)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6757913B2 (en) 1996-07-15 2004-06-29 Gregory D. Knox Wireless music and data transceiver system
JP2891193B2 (en) * 1996-08-16 1999-05-17 日本電気株式会社 Wideband speech spectral coefficient quantizer
US6173317B1 (en) 1997-03-14 2001-01-09 Microsoft Corporation Streaming and displaying a video stream with synchronized annotations over a computer network
US6115420A (en) * 1997-03-14 2000-09-05 Microsoft Corporation Digital video signal encoder and encoding method
FI114248B (en) * 1997-03-14 2004-09-15 Nokia Corp Method and apparatus for audio coding and audio decoding
US6151632A (en) * 1997-03-14 2000-11-21 Microsoft Corporation Method and apparatus for distributed transmission of real-time multimedia information
US5890125A (en) * 1997-07-16 1999-03-30 Dolby Laboratories Licensing Corporation Method and apparatus for encoding and decoding multiple audio channels at low bit rates using adaptive selection of encoding method
US5913191A (en) * 1997-10-17 1999-06-15 Dolby Laboratories Licensing Corporation Frame-based audio coding with additional filterbank to suppress aliasing artifacts at frame boundaries
US6012025A (en) * 1998-01-28 2000-01-04 Nokia Mobile Phones Limited Audio coding method and apparatus using backward adaptive prediction
JP3199020B2 (en) * 1998-02-27 2001-08-13 日本電気株式会社 Audio music signal encoding device and decoding device
US6266644B1 (en) * 1998-09-26 2001-07-24 Liquid Audio, Inc. Audio encoding apparatus and methods
EP1076297A1 (en) * 1999-08-09 2001-02-14 Deutsche Thomson-Brandt Gmbh Method for fast Fourier transformation of audio signals
US6567781B1 (en) 1999-12-30 2003-05-20 Quikcat.Com, Inc. Method and apparatus for compressing audio data using a dynamical system having a multi-state dynamical rule set and associated transform basis function
US6735561B1 (en) * 2000-03-29 2004-05-11 At&T Corp. Effective deployment of temporal noise shaping (TNS) filters
US7099830B1 (en) * 2000-03-29 2006-08-29 At&T Corp. Effective deployment of temporal noise shaping (TNS) filters
US7110953B1 (en) * 2000-06-02 2006-09-19 Agere Systems Inc. Perceptual coding of audio signals using separated irrelevancy reduction and redundancy reduction
AU2002220233A1 (en) * 2000-12-01 2002-06-11 Lizardtech, Inc. Method for lossless encoding of image data by approximating linear transforms and preserving selected properties
US6647149B2 (en) 2001-01-03 2003-11-11 Electronics For Imaging, Inc. Methods and apparatus for securely transmitting and processing digital image data
US6980933B2 (en) * 2004-01-27 2005-12-27 Dolby Laboratories Licensing Corporation Coding techniques using estimated spectral magnitude and phase derived from MDCT coefficients
JP4661074B2 (en) * 2004-04-07 2011-03-30 ソニー株式会社 Information processing system, information processing method, and robot apparatus
EP1852848A1 (en) * 2006-05-05 2007-11-07 Deutsche Thomson-Brandt GmbH Method and apparatus for lossless encoding of a source signal using a lossy encoded data stream and a lossless extension data stream
DE102006022346B4 (en) * 2006-05-12 2008-02-28 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Information signal coding
US8005671B2 (en) * 2006-12-04 2011-08-23 Qualcomm Incorporated Systems and methods for dynamic normalization to reduce loss in precision for low-level signals
WO2008071353A2 (en) * 2006-12-12 2008-06-19 Fraunhofer-Gesellschaft Zur Förderung Der Angewandten Forschung E.V: Encoder, decoder and methods for encoding and decoding data segments representing a time-domain data stream
US8571503B2 (en) * 2010-03-05 2013-10-29 Mitac International Corp. Signal receiving methods and devices
CN102812512B (en) * 2010-03-23 2014-06-25 Lg电子株式会社 Method and apparatus for processing an audio signal
CN103854653B (en) * 2012-12-06 2016-12-28 华为技术有限公司 The method and apparatus of signal decoding
EP3893240B1 (en) * 2013-01-08 2024-04-24 Dolby International AB Model based prediction in a critically sampled filterbank
JP6906876B2 (en) * 2017-06-19 2021-07-21 アールティーエックス アー/エス Audio signal coding and decoding

Family Cites Families (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE3276651D1 (en) * 1982-11-26 1987-07-30 Ibm Speech signal coding method and apparatus
US4751736A (en) * 1985-01-31 1988-06-14 Communications Satellite Corporation Variable bit rate speech codec with backward-type prediction and quantization
GB8803390D0 (en) * 1988-02-13 1988-03-16 Univ Belfast Method and apparatus for electrical signal coding
US5185800A (en) * 1989-10-13 1993-02-09 Centre National D'etudes Des Telecommunications Bit allocation device for transformed digital audio broadcasting signals with adaptive quantization based on psychoauditive criterion
CN1062963C (en) * 1990-04-12 2001-03-07 多尔拜实验特许公司 Adaptive-block-lenght, adaptive-transform, and adaptive-window transform coder, decoder, and encoder/decoder for high-quality audio
US5274740A (en) * 1991-01-08 1993-12-28 Dolby Laboratories Licensing Corporation Decoder for variable number of channel presentation of multidimensional sound fields
JP3310682B2 (en) * 1992-01-21 2002-08-05 日本ビクター株式会社 Audio signal encoding method and reproduction method
US5291557A (en) * 1992-10-13 1994-03-01 Dolby Laboratories Licensing Corporation Adaptive rematrixing of matrixed audio signals
US5451954A (en) * 1993-08-04 1995-09-19 Dolby Laboratories Licensing Corporation Quantization noise suppression for encoder/decoder system

Also Published As

Publication number Publication date
JP4033898B2 (en) 2008-01-16
DK0799531T3 (en) 2000-07-10
EP0799531B1 (en) 2000-03-22
AU704693B2 (en) 1999-04-29
ES2143673T3 (en) 2000-05-16
ATE191107T1 (en) 2000-04-15
AU4687496A (en) 1996-07-10
US5699484A (en) 1997-12-16
WO1996019876A1 (en) 1996-06-27
DE69515907D1 (en) 2000-04-27
CA2206129C (en) 2009-11-10
DE69515907T2 (en) 2000-08-17
JPH10511243A (en) 1998-10-27
EP0799531A1 (en) 1997-10-08

Similar Documents

Publication Publication Date Title
CA2206129A1 (en) Method and apparatus for applying waveform prediction to subbands of a perceptual coding system
CA2199070C (en) Switched filterbank for use in audio signal coding
Brandenburg OCF--A new coding algorithm for high quality sound signals
CA2140779C (en) Method, apparatus and recording medium for coding of separated tone and noise characteristics spectral components of an acoustic signal
EP0714173B1 (en) Method and device for encoding signal, method and device for decoding signal, recording medium, and signal transmitting method
JP3186292B2 (en) High efficiency coding method and apparatus
KR100419545B1 (en) Transmission system using different coding principles
WO1995002928A1 (en) Hybrid adaptive allocation for audio encoder and decoder
JPH07336232A (en) Method and device for coding information, method and device for decoding information and information recording medium
US6415251B1 (en) Subband coder or decoder band-limiting the overlap region between a processed subband and an adjacent non-processed one
KR20000022434A (en) Coding and decoding of audio signals by using intensity stereo and prediction processes
US5781586A (en) Method and apparatus for encoding the information, method and apparatus for decoding the information and information recording medium
US5758316A (en) Methods and apparatus for information encoding and decoding based upon tonal components of plural channels
KR100303580B1 (en) Transmitter, Encoding Device and Transmission Method
JP3189401B2 (en) Audio data encoding method and audio data encoding device
CA2208384C (en) Wideband speech coder
EP0734617B1 (en) Transmission system utilizing different coding principles
JPH09135173A (en) Device and method for encoding, device and method for decoding, device and method for transmission and recording medium
JP2575265B2 (en) Digital audio signal encoding method
JPH06224862A (en) Method and equipment for processing digital audio signal
JP2000091921A (en) Encoding method, decoding method, encoding device, decoding device, device and method for recording digital signal, and method and device for transmitting digital signal
JPH03132216A (en) Encoder for digital data
JPH07106977A (en) Information decoding device
EP0573103A1 (en) Transmitter, receiver and record carrier in a digital transmission system
JPH0470800A (en) Voice information compressing device

Legal Events

Date Code Title Description
EEER Examination request
MKEX Expiry

Effective date: 20151221