Sign in

Perceptual coding of audio signals

 Joseph L. Hall et al
A technique for the masking of quantizing noise in the coding of audio signals is adapted to include geometric interpolation between the thresholds for a tone masking noise and for noise masking a tone, in order to reduce use of bit-rate capability where it is not necessary for transparent or...
Inventors: Joseph L. Hall, II, James D. Johnston
Assignee: AT&T Bell Laboratories

U.S. Classification
395/235; 395/236; 395/231; 381/1

International Classification
G10L 700; G10L 302

View patent at USPTO

Citations

Patent NumberTitleIssue date
3989897Method and apparatus for reducing noise content in audio signalsNov 2, 1976
4349698Audio signal translation with no delay elementsSep 14, 1982
4356349Acoustic image enhancing method and apparatusOct 26, 1982
4803727Transmission systemFeb 7, 1989
4860360Method of evaluating speechAug 22, 1989
4949383Frequency domain speech codingAug 14, 1990
4972484Method of transmitting or storing masked sub-band coded audio signalsNov 20, 1990

Referenced by

Patent NumberTitleIssue date
5491773Encoding system comprising a subband coder for subband coding of a wideband digital signal constituted by first and second signal componentsFeb 13, 1996
5495552Methods of efficiently recording an audio signal in semiconductor memoryFeb 27, 1996
5535300Perceptual coding of audio signals using entropy coding and/or multiple power spectraJul 9, 1996
5539829Subband coded digital transmission system using some composite signalsJul 23, 1996
5579430Digital encoding processNov 26, 1996
5586126Sample amplitude error detection and correction apparatus and method for use with a low information content signalDec 17, 1996
5606618Subband coded digital transmission system using some composite signalsFeb 25, 1997
5613035Apparatus for adaptively encoding input digital audio signals from a plurality of channelsMar 18, 1997
5621851Method of expanding differential PCM data of speech signalsApr 15, 1997
5630010Methods of efficiently recording an audio signal in semiconductor memoryMay 13, 1997
5646961Method for noise weighting filteringJul 8, 1997
5649053Method for encoding audio signalsJul 15, 1997
5664052Method and device for discriminating voiced and unvoiced soundsSep 2, 1997
5682463Perceptual audio compression based on loudness uncertaintyOct 28, 1997
5684921Method and system for identifying a corrupted speech message signalNov 4, 1997
5699479Tonality for perceptual audio compression based on loudness uncertaintyDec 16, 1997
5699382Method for noise weighting filteringDec 16, 1997
5717764Global masking thresholding for use in perceptual codingFeb 10, 1998
5752221Method of efficiently recording an audio signal in semiconductor memoryMay 12, 1998
5761634Method and apparatus for group encoding signalsJun 2, 1998
5774843Methods of efficiently recording an audio signal in semiconductor memoryJun 30, 1998
5781888Perceptual noise shaping in the time domain via LPC prediction in the frequency domain Jul 14, 1998
5812971Enhanced joint stereo coding method using temporal envelope shapingSep 22, 1998
5826227Hiding a source identifier within a signalOct 20, 1998
5845251Method, system and product for modifying the bandwidth of subband encoded audio data Dec 1, 1998
5864801Methods of efficiently recording and reproducing an audio signal in a memory using hierarchical encodingJan 26, 1999
5864813Method, system and product for harmonic enhancement of encoded audio signalsJan 26, 1999
5864820Method, system and product for mixing of encoded audio signalsJan 26, 1999
5870703Adaptive bit allocation of tonal and noise componentsFeb 9, 1999
5873065Two-stage compression and expansion of coupling processed multi-channel sound signals for transmission and recordingFeb 16, 1999
5890126Audio data decompression and interpolation apparatus and methodMar 30, 1999
5946652Methods for non-linearly quantizing and non-linearly dequantizing an information signal using off-center decision levelsAug 31, 1999
5999899Low bit rate audio coder and decoder operating in a transform domain using vector quantizationDec 7, 1999
6058362System and method for masking quantization noise of audio signalsMay 2, 2000
6091773Data compression method and apparatusJul 18, 2000
6122338Audio encoding transmission systemSep 19, 2000
6182034System and method for producing a fixed effort quantization step size with a binary searchJan 30, 2001
6256608System and method for entropy encoding quantized transform coefficients of a signalJul 3, 2001
6301555Adjustable psycho-acoustic parametersOct 9, 2001
6332119Adjustable CODEC with adjustable parametersDec 18, 2001
6438236Audio signal identification using digital labelling signalsAug 20, 2002
6456963Block length decision based on tonality indexSep 24, 2002
6463405Audiophile encoding of digital audio data using 2-bit polarity/magnitude indicator and 8-bit scale factor for each subbandOct 8, 2002
6477496Signal synthesis by decoding subband scale factors from one audio signal and subband samples from different oneNov 5, 2002
6516299Method, system and product for modifying the dynamic range of encoded audio signalsFeb 4, 2003
6542863Fast codebook search method for MPEG audio encodingApr 1, 2003
6601032Fast code length search method for MPEG audio encodingJul 29, 2003
6633841Voice activity detection speech coding to accommodate music signalsOct 14, 2003
6650624Cable modem apparatus and methodNov 18, 2003
6678648Fast loop iteration and bitstream formatting method for MPEG audio encodingJan 13, 2004
6778649Method and apparatus for transmitting coded audio signals through a transmission channel with limited bandwidthAug 17, 2004
6782365Graphic interface system and product for editing encoded audio dataAug 24, 2004
6961314Burst receiver for cable modem systemNov 1, 2005
6975254Methods and devices for coding or decoding an audio signal or bit streamDec 13, 2005
RE39080Rate loop processor for perceptual encoder/decoderApr 25, 2006
7103065Data packet fragmentation in a cable modem systemSep 5, 2006
7103065Data packet fragmentation in a cable modem systemSep 5, 2006
7120123Pre-equalization technique for upstream communication between cable modem and headendOct 10, 2006
7139283Robust techniques for optimal upstream communication between cable modem subscribers and a headendNov 21, 2006
7194757Method and apparatus for push and pull distribution of multimediaMar 20, 2007
7240207Fingerprinting media entities employing fingerprint algorithms and bit-to-bit comparisonsJul 3, 2007
7289883Apparatus and method for patient rounding with a remote controlled robotOct 30, 2007
RE40280Rate loop processor for perceptual encoder/decoderApr 29, 2008
7372824Satellite receiver/router, system, and method of useMay 13, 2008
7412375Speech quality assessment with noise maskingAug 12, 2008

Claims

What is claimed is:

1. A method of processing an ordered time sequence of audio signals partitioned into a set of ordered blocks, each said block having a discrete frequency spectrum comprising a first set of frequency coefficients, the method comprising, for each of said blocks, the steps of:

(a) grouping said first set of frequency coefficients into at least one group, each group comprising at least one frequency coefficient;
(b) generating at least one tonality value, each group having an associated tonality value, said at least one tonality value reflecting the degree to which said time sequence of audio signals comprises tone-like quality;
(c) generating at least one noise masking threshold, each said at least one noise masking threshold being based upon at least a portion of said at least one tonality value; and
(d) quantizing at least one frequency coefficient in said at least one group, said quantizing based upon said at least one noise masking threshold.

2. The method of claim 1 wherein said discrete frequency spectrum further comprises a second set of frequency coefficients, said first set of frequency coefficients in combination with said second set of frequency coefficients representing all frequencies present in each said block.

3. The method of claim 1 wherein each said group in said at least one group comprising more than one frequency coefficient comprises more than one contiguous frequency coefficient.

4. The method of claim 1 further comprising the step of coding said quantized set of groups using Huffman codes.

5. The method of claim 1 wherein each said block is representable by a number of bits, said number of bits having a predetermined range, said quantizing is based on said number of bits.

6. The method of claim 5 wherein said step of quantizing said at least one frequency coefficient in said at least one group comprises quantizing all frequency coefficients in said first set of frequency coefficients and wherein said method further comprising, for each block, the steps of:

(a) generating an amount of bits needed to represent said first set of frequency coefficients in a quantized form;
(b) comparing said amount of bits to said number of bits;
(c) adjusting each said at least one noise masking threshold; and
(d) repeating set (d) of claim 1 and steps (a) through (c) until said amount of bits is within said predetermined range of said number of bits.

7. The method of claim 1 wherein said ordered time sequence of audio signals represents a first channel and a second channel of a stereo signal, the method further comprising, for each of said blocks, the steps of:

(a) generating a first power spectrum, said first power spectrum being representative of said first channel;
(b) generating a second power spectrum, said second power spectrum being representative of said second channel;
(c) adding said first power spectrum to said second power spectrum prior to said determining at least one noise masking threshold; and
(d) applying each of said at least one noise masking threshold to said first channel and said second channel.

8. The method of claim 7 wherein the first channel is L and the second channel is R.

9. The method of claim 7 wherein the first channel is L+R and the second channel represents a difference between L and R.

10. A storage medium manufactured in accordance with a process comprising the steps of:

(a) processing an ordered time sequence of audio signals partitioned into a set of ordered blocks, each said block having a discrete frequency spectrum comprising a first set of frequency coefficients; and
(b) for each of said blocks:
(1) grouping said first set of frequency coefficients into at least one group, each group comprising at least one frequency coefficient;
(2) generating at least one tonality value, each group having an associated tonality value, said at least one tonality value reflecting the degree to which said time sequence of audio signals comprises tone-like quality;
(3) generating at least one noise masking threshold, each said at least one noise masking threshold being based upon at least one tonality value;
(4) quantizing at least one frequency coefficient in said at least one group resulting in a set of quantized frequency coefficients, said quantizing based upon said at least one noise masking threshold;
(5) applying a recording signal to said storage medium, said recording signal comprising signals representing said set of quantized frequency coefficients; and
(6) recording said recording signal onto said storage medium.

11. The method of claim 10 wherein said storage medium is a compact disc.

12. The method of claim 10 wherein said storage medium is a digital storage means.

13. A method of transmitting audio signals, the method comprising the steps of:

(a) processing an ordered time sequence of audio signals partitioned into a set of ordered blocks, each said block having a discrete frequency spectrum comprising a first set of frequency coefficients; and
(b) for each of said blocks:
(1) grouping said first set of frequency coefficients into at least one group, each group comprising at least one frequency coefficient;
(2) generating at least one tonality value, each group having an associated tonality value, said at least one tonality value reflecting the degree to which said time sequence of audio signals comprises tone-like quality;
(3) generating at least one noise masking threshold, each said at least one noise masking threshold being based upon at least one tonality value;
(4) quantizing at least one frequency coefficient in said at least one group resulting in a set of quantized frequency coefficients, said quantizing based upon said at least one noise masking threshold;
(5) generating a transmission signal comprising signals representing said set of quantized frequency coefficients; and
(6) applying said transmission signal to a transmission medium.

14. The method of claim 13 wherein said transmission medium is a broadcast transmission medium.

15. The method of claim 13 wherein said transmission medium is an electrical conducting medium.

16. The method of claim 13 wherein said transmission medium is an optical transmission medium.

17. A method for generating signals representing an ordered time sequence of audio signals partitioned into a set of ordered blocks, each said block having a discrete frequency spectrum comprising a first set of frequency coefficients, the method comprising, for each of said blocks, the steps of:

(a) grouping said first set of frequency coefficients into a plurality of groups, each group in said plurality of groups representing a critical band of frequencies and comprising at least one frequency coefficient;
(b) generating a tonality value for each said group in said plurality of groups of frequency coefficients, said tonality values reflecting the degree to which said time sequence of audio signals comprises tone-like quality;
(c) generating a noise masking threshold for each said group in said plurality of groups of frequency coefficients, each said noise masking threshold being based upon said tonality value for the respective group; and
(d) quantizing each frequency coefficient in said at least one frequency coefficient in each said group, said quantizing being based upon said noise masking threshold associated with said group and a predetermined number of bits.

Drawings