WO2008021185A3 - A method for quantizing speech and audio through an efficient perceptually relevant search of multiple quantization patterns - Google Patents

A method for quantizing speech and audio through an efficient perceptually relevant search of multiple quantization patterns Download PDF

Info

Publication number
WO2008021185A3
WO2008021185A3 PCT/US2007/017719 US2007017719W WO2008021185A3 WO 2008021185 A3 WO2008021185 A3 WO 2008021185A3 US 2007017719 W US2007017719 W US 2007017719W WO 2008021185 A3 WO2008021185 A3 WO 2008021185A3
Authority
WO
WIPO (PCT)
Prior art keywords
patterns
relevant search
perceptually relevant
audio
efficient
Prior art date
Application number
PCT/US2007/017719
Other languages
French (fr)
Other versions
WO2008021185A2 (en
Inventor
Sean A Ramprashad
Original Assignee
Ntt Docomo Inc
Sean A Ramprashad
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Ntt Docomo Inc, Sean A Ramprashad filed Critical Ntt Docomo Inc
Priority to JP2009523846A priority Critical patent/JP2010500819A/en
Publication of WO2008021185A2 publication Critical patent/WO2008021185A2/en
Publication of WO2008021185A3 publication Critical patent/WO2008021185A3/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/032Quantisation or dequantisation of spectral components
    • G10L19/038Vector quantisation, e.g. TwinVQ audio
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L2019/0001Codebooks
    • G10L2019/0004Design or structure of the codebook
    • G10L2019/0005Multi-stage vector quantisation

Abstract

A method and apparatus is disclosed herein for quantizing data using a perceptually relevant search of multiple quantization patterns. In one embodiment, the method comprises performing a perceptually relevant search of multiple quantization patterns in which one of a plurality of prototype patterns and its associated permutation are selected to quantize the target vector, each prototype pattern in the plurality of prototype patterns being capable of directing quantization across the vector; converting the one prototype pattern, the associated permutation and quantization information resulting from both to a plurality of bits by an encoder; and transferring the bits as part of a bit stream.
PCT/US2007/017719 2006-08-11 2007-08-08 A method for quantizing speech and audio through an efficient perceptually relevant search of multiple quantization patterns WO2008021185A2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP2009523846A JP2010500819A (en) 2006-08-11 2007-08-08 A method for quantizing speech and audio by efficient perceptual related retrieval of multiple quantization patterns

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US83716406P 2006-08-11 2006-08-11
US60/837,164 2006-08-11
US11/835,273 2007-08-07
US11/835,273 US7873514B2 (en) 2006-08-11 2007-08-07 Method for quantizing speech and audio through an efficient perceptually relevant search of multiple quantization patterns

Publications (2)

Publication Number Publication Date
WO2008021185A2 WO2008021185A2 (en) 2008-02-21
WO2008021185A3 true WO2008021185A3 (en) 2008-04-17

Family

ID=38952080

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2007/017719 WO2008021185A2 (en) 2006-08-11 2007-08-08 A method for quantizing speech and audio through an efficient perceptually relevant search of multiple quantization patterns

Country Status (3)

Country Link
US (1) US7873514B2 (en)
JP (1) JP2010500819A (en)
WO (1) WO2008021185A2 (en)

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20080047443A (en) * 2005-10-14 2008-05-28 마츠시타 덴끼 산교 가부시키가이샤 Transform coder and transform coding method
CA2701757C (en) * 2007-10-12 2016-11-22 Panasonic Corporation Vector quantization apparatus, vector dequantization apparatus and the methods
US8515767B2 (en) * 2007-11-04 2013-08-20 Qualcomm Incorporated Technique for encoding/decoding of codebook indices for quantized MDCT spectrum in scalable speech and audio codecs
KR101826331B1 (en) * 2010-09-15 2018-03-22 삼성전자주식회사 Apparatus and method for encoding and decoding for high frequency bandwidth extension
EP3023985B1 (en) 2010-12-29 2017-07-05 Samsung Electronics Co., Ltd Methods for audio signal encoding and decoding
US9716901B2 (en) * 2012-05-23 2017-07-25 Google Inc. Quantization with distinct weighting of coherent and incoherent quantization error

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CA1333420C (en) * 1988-02-29 1994-12-06 Tokumichi Murakami Vector quantizer
US5210820A (en) * 1990-05-02 1993-05-11 Broadcast Data Systems Limited Partnership Signal recognition system and method
JP3412118B2 (en) * 1997-12-25 2003-06-03 日本電信電話株式会社 Conjugate structure vector quantization method, apparatus therefor, and program recording medium
JP3367931B2 (en) * 2000-03-06 2003-01-20 日本電信電話株式会社 Conjugate structure vector quantization method
US7885809B2 (en) * 2005-04-20 2011-02-08 Ntt Docomo, Inc. Quantization of speech and audio coding parameters using partial information on atypical subsequences
JP4287840B2 (en) * 2005-06-06 2009-07-01 パナソニック株式会社 Encoder

Non-Patent Citations (5)

* Cited by examiner, † Cited by third party
Title
KVIVEK K GOYAL ET AL: "On Optimal Permutation Codes", IEEE TRANSACTIONS ON INFORMATION THEORY, IEEE SERVICE CENTER, PISCATAWAY, NJ, US, vol. 47, no. 7, November 2001 (2001-11-01), XP011028083, ISSN: 0018-9448 *
RAMO A: "Improving LSF quantization performance with sorting", SIGNAL PROCESSING, 2004. PROCEEDINGS. ICSP '04. 2004 7TH INTERNATIONAL CONFERENCE ON BEIJING, CHINA AUG. 31 - SEPT 4, 2004, PISCATAWAY, NJ, USA,IEEE, 31 August 2004 (2004-08-31), pages 587 - 589, XP010809692, ISBN: 0-7803-8406-7 *
RAMPRASHAD S A: "Efficient Quantization Of Statistically Normalized Vectors Using Multi-Option Partial-Order Bit-Assignment Schemes", ACOUSTICS, SPEECH AND SIGNAL PROCESSING, 2006. ICASSP 2006 PROCEEDINGS. 2006 IEEE INTERNATIONAL CONFERENCE ON TOULOUSE, FRANCE 14-19 MAY 2006, PISCATAWAY, NJ, USA,IEEE, 14 May 2006 (2006-05-14), pages I - 845, XP010930312, ISBN: 1-4244-0469-X *
SEAN A RAMPRASHAD ED - RADHA KRISHNA GANTI ET AL: "Partial-Order Bit-Allocation Schemes for Lowrate Quantization", SIGNALS, SYSTEMS AND COMPUTERS, 2006. ACSSC '06. FORTIETH ASILOMAR CONFERENCE ON, IEEE, PI, October 2006 (2006-10-01), pages 1059 - 1066, XP031081202, ISBN: 1-4244-0784-2 *
SEAN A RAMPRASHAD: "Sparse Bit-Allocations Based on Partial Ordering Schemes With Application to Speech and Audio Coding", IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, IEEE SERVICE CENTER, NEW YORK, NY, US, vol. 15, no. 1, January 2007 (2007-01-01), pages 57 - 69, XP011145328, ISSN: 1558-7916 *

Also Published As

Publication number Publication date
US7873514B2 (en) 2011-01-18
WO2008021185A2 (en) 2008-02-21
JP2010500819A (en) 2010-01-07
US20080040107A1 (en) 2008-02-14

Similar Documents

Publication Publication Date Title
WO2008021185A3 (en) A method for quantizing speech and audio through an efficient perceptually relevant search of multiple quantization patterns
BRPI0802614A2 (en) methods and apparatus for encoding and decoding object-based audio signals
WO2008045950A3 (en) Methods and apparatus for embedding codes in compressed audio data streams
TWI365610B (en) Systems and methods for scalably encoding and decoding data
TW200746052A (en) Apparatus and method for encoding and decoding signal
WO2008014522A3 (en) Data encoding method and apparatus for flash-type signaling
WO2011062424A3 (en) Method and apparatus for transmitting and receiving data in a communication system
ATE490454T1 (en) METHOD FOR SWITCHING RATE AND BANDWIDTH SCALABLE AUDIO DECODING RATE
WO2011019234A3 (en) Method and apparatus for encoding and decoding image by using large transformation unit
WO2005122235A3 (en) Methods and devices for forming nanostructure monolayers and devices including such monolayers
ATE456256T1 (en) DECODER ARCHITECTURE FOR OPTIMIZED ERROR MANAGEMENT IN MULTIMEDIA STREAMS
WO2006126844A8 (en) Method and apparatus for decoding an audio signal
ATE532275T1 (en) METHOD AND SYSTEM FOR SENDING AND RECEIVING DATA STREAMS
SE0701690L (en) Generating a data stream and identifying positions within a data stream
EP3712888A3 (en) Apparatus and method for coding and decoding multi object audio signal with multi channel
TW200703240A (en) Systems, methods, and apparatus for quantization of spectral envelope representation
WO2009061363A3 (en) A scalable video coding method for fast channel change and increased error resilience
PT1826932E (en) Method and apparatus for generating digital audio signatures
HK1096428A1 (en) Method for producing hydrocarbons and oxygen-containing compounds, from biomass
EP2645367A3 (en) Encoding/decoding method for audio signals using adaptive sinusoidal coding and apparatus thereof
WO2008023352A3 (en) Method and apparatus for generating a summary
WO2012134121A3 (en) Method and apparatus for transmitting and receiving control information in a broadcasting/communication system
ATE527653T1 (en) METHOD AND DEVICE FOR ENCODING AND DECODING DIGITAL SIGNALS
MX2009009229A (en) Encoding device and encoding method.
WO2004080020A3 (en) Methods and apparatus for reducing discrete power spectral density components of signals transmitted in wideband communication systems

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 07836673

Country of ref document: EP

Kind code of ref document: A2

WWE Wipo information: entry into national phase

Ref document number: 2009523846

Country of ref document: JP

NENP Non-entry into the national phase

Ref country code: DE

NENP Non-entry into the national phase

Ref country code: RU

122 Ep: pct application non-entry in european phase

Ref document number: 07836673

Country of ref document: EP

Kind code of ref document: A2