WO2008021185A3 - A method for quantizing speech and audio through an efficient perceptually relevant search of multiple quantization patterns - Google Patents
A method for quantizing speech and audio through an efficient perceptually relevant search of multiple quantization patterns Download PDFInfo
- Publication number
- WO2008021185A3 WO2008021185A3 PCT/US2007/017719 US2007017719W WO2008021185A3 WO 2008021185 A3 WO2008021185 A3 WO 2008021185A3 US 2007017719 W US2007017719 W US 2007017719W WO 2008021185 A3 WO2008021185 A3 WO 2008021185A3
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- patterns
- relevant search
- perceptually relevant
- audio
- efficient
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/032—Quantisation or dequantisation of spectral components
- G10L19/038—Vector quantisation, e.g. TwinVQ audio
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L2019/0001—Codebooks
- G10L2019/0004—Design or structure of the codebook
- G10L2019/0005—Multi-stage vector quantisation
Abstract
A method and apparatus is disclosed herein for quantizing data using a perceptually relevant search of multiple quantization patterns. In one embodiment, the method comprises performing a perceptually relevant search of multiple quantization patterns in which one of a plurality of prototype patterns and its associated permutation are selected to quantize the target vector, each prototype pattern in the plurality of prototype patterns being capable of directing quantization across the vector; converting the one prototype pattern, the associated permutation and quantization information resulting from both to a plurality of bits by an encoder; and transferring the bits as part of a bit stream.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2009523846A JP2010500819A (en) | 2006-08-11 | 2007-08-08 | A method for quantizing speech and audio by efficient perceptual related retrieval of multiple quantization patterns |
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US83716406P | 2006-08-11 | 2006-08-11 | |
US60/837,164 | 2006-08-11 | ||
US11/835,273 | 2007-08-07 | ||
US11/835,273 US7873514B2 (en) | 2006-08-11 | 2007-08-07 | Method for quantizing speech and audio through an efficient perceptually relevant search of multiple quantization patterns |
Publications (2)
Publication Number | Publication Date |
---|---|
WO2008021185A2 WO2008021185A2 (en) | 2008-02-21 |
WO2008021185A3 true WO2008021185A3 (en) | 2008-04-17 |
Family
ID=38952080
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US2007/017719 WO2008021185A2 (en) | 2006-08-11 | 2007-08-08 | A method for quantizing speech and audio through an efficient perceptually relevant search of multiple quantization patterns |
Country Status (3)
Country | Link |
---|---|
US (1) | US7873514B2 (en) |
JP (1) | JP2010500819A (en) |
WO (1) | WO2008021185A2 (en) |
Families Citing this family (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR20080047443A (en) * | 2005-10-14 | 2008-05-28 | 마츠시타 덴끼 산교 가부시키가이샤 | Transform coder and transform coding method |
CA2701757C (en) * | 2007-10-12 | 2016-11-22 | Panasonic Corporation | Vector quantization apparatus, vector dequantization apparatus and the methods |
US8515767B2 (en) * | 2007-11-04 | 2013-08-20 | Qualcomm Incorporated | Technique for encoding/decoding of codebook indices for quantized MDCT spectrum in scalable speech and audio codecs |
KR101826331B1 (en) * | 2010-09-15 | 2018-03-22 | 삼성전자주식회사 | Apparatus and method for encoding and decoding for high frequency bandwidth extension |
EP3023985B1 (en) | 2010-12-29 | 2017-07-05 | Samsung Electronics Co., Ltd | Methods for audio signal encoding and decoding |
US9716901B2 (en) * | 2012-05-23 | 2017-07-25 | Google Inc. | Quantization with distinct weighting of coherent and incoherent quantization error |
Family Cites Families (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CA1333420C (en) * | 1988-02-29 | 1994-12-06 | Tokumichi Murakami | Vector quantizer |
US5210820A (en) * | 1990-05-02 | 1993-05-11 | Broadcast Data Systems Limited Partnership | Signal recognition system and method |
JP3412118B2 (en) * | 1997-12-25 | 2003-06-03 | 日本電信電話株式会社 | Conjugate structure vector quantization method, apparatus therefor, and program recording medium |
JP3367931B2 (en) * | 2000-03-06 | 2003-01-20 | 日本電信電話株式会社 | Conjugate structure vector quantization method |
US7885809B2 (en) * | 2005-04-20 | 2011-02-08 | Ntt Docomo, Inc. | Quantization of speech and audio coding parameters using partial information on atypical subsequences |
JP4287840B2 (en) * | 2005-06-06 | 2009-07-01 | パナソニック株式会社 | Encoder |
-
2007
- 2007-08-07 US US11/835,273 patent/US7873514B2/en active Active
- 2007-08-08 JP JP2009523846A patent/JP2010500819A/en active Pending
- 2007-08-08 WO PCT/US2007/017719 patent/WO2008021185A2/en active Application Filing
Non-Patent Citations (5)
Title |
---|
KVIVEK K GOYAL ET AL: "On Optimal Permutation Codes", IEEE TRANSACTIONS ON INFORMATION THEORY, IEEE SERVICE CENTER, PISCATAWAY, NJ, US, vol. 47, no. 7, November 2001 (2001-11-01), XP011028083, ISSN: 0018-9448 * |
RAMO A: "Improving LSF quantization performance with sorting", SIGNAL PROCESSING, 2004. PROCEEDINGS. ICSP '04. 2004 7TH INTERNATIONAL CONFERENCE ON BEIJING, CHINA AUG. 31 - SEPT 4, 2004, PISCATAWAY, NJ, USA,IEEE, 31 August 2004 (2004-08-31), pages 587 - 589, XP010809692, ISBN: 0-7803-8406-7 * |
RAMPRASHAD S A: "Efficient Quantization Of Statistically Normalized Vectors Using Multi-Option Partial-Order Bit-Assignment Schemes", ACOUSTICS, SPEECH AND SIGNAL PROCESSING, 2006. ICASSP 2006 PROCEEDINGS. 2006 IEEE INTERNATIONAL CONFERENCE ON TOULOUSE, FRANCE 14-19 MAY 2006, PISCATAWAY, NJ, USA,IEEE, 14 May 2006 (2006-05-14), pages I - 845, XP010930312, ISBN: 1-4244-0469-X * |
SEAN A RAMPRASHAD ED - RADHA KRISHNA GANTI ET AL: "Partial-Order Bit-Allocation Schemes for Lowrate Quantization", SIGNALS, SYSTEMS AND COMPUTERS, 2006. ACSSC '06. FORTIETH ASILOMAR CONFERENCE ON, IEEE, PI, October 2006 (2006-10-01), pages 1059 - 1066, XP031081202, ISBN: 1-4244-0784-2 * |
SEAN A RAMPRASHAD: "Sparse Bit-Allocations Based on Partial Ordering Schemes With Application to Speech and Audio Coding", IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, IEEE SERVICE CENTER, NEW YORK, NY, US, vol. 15, no. 1, January 2007 (2007-01-01), pages 57 - 69, XP011145328, ISSN: 1558-7916 * |
Also Published As
Publication number | Publication date |
---|---|
US7873514B2 (en) | 2011-01-18 |
WO2008021185A2 (en) | 2008-02-21 |
JP2010500819A (en) | 2010-01-07 |
US20080040107A1 (en) | 2008-02-14 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2008021185A3 (en) | A method for quantizing speech and audio through an efficient perceptually relevant search of multiple quantization patterns | |
BRPI0802614A2 (en) | methods and apparatus for encoding and decoding object-based audio signals | |
WO2008045950A3 (en) | Methods and apparatus for embedding codes in compressed audio data streams | |
TWI365610B (en) | Systems and methods for scalably encoding and decoding data | |
TW200746052A (en) | Apparatus and method for encoding and decoding signal | |
WO2008014522A3 (en) | Data encoding method and apparatus for flash-type signaling | |
WO2011062424A3 (en) | Method and apparatus for transmitting and receiving data in a communication system | |
ATE490454T1 (en) | METHOD FOR SWITCHING RATE AND BANDWIDTH SCALABLE AUDIO DECODING RATE | |
WO2011019234A3 (en) | Method and apparatus for encoding and decoding image by using large transformation unit | |
WO2005122235A3 (en) | Methods and devices for forming nanostructure monolayers and devices including such monolayers | |
ATE456256T1 (en) | DECODER ARCHITECTURE FOR OPTIMIZED ERROR MANAGEMENT IN MULTIMEDIA STREAMS | |
WO2006126844A8 (en) | Method and apparatus for decoding an audio signal | |
ATE532275T1 (en) | METHOD AND SYSTEM FOR SENDING AND RECEIVING DATA STREAMS | |
SE0701690L (en) | Generating a data stream and identifying positions within a data stream | |
EP3712888A3 (en) | Apparatus and method for coding and decoding multi object audio signal with multi channel | |
TW200703240A (en) | Systems, methods, and apparatus for quantization of spectral envelope representation | |
WO2009061363A3 (en) | A scalable video coding method for fast channel change and increased error resilience | |
PT1826932E (en) | Method and apparatus for generating digital audio signatures | |
HK1096428A1 (en) | Method for producing hydrocarbons and oxygen-containing compounds, from biomass | |
EP2645367A3 (en) | Encoding/decoding method for audio signals using adaptive sinusoidal coding and apparatus thereof | |
WO2008023352A3 (en) | Method and apparatus for generating a summary | |
WO2012134121A3 (en) | Method and apparatus for transmitting and receiving control information in a broadcasting/communication system | |
ATE527653T1 (en) | METHOD AND DEVICE FOR ENCODING AND DECODING DIGITAL SIGNALS | |
MX2009009229A (en) | Encoding device and encoding method. | |
WO2004080020A3 (en) | Methods and apparatus for reducing discrete power spectral density components of signals transmitted in wideband communication systems |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 07836673 Country of ref document: EP Kind code of ref document: A2 |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2009523846 Country of ref document: JP |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
NENP | Non-entry into the national phase |
Ref country code: RU |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 07836673 Country of ref document: EP Kind code of ref document: A2 |