WO2008106036A3 - Speech enhancement in entertainment audio - Google Patents

Speech enhancement in entertainment audio Download PDF

Info

Publication number
WO2008106036A3
WO2008106036A3 PCT/US2008/002238 US2008002238W WO2008106036A3 WO 2008106036 A3 WO2008106036 A3 WO 2008106036A3 US 2008002238 W US2008002238 W US 2008002238W WO 2008106036 A3 WO2008106036 A3 WO 2008106036A3
Authority
WO
WIPO (PCT)
Prior art keywords
audio
methods
speech enhancement
relates
entertainment audio
Prior art date
Application number
PCT/US2008/002238
Other languages
French (fr)
Other versions
WO2008106036A2 (en
Inventor
Hannes Muesch
Original Assignee
Dolby Lab Licensing Corp
Hannes Muesch
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority to JP2009551991A priority Critical patent/JP5530720B2/en
Priority to US12/528,323 priority patent/US8195454B2/en
Priority to EP08725831A priority patent/EP2118885B1/en
Priority to BRPI0807703-7A priority patent/BRPI0807703B1/en
Priority to CN2008800099293A priority patent/CN101647059B/en
Priority to ES08725831T priority patent/ES2391228T3/en
Application filed by Dolby Lab Licensing Corp, Hannes Muesch filed Critical Dolby Lab Licensing Corp
Publication of WO2008106036A2 publication Critical patent/WO2008106036A2/en
Publication of WO2008106036A3 publication Critical patent/WO2008106036A3/en
Priority to US13/463,600 priority patent/US8271276B1/en
Priority to US13/571,344 priority patent/US8972250B2/en
Priority to US14/605,003 priority patent/US9368128B2/en
Priority to US14/701,622 priority patent/US9418680B2/en
Priority to US15/207,155 priority patent/US9818433B2/en
Priority to US15/730,908 priority patent/US10418052B2/en
Priority to US16/516,634 priority patent/US10586557B2/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/012Comfort noise or silence coding
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/018Audio watermarking, i.e. embedding inaudible data in the audio signal
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0316Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude
    • G10L21/0364Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude for improving intelligibility
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/93Discriminating between voiced and unvoiced parts of speech signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/93Discriminating between voiced and unvoiced parts of speech signals
    • G10L2025/932Decision in previous or following frames
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/93Discriminating between voiced and unvoiced parts of speech signals
    • G10L2025/937Signal energy in various frequency bands

Abstract

The invention relates to audio signal processing. More specifically, the invention relates to enhancing entertainment audio, such as television audio, to improve the clarity and intelligibility of speech, such as dialog and narrative audio. The invention relates to methods, apparatus for performing such methods, and to software stored on a computer-readable medium for causing a computer to perform such methods.
PCT/US2008/002238 2007-02-26 2008-02-20 Speech enhancement in entertainment audio WO2008106036A2 (en)

Priority Applications (13)

Application Number Priority Date Filing Date Title
JP2009551991A JP5530720B2 (en) 2007-02-26 2008-02-20 Speech enhancement method, apparatus, and computer-readable recording medium for entertainment audio
US12/528,323 US8195454B2 (en) 2007-02-26 2008-02-20 Speech enhancement in entertainment audio
EP08725831A EP2118885B1 (en) 2007-02-26 2008-02-20 Speech enhancement in entertainment audio
BRPI0807703-7A BRPI0807703B1 (en) 2007-02-26 2008-02-20 METHOD FOR IMPROVING SPEECH IN ENTERTAINMENT AUDIO AND COMPUTER-READABLE NON-TRANSITIONAL MEDIA
CN2008800099293A CN101647059B (en) 2007-02-26 2008-02-20 Speech enhancement in entertainment audio
ES08725831T ES2391228T3 (en) 2007-02-26 2008-02-20 Entertainment audio voice enhancement
US13/463,600 US8271276B1 (en) 2007-02-26 2012-05-03 Enhancement of multichannel audio
US13/571,344 US8972250B2 (en) 2007-02-26 2012-08-10 Enhancement of multichannel audio
US14/605,003 US9368128B2 (en) 2007-02-26 2015-01-26 Enhancement of multichannel audio
US14/701,622 US9418680B2 (en) 2007-02-26 2015-05-01 Voice activity detector for audio signals
US15/207,155 US9818433B2 (en) 2007-02-26 2016-07-11 Voice activity detector for audio signals
US15/730,908 US10418052B2 (en) 2007-02-26 2017-10-12 Voice activity detector for audio signals
US16/516,634 US10586557B2 (en) 2007-02-26 2019-07-19 Voice activity detector for audio signals

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US90339207P 2007-02-26 2007-02-26
US60/903,392 2007-02-26

Related Child Applications (2)

Application Number Title Priority Date Filing Date
US12/528,323 A-371-Of-International US8195454B2 (en) 2007-02-26 2008-02-20 Speech enhancement in entertainment audio
US13/463,600 Continuation US8271276B1 (en) 2007-02-26 2012-05-03 Enhancement of multichannel audio

Publications (2)

Publication Number Publication Date
WO2008106036A2 WO2008106036A2 (en) 2008-09-04
WO2008106036A3 true WO2008106036A3 (en) 2008-11-27

Family

ID=39721787

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2008/002238 WO2008106036A2 (en) 2007-02-26 2008-02-20 Speech enhancement in entertainment audio

Country Status (8)

Country Link
US (8) US8195454B2 (en)
EP (1) EP2118885B1 (en)
JP (2) JP5530720B2 (en)
CN (1) CN101647059B (en)
BR (1) BRPI0807703B1 (en)
ES (1) ES2391228T3 (en)
RU (1) RU2440627C2 (en)
WO (1) WO2008106036A2 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102088648A (en) * 2009-12-03 2011-06-08 奥迪康有限公司 Method for dynamic suppression of surrounding acoustic noise when listening to electrical inputs

Families Citing this family (83)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR100789084B1 (en) * 2006-11-21 2007-12-26 한양대학교 산학협력단 Speech enhancement method by overweighting gain with nonlinear structure in wavelet packet transform
US8195454B2 (en) 2007-02-26 2012-06-05 Dolby Laboratories Licensing Corporation Speech enhancement in entertainment audio
WO2009086174A1 (en) 2007-12-21 2009-07-09 Srs Labs, Inc. System for adjusting perceived loudness of audio signals
US8639519B2 (en) * 2008-04-09 2014-01-28 Motorola Mobility Llc Method and apparatus for selective signal coding based on core encoder performance
UA101974C2 (en) * 2008-04-18 2013-05-27 Долби Леборетериз Лайсенсинг Корпорейшн Method and apparatus for maintaining speech audibility in multi-channel audio with minimal impact on surround experience
US8712771B2 (en) * 2009-07-02 2014-04-29 Alon Konchitsky Automated difference recognition between speaking sounds and music
DE112009005215T8 (en) * 2009-08-04 2013-01-03 Nokia Corp. Method and apparatus for audio signal classification
US8538042B2 (en) * 2009-08-11 2013-09-17 Dts Llc System for increasing perceived loudness of speakers
EP2486567A1 (en) 2009-10-09 2012-08-15 Dolby Laboratories Licensing Corporation Automatic generation of metadata for audio dominance effects
BR112012008671A2 (en) 2009-10-19 2016-04-19 Ericsson Telefon Ab L M method for detecting voice activity from a received input signal, and, voice activity detector
US9838784B2 (en) 2009-12-02 2017-12-05 Knowles Electronics, Llc Directional audio capture
TWI459828B (en) * 2010-03-08 2014-11-01 Dolby Lab Licensing Corp Method and system for scaling ducking of speech-relevant channels in multi-channel audio
WO2011115944A1 (en) 2010-03-18 2011-09-22 Dolby Laboratories Licensing Corporation Techniques for distortion reducing multi-band compressor with timbre preservation
US8538035B2 (en) 2010-04-29 2013-09-17 Audience, Inc. Multi-microphone robust noise suppression
US8473287B2 (en) 2010-04-19 2013-06-25 Audience, Inc. Method for jointly optimizing noise reduction and voice quality in a mono or multi-microphone system
JP5834449B2 (en) * 2010-04-22 2015-12-24 富士通株式会社 Utterance state detection device, utterance state detection program, and utterance state detection method
US8781137B1 (en) 2010-04-27 2014-07-15 Audience, Inc. Wind noise detection and suppression
US8447596B2 (en) 2010-07-12 2013-05-21 Audience, Inc. Monaural noise suppression based on computational auditory scene analysis
JP5652642B2 (en) * 2010-08-02 2015-01-14 ソニー株式会社 Data generation apparatus, data generation method, data processing apparatus, and data processing method
KR101726738B1 (en) * 2010-12-01 2017-04-13 삼성전자주식회사 Sound processing apparatus and sound processing method
EP2469741A1 (en) * 2010-12-21 2012-06-27 Thomson Licensing Method and apparatus for encoding and decoding successive frames of an ambisonics representation of a 2- or 3-dimensional sound field
KR101520212B1 (en) 2011-04-15 2015-05-13 텔레폰악티에볼라겟엘엠에릭슨(펍) Method and a decoder for attenuation of signal regions reconstructed with low accuracy
US8918197B2 (en) 2012-06-13 2014-12-23 Avraham Suhami Audio communication networks
FR2981782B1 (en) * 2011-10-20 2015-12-25 Esii METHOD FOR SENDING AND AUDIO RECOVERY OF AUDIO INFORMATION
JP5565405B2 (en) * 2011-12-21 2014-08-06 ヤマハ株式会社 Sound processing apparatus and sound processing method
US20130253923A1 (en) * 2012-03-21 2013-09-26 Her Majesty The Queen In Right Of Canada, As Represented By The Minister Of Industry Multichannel enhancement system for preserving spatial cues
CN103325386B (en) * 2012-03-23 2016-12-21 杜比实验室特许公司 The method and system controlled for signal transmission
WO2013150340A1 (en) * 2012-04-05 2013-10-10 Nokia Corporation Adaptive audio signal filtering
US9312829B2 (en) 2012-04-12 2016-04-12 Dts Llc System for adjusting loudness of audio signals in real time
US8843367B2 (en) * 2012-05-04 2014-09-23 8758271 Canada Inc. Adaptive equalization system
US9460729B2 (en) 2012-09-21 2016-10-04 Dolby Laboratories Licensing Corporation Layered approach to spatial audio coding
JP2014106247A (en) * 2012-11-22 2014-06-09 Fujitsu Ltd Signal processing device, signal processing method, and signal processing program
MY193865A (en) * 2013-01-08 2022-10-29 Dolby Int Ab Model based prediction in a critically sampled filterbank
DE13750900T1 (en) 2013-01-08 2016-02-11 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Improved speech intelligibility for background noise through SII-dependent amplification and compression
CN103079258A (en) * 2013-01-09 2013-05-01 广东欧珀移动通信有限公司 Method for improving speech recognition accuracy and mobile intelligent terminal
US10506067B2 (en) 2013-03-15 2019-12-10 Sonitum Inc. Dynamic personalization of a communication session in heterogeneous environments
US9933990B1 (en) 2013-03-15 2018-04-03 Sonitum Inc. Topological mapping of control parameters
CN104079247B (en) 2013-03-26 2018-02-09 杜比实验室特许公司 Balanced device controller and control method and audio reproducing system
CN104078050A (en) 2013-03-26 2014-10-01 杜比实验室特许公司 Device and method for audio classification and audio processing
CN107093991B (en) 2013-03-26 2020-10-09 杜比实验室特许公司 Loudness normalization method and equipment based on target loudness
CN108365827B (en) 2013-04-29 2021-10-26 杜比实验室特许公司 Band compression with dynamic threshold
TWM487509U (en) * 2013-06-19 2014-10-01 杜比實驗室特許公司 Audio processing apparatus and electrical device
EP3014609B1 (en) * 2013-06-27 2017-09-27 Dolby Laboratories Licensing Corporation Bitstream syntax for spatial voice coding
US9031838B1 (en) 2013-07-15 2015-05-12 Vail Systems, Inc. Method and apparatus for voice clarity and speech intelligibility detection and correction
US9536540B2 (en) 2013-07-19 2017-01-03 Knowles Electronics, Llc Speech signal separation and synthesis based on auditory scene analysis and speech modeling
CN103413553B (en) * 2013-08-20 2016-03-09 腾讯科技(深圳)有限公司 Audio coding method, audio-frequency decoding method, coding side, decoding end and system
CN105493182B (en) * 2013-08-28 2020-01-21 杜比实验室特许公司 Hybrid waveform coding and parametric coding speech enhancement
TR201908748T4 (en) * 2013-10-22 2019-07-22 Fraunhofer Ges Forschung Concept for combined dynamic range compression and guided clipping for audio devices.
JP6361271B2 (en) * 2014-05-09 2018-07-25 富士通株式会社 Speech enhancement device, speech enhancement method, and computer program for speech enhancement
CN105336341A (en) 2014-05-26 2016-02-17 杜比实验室特许公司 Method for enhancing intelligibility of voice content in audio signals
US9978388B2 (en) 2014-09-12 2018-05-22 Knowles Electronics, Llc Systems and methods for restoration of speech components
RU2696952C2 (en) * 2014-10-01 2019-08-07 Долби Интернешнл Аб Audio coder and decoder
EP4044180A1 (en) 2014-10-01 2022-08-17 Dolby International AB Decoding an encoded audio signal using drc profiles
US10163453B2 (en) 2014-10-24 2018-12-25 Staton Techiya, Llc Robust voice activity detector system for use with an earphone
CN104409081B (en) * 2014-11-25 2017-12-22 广州酷狗计算机科技有限公司 Audio signal processing method and device
JP6501259B2 (en) * 2015-08-04 2019-04-17 本田技研工業株式会社 Speech processing apparatus and speech processing method
EP3203472A1 (en) * 2016-02-08 2017-08-09 Oticon A/s A monaural speech intelligibility predictor unit
US9820042B1 (en) 2016-05-02 2017-11-14 Knowles Electronics, Llc Stereo separation and directional suppression with omni-directional microphones
RU2620569C1 (en) * 2016-05-17 2017-05-26 Николай Александрович Иванов Method of measuring the convergence of speech
RU2676022C1 (en) * 2016-07-13 2018-12-25 Общество с ограниченной ответственностью "Речевая аппаратура "Унитон" Method of increasing the speech intelligibility
US10362412B2 (en) 2016-12-22 2019-07-23 Oticon A/S Hearing device comprising a dynamic compressive amplification system and a method of operating a hearing device
WO2018152034A1 (en) * 2017-02-14 2018-08-23 Knowles Electronics, Llc Voice activity detector and methods therefor
US11386913B2 (en) 2017-08-01 2022-07-12 Dolby Laboratories Licensing Corporation Audio object classification based on location metadata
WO2019027812A1 (en) 2017-08-01 2019-02-07 Dolby Laboratories Licensing Corporation Audio object classification based on location metadata
EP3477641A1 (en) * 2017-10-26 2019-05-01 Vestel Elektronik Sanayi ve Ticaret A.S. Consumer electronics device and method of operation
WO2020020043A1 (en) * 2018-07-25 2020-01-30 Dolby Laboratories Licensing Corporation Compressor target curve to avoid boosting noise
US11335357B2 (en) * 2018-08-14 2022-05-17 Bose Corporation Playback enhancement in audio systems
CN110875059B (en) * 2018-08-31 2022-08-05 深圳市优必选科技有限公司 Method and device for judging reception end and storage device
US10795638B2 (en) 2018-10-19 2020-10-06 Bose Corporation Conversation assistance audio device personalization
US11164592B1 (en) * 2019-05-09 2021-11-02 Amazon Technologies, Inc. Responsive automatic gain control
US11146607B1 (en) * 2019-05-31 2021-10-12 Dialpad, Inc. Smart noise cancellation
CN114503197B (en) * 2019-08-27 2023-06-13 杜比实验室特许公司 Dialog enhancement using adaptive smoothing
RU2726326C1 (en) * 2019-11-26 2020-07-13 Акционерное общество "ЗАСЛОН" Method of increasing intelligibility of speech by elderly people when receiving sound programs on headphones
WO2021119102A1 (en) * 2019-12-09 2021-06-17 Dolby Laboratories Licensing Corporation Adjusting audio and non-audio features based on noise metrics and speech intelligibility metrics
US20230113561A1 (en) * 2020-03-13 2023-04-13 Immersion Networks, Inc. Loudness equalization system
CN115335901A (en) * 2020-03-27 2022-11-11 杜比实验室特许公司 Automatic leveling of speech content
JP2023530225A (en) 2020-05-29 2023-07-14 フラウンホファー ゲセルシャフト ツール フェールデルンク ダー アンゲヴァンテン フォルシュンク エー.ファオ. Method and apparatus for processing early audio signals
TW202226225A (en) * 2020-10-27 2022-07-01 美商恩倍科微電子股份有限公司 Apparatus and method for improved voice activity detection using zero crossing detection
US11790931B2 (en) 2020-10-27 2023-10-17 Ambiq Micro, Inc. Voice activity detection using zero crossing detection
US11595730B2 (en) * 2021-03-08 2023-02-28 Tencent America LLC Signaling loudness adjustment for an audio scene
CN113113049A (en) * 2021-03-18 2021-07-13 西北工业大学 Voice activity detection method combined with voice enhancement
EP4134954B1 (en) * 2021-08-09 2023-08-02 OPTImic GmbH Method and device for improving an audio signal
KR102628500B1 (en) * 2021-09-29 2024-01-24 주식회사 케이티 Apparatus for face-to-face recording and method for using the same

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4672669A (en) * 1983-06-07 1987-06-09 International Business Machines Corp. Voice activity detection process and means for implementing said process
US6198830B1 (en) * 1997-01-29 2001-03-06 Siemens Audiologische Technik Gmbh Method and circuit for the amplification of input signals of a hearing aid
US20030101050A1 (en) * 2001-11-29 2003-05-29 Microsoft Corporation Real-time speech and music classifier
US20030198357A1 (en) * 2001-08-07 2003-10-23 Todd Schneider Sound intelligibility enhancement using a psychoacoustic model and an oversampled filterbank
US20040190740A1 (en) * 2003-02-26 2004-09-30 Josef Chalupper Method for automatic amplification adjustment in a hearing aid device, as well as a hearing aid device

Family Cites Families (120)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US3803357A (en) 1971-06-30 1974-04-09 J Sacks Noise filter
US4661981A (en) 1983-01-03 1987-04-28 Henrickson Larry K Method and means for processing speech
US4628529A (en) 1985-07-01 1986-12-09 Motorola, Inc. Noise suppression system
US4912767A (en) 1988-03-14 1990-03-27 International Business Machines Corporation Distributed noise cancellation system
CN1062963C (en) 1990-04-12 2001-03-07 多尔拜实验特许公司 Adaptive-block-lenght, adaptive-transform, and adaptive-window transform coder, decoder, and encoder/decoder for high-quality audio
ATE138238T1 (en) 1991-01-08 1996-06-15 Dolby Lab Licensing Corp ENCODER/DECODER FOR MULTI-DIMENSIONAL SOUND FIELDS
US5632005A (en) 1991-01-08 1997-05-20 Ray Milton Dolby Encoder/decoder for multidimensional sound fields
DE69232729T2 (en) 1991-05-29 2003-04-24 Pacific Microsonics Inc Improvements in systems to achieve higher frequency resolution
US5388185A (en) 1991-09-30 1995-02-07 U S West Advanced Technologies, Inc. System for adaptive processing of telephone voice signals
US5263091A (en) 1992-03-10 1993-11-16 Waller Jr James K Intelligent automatic threshold circuit
US5251263A (en) 1992-05-22 1993-10-05 Andrea Electronics Corporation Adaptive noise cancellation and speech enhancement system and apparatus therefor
US5734789A (en) 1992-06-01 1998-03-31 Hughes Electronics Voiced, unvoiced or noise modes in a CELP vocoder
US5425106A (en) 1993-06-25 1995-06-13 Hda Entertainment, Inc. Integrated circuit for audio enhancement system
US5400405A (en) 1993-07-02 1995-03-21 Harman Electronics, Inc. Audio image enhancement system
US5471527A (en) 1993-12-02 1995-11-28 Dsc Communications Corporation Voice enhancement system and method
US5539806A (en) 1994-09-23 1996-07-23 At&T Corp. Method for customer selection of telephone sound enhancement
US5623491A (en) 1995-03-21 1997-04-22 Dsc Communications Corporation Device for adapting narrowband voice traffic of a local access network to allow transmission over a broadband asynchronous transfer mode network
US5727119A (en) 1995-03-27 1998-03-10 Dolby Laboratories Licensing Corporation Method and apparatus for efficient implementation of single-sideband filter banks providing accurate measures of spectral magnitude and phase
US5812969A (en) * 1995-04-06 1998-09-22 Adaptec, Inc. Process for balancing the loudness of digitally sampled audio waveforms
US6263307B1 (en) * 1995-04-19 2001-07-17 Texas Instruments Incorporated Adaptive weiner filtering using line spectral frequencies
US5661808A (en) 1995-04-27 1997-08-26 Srs Labs, Inc. Stereo enhancement system
JP3416331B2 (en) 1995-04-28 2003-06-16 松下電器産業株式会社 Audio decoding device
US5774557A (en) 1995-07-24 1998-06-30 Slater; Robert Winston Autotracking microphone squelch for aircraft intercom systems
FI102337B1 (en) * 1995-09-13 1998-11-13 Nokia Mobile Phones Ltd Method and circuit arrangement for processing an audio signal
FI100840B (en) 1995-12-12 1998-02-27 Nokia Mobile Phones Ltd Noise attenuator and method for attenuating background noise from noisy speech and a mobile station
DE19547093A1 (en) * 1995-12-16 1997-06-19 Nokia Deutschland Gmbh Circuit for improvement of noise immunity of audio signal
US5689615A (en) 1996-01-22 1997-11-18 Rockwell International Corporation Usage of voice activity detection for efficient coding of speech
US5884255A (en) * 1996-07-16 1999-03-16 Coherent Communications Systems Corp. Speech detection system employing multiple determinants
US6570991B1 (en) 1996-12-18 2003-05-27 Interval Research Corporation Multi-feature speech/music discrimination system
JPH10257583A (en) * 1997-03-06 1998-09-25 Asahi Chem Ind Co Ltd Voice processing unit and its voice processing method
US5907822A (en) 1997-04-04 1999-05-25 Lincom Corporation Loss tolerant speech decoder for telecommunications
US6208637B1 (en) 1997-04-14 2001-03-27 Next Level Communications, L.L.P. Method and apparatus for the generation of analog telephone signals in digital subscriber line access systems
FR2768547B1 (en) 1997-09-18 1999-11-19 Matra Communication METHOD FOR NOISE REDUCTION OF A DIGITAL SPEAKING SIGNAL
US6169971B1 (en) * 1997-12-03 2001-01-02 Glenayre Electronics, Inc. Method to suppress noise in digital voice processing
US6104994A (en) 1998-01-13 2000-08-15 Conexant Systems, Inc. Method for speech coding under background noise conditions
EP2009785B1 (en) 1998-04-14 2010-09-15 Hearing Enhancement Company, Llc. Method and apparatus for providing end user adjustment capability that accommodates hearing impaired and non-hearing impaired listener preferences
US6122611A (en) 1998-05-11 2000-09-19 Conexant Systems, Inc. Adding noise during LPC coded voice activity periods to improve the quality of coded speech coexisting with background noise
US6453289B1 (en) * 1998-07-24 2002-09-17 Hughes Electronics Corporation Method of noise reduction for speech codecs
US6223154B1 (en) 1998-07-31 2001-04-24 Motorola, Inc. Using vocoded parameters in a staggered average to provide speakerphone operation based on enhanced speech activity thresholds
US6188981B1 (en) 1998-09-18 2001-02-13 Conexant Systems, Inc. Method and apparatus for detecting voice activity in a speech signal
US6061431A (en) 1998-10-09 2000-05-09 Cisco Technology, Inc. Method for hearing loss compensation in telephony systems based on telephone number resolution
US6993480B1 (en) 1998-11-03 2006-01-31 Srs Labs, Inc. Voice intelligibility enhancement system
US6256606B1 (en) 1998-11-30 2001-07-03 Conexant Systems, Inc. Silence description coding for multi-rate speech codecs
US6208618B1 (en) 1998-12-04 2001-03-27 Tellabs Operations, Inc. Method and apparatus for replacing lost PSTN data in a packet network
US6289309B1 (en) 1998-12-16 2001-09-11 Sarnoff Corporation Noise spectrum tracking for speech enhancement
US6922669B2 (en) 1998-12-29 2005-07-26 Koninklijke Philips Electronics N.V. Knowledge-based strategies applied to N-best lists in automatic speech recognition systems
US6246345B1 (en) * 1999-04-16 2001-06-12 Dolby Laboratories Licensing Corporation Using gain-adaptive quantization and non-uniform symbol lengths for improved audio coding
US6618701B2 (en) * 1999-04-19 2003-09-09 Motorola, Inc. Method and system for noise suppression using external voice activity detection
US6633841B1 (en) 1999-07-29 2003-10-14 Mindspeed Technologies, Inc. Voice activity detection speech coding to accommodate music signals
US6910011B1 (en) * 1999-08-16 2005-06-21 Haman Becker Automotive Systems - Wavemakers, Inc. Noisy acoustic signal enhancement
CA2290037A1 (en) * 1999-11-18 2001-05-18 Voiceage Corporation Gain-smoothing amplifier device and method in codecs for wideband speech and audio signals
US6813490B1 (en) * 1999-12-17 2004-11-02 Nokia Corporation Mobile station with audio signal adaptation to hearing characteristics of the user
US6449593B1 (en) 2000-01-13 2002-09-10 Nokia Mobile Phones Ltd. Method and system for tracking human speakers
US6351733B1 (en) 2000-03-02 2002-02-26 Hearing Enhancement Company, Llc Method and apparatus for accommodating primary content audio and secondary content remaining audio capability in the digital audio production process
US7962326B2 (en) 2000-04-20 2011-06-14 Invention Machine Corporation Semantic answering system and method
US20030179888A1 (en) * 2002-03-05 2003-09-25 Burnett Gregory C. Voice activity detection (VAD) devices and methods for use with noise suppression systems
US7246058B2 (en) 2001-05-30 2007-07-17 Aliph, Inc. Detecting voiced and unvoiced speech using both acoustic and nonacoustic sensors
US6898566B1 (en) * 2000-08-16 2005-05-24 Mindspeed Technologies, Inc. Using signal to noise ratio of a speech signal to adjust thresholds for extracting speech parameters for coding the speech signal
US6862567B1 (en) * 2000-08-30 2005-03-01 Mindspeed Technologies, Inc. Noise suppression in the frequency domain by adjusting gain according to voicing parameters
US7020605B2 (en) * 2000-09-15 2006-03-28 Mindspeed Technologies, Inc. Speech coding system with time-domain noise attenuation
US6615169B1 (en) * 2000-10-18 2003-09-02 Nokia Corporation High frequency enhancement layer coding in wideband speech codec
JP2002169599A (en) * 2000-11-30 2002-06-14 Toshiba Corp Noise suppressing method and electronic equipment
US6631139B2 (en) 2001-01-31 2003-10-07 Qualcomm Incorporated Method and apparatus for interoperability between voice transmission systems during speech inactivity
US6694293B2 (en) * 2001-02-13 2004-02-17 Mindspeed Technologies, Inc. Speech coding system with a music classifier
US20030028386A1 (en) 2001-04-02 2003-02-06 Zinser Richard L. Compressed domain universal transcoder
ATE318062T1 (en) 2001-04-18 2006-03-15 Gennum Corp MULTI-CHANNEL HEARING AID WITH TRANSMISSION POSSIBILITIES BETWEEN THE CHANNELS
DE60222445T2 (en) * 2001-08-17 2008-06-12 Broadcom Corp., Irvine METHOD FOR HIDING BIT ERRORS FOR LANGUAGE CODING
US20030046069A1 (en) * 2001-08-28 2003-03-06 Vergin Julien Rivarol Noise reduction system and method
WO2003022003A2 (en) * 2001-09-06 2003-03-13 Koninklijke Philips Electronics N.V. Audio reproducing device
US6937980B2 (en) 2001-10-02 2005-08-30 Telefonaktiebolaget Lm Ericsson (Publ) Speech recognition using microphone antenna array
US7328151B2 (en) 2002-03-22 2008-02-05 Sound Id Audio decoder with dynamic adjustment of signal modification
US7167568B2 (en) 2002-05-02 2007-01-23 Microsoft Corporation Microphone array signal enhancement
US7072477B1 (en) * 2002-07-09 2006-07-04 Apple Computer, Inc. Method and apparatus for automatically normalizing a perceived volume level in a digitally encoded file
CA2492091C (en) * 2002-07-12 2009-04-28 Widex A/S Hearing aid and a method for enhancing speech intelligibility
US7454331B2 (en) 2002-08-30 2008-11-18 Dolby Laboratories Licensing Corporation Controlling loudness of speech in signals that contain speech and other types of audio material
US7283956B2 (en) * 2002-09-18 2007-10-16 Motorola, Inc. Noise suppression
AU2003278013A1 (en) 2002-10-11 2004-05-04 Voiceage Corporation Methods and devices for source controlled variable bit-rate wideband speech coding
US7174022B1 (en) * 2002-11-15 2007-02-06 Fortemedia, Inc. Small array microphone for beam-forming and noise suppression
US7343284B1 (en) * 2003-07-17 2008-03-11 Nortel Networks Limited Method and system for speech processing for enhancement and detection
US7398207B2 (en) * 2003-08-25 2008-07-08 Time Warner Interactive Video Group, Inc. Methods and systems for determining audio loudness levels in programming
US7099821B2 (en) * 2003-09-12 2006-08-29 Softmax, Inc. Separation of target acoustic signals in a multi-transducer arrangement
SG119199A1 (en) * 2003-09-30 2006-02-28 Stmicroelectronics Asia Pacfic Voice activity detector
US7539614B2 (en) * 2003-11-14 2009-05-26 Nxp B.V. System and method for audio signal processing using different gain factors for voiced and unvoiced phonemes
US7483831B2 (en) 2003-11-21 2009-01-27 Articulation Incorporated Methods and apparatus for maximizing speech intelligibility in quiet or noisy backgrounds
CA2454296A1 (en) * 2003-12-29 2005-06-29 Nokia Corporation Method and device for speech enhancement in the presence of background noise
FI118834B (en) 2004-02-23 2008-03-31 Nokia Corp Classification of audio signals
CA2992065C (en) 2004-03-01 2018-11-20 Dolby Laboratories Licensing Corporation Reconstructing audio signals with multiple decorrelation techniques
US7492889B2 (en) 2004-04-23 2009-02-17 Acoustic Technologies, Inc. Noise suppression based on bark band wiener filtering and modified doblinger noise estimate
US7451093B2 (en) 2004-04-29 2008-11-11 Srs Labs, Inc. Systems and methods of remotely enabling sound enhancement techniques
AU2004320207A1 (en) 2004-05-25 2005-12-08 Huonlabs Pty Ltd Audio apparatus and method
US8788265B2 (en) 2004-05-25 2014-07-22 Nokia Solutions And Networks Oy System and method for babble noise detection
US7649988B2 (en) 2004-06-15 2010-01-19 Acoustic Technologies, Inc. Comfort noise generator using modified Doblinger noise estimate
TWI454101B (en) 2004-08-30 2014-09-21 Qualcomm Inc Adaptive de-jitter buffer for packetized data commumications
FI20045315A (en) 2004-08-30 2006-03-01 Nokia Corp Detection of voice activity in an audio signal
CN101015230B (en) 2004-09-06 2012-09-05 皇家飞利浦电子股份有限公司 Audio signal enhancement
US7383179B2 (en) * 2004-09-28 2008-06-03 Clarity Technologies, Inc. Method of cascading noise reduction algorithms to avoid speech distortion
US7949520B2 (en) 2004-10-26 2011-05-24 QNX Software Sytems Co. Adaptive filter pitch extraction
WO2006051451A1 (en) 2004-11-09 2006-05-18 Koninklijke Philips Electronics N.V. Audio coding and decoding
RU2284585C1 (en) 2005-02-10 2006-09-27 Владимир Кириллович Железняк Method for measuring speech intelligibility
US20060224381A1 (en) 2005-04-04 2006-10-05 Nokia Corporation Detecting speech frames belonging to a low energy sequence
WO2006116025A1 (en) 2005-04-22 2006-11-02 Qualcomm Incorporated Systems, methods, and apparatus for gain factor smoothing
US8566086B2 (en) 2005-06-28 2013-10-22 Qnx Software Systems Limited System for adaptive enhancement of speech signals
US20070078645A1 (en) 2005-09-30 2007-04-05 Nokia Corporation Filterbank-based processing of speech signals
EP1640972A1 (en) 2005-12-23 2006-03-29 Phonak AG System and method for separation of a users voice from ambient sound
US20070147635A1 (en) 2005-12-23 2007-06-28 Phonak Ag System and method for separation of a user's voice from ambient sound
US20070198251A1 (en) 2006-02-07 2007-08-23 Jaber Associates, L.L.C. Voice activity detection method and apparatus for voiced/unvoiced decision and pitch estimation in a noisy speech feature extraction
WO2007091956A2 (en) * 2006-02-10 2007-08-16 Telefonaktiebolaget Lm Ericsson (Publ) A voice detector and a method for suppressing sub-bands in a voice detector
ATE527833T1 (en) 2006-05-04 2011-10-15 Lg Electronics Inc IMPROVE STEREO AUDIO SIGNALS WITH REMIXING
US8032370B2 (en) * 2006-05-09 2011-10-04 Nokia Corporation Method, apparatus, system and software product for adaptation of voice activity detection parameters based on the quality of the coding modes
CN100578622C (en) * 2006-05-30 2010-01-06 北京中星微电子有限公司 A kind of adaptive microphone array system and audio signal processing method thereof
US20080071540A1 (en) 2006-09-13 2008-03-20 Honda Motor Co., Ltd. Speech recognition method for robot under motor noise thereof
WO2007082579A2 (en) 2006-12-18 2007-07-26 Phonak Ag Active hearing protection system
US8195454B2 (en) * 2007-02-26 2012-06-05 Dolby Laboratories Licensing Corporation Speech enhancement in entertainment audio
WO2009086174A1 (en) * 2007-12-21 2009-07-09 Srs Labs, Inc. System for adjusting perceived loudness of audio signals
US8175888B2 (en) 2008-12-29 2012-05-08 Motorola Mobility, Inc. Enhanced layered gain factor balancing within a multiple-channel audio coding system
CN102044243B (en) * 2009-10-15 2012-08-29 华为技术有限公司 Method and device for voice activity detection (VAD) and encoder
DK3493205T3 (en) * 2010-12-24 2021-04-19 Huawei Tech Co Ltd METHOD AND DEVICE FOR ADAPTIVE DETECTION OF VOICE ACTIVITY IN AN AUDIO INPUT SIGNAL
CN102801861B (en) * 2012-08-07 2015-08-19 歌尔声学股份有限公司 A kind of sound enhancement method and device being applied to mobile phone
US9472208B2 (en) * 2012-08-31 2016-10-18 Telefonaktiebolaget Lm Ericsson (Publ) Method and device for voice activity detection
US20140126737A1 (en) * 2012-11-05 2014-05-08 Aliphcom, Inc. Noise suppressing multi-microphone headset

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4672669A (en) * 1983-06-07 1987-06-09 International Business Machines Corp. Voice activity detection process and means for implementing said process
US6198830B1 (en) * 1997-01-29 2001-03-06 Siemens Audiologische Technik Gmbh Method and circuit for the amplification of input signals of a hearing aid
US20030198357A1 (en) * 2001-08-07 2003-10-23 Todd Schneider Sound intelligibility enhancement using a psychoacoustic model and an oversampled filterbank
US20030101050A1 (en) * 2001-11-29 2003-05-29 Microsoft Corporation Real-time speech and music classifier
US20040190740A1 (en) * 2003-02-26 2004-09-30 Josef Chalupper Method for automatic amplification adjustment in a hearing aid device, as well as a hearing aid device

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
BASBUG F ET AL: "Robust voice activity detection for DTX operation of speech coders", SPEECH CODING PROCEEDINGS, 1999 IEEE WORKSHOP ON PORVOO, FINLAND 20-23 JUNE 1999, PISCATAWAY, NJ, USA,IEEE, US, 20 June 1999 (1999-06-20), pages 58 - 60, XP010345538, ISBN: 978-0-7803-5651-1 *
BERITELLI F ET AL: "Performance Evaluation and Comparison of G.729/AMR/Fuzzy Voice Activity Detectors", IEEE SIGNAL PROCESSING LETTERS, IEEE SERVICE CENTER, PISCATAWAY, NJ, US, vol. 9, no. 3, 1 March 2002 (2002-03-01), XP011067784, ISSN: 1070-9908 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102088648A (en) * 2009-12-03 2011-06-08 奥迪康有限公司 Method for dynamic suppression of surrounding acoustic noise when listening to electrical inputs
CN102088648B (en) * 2009-12-03 2015-04-08 奥迪康有限公司 Acoustic instrument and method for operating acoustic instrument adapted for clients

Also Published As

Publication number Publication date
US9818433B2 (en) 2017-11-14
RU2440627C2 (en) 2012-01-20
US20150142424A1 (en) 2015-05-21
US10586557B2 (en) 2020-03-10
US20120310635A1 (en) 2012-12-06
US20180033453A1 (en) 2018-02-01
US20100121634A1 (en) 2010-05-13
US9368128B2 (en) 2016-06-14
US20150243300A1 (en) 2015-08-27
US8271276B1 (en) 2012-09-18
ES2391228T3 (en) 2012-11-22
CN101647059B (en) 2012-09-05
EP2118885B1 (en) 2012-07-11
RU2009135829A (en) 2011-04-10
BRPI0807703B1 (en) 2020-09-24
CN101647059A (en) 2010-02-10
BRPI0807703A2 (en) 2014-05-27
US9418680B2 (en) 2016-08-16
US10418052B2 (en) 2019-09-17
WO2008106036A2 (en) 2008-09-04
JP2013092792A (en) 2013-05-16
US20190341069A1 (en) 2019-11-07
US8195454B2 (en) 2012-06-05
EP2118885A2 (en) 2009-11-18
US8972250B2 (en) 2015-03-03
US20120221328A1 (en) 2012-08-30
JP5530720B2 (en) 2014-06-25
JP2010519601A (en) 2010-06-03
US20160322068A1 (en) 2016-11-03

Similar Documents

Publication Publication Date Title
WO2008106036A3 (en) Speech enhancement in entertainment audio
EP2151822B8 (en) Apparatus and method for processing an audio signal for speech enhancement using a feature extraction
WO2011130083A3 (en) Camera-assisted noise cancellation and speech recognition
WO2010087614A3 (en) Method for encoding and decoding an audio signal and apparatus for same
EP2323425A4 (en) Method and device for generating, playing audio signals, and processing system for audio signals
EP2594087B8 (en) Electronic apparatus for generating modified wideband audio signals based on two or more wideband microphone signals
TW200738040A (en) Method and apparatus for processing a audio signal
WO2010011377A3 (en) Method and apparatus for maintaining speech audibility in multi-channel audio with minimal impact on surround experience
EP2499582A4 (en) System and method for hybrid processing in a natural language voive services environment
EP4246511A3 (en) Method and apparatus for compressing and decompressing a higher order ambisonics signal representation
WO2007098055A3 (en) Encoding and adaptive, scalable accessing of distributed models
WO2010060076A3 (en) Systems, methods, apparatus, and computer program products for enhanced active noise cancellation
EP3246918C0 (en) Audio decoder, method for decoding an audio signal and computer program
WO2010086194A3 (en) Apparatus, method and computer program for manipulating an audio signal comprising a transient event
WO2010142297A3 (en) Generative audio matching game system
WO2009110738A3 (en) Method and apparatus for processing audio signal
WO2012108680A3 (en) Method and device for bandwidth extension
PL2425427T3 (en) Apparatus for providing one or more adjusted parameters for a provision of an upmix signal representation on the basis of a downmix signal representation, audio signal decoder, audio signal transcoder, method and computer program using an object-related parametric information
TW200731832A (en) Method and apparatus for processing a media signal
EP3803866A4 (en) Method, apparatus and computer-readable media to manage semi-constant (persistent) sound sources in microphone pickup/focus zones
WO2008109504A3 (en) Processing system and method for performing high throughput non-plasma processing
WO2008085703A3 (en) A spectro-temporal varying approach for speech enhancement
WO2010090427A3 (en) Audio signal encoding and decoding method, and apparatus for same
WO2010067976A3 (en) Signal separation method, and communication system and speech recognition system using the signal separation method
EP3107309A4 (en) Dual-microphone earphone and noise reduction processing method for audio signal in call

Legal Events

Date Code Title Description
WWE Wipo information: entry into national phase

Ref document number: 200880009929.3

Country of ref document: CN

DPE1 Request for preliminary examination filed after expiration of 19th month from priority date (pct application filed from 20040101)
WWE Wipo information: entry into national phase

Ref document number: 2939/KOLNP/2009

Country of ref document: IN

WWE Wipo information: entry into national phase

Ref document number: 12528323

Country of ref document: US

WWE Wipo information: entry into national phase

Ref document number: 2009551991

Country of ref document: JP

NENP Non-entry into the national phase

Ref country code: DE

WWE Wipo information: entry into national phase

Ref document number: 2008725831

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 2009135829

Country of ref document: RU

ENP Entry into the national phase

Ref document number: PI0807703

Country of ref document: BR

Kind code of ref document: A2

Effective date: 20090826