DE69424172D1 - Verfahren zur erkennung eines gesprochenen wortes in anwesenheit störender sprache - Google Patents

Verfahren zur erkennung eines gesprochenen wortes in anwesenheit störender sprache

Info

Publication number
DE69424172D1
DE69424172D1 DE69424172T DE69424172T DE69424172D1 DE 69424172 D1 DE69424172 D1 DE 69424172D1 DE 69424172 T DE69424172 T DE 69424172T DE 69424172 T DE69424172 T DE 69424172T DE 69424172 D1 DE69424172 D1 DE 69424172D1
Authority
DE
Germany
Prior art keywords
word
residual signal
spoken word
buffer
detection
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
DE69424172T
Other languages
English (en)
Other versions
DE69424172T2 (de
Inventor
B Schalk
Fadi Kaake
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nuance Communications Inc
Original Assignee
VCS Industries Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by VCS Industries Inc filed Critical VCS Industries Inc
Publication of DE69424172D1 publication Critical patent/DE69424172D1/de
Application granted granted Critical
Publication of DE69424172T2 publication Critical patent/DE69424172T2/de
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/20Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L2021/02082Noise filtering the noise being echo, reverberation of the speech
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L2021/02087Noise filtering the noise being separate speech, e.g. cocktail party
DE69424172T 1993-08-13 1994-08-15 Verfahren zur erkennung eines gesprochenen wortes in anwesenheit störender sprache Expired - Lifetime DE69424172T2 (de)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US08/106,072 US5475791A (en) 1993-08-13 1993-08-13 Method for recognizing a spoken word in the presence of interfering speech
PCT/US1994/009353 WO1995005655A2 (en) 1993-08-13 1994-08-15 Method for recognizing a spoken word in the presence of interfering speech

Publications (2)

Publication Number Publication Date
DE69424172D1 true DE69424172D1 (de) 2000-05-31
DE69424172T2 DE69424172T2 (de) 2000-11-23

Family

ID=22309326

Family Applications (1)

Application Number Title Priority Date Filing Date
DE69424172T Expired - Lifetime DE69424172T2 (de) 1993-08-13 1994-08-15 Verfahren zur erkennung eines gesprochenen wortes in anwesenheit störender sprache

Country Status (8)

Country Link
US (1) US5475791A (de)
EP (1) EP0713597B1 (de)
AT (1) ATE192258T1 (de)
AU (1) AU687089B2 (de)
CA (1) CA2169447A1 (de)
DE (1) DE69424172T2 (de)
ES (1) ES2145148T3 (de)
WO (1) WO1995005655A2 (de)

Families Citing this family (53)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE69612480T2 (de) * 1995-02-15 2001-10-11 British Telecomm Detektion von sprechaktivität
DE19533541C1 (de) * 1995-09-11 1997-03-27 Daimler Benz Aerospace Ag Verfahren zur automatischen Steuerung eines oder mehrerer Geräte durch Sprachkommandos oder per Sprachdialog im Echtzeitbetrieb und Vorrichtung zum Ausführen des Verfahrens
JP2921472B2 (ja) * 1996-03-15 1999-07-19 日本電気株式会社 音声および雑音の除去装置、音声認識装置
US5765130A (en) * 1996-05-21 1998-06-09 Applied Language Technologies, Inc. Method and apparatus for facilitating speech barge-in in connection with voice recognition systems
ES2172011T3 (es) * 1996-11-28 2002-09-16 British Telecomm Aparato y procedimiento interactivo.
US5953411A (en) * 1996-12-18 1999-09-14 Intel Corporation Method and apparatus for maintaining audio sample correlation
US5848130A (en) * 1996-12-31 1998-12-08 At&T Corp System and method for enhanced intelligibility of voice messages
US6775264B1 (en) 1997-03-03 2004-08-10 Webley Systems, Inc. Computer, internet and telecommunications based network
JPH10257583A (ja) * 1997-03-06 1998-09-25 Asahi Chem Ind Co Ltd 音声処理装置およびその音声処理方法
GB2325112B (en) 1997-05-06 2002-07-31 Ibm Voice processing system
GB2325110B (en) 1997-05-06 2002-10-16 Ibm Voice processing system
DE19722784C1 (de) * 1997-05-30 1999-01-14 Deutsche Telekom Ag Verfahren und Anordnung für ein sprachgesteuertes Kommunikationsendgerät mit akustischer Bedienerführung
DE69820222T2 (de) * 1997-10-07 2004-09-30 Koninklijke Philips Electronics N.V. Verfahren und vorrichtung zur aktivierung einer sprachgesteuerten funktion in einem mehrplatznetzwerk mittels sowohl sprecherabhängiger als auch sprecherunabhängiger spracherkennung
US6167251A (en) * 1998-10-02 2000-12-26 Telespree Communications Keyless portable cellular phone system having remote voice recognition
US7274928B2 (en) 1998-10-02 2007-09-25 Telespree Communications Portable cellular phone system having automatic initialization
US6665645B1 (en) * 1999-07-28 2003-12-16 Matsushita Electric Industrial Co., Ltd. Speech recognition apparatus for AV equipment
US7117149B1 (en) * 1999-08-30 2006-10-03 Harman Becker Automotive Systems-Wavemakers, Inc. Sound source classification
US6963759B1 (en) 1999-10-05 2005-11-08 Fastmobile, Inc. Speech recognition technique based on local interrupt detection
US6868385B1 (en) 1999-10-05 2005-03-15 Yomobile, Inc. Method and apparatus for the provision of information signals based upon speech recognition
US6937977B2 (en) * 1999-10-05 2005-08-30 Fastmobile, Inc. Method and apparatus for processing an input speech signal during presentation of an output audio signal
GB9928011D0 (en) * 1999-11-27 2000-01-26 Ibm Voice processing system
US6721705B2 (en) 2000-02-04 2004-04-13 Webley Systems, Inc. Robust voice browser system and voice activated device controller
US7516190B2 (en) 2000-02-04 2009-04-07 Parus Holdings, Inc. Personal voice-based information retrieval system
US6744885B1 (en) * 2000-02-24 2004-06-01 Lucent Technologies Inc. ASR talkoff suppressor
WO2001075555A2 (en) * 2000-03-06 2001-10-11 Conita Technologies, Inc. Personal virtual assistant
WO2002015560A2 (en) * 2000-08-12 2002-02-21 Georgia Tech Research Corporation A system and method for capturing an image
US6725193B1 (en) * 2000-09-13 2004-04-20 Telefonaktiebolaget Lm Ericsson Cancellation of loudspeaker words in speech recognition
US20020173333A1 (en) * 2001-05-18 2002-11-21 Buchholz Dale R. Method and apparatus for processing barge-in requests
DE10158583A1 (de) * 2001-11-29 2003-06-12 Philips Intellectual Property Verfahren zum Betrieb eines Barge-In-Dialogsystems
US7328159B2 (en) * 2002-01-15 2008-02-05 Qualcomm Inc. Interactive speech recognition apparatus and method with conditioned voice prompts
US8046581B2 (en) 2002-03-04 2011-10-25 Telespree Communications Method and apparatus for secure immediate wireless access in a telecommunications network
US7197301B2 (en) 2002-03-04 2007-03-27 Telespree Communications Method and apparatus for secure immediate wireless access in a telecommunications network
US20030229491A1 (en) * 2002-06-06 2003-12-11 International Business Machines Corporation Single sound fragment processing
JP3727927B2 (ja) * 2003-02-10 2005-12-21 株式会社東芝 話者照合装置
US20050071158A1 (en) * 2003-09-25 2005-03-31 Vocollect, Inc. Apparatus and method for detecting user speech
US7496387B2 (en) * 2003-09-25 2009-02-24 Vocollect, Inc. Wireless headset for use in speech recognition environment
US20060146652A1 (en) * 2005-01-03 2006-07-06 Sdi Technologies, Inc. Sunset timer
WO2006077626A1 (ja) * 2005-01-18 2006-07-27 Fujitsu Limited 話速変換方法及び話速変換装置
US20070055514A1 (en) * 2005-09-08 2007-03-08 Beattie Valerie L Intelligent tutoring feedback
US8417185B2 (en) 2005-12-16 2013-04-09 Vocollect, Inc. Wireless headset and method for robust voice data communication
US7885419B2 (en) 2006-02-06 2011-02-08 Vocollect, Inc. Headset terminal with speech functionality
US7773767B2 (en) * 2006-02-06 2010-08-10 Vocollect, Inc. Headset terminal with rear stability strap
US8046221B2 (en) * 2007-10-31 2011-10-25 At&T Intellectual Property Ii, L.P. Multi-state barge-in models for spoken dialog systems
EP2107553B1 (de) * 2008-03-31 2011-05-18 Harman Becker Automotive Systems GmbH Verfahren zur Erkennung einer Unterbrechung einer Sprachausgabe
EP2148325B1 (de) * 2008-07-22 2014-10-01 Nuance Communications, Inc. Verfahren zur Bestimmung der Anwesenheit einer gewollten Signalkomponente
USD605629S1 (en) 2008-09-29 2009-12-08 Vocollect, Inc. Headset
US8160287B2 (en) 2009-05-22 2012-04-17 Vocollect, Inc. Headset with adjustable headband
US8438659B2 (en) 2009-11-05 2013-05-07 Vocollect, Inc. Portable computing device and headset interface
US9502050B2 (en) 2012-06-10 2016-11-22 Nuance Communications, Inc. Noise dependent signal processing for in-car communication systems with multiple acoustic zones
CN104704560B (zh) 2012-09-04 2018-06-05 纽昂斯通讯公司 共振峰依赖的语音信号增强
US9613633B2 (en) 2012-10-30 2017-04-04 Nuance Communications, Inc. Speech enhancement
CN109903758B (zh) 2017-12-08 2023-06-23 阿里巴巴集团控股有限公司 音频处理方法、装置及终端设备
CN111048096B (zh) * 2019-12-24 2022-07-26 大众问问(北京)信息科技有限公司 一种语音信号处理方法、装置及终端

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS5852695A (ja) * 1981-09-25 1983-03-28 日産自動車株式会社 車両用音声検出装置
US4645883A (en) * 1984-05-09 1987-02-24 Communications Satellite Corporation Double talk and line noise detector for a echo canceller
US4914692A (en) * 1987-12-29 1990-04-03 At&T Bell Laboratories Automatic speech recognition using echo cancellation
US5125024A (en) * 1990-03-28 1992-06-23 At&T Bell Laboratories Voice response unit
US5155760A (en) * 1991-06-26 1992-10-13 At&T Bell Laboratories Voice messaging system with voice activated prompt interrupt

Also Published As

Publication number Publication date
US5475791A (en) 1995-12-12
WO1995005655A2 (en) 1995-02-23
AU687089B2 (en) 1998-02-19
EP0713597A4 (de) 1998-01-28
AU7527394A (en) 1995-03-14
ES2145148T3 (es) 2000-07-01
CA2169447A1 (en) 1995-02-23
WO1995005655A3 (en) 1995-03-23
EP0713597A1 (de) 1996-05-29
DE69424172T2 (de) 2000-11-23
ATE192258T1 (de) 2000-05-15
EP0713597B1 (de) 2000-04-26

Similar Documents

Publication Publication Date Title
DE69424172T2 (de) Verfahren zur erkennung eines gesprochenen wortes in anwesenheit störender sprache
US5369726A (en) Speech recognition circuitry employing nonlinear processing speech element modeling and phoneme estimation
EP0664535A3 (de) Spracherkennungssystem für zusammenhängende Sätze mit grossem Wortschatz sowie Verfahren zur Sprachdarstellung mittels evolutionärer Grammatik als kontextfreie Grammatik.
US6098040A (en) Method and apparatus for providing an improved feature set in speech recognition by performing noise cancellation and background masking
MX9505299A (es) Sistemas, metodos y articulos de fabricacion para realizar la hipotesizacion de n-cadenas optimas de alta resolucion.
DE3372552D1 (en) Speech recognition system
WO1995005655B1 (en) Method for recognizing a spoken word in the presence of interfering speech
FR2522179B1 (fr) Procede et appareil de reconnaissance de paroles permettant de reconnaitre des phonemes particuliers du signal vocal quelle que soit la personne qui parle
ATE314718T1 (de) Srecherangepasste spracherkennung
DE60309142D1 (de) Vorrichtung zur bestimmung von parametern eines gauss'schen mischungmodells (gmm) oder eines gmm basierten hidden markov modells
JPS6413595A (en) Voice recognition circuit using estimate of phoneme
DE3275779D1 (en) Recognition of speech or speech-like sounds
DE59904741D1 (de) Anordnung und verfahren zur erkennung eines vorgegebenen wortschatzes in gesprochener sprache durch einen rechner
CN107039035A (zh) 一种语音起始点和终止点的检测方法
CA2192397A1 (en) Method and system for performing speech recognition
Mary et al. Autoassociative neural network models for language identification
ATE279003T1 (de) Verfahren und vorrichtung zur integritätsprüfung von benutzeroberflächen sprachgesteuerter geräte
Sudhakar et al. Automatic speech segmentation to improve speech synthesis performance
US5765124A (en) Time-varying feature space preprocessing procedure for telephone based speech recognition
Ali et al. Robust classification of stop consonants using auditory-based speech processing
Hahn et al. An improved speech detection algorithm for isolated Korean utterances
Hanson et al. Speech enhancement with harmonic synthesis
Hoshimi et al. Speaker independent speech recognition method using training speech from a small number of speakers
Mayora-Ibarra et al. Time-domain segmentation and labelling of speech with fuzzy-logic post-correction rules
SU781882A2 (ru) Устройство дл распознавани слов

Legal Events

Date Code Title Description
8364 No opposition during term of opposition
8327 Change in the person/name/address of the patent owner

Owner name: SCANSOFT, INC. (N.D.GES.D. STAATES DELAWARE), PEAB

8328 Change in the person/name/address of the agent

Representative=s name: TIEDTKE, BUEHLING, KINNE & PARTNER GBR, 80336 MUENCHEN