DE69424172D1 - Verfahren zur erkennung eines gesprochenen wortes in anwesenheit störender sprache - Google Patents
Verfahren zur erkennung eines gesprochenen wortes in anwesenheit störender spracheInfo
- Publication number
- DE69424172D1 DE69424172D1 DE69424172T DE69424172T DE69424172D1 DE 69424172 D1 DE69424172 D1 DE 69424172D1 DE 69424172 T DE69424172 T DE 69424172T DE 69424172 T DE69424172 T DE 69424172T DE 69424172 D1 DE69424172 D1 DE 69424172D1
- Authority
- DE
- Germany
- Prior art keywords
- word
- residual signal
- spoken word
- buffer
- detection
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Lifetime
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/20—Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L2021/02082—Noise filtering the noise being echo, reverberation of the speech
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L2021/02087—Noise filtering the noise being separate speech, e.g. cocktail party
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US08/106,072 US5475791A (en) | 1993-08-13 | 1993-08-13 | Method for recognizing a spoken word in the presence of interfering speech |
PCT/US1994/009353 WO1995005655A2 (en) | 1993-08-13 | 1994-08-15 | Method for recognizing a spoken word in the presence of interfering speech |
Publications (2)
Publication Number | Publication Date |
---|---|
DE69424172D1 true DE69424172D1 (de) | 2000-05-31 |
DE69424172T2 DE69424172T2 (de) | 2000-11-23 |
Family
ID=22309326
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
DE69424172T Expired - Lifetime DE69424172T2 (de) | 1993-08-13 | 1994-08-15 | Verfahren zur erkennung eines gesprochenen wortes in anwesenheit störender sprache |
Country Status (8)
Country | Link |
---|---|
US (1) | US5475791A (de) |
EP (1) | EP0713597B1 (de) |
AT (1) | ATE192258T1 (de) |
AU (1) | AU687089B2 (de) |
CA (1) | CA2169447A1 (de) |
DE (1) | DE69424172T2 (de) |
ES (1) | ES2145148T3 (de) |
WO (1) | WO1995005655A2 (de) |
Families Citing this family (53)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
DE69612480T2 (de) * | 1995-02-15 | 2001-10-11 | British Telecomm | Detektion von sprechaktivität |
DE19533541C1 (de) * | 1995-09-11 | 1997-03-27 | Daimler Benz Aerospace Ag | Verfahren zur automatischen Steuerung eines oder mehrerer Geräte durch Sprachkommandos oder per Sprachdialog im Echtzeitbetrieb und Vorrichtung zum Ausführen des Verfahrens |
JP2921472B2 (ja) * | 1996-03-15 | 1999-07-19 | 日本電気株式会社 | 音声および雑音の除去装置、音声認識装置 |
US5765130A (en) * | 1996-05-21 | 1998-06-09 | Applied Language Technologies, Inc. | Method and apparatus for facilitating speech barge-in in connection with voice recognition systems |
ES2172011T3 (es) * | 1996-11-28 | 2002-09-16 | British Telecomm | Aparato y procedimiento interactivo. |
US5953411A (en) * | 1996-12-18 | 1999-09-14 | Intel Corporation | Method and apparatus for maintaining audio sample correlation |
US5848130A (en) * | 1996-12-31 | 1998-12-08 | At&T Corp | System and method for enhanced intelligibility of voice messages |
US6775264B1 (en) | 1997-03-03 | 2004-08-10 | Webley Systems, Inc. | Computer, internet and telecommunications based network |
JPH10257583A (ja) * | 1997-03-06 | 1998-09-25 | Asahi Chem Ind Co Ltd | 音声処理装置およびその音声処理方法 |
GB2325112B (en) | 1997-05-06 | 2002-07-31 | Ibm | Voice processing system |
GB2325110B (en) | 1997-05-06 | 2002-10-16 | Ibm | Voice processing system |
DE19722784C1 (de) * | 1997-05-30 | 1999-01-14 | Deutsche Telekom Ag | Verfahren und Anordnung für ein sprachgesteuertes Kommunikationsendgerät mit akustischer Bedienerführung |
DE69820222T2 (de) * | 1997-10-07 | 2004-09-30 | Koninklijke Philips Electronics N.V. | Verfahren und vorrichtung zur aktivierung einer sprachgesteuerten funktion in einem mehrplatznetzwerk mittels sowohl sprecherabhängiger als auch sprecherunabhängiger spracherkennung |
US6167251A (en) * | 1998-10-02 | 2000-12-26 | Telespree Communications | Keyless portable cellular phone system having remote voice recognition |
US7274928B2 (en) | 1998-10-02 | 2007-09-25 | Telespree Communications | Portable cellular phone system having automatic initialization |
US6665645B1 (en) * | 1999-07-28 | 2003-12-16 | Matsushita Electric Industrial Co., Ltd. | Speech recognition apparatus for AV equipment |
US7117149B1 (en) * | 1999-08-30 | 2006-10-03 | Harman Becker Automotive Systems-Wavemakers, Inc. | Sound source classification |
US6963759B1 (en) | 1999-10-05 | 2005-11-08 | Fastmobile, Inc. | Speech recognition technique based on local interrupt detection |
US6868385B1 (en) | 1999-10-05 | 2005-03-15 | Yomobile, Inc. | Method and apparatus for the provision of information signals based upon speech recognition |
US6937977B2 (en) * | 1999-10-05 | 2005-08-30 | Fastmobile, Inc. | Method and apparatus for processing an input speech signal during presentation of an output audio signal |
GB9928011D0 (en) * | 1999-11-27 | 2000-01-26 | Ibm | Voice processing system |
US6721705B2 (en) | 2000-02-04 | 2004-04-13 | Webley Systems, Inc. | Robust voice browser system and voice activated device controller |
US7516190B2 (en) | 2000-02-04 | 2009-04-07 | Parus Holdings, Inc. | Personal voice-based information retrieval system |
US6744885B1 (en) * | 2000-02-24 | 2004-06-01 | Lucent Technologies Inc. | ASR talkoff suppressor |
WO2001075555A2 (en) * | 2000-03-06 | 2001-10-11 | Conita Technologies, Inc. | Personal virtual assistant |
WO2002015560A2 (en) * | 2000-08-12 | 2002-02-21 | Georgia Tech Research Corporation | A system and method for capturing an image |
US6725193B1 (en) * | 2000-09-13 | 2004-04-20 | Telefonaktiebolaget Lm Ericsson | Cancellation of loudspeaker words in speech recognition |
US20020173333A1 (en) * | 2001-05-18 | 2002-11-21 | Buchholz Dale R. | Method and apparatus for processing barge-in requests |
DE10158583A1 (de) * | 2001-11-29 | 2003-06-12 | Philips Intellectual Property | Verfahren zum Betrieb eines Barge-In-Dialogsystems |
US7328159B2 (en) * | 2002-01-15 | 2008-02-05 | Qualcomm Inc. | Interactive speech recognition apparatus and method with conditioned voice prompts |
US8046581B2 (en) | 2002-03-04 | 2011-10-25 | Telespree Communications | Method and apparatus for secure immediate wireless access in a telecommunications network |
US7197301B2 (en) | 2002-03-04 | 2007-03-27 | Telespree Communications | Method and apparatus for secure immediate wireless access in a telecommunications network |
US20030229491A1 (en) * | 2002-06-06 | 2003-12-11 | International Business Machines Corporation | Single sound fragment processing |
JP3727927B2 (ja) * | 2003-02-10 | 2005-12-21 | 株式会社東芝 | 話者照合装置 |
US20050071158A1 (en) * | 2003-09-25 | 2005-03-31 | Vocollect, Inc. | Apparatus and method for detecting user speech |
US7496387B2 (en) * | 2003-09-25 | 2009-02-24 | Vocollect, Inc. | Wireless headset for use in speech recognition environment |
US20060146652A1 (en) * | 2005-01-03 | 2006-07-06 | Sdi Technologies, Inc. | Sunset timer |
WO2006077626A1 (ja) * | 2005-01-18 | 2006-07-27 | Fujitsu Limited | 話速変換方法及び話速変換装置 |
US20070055514A1 (en) * | 2005-09-08 | 2007-03-08 | Beattie Valerie L | Intelligent tutoring feedback |
US8417185B2 (en) | 2005-12-16 | 2013-04-09 | Vocollect, Inc. | Wireless headset and method for robust voice data communication |
US7885419B2 (en) | 2006-02-06 | 2011-02-08 | Vocollect, Inc. | Headset terminal with speech functionality |
US7773767B2 (en) * | 2006-02-06 | 2010-08-10 | Vocollect, Inc. | Headset terminal with rear stability strap |
US8046221B2 (en) * | 2007-10-31 | 2011-10-25 | At&T Intellectual Property Ii, L.P. | Multi-state barge-in models for spoken dialog systems |
EP2107553B1 (de) * | 2008-03-31 | 2011-05-18 | Harman Becker Automotive Systems GmbH | Verfahren zur Erkennung einer Unterbrechung einer Sprachausgabe |
EP2148325B1 (de) * | 2008-07-22 | 2014-10-01 | Nuance Communications, Inc. | Verfahren zur Bestimmung der Anwesenheit einer gewollten Signalkomponente |
USD605629S1 (en) | 2008-09-29 | 2009-12-08 | Vocollect, Inc. | Headset |
US8160287B2 (en) | 2009-05-22 | 2012-04-17 | Vocollect, Inc. | Headset with adjustable headband |
US8438659B2 (en) | 2009-11-05 | 2013-05-07 | Vocollect, Inc. | Portable computing device and headset interface |
US9502050B2 (en) | 2012-06-10 | 2016-11-22 | Nuance Communications, Inc. | Noise dependent signal processing for in-car communication systems with multiple acoustic zones |
CN104704560B (zh) | 2012-09-04 | 2018-06-05 | 纽昂斯通讯公司 | 共振峰依赖的语音信号增强 |
US9613633B2 (en) | 2012-10-30 | 2017-04-04 | Nuance Communications, Inc. | Speech enhancement |
CN109903758B (zh) | 2017-12-08 | 2023-06-23 | 阿里巴巴集团控股有限公司 | 音频处理方法、装置及终端设备 |
CN111048096B (zh) * | 2019-12-24 | 2022-07-26 | 大众问问(北京)信息科技有限公司 | 一种语音信号处理方法、装置及终端 |
Family Cites Families (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPS5852695A (ja) * | 1981-09-25 | 1983-03-28 | 日産自動車株式会社 | 車両用音声検出装置 |
US4645883A (en) * | 1984-05-09 | 1987-02-24 | Communications Satellite Corporation | Double talk and line noise detector for a echo canceller |
US4914692A (en) * | 1987-12-29 | 1990-04-03 | At&T Bell Laboratories | Automatic speech recognition using echo cancellation |
US5125024A (en) * | 1990-03-28 | 1992-06-23 | At&T Bell Laboratories | Voice response unit |
US5155760A (en) * | 1991-06-26 | 1992-10-13 | At&T Bell Laboratories | Voice messaging system with voice activated prompt interrupt |
-
1993
- 1993-08-13 US US08/106,072 patent/US5475791A/en not_active Expired - Lifetime
-
1994
- 1994-08-15 EP EP94925293A patent/EP0713597B1/de not_active Expired - Lifetime
- 1994-08-15 DE DE69424172T patent/DE69424172T2/de not_active Expired - Lifetime
- 1994-08-15 ES ES94925293T patent/ES2145148T3/es not_active Expired - Lifetime
- 1994-08-15 CA CA002169447A patent/CA2169447A1/en not_active Abandoned
- 1994-08-15 WO PCT/US1994/009353 patent/WO1995005655A2/en active IP Right Grant
- 1994-08-15 AT AT94925293T patent/ATE192258T1/de not_active IP Right Cessation
- 1994-08-15 AU AU75273/94A patent/AU687089B2/en not_active Ceased
Also Published As
Publication number | Publication date |
---|---|
US5475791A (en) | 1995-12-12 |
WO1995005655A2 (en) | 1995-02-23 |
AU687089B2 (en) | 1998-02-19 |
EP0713597A4 (de) | 1998-01-28 |
AU7527394A (en) | 1995-03-14 |
ES2145148T3 (es) | 2000-07-01 |
CA2169447A1 (en) | 1995-02-23 |
WO1995005655A3 (en) | 1995-03-23 |
EP0713597A1 (de) | 1996-05-29 |
DE69424172T2 (de) | 2000-11-23 |
ATE192258T1 (de) | 2000-05-15 |
EP0713597B1 (de) | 2000-04-26 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
DE69424172T2 (de) | Verfahren zur erkennung eines gesprochenen wortes in anwesenheit störender sprache | |
US5369726A (en) | Speech recognition circuitry employing nonlinear processing speech element modeling and phoneme estimation | |
EP0664535A3 (de) | Spracherkennungssystem für zusammenhängende Sätze mit grossem Wortschatz sowie Verfahren zur Sprachdarstellung mittels evolutionärer Grammatik als kontextfreie Grammatik. | |
US6098040A (en) | Method and apparatus for providing an improved feature set in speech recognition by performing noise cancellation and background masking | |
MX9505299A (es) | Sistemas, metodos y articulos de fabricacion para realizar la hipotesizacion de n-cadenas optimas de alta resolucion. | |
DE3372552D1 (en) | Speech recognition system | |
WO1995005655B1 (en) | Method for recognizing a spoken word in the presence of interfering speech | |
FR2522179B1 (fr) | Procede et appareil de reconnaissance de paroles permettant de reconnaitre des phonemes particuliers du signal vocal quelle que soit la personne qui parle | |
ATE314718T1 (de) | Srecherangepasste spracherkennung | |
DE60309142D1 (de) | Vorrichtung zur bestimmung von parametern eines gauss'schen mischungmodells (gmm) oder eines gmm basierten hidden markov modells | |
JPS6413595A (en) | Voice recognition circuit using estimate of phoneme | |
DE3275779D1 (en) | Recognition of speech or speech-like sounds | |
DE59904741D1 (de) | Anordnung und verfahren zur erkennung eines vorgegebenen wortschatzes in gesprochener sprache durch einen rechner | |
CN107039035A (zh) | 一种语音起始点和终止点的检测方法 | |
CA2192397A1 (en) | Method and system for performing speech recognition | |
Mary et al. | Autoassociative neural network models for language identification | |
ATE279003T1 (de) | Verfahren und vorrichtung zur integritätsprüfung von benutzeroberflächen sprachgesteuerter geräte | |
Sudhakar et al. | Automatic speech segmentation to improve speech synthesis performance | |
US5765124A (en) | Time-varying feature space preprocessing procedure for telephone based speech recognition | |
Ali et al. | Robust classification of stop consonants using auditory-based speech processing | |
Hahn et al. | An improved speech detection algorithm for isolated Korean utterances | |
Hanson et al. | Speech enhancement with harmonic synthesis | |
Hoshimi et al. | Speaker independent speech recognition method using training speech from a small number of speakers | |
Mayora-Ibarra et al. | Time-domain segmentation and labelling of speech with fuzzy-logic post-correction rules | |
SU781882A2 (ru) | Устройство дл распознавани слов |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
8364 | No opposition during term of opposition | ||
8327 | Change in the person/name/address of the patent owner |
Owner name: SCANSOFT, INC. (N.D.GES.D. STAATES DELAWARE), PEAB |
|
8328 | Change in the person/name/address of the agent |
Representative=s name: TIEDTKE, BUEHLING, KINNE & PARTNER GBR, 80336 MUENCHEN |