CN1317783A - 语音识别系统中确定非目标语言的方法和装置 - Google Patents
语音识别系统中确定非目标语言的方法和装置 Download PDFInfo
- Publication number
- CN1317783A CN1317783A CN01116330.5A CN01116330A CN1317783A CN 1317783 A CN1317783 A CN 1317783A CN 01116330 A CN01116330 A CN 01116330A CN 1317783 A CN1317783 A CN 1317783A
- Authority
- CN
- China
- Prior art keywords
- target language
- scoring
- model
- language
- audio stream
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 33
- 238000006243 chemical reaction Methods 0.000 claims description 16
- 238000012549 training Methods 0.000 claims description 15
- 230000008569 process Effects 0.000 claims description 14
- 230000033764 rhythmic process Effects 0.000 claims description 6
- 230000008859 change Effects 0.000 claims description 3
- 238000013518 transcription Methods 0.000 abstract 1
- 230000035897 transcription Effects 0.000 abstract 1
- 238000005516 engineering process Methods 0.000 description 7
- 238000004364 calculation method Methods 0.000 description 6
- 238000012545 processing Methods 0.000 description 5
- 238000013500 data storage Methods 0.000 description 4
- 239000004744 fabric Substances 0.000 description 4
- 230000007246 mechanism Effects 0.000 description 4
- 238000010276 construction Methods 0.000 description 3
- 230000008878 coupling Effects 0.000 description 3
- 238000010168 coupling process Methods 0.000 description 3
- 238000005859 coupling reaction Methods 0.000 description 3
- 238000010586 diagram Methods 0.000 description 3
- 238000013519 translation Methods 0.000 description 3
- 230000004308 accommodation Effects 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 244000309464 bull Species 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 238000012937 correction Methods 0.000 description 1
- 238000009795 derivation Methods 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 238000007562 laser obscuration time method Methods 0.000 description 1
- 230000002045 lasting effect Effects 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 239000003550 marker Substances 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 230000000877 morphologic effect Effects 0.000 description 1
- 230000008520 organization Effects 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 238000005303 weighing Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/20—Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Machine Translation (AREA)
Abstract
Description
Claims (17)
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US09/544678 | 2000-04-07 | ||
US09/544,678 US6738745B1 (en) | 2000-04-07 | 2000-04-07 | Methods and apparatus for identifying a non-target language in a speech recognition system |
US09/544,678 | 2000-04-07 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN1317783A true CN1317783A (zh) | 2001-10-17 |
CN1211779C CN1211779C (zh) | 2005-07-20 |
Family
ID=24173130
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN01116330.5A Expired - Fee Related CN1211779C (zh) | 2000-04-07 | 2001-04-06 | 语音识别系统中确定非目标语言的方法和装置 |
Country Status (3)
Country | Link |
---|---|
US (1) | US6738745B1 (zh) |
CN (1) | CN1211779C (zh) |
DE (1) | DE10111056B4 (zh) |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105917405A (zh) * | 2014-01-17 | 2016-08-31 | 微软技术许可有限责任公司 | 外源性大词汇量模型到基于规则的语音识别的合并 |
CN107622768A (zh) * | 2016-07-13 | 2018-01-23 | 谷歌公司 | 音频截剪器 |
US10749989B2 (en) | 2014-04-01 | 2020-08-18 | Microsoft Technology Licensing Llc | Hybrid client/server architecture for parallel processing |
US10885918B2 (en) | 2013-09-19 | 2021-01-05 | Microsoft Technology Licensing, Llc | Speech recognition using phoneme matching |
Families Citing this family (26)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
AU2002212992A1 (en) * | 2000-09-29 | 2002-04-08 | Lernout And Hauspie Speech Products N.V. | Corpus-based prosody translation system |
US20020077833A1 (en) * | 2000-12-20 | 2002-06-20 | Arons Barry M. | Transcription and reporting system |
US7191116B2 (en) * | 2001-06-19 | 2007-03-13 | Oracle International Corporation | Methods and systems for determining a language of a document |
US7437289B2 (en) * | 2001-08-16 | 2008-10-14 | International Business Machines Corporation | Methods and apparatus for the systematic adaptation of classification systems from sparse adaptation data |
TW517221B (en) * | 2001-08-24 | 2003-01-11 | Ind Tech Res Inst | Voice recognition system |
GB2409087A (en) * | 2003-12-12 | 2005-06-15 | Ibm | Computer generated prompting |
US8036893B2 (en) | 2004-07-22 | 2011-10-11 | Nuance Communications, Inc. | Method and system for identifying and correcting accent-induced speech recognition difficulties |
US7725318B2 (en) * | 2004-07-30 | 2010-05-25 | Nice Systems Inc. | System and method for improving the accuracy of audio searching |
US8924212B1 (en) | 2005-08-26 | 2014-12-30 | At&T Intellectual Property Ii, L.P. | System and method for robust access and entry to large structured data using voice form-filling |
US20070106646A1 (en) * | 2005-11-09 | 2007-05-10 | Bbnt Solutions Llc | User-directed navigation of multimedia search results |
US20070118873A1 (en) * | 2005-11-09 | 2007-05-24 | Bbnt Solutions Llc | Methods and apparatus for merging media content |
US7801910B2 (en) * | 2005-11-09 | 2010-09-21 | Ramp Holdings, Inc. | Method and apparatus for timed tagging of media content |
US9697231B2 (en) * | 2005-11-09 | 2017-07-04 | Cxense Asa | Methods and apparatus for providing virtual media channels based on media search |
US9697230B2 (en) | 2005-11-09 | 2017-07-04 | Cxense Asa | Methods and apparatus for dynamic presentation of advertising, factual, and informational content using enhanced metadata in search-driven media applications |
US20070106685A1 (en) * | 2005-11-09 | 2007-05-10 | Podzinger Corp. | Method and apparatus for updating speech recognition databases and reindexing audio and video content using the same |
US8583416B2 (en) * | 2007-12-27 | 2013-11-12 | Fluential, Llc | Robust information extraction from utterances |
US9436759B2 (en) | 2007-12-27 | 2016-09-06 | Nant Holdings Ip, Llc | Robust information extraction from utterances |
US8312022B2 (en) | 2008-03-21 | 2012-11-13 | Ramp Holdings, Inc. | Search engine optimization |
US7472061B1 (en) * | 2008-03-31 | 2008-12-30 | International Business Machines Corporation | Systems and methods for building a native language phoneme lexicon having native pronunciations of non-native words derived from non-native pronunciations |
US8977645B2 (en) * | 2009-01-16 | 2015-03-10 | Google Inc. | Accessing a search interface in a structured presentation |
US8484218B2 (en) * | 2011-04-21 | 2013-07-09 | Google Inc. | Translating keywords from a source language to a target language |
US9129605B2 (en) * | 2012-03-30 | 2015-09-08 | Src, Inc. | Automated voice and speech labeling |
US9495591B2 (en) * | 2012-04-13 | 2016-11-15 | Qualcomm Incorporated | Object recognition using multi-modal matching scheme |
US9190055B1 (en) * | 2013-03-14 | 2015-11-17 | Amazon Technologies, Inc. | Named entity recognition with personalized models |
US9390708B1 (en) * | 2013-05-28 | 2016-07-12 | Amazon Technologies, Inc. | Low latency and memory efficient keywork spotting |
CN111078937B (zh) * | 2019-12-27 | 2021-08-10 | 北京世纪好未来教育科技有限公司 | 语音信息检索方法、装置、设备和计算机可读存储介质 |
Family Cites Families (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPS6391699A (ja) * | 1986-10-03 | 1988-04-22 | 株式会社リコー | 音声認識方式 |
US5586215A (en) * | 1992-05-26 | 1996-12-17 | Ricoh Corporation | Neural network acoustic and visual speech recognition system |
JP3034773B2 (ja) * | 1994-12-27 | 2000-04-17 | シャープ株式会社 | 電子通訳機 |
CA2160184A1 (en) * | 1994-12-29 | 1996-06-30 | James Lee Hieronymus | Language identification with phonological and lexical models |
US5913185A (en) * | 1996-08-19 | 1999-06-15 | International Business Machines Corporation | Determining a natural language shift in a computer document |
US6122613A (en) * | 1997-01-30 | 2000-09-19 | Dragon Systems, Inc. | Speech recognition using multiple recognizers (selectively) applied to the same input sample |
US6047251A (en) * | 1997-09-15 | 2000-04-04 | Caere Corporation | Automatic language identification system for multilingual optical character recognition |
US6061646A (en) * | 1997-12-18 | 2000-05-09 | International Business Machines Corp. | Kiosk for multiple spoken languages |
US6085160A (en) * | 1998-07-10 | 2000-07-04 | Lernout & Hauspie Speech Products N.V. | Language independent speech recognition |
-
2000
- 2000-04-07 US US09/544,678 patent/US6738745B1/en not_active Expired - Lifetime
-
2001
- 2001-03-08 DE DE10111056A patent/DE10111056B4/de not_active Expired - Fee Related
- 2001-04-06 CN CN01116330.5A patent/CN1211779C/zh not_active Expired - Fee Related
Cited By (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10885918B2 (en) | 2013-09-19 | 2021-01-05 | Microsoft Technology Licensing, Llc | Speech recognition using phoneme matching |
CN105917405A (zh) * | 2014-01-17 | 2016-08-31 | 微软技术许可有限责任公司 | 外源性大词汇量模型到基于规则的语音识别的合并 |
US10311878B2 (en) | 2014-01-17 | 2019-06-04 | Microsoft Technology Licensing, Llc | Incorporating an exogenous large-vocabulary model into rule-based speech recognition |
CN105917405B (zh) * | 2014-01-17 | 2019-11-05 | 微软技术许可有限责任公司 | 外源性大词汇量模型到基于规则的语音识别的合并 |
US10749989B2 (en) | 2014-04-01 | 2020-08-18 | Microsoft Technology Licensing Llc | Hybrid client/server architecture for parallel processing |
CN107622768A (zh) * | 2016-07-13 | 2018-01-23 | 谷歌公司 | 音频截剪器 |
CN107622768B (zh) * | 2016-07-13 | 2021-09-28 | 谷歌有限责任公司 | 音频截剪器 |
Also Published As
Publication number | Publication date |
---|---|
DE10111056B4 (de) | 2005-11-10 |
CN1211779C (zh) | 2005-07-20 |
DE10111056A1 (de) | 2001-10-18 |
US6738745B1 (en) | 2004-05-18 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN1211779C (zh) | 语音识别系统中确定非目标语言的方法和装置 | |
US7475015B2 (en) | Semantic language modeling and confidence measurement | |
US8793130B2 (en) | Confidence measure generation for speech related searching | |
EP1462950B1 (en) | Method for language modelling | |
US5797123A (en) | Method of key-phase detection and verification for flexible speech understanding | |
CA2508946C (en) | Method and apparatus for natural language call routing using confidence scores | |
EP1922653B1 (en) | Word clustering for input data | |
Kawahara et al. | Flexible speech understanding based on combined key-phrase detection and verification | |
Hazen et al. | A comparison and combination of methods for OOV word detection and word confidence scoring | |
US20020173955A1 (en) | Method of speech recognition by presenting N-best word candidates | |
US20030191625A1 (en) | Method and system for creating a named entity language model | |
US20020087311A1 (en) | Computer-implemented dynamic language model generation method and system | |
US20130289987A1 (en) | Negative Example (Anti-Word) Based Performance Improvement For Speech Recognition | |
Raymond et al. | On the use of finite state transducers for semantic interpretation | |
Kawahara et al. | Key-phrase detection and verification for flexible speech understanding | |
US20050038647A1 (en) | Program product, method and system for detecting reduced speech | |
Gandhe et al. | Using web text to improve keyword spotting in speech | |
Kawahara et al. | Combining key-phrase detection and subword-based verification for flexible speech understanding | |
Rose | Word spotting from continuous speech utterances | |
Decadt et al. | Transcription of out-of-vocabulary words in large vocabulary speech recognition based on phoneme-to-grapheme conversion | |
US20030069730A1 (en) | Meaning token dictionary for automatic speech recognition | |
Imperl et al. | Clustering of triphones using phoneme similarity estimation for the definition of a multilingual set of triphones | |
Raymond et al. | Belief confirmation in spoken dialog systems using confidence measures | |
Bocchieri et al. | The 1994 at&t atis chronus recognizer | |
Kellner | Initial language models for spoken dialogue systems |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C06 | Publication | ||
PB01 | Publication | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
ASS | Succession or assignment of patent right |
Owner name: WEICHA COMMUNICATION CO.,LTD. Free format text: FORMER OWNER: INTERNATIONAL BUSINESS MACHINE CORP. Effective date: 20090731 |
|
C41 | Transfer of patent application or patent right or utility model | ||
TR01 | Transfer of patent right |
Effective date of registration: 20090731 Address after: Massachusetts, USA Patentee after: Nuance Communications Inc. Address before: American New York Patentee before: International Business Machines Corp. |
|
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20050720 Termination date: 20170406 |
|
CF01 | Termination of patent right due to non-payment of annual fee |