DE69427083T2 - Spracherkennungssystem für mehrere sprachen - Google Patents
Spracherkennungssystem für mehrere sprachenInfo
- Publication number
- DE69427083T2 DE69427083T2 DE69427083T DE69427083T DE69427083T2 DE 69427083 T2 DE69427083 T2 DE 69427083T2 DE 69427083 T DE69427083 T DE 69427083T DE 69427083 T DE69427083 T DE 69427083T DE 69427083 T2 DE69427083 T2 DE 69427083T2
- Authority
- DE
- Germany
- Prior art keywords
- voice recognition
- recognition system
- multiple languages
- spectrum
- phones
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/16—Speech classification or search using artificial neural networks
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
- G10L15/187—Phonemic context, e.g. pronunciation rules, phonotactical constraints or phoneme n-grams
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
- G10L2015/025—Phonemes, fenemes or fenones being the recognition units
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/063—Training
- G10L2015/0638—Interactive procedures
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/18—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being spectral information of each sub-band
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US9074793A | 1993-07-13 | 1993-07-13 | |
PCT/US1994/007742 WO1995002879A1 (en) | 1993-07-13 | 1994-07-12 | Multi-language speech recognition system |
Publications (2)
Publication Number | Publication Date |
---|---|
DE69427083D1 DE69427083D1 (de) | 2001-05-17 |
DE69427083T2 true DE69427083T2 (de) | 2001-12-06 |
Family
ID=22224117
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
DE69427083T Expired - Fee Related DE69427083T2 (de) | 1993-07-13 | 1994-07-12 | Spracherkennungssystem für mehrere sprachen |
Country Status (8)
Country | Link |
---|---|
US (1) | US5758023A (de) |
EP (1) | EP0708958B1 (de) |
JP (1) | JPH09500223A (de) |
AT (1) | ATE200590T1 (de) |
AU (1) | AU682380B2 (de) |
CA (1) | CA2167200A1 (de) |
DE (1) | DE69427083T2 (de) |
WO (1) | WO1995002879A1 (de) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7162424B2 (en) | 2001-04-26 | 2007-01-09 | Siemens Aktiengesellschaft | Method and system for defining a sequence of sound modules for synthesis of a speech signal in a tonal language |
US7949517B2 (en) | 2006-12-01 | 2011-05-24 | Deutsche Telekom Ag | Dialogue system with logical evaluation for language identification in speech recognition |
Families Citing this family (74)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5790754A (en) * | 1994-10-21 | 1998-08-04 | Sensory Circuits, Inc. | Speech recognition apparatus for consumer electronic applications |
DE19636739C1 (de) * | 1996-09-10 | 1997-07-03 | Siemens Ag | Verfahren zur Mehrsprachenverwendung eines hidden Markov Lautmodelles in einem Spracherkennungssystem |
EP0920692B1 (de) * | 1996-12-24 | 2003-03-26 | Cellon France SAS | Verfahren zum trainieren eines spracherkennungssystems und ein gerät zum praktizieren des verfahrens, insbesondere eines tragbaren telefons |
US6061646A (en) * | 1997-12-18 | 2000-05-09 | International Business Machines Corp. | Kiosk for multiple spoken languages |
US6085160A (en) * | 1998-07-10 | 2000-07-04 | Lernout & Hauspie Speech Products N.V. | Language independent speech recognition |
WO2000022609A1 (en) * | 1998-10-13 | 2000-04-20 | Telefonaktiebolaget Lm Ericsson (Publ) | Speech recognition and control system and telephone |
US6188984B1 (en) * | 1998-11-17 | 2001-02-13 | Fonix Corporation | Method and system for syllable parsing |
US6377913B1 (en) * | 1999-08-13 | 2002-04-23 | International Business Machines Corporation | Method and system for multi-client access to a dialog system |
JP4292646B2 (ja) | 1999-09-16 | 2009-07-08 | 株式会社デンソー | ユーザインタフェース装置、ナビゲーションシステム、情報処理装置及び記録媒体 |
US6963837B1 (en) * | 1999-10-06 | 2005-11-08 | Multimodal Technologies, Inc. | Attribute-based word modeling |
US9076448B2 (en) | 1999-11-12 | 2015-07-07 | Nuance Communications, Inc. | Distributed real time speech recognition system |
US7725307B2 (en) | 1999-11-12 | 2010-05-25 | Phoenix Solutions, Inc. | Query engine for processing voice based queries including semantic decoding |
US7392185B2 (en) | 1999-11-12 | 2008-06-24 | Phoenix Solutions, Inc. | Speech based learning/training system using semantic decoding |
US7050977B1 (en) | 1999-11-12 | 2006-05-23 | Phoenix Solutions, Inc. | Speech-enabled server for internet website and method |
DE10018134A1 (de) * | 2000-04-12 | 2001-10-18 | Siemens Ag | Verfahren und Vorrichtung zum Bestimmen prosodischer Markierungen |
JP3339579B2 (ja) * | 2000-10-04 | 2002-10-28 | 株式会社鷹山 | 電話装置 |
EP1217610A1 (de) * | 2000-11-28 | 2002-06-26 | Siemens Aktiengesellschaft | Verfahren und System zur multilingualen Spracherkennung |
EP1217609A3 (de) * | 2000-12-22 | 2004-02-25 | Hewlett-Packard Company | Spracherkennung |
US20020095274A1 (en) * | 2001-01-17 | 2002-07-18 | Richards Alfred N. | Pool cover design verifying system |
US7107215B2 (en) * | 2001-04-16 | 2006-09-12 | Sakhr Software Company | Determining a compact model to transcribe the arabic language acoustically in a well defined basic phonetic study |
US20030092423A1 (en) * | 2001-11-09 | 2003-05-15 | Roger Boivin | System and method to allow law enforcement agencies to track and monitor calls made on recyclable/disposable mobile telephones |
US7295982B1 (en) * | 2001-11-19 | 2007-11-13 | At&T Corp. | System and method for automatic verification of the understandability of speech |
US6990445B2 (en) * | 2001-12-17 | 2006-01-24 | Xl8 Systems, Inc. | System and method for speech recognition and transcription |
WO2003060877A1 (de) * | 2002-01-17 | 2003-07-24 | Siemens Aktiengesellschaft | Betriebsverfahren eines automatischen spracherkenners zur sprecherunabhängigen spracherkennung von worten aus verschiedenen sprachen und automatischer spracherkenner |
US7286993B2 (en) * | 2002-01-31 | 2007-10-23 | Product Discovery, Inc. | Holographic speech translation system and method |
US20030208451A1 (en) * | 2002-05-03 | 2003-11-06 | Jim-Shih Liaw | Artificial neural systems with dynamic synapses |
US7010488B2 (en) * | 2002-05-09 | 2006-03-07 | Oregon Health & Science University | System and method for compressing concatenative acoustic inventories for speech synthesis |
US20040030555A1 (en) * | 2002-08-12 | 2004-02-12 | Oregon Health & Science University | System and method for concatenating acoustic contours for speech synthesis |
DE10256935A1 (de) * | 2002-12-05 | 2004-07-01 | Siemens Ag | Auswahl der Benutzersprache an einem rein akustisch gesteuerten Telefon |
KR100486735B1 (ko) * | 2003-02-28 | 2005-05-03 | 삼성전자주식회사 | 최적구획 분류신경망 구성방법과 최적구획 분류신경망을이용한 자동 레이블링방법 및 장치 |
US7321852B2 (en) * | 2003-10-28 | 2008-01-22 | International Business Machines Corporation | System and method for transcribing audio files of various languages |
US8036893B2 (en) * | 2004-07-22 | 2011-10-11 | Nuance Communications, Inc. | Method and system for identifying and correcting accent-induced speech recognition difficulties |
US7406408B1 (en) | 2004-08-24 | 2008-07-29 | The United States Of America As Represented By The Director, National Security Agency | Method of recognizing phones in speech of any language |
US7430503B1 (en) | 2004-08-24 | 2008-09-30 | The United States Of America As Represented By The Director, National Security Agency | Method of combining corpora to achieve consistency in phonetic labeling |
US20060122834A1 (en) * | 2004-12-03 | 2006-06-08 | Bennett Ian M | Emotion detection device & method for use in distributed systems |
US20070038455A1 (en) * | 2005-08-09 | 2007-02-15 | Murzina Marina V | Accent detection and correction system |
US8032372B1 (en) * | 2005-09-13 | 2011-10-04 | Escription, Inc. | Dictation selection |
US7970613B2 (en) | 2005-11-12 | 2011-06-28 | Sony Computer Entertainment Inc. | Method and system for Gaussian probability data bit reduction and computation |
US20070138267A1 (en) * | 2005-12-21 | 2007-06-21 | Singer-Harter Debra L | Public terminal-based translator |
US7778831B2 (en) * | 2006-02-21 | 2010-08-17 | Sony Computer Entertainment Inc. | Voice recognition with dynamic filter bank adjustment based on speaker categorization determined from runtime pitch |
US8010358B2 (en) * | 2006-02-21 | 2011-08-30 | Sony Computer Entertainment Inc. | Voice recognition with parallel gender and age normalization |
US7822605B2 (en) * | 2006-10-19 | 2010-10-26 | Nice Systems Ltd. | Method and apparatus for large population speaker identification in telephone interactions |
US20080126093A1 (en) * | 2006-11-28 | 2008-05-29 | Nokia Corporation | Method, Apparatus and Computer Program Product for Providing a Language Based Interactive Multimedia System |
US20100064234A1 (en) * | 2007-03-09 | 2010-03-11 | Ghost, Inc. | System and Method for Browser within a Web Site and Proxy Server |
CN101578659B (zh) * | 2007-05-14 | 2012-01-18 | 松下电器产业株式会社 | 音质转换装置及音质转换方法 |
KR100925479B1 (ko) * | 2007-09-19 | 2009-11-06 | 한국전자통신연구원 | 음성 인식 방법 및 장치 |
US8032384B2 (en) * | 2008-03-14 | 2011-10-04 | Jay S Rylander | Hand held language translation and learning device |
US9418662B2 (en) * | 2009-01-21 | 2016-08-16 | Nokia Technologies Oy | Method, apparatus and computer program product for providing compound models for speech recognition adaptation |
US8442833B2 (en) * | 2009-02-17 | 2013-05-14 | Sony Computer Entertainment Inc. | Speech processing with source location estimation using signals from two or more microphones |
US8788256B2 (en) * | 2009-02-17 | 2014-07-22 | Sony Computer Entertainment Inc. | Multiple language voice recognition |
US8442829B2 (en) * | 2009-02-17 | 2013-05-14 | Sony Computer Entertainment Inc. | Automatic computation streaming partition for voice recognition on multiple processors with limited memory |
WO2011037562A1 (en) * | 2009-09-23 | 2011-03-31 | Nuance Communications, Inc. | Probabilistic representation of acoustic segments |
WO2011150969A1 (en) * | 2010-06-02 | 2011-12-08 | Naxos Finance Sa | Apparatus for image data recording and reproducing, and method thereof |
FI20106048A0 (fi) * | 2010-10-12 | 2010-10-12 | Annu Marttila | Kieliprofiloinnin menetelmä |
US8914242B2 (en) | 2011-07-21 | 2014-12-16 | Thermo Ramsey, Inc. | Signal processing in guided wave cutoff spectroscopy |
US8442825B1 (en) | 2011-08-16 | 2013-05-14 | The United States Of America As Represented By The Director, National Security Agency | Biomimetic voice identifier |
US9153235B2 (en) | 2012-04-09 | 2015-10-06 | Sony Computer Entertainment Inc. | Text dependent speaker recognition with long-term feature based on functional data analysis |
CN103631802B (zh) * | 2012-08-24 | 2015-05-20 | 腾讯科技(深圳)有限公司 | 歌曲信息检索方法、装置及相应的服务器 |
EP2736042A1 (de) * | 2012-11-23 | 2014-05-28 | Samsung Electronics Co., Ltd | Vorrichtung und Verfahren zur Erstellung eines mehrsprachigen akustischen Modells und computerlesbares Aufzeichnungsmedium für Speicherprogramm zur Ausführung des Verfahrens |
US10510264B2 (en) | 2013-03-21 | 2019-12-17 | Neuron Fuel, Inc. | Systems and methods for customized lesson creation and application |
US9595205B2 (en) | 2012-12-18 | 2017-03-14 | Neuron Fuel, Inc. | Systems and methods for goal-based programming instruction |
US8800113B1 (en) * | 2013-03-15 | 2014-08-12 | Blackstone Medical, Inc. | Rigid modular connector |
US9953630B1 (en) * | 2013-05-31 | 2018-04-24 | Amazon Technologies, Inc. | Language recognition for device settings |
KR102084646B1 (ko) * | 2013-07-04 | 2020-04-14 | 삼성전자주식회사 | 음성 인식 장치 및 음성 인식 방법 |
CN104143328B (zh) * | 2013-08-15 | 2015-11-25 | 腾讯科技(深圳)有限公司 | 一种关键词检测方法和装置 |
US9589564B2 (en) | 2014-02-05 | 2017-03-07 | Google Inc. | Multiple speech locale-specific hotword classifiers for selection of a speech locale |
US9135911B2 (en) * | 2014-02-07 | 2015-09-15 | NexGen Flight LLC | Automated generation of phonemic lexicon for voice activated cockpit management systems |
WO2016039751A1 (en) * | 2014-09-11 | 2016-03-17 | Nuance Communications, Inc. | Method for scoring in an automatic speech recognition system |
US20170011735A1 (en) * | 2015-07-10 | 2017-01-12 | Electronics And Telecommunications Research Institute | Speech recognition system and method |
US10614826B2 (en) | 2017-05-24 | 2020-04-07 | Modulate, Inc. | System and method for voice-to-voice conversion |
CN112364658A (zh) | 2019-07-24 | 2021-02-12 | 阿里巴巴集团控股有限公司 | 翻译以及语音识别方法、装置、设备 |
KR102303785B1 (ko) * | 2019-08-05 | 2021-09-23 | 엘지전자 주식회사 | 로봇의 언어를 설정하는 인공 지능 서버 및 그 방법 |
WO2021030759A1 (en) | 2019-08-14 | 2021-02-18 | Modulate, Inc. | Generation and detection of watermark for real-time voice conversion |
US11551695B1 (en) * | 2020-05-13 | 2023-01-10 | Amazon Technologies, Inc. | Model training system for custom speech-to-text models |
Family Cites Families (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4536844A (en) * | 1983-04-26 | 1985-08-20 | Fairchild Camera And Instrument Corporation | Method and apparatus for simulating aural response information |
US4882757A (en) * | 1986-04-25 | 1989-11-21 | Texas Instruments Incorporated | Speech recognition system |
JP2717652B2 (ja) * | 1986-06-02 | 1998-02-18 | モトローラ・インコーポレーテッド | 連続音声認識システム |
US4852170A (en) * | 1986-12-18 | 1989-07-25 | R & D Associates | Real time computer speech recognition system |
US4905285A (en) * | 1987-04-03 | 1990-02-27 | American Telephone And Telegraph Company, At&T Bell Laboratories | Analysis arrangement based on a model of human neural responses |
US4910784A (en) * | 1987-07-30 | 1990-03-20 | Texas Instruments Incorporated | Low cost speech recognition system and method |
US4984177A (en) * | 1988-02-05 | 1991-01-08 | Advanced Products And Technologies, Inc. | Voice language translator |
JP2764277B2 (ja) * | 1988-09-07 | 1998-06-11 | 株式会社日立製作所 | 音声認識装置 |
US4937870A (en) * | 1988-11-14 | 1990-06-26 | American Telephone And Telegraph Company | Speech recognition arrangement |
US5033087A (en) * | 1989-03-14 | 1991-07-16 | International Business Machines Corp. | Method and apparatus for the automatic determination of phonological rules as for a continuous speech recognition system |
US5278911A (en) * | 1989-05-18 | 1994-01-11 | Smiths Industries Public Limited Company | Speech recognition using a neural net |
US5293584A (en) * | 1992-05-21 | 1994-03-08 | International Business Machines Corporation | Speech recognition system for natural language translation |
-
1994
- 1994-07-12 AT AT94923413T patent/ATE200590T1/de active
- 1994-07-12 WO PCT/US1994/007742 patent/WO1995002879A1/en active IP Right Grant
- 1994-07-12 AU AU73282/94A patent/AU682380B2/en not_active Ceased
- 1994-07-12 CA CA002167200A patent/CA2167200A1/en not_active Abandoned
- 1994-07-12 JP JP7504646A patent/JPH09500223A/ja not_active Withdrawn
- 1994-07-12 EP EP94923413A patent/EP0708958B1/de not_active Expired - Lifetime
- 1994-07-12 DE DE69427083T patent/DE69427083T2/de not_active Expired - Fee Related
-
1995
- 1995-09-21 US US08/532,867 patent/US5758023A/en not_active Expired - Fee Related
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7162424B2 (en) | 2001-04-26 | 2007-01-09 | Siemens Aktiengesellschaft | Method and system for defining a sequence of sound modules for synthesis of a speech signal in a tonal language |
US7949517B2 (en) | 2006-12-01 | 2011-05-24 | Deutsche Telekom Ag | Dialogue system with logical evaluation for language identification in speech recognition |
Also Published As
Publication number | Publication date |
---|---|
AU7328294A (en) | 1995-02-13 |
EP0708958B1 (de) | 2001-04-11 |
CA2167200A1 (en) | 1995-01-26 |
WO1995002879A1 (en) | 1995-01-26 |
JPH09500223A (ja) | 1997-01-07 |
DE69427083D1 (de) | 2001-05-17 |
EP0708958A4 (de) | 1997-10-15 |
EP0708958A1 (de) | 1996-05-01 |
AU682380B2 (en) | 1997-10-02 |
US5758023A (en) | 1998-05-26 |
ATE200590T1 (de) | 2001-04-15 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
ATE200590T1 (de) | Spracherkennungssystem für mehrere sprachen | |
Juang et al. | Automatic recognition and understanding of spoken language-a first step toward natural human-machine communication | |
Jelinek et al. | Perplexity—a measure of the difficulty of speech recognition tasks | |
KR20200023456A (ko) | 발언 분류기 | |
US7319959B1 (en) | Multi-source phoneme classification for noise-robust automatic speech recognition | |
TW347619B (en) | A communication system and method using a speaker dependent time-scaling technique a method for time-scale modification of speech using a modified version of the Waveform Similarity based Overlap-Add technique (WSOLA). | |
FR2522179B1 (fr) | Procede et appareil de reconnaissance de paroles permettant de reconnaitre des phonemes particuliers du signal vocal quelle que soit la personne qui parle | |
DE69330427T2 (de) | Spracherkennungssystem für sprachen mit zusammengesetzten wörtern | |
RU2466468C1 (ru) | Система и способ распознавания речи | |
ATE363120T1 (de) | Audio-dialogsystem und sprachgesteuertes browsing-verfahren | |
Hermansky et al. | Perceptual properties of current speech recognition technology | |
US20160210982A1 (en) | Method and Apparatus to Enhance Speech Understanding | |
US10143027B1 (en) | Device selection for routing of communications | |
JPH10504404A (ja) | 音声認識のための方法および装置 | |
JP6599828B2 (ja) | 音処理方法、音処理装置、及びプログラム | |
CN113488026A (zh) | 基于语用信息的语音理解模型生成方法和智能语音交互方法 | |
Price et al. | Combining linguistic with statistical methods in modeling prosody | |
Pols | Flexible, robust, and efficient human speech processing versus present-day speech technology | |
KR20210000802A (ko) | 인공지능 음성 인식 처리 방법 및 시스템 | |
US11172527B2 (en) | Routing of communications to a device | |
Prasangini et al. | Sinhala speech to sinhala unicode text conversion for disaster relief facilitation in sri lanka | |
Berger et al. | Speech Activity Detection for Deaf People: Evaluation on the Developed Smart Solution Prototype | |
Naveena et al. | Extraction of Prosodic Features to Automatically Recognize Tamil Dialects | |
Tomas et al. | Determination of spectral parameters of speech signal by Goertzel algorithm | |
JPS6331798B2 (de) |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
8339 | Ceased/non-payment of the annual fee |