DE60020660D1 - Kontextabhängige akustische Modelle für die Spracherkennung mit Eigenstimmenanpassung - Google Patents
Kontextabhängige akustische Modelle für die Spracherkennung mit EigenstimmenanpassungInfo
- Publication number
- DE60020660D1 DE60020660D1 DE60020660T DE60020660T DE60020660D1 DE 60020660 D1 DE60020660 D1 DE 60020660D1 DE 60020660 T DE60020660 T DE 60020660T DE 60020660 T DE60020660 T DE 60020660T DE 60020660 D1 DE60020660 D1 DE 60020660D1
- Authority
- DE
- Germany
- Prior art keywords
- voice
- context
- matching
- acoustic models
- dependent acoustic
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
- G10L15/07—Adaptation to the speaker
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US09/450,392 US6571208B1 (en) | 1999-11-29 | 1999-11-29 | Context-dependent acoustic models for medium and large vocabulary speech recognition with eigenvoice training |
US450392 | 1999-11-29 |
Publications (2)
Publication Number | Publication Date |
---|---|
DE60020660D1 true DE60020660D1 (de) | 2005-07-14 |
DE60020660T2 DE60020660T2 (de) | 2005-10-06 |
Family
ID=23787898
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
DE60020660T Expired - Fee Related DE60020660T2 (de) | 1999-11-29 | 2000-11-27 | Kontextabhängige akustische Modelle für die Spracherkennung mit Eigenstimmenanpassung |
Country Status (7)
Country | Link |
---|---|
US (1) | US6571208B1 (de) |
EP (1) | EP1103952B1 (de) |
JP (1) | JP3683177B2 (de) |
CN (1) | CN1298172A (de) |
DE (1) | DE60020660T2 (de) |
ES (1) | ES2243210T3 (de) |
TW (1) | TW493160B (de) |
Families Citing this family (37)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
DE10047723A1 (de) * | 2000-09-27 | 2002-04-11 | Philips Corp Intellectual Pty | Verfahren zur Ermittlung eines Eigenraums zur Darstellung einer Mehrzahl von Trainingssprechern |
DE10047724A1 (de) * | 2000-09-27 | 2002-04-11 | Philips Corp Intellectual Pty | Verfahren zur Ermittlung eines Eigenraumes zur Darstellung einer Mehrzahl von Trainingssprechern |
JP2002150614A (ja) * | 2000-11-10 | 2002-05-24 | Pioneer Electronic Corp | 光ディスク |
ATE297588T1 (de) * | 2000-11-14 | 2005-06-15 | Ibm | Anpassung des phonetischen kontextes zur verbesserung der spracherkennung |
US6970820B2 (en) * | 2001-02-26 | 2005-11-29 | Matsushita Electric Industrial Co., Ltd. | Voice personalization of speech synthesizer |
US6895376B2 (en) * | 2001-05-04 | 2005-05-17 | Matsushita Electric Industrial Co., Ltd. | Eigenvoice re-estimation technique of acoustic models for speech recognition, speaker identification and speaker verification |
US7085722B2 (en) * | 2001-05-14 | 2006-08-01 | Sony Computer Entertainment America Inc. | System and method for menu-driven voice control of characters in a game environment |
US20040024585A1 (en) * | 2002-07-03 | 2004-02-05 | Amit Srivastava | Linguistic segmentation of speech |
US20040006628A1 (en) * | 2002-07-03 | 2004-01-08 | Scott Shepard | Systems and methods for providing real-time alerting |
US7788096B2 (en) | 2002-09-03 | 2010-08-31 | Microsoft Corporation | Method and apparatus for generating decision tree questions for speech processing |
US7752045B2 (en) * | 2002-10-07 | 2010-07-06 | Carnegie Mellon University | Systems and methods for comparing speech elements |
US20040083090A1 (en) * | 2002-10-17 | 2004-04-29 | Daniel Kiecza | Manager for integrating language technology components |
US7165026B2 (en) * | 2003-03-31 | 2007-01-16 | Microsoft Corporation | Method of noise estimation using incremental bayes learning |
US7499857B2 (en) * | 2003-05-15 | 2009-03-03 | Microsoft Corporation | Adaptation of compressed acoustic models |
US8133115B2 (en) | 2003-10-22 | 2012-03-13 | Sony Computer Entertainment America Llc | System and method for recording and displaying a graphical path in a video game |
KR20050063986A (ko) * | 2003-12-23 | 2005-06-29 | 한국전자통신연구원 | 고유음성 계수를 이용한 화자종속 음성인식 시스템 및 방법 |
TWI264702B (en) * | 2004-05-03 | 2006-10-21 | Acer Inc | Method for constructing acoustic model |
US20060071933A1 (en) | 2004-10-06 | 2006-04-06 | Sony Computer Entertainment Inc. | Application binary interface for multi-pass shaders |
US7636126B2 (en) | 2005-06-22 | 2009-12-22 | Sony Computer Entertainment Inc. | Delay matching in audio/video systems |
US7880746B2 (en) | 2006-05-04 | 2011-02-01 | Sony Computer Entertainment Inc. | Bandwidth management through lighting control of a user environment via a display device |
US7965859B2 (en) | 2006-05-04 | 2011-06-21 | Sony Computer Entertainment Inc. | Lighting control of a user environment via a display device |
WO2007131530A1 (en) * | 2006-05-16 | 2007-11-22 | Loquendo S.P.A. | Intersession variability compensation for automatic extraction of information from voice |
US20090030676A1 (en) * | 2007-07-26 | 2009-01-29 | Creative Technology Ltd | Method of deriving a compressed acoustic model for speech recognition |
US9126116B2 (en) | 2007-09-05 | 2015-09-08 | Sony Computer Entertainment America Llc | Ranking of user-generated game play advice |
US9108108B2 (en) | 2007-09-05 | 2015-08-18 | Sony Computer Entertainment America Llc | Real-time, contextual display of ranked, user-generated game play advice |
JP2010152081A (ja) * | 2008-12-25 | 2010-07-08 | Toshiba Corp | 話者適応装置及びそのプログラム |
GB2478314B (en) | 2010-03-02 | 2012-09-12 | Toshiba Res Europ Ltd | A speech processor, a speech processing method and a method of training a speech processor |
US10786736B2 (en) | 2010-05-11 | 2020-09-29 | Sony Interactive Entertainment LLC | Placement of user information in a game space |
US20120109649A1 (en) * | 2010-11-01 | 2012-05-03 | General Motors Llc | Speech dialect classification for automatic speech recognition |
US9342817B2 (en) | 2011-07-07 | 2016-05-17 | Sony Interactive Entertainment LLC | Auto-creating groups for sharing photos |
US9833707B2 (en) | 2012-10-29 | 2017-12-05 | Sony Interactive Entertainment Inc. | Ambient light control and calibration via a console |
CN104572631B (zh) * | 2014-12-03 | 2018-04-13 | 北京捷通华声语音技术有限公司 | 一种语言模型的训练方法及系统 |
US10360357B2 (en) | 2017-01-10 | 2019-07-23 | International Business Machines Corporation | Personal identification using action sequences detected by sensors |
US10561942B2 (en) | 2017-05-15 | 2020-02-18 | Sony Interactive Entertainment America Llc | Metronome for competitive gaming headset |
US10128914B1 (en) | 2017-09-06 | 2018-11-13 | Sony Interactive Entertainment LLC | Smart tags with multiple interactions |
US11698927B2 (en) | 2018-05-16 | 2023-07-11 | Sony Interactive Entertainment LLC | Contextual digital media processing systems and methods |
US11410642B2 (en) * | 2019-08-16 | 2022-08-09 | Soundhound, Inc. | Method and system using phoneme embedding |
Family Cites Families (42)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4903035A (en) | 1983-12-20 | 1990-02-20 | Bsh Electronics, Ltd. | Electrical signal separating device having isolating and matching circuitry |
US4718088A (en) | 1984-03-27 | 1988-01-05 | Exxon Research And Engineering Company | Speech recognition training method |
JPS62231993A (ja) | 1986-03-25 | 1987-10-12 | インタ−ナシヨナル ビジネス マシ−ンズ コ−ポレ−シヨン | 音声認識方法 |
US4817156A (en) | 1987-08-10 | 1989-03-28 | International Business Machines Corporation | Rapidly training a speech recognizer to a subsequent speaker given training data of a reference speaker |
JPH01102599A (ja) | 1987-10-12 | 1989-04-20 | Internatl Business Mach Corp <Ibm> | 音声認識方法 |
JP2733955B2 (ja) | 1988-05-18 | 1998-03-30 | 日本電気株式会社 | 適応型音声認識装置 |
US5127055A (en) | 1988-12-30 | 1992-06-30 | Kurzweil Applied Intelligence, Inc. | Speech recognition apparatus & method having dynamic reference pattern adaptation |
JPH0636156B2 (ja) | 1989-03-13 | 1994-05-11 | インターナショナル・ビジネス・マシーンズ・コーポレーション | 音声認識装置 |
DE3931638A1 (de) | 1989-09-22 | 1991-04-04 | Standard Elektrik Lorenz Ag | Verfahren zur sprecheradaptiven erkennung von sprache |
JP3014177B2 (ja) | 1991-08-08 | 2000-02-28 | 富士通株式会社 | 話者適応音声認識装置 |
US5280562A (en) | 1991-10-03 | 1994-01-18 | International Business Machines Corporation | Speech coding apparatus with single-dimension acoustic prototypes for a speech recognizer |
ES2128390T3 (es) | 1992-03-02 | 1999-05-16 | At & T Corp | Metodo de adiestramiento y dispositivo para reconocimiento de voz. |
US5233681A (en) | 1992-04-24 | 1993-08-03 | International Business Machines Corporation | Context-dependent speech recognizer using estimated next word context |
US5293584A (en) | 1992-05-21 | 1994-03-08 | International Business Machines Corporation | Speech recognition system for natural language translation |
US5473728A (en) | 1993-02-24 | 1995-12-05 | The United States Of America As Represented By The Secretary Of The Navy | Training of homoscedastic hidden Markov models for automatic speech recognition |
JPH075892A (ja) | 1993-04-29 | 1995-01-10 | Matsushita Electric Ind Co Ltd | 音声認識方法 |
US5664059A (en) | 1993-04-29 | 1997-09-02 | Panasonic Technologies, Inc. | Self-learning speaker adaptation based on spectral variation source decomposition |
US5522011A (en) | 1993-09-27 | 1996-05-28 | International Business Machines Corporation | Speech coding apparatus and method using classification rules |
AU7802194A (en) | 1993-09-30 | 1995-04-18 | Apple Computer, Inc. | Continuous reference adaptation in a pattern recognition system |
JP2692581B2 (ja) | 1994-06-07 | 1997-12-17 | 日本電気株式会社 | 音響カテゴリ平均値計算装置及び適応化装置 |
US5793891A (en) | 1994-07-07 | 1998-08-11 | Nippon Telegraph And Telephone Corporation | Adaptive training method for pattern recognition |
US5825978A (en) | 1994-07-18 | 1998-10-20 | Sri International | Method and apparatus for speech recognition using optimized partial mixture tying of HMM state functions |
US5737723A (en) | 1994-08-29 | 1998-04-07 | Lucent Technologies Inc. | Confusable word detection in speech recognition |
US5715468A (en) * | 1994-09-30 | 1998-02-03 | Budzinski; Robert Lucius | Memory system for storing and retrieving experience and knowledge with natural language |
US5864810A (en) | 1995-01-20 | 1999-01-26 | Sri International | Method and apparatus for speech recognition adapted to an individual speaker |
JP3453456B2 (ja) | 1995-06-19 | 2003-10-06 | キヤノン株式会社 | 状態共有モデルの設計方法及び装置ならびにその状態共有モデルを用いた音声認識方法および装置 |
US5842163A (en) | 1995-06-21 | 1998-11-24 | Sri International | Method and apparatus for computing likelihood and hypothesizing keyword appearance in speech |
US5806029A (en) | 1995-09-15 | 1998-09-08 | At&T Corp | Signal conditioned minimum error rate training for continuous speech recognition |
JP2871561B2 (ja) | 1995-11-30 | 1999-03-17 | 株式会社エイ・ティ・アール音声翻訳通信研究所 | 不特定話者モデル生成装置及び音声認識装置 |
US5787394A (en) | 1995-12-13 | 1998-07-28 | International Business Machines Corporation | State-dependent speaker clustering for speaker adaptation |
US5778342A (en) | 1996-02-01 | 1998-07-07 | Dspc Israel Ltd. | Pattern recognition system and method |
US5895447A (en) | 1996-02-02 | 1999-04-20 | International Business Machines Corporation | Speech recognition using thresholded speaker class model selection or model adaptation |
JP3302266B2 (ja) | 1996-07-23 | 2002-07-15 | 沖電気工業株式会社 | ヒドン・マルコフ・モデルの学習方法 |
US6163769A (en) * | 1997-10-02 | 2000-12-19 | Microsoft Corporation | Text-to-speech using clustered context-dependent phoneme-based units |
US6230131B1 (en) * | 1998-04-29 | 2001-05-08 | Matsushita Electric Industrial Co., Ltd. | Method for generating spelling-to-pronunciation decision tree |
US6029132A (en) * | 1998-04-30 | 2000-02-22 | Matsushita Electric Industrial Co. | Method for letter-to-sound in text-to-speech synthesis |
US6016471A (en) * | 1998-04-29 | 2000-01-18 | Matsushita Electric Industrial Co., Ltd. | Method and apparatus using decision trees to generate and score multiple pronunciations for a spelled word |
US6343267B1 (en) * | 1998-04-30 | 2002-01-29 | Matsushita Electric Industrial Co., Ltd. | Dimensionality reduction for speaker normalization and speaker and environment adaptation using eigenvoice techniques |
US6263309B1 (en) * | 1998-04-30 | 2001-07-17 | Matsushita Electric Industrial Co., Ltd. | Maximum likelihood method for finding an adapted speaker model in eigenvoice space |
TW436758B (en) * | 1998-04-30 | 2001-05-28 | Matsushita Electric Ind Co Ltd | Speaker and environment adaptation based on eigenvoices including maximum likelihood method |
US6233553B1 (en) * | 1998-09-04 | 2001-05-15 | Matsushita Electric Industrial Co., Ltd. | Method and system for automatically determining phonetic transcriptions associated with spelled words |
US6324512B1 (en) * | 1999-08-26 | 2001-11-27 | Matsushita Electric Industrial Co., Ltd. | System and method for allowing family members to access TV contents and program media recorder over telephone or internet |
-
1999
- 1999-11-29 US US09/450,392 patent/US6571208B1/en not_active Expired - Lifetime
-
2000
- 2000-11-27 DE DE60020660T patent/DE60020660T2/de not_active Expired - Fee Related
- 2000-11-27 EP EP00310492A patent/EP1103952B1/de not_active Expired - Lifetime
- 2000-11-27 ES ES00310492T patent/ES2243210T3/es not_active Expired - Lifetime
- 2000-11-29 JP JP2000363363A patent/JP3683177B2/ja not_active Expired - Fee Related
- 2000-11-29 CN CN00134269A patent/CN1298172A/zh active Pending
-
2001
- 2001-02-01 TW TW089125231A patent/TW493160B/zh not_active IP Right Cessation
Also Published As
Publication number | Publication date |
---|---|
CN1298172A (zh) | 2001-06-06 |
EP1103952A2 (de) | 2001-05-30 |
EP1103952A3 (de) | 2002-04-03 |
JP2001195084A (ja) | 2001-07-19 |
EP1103952B1 (de) | 2005-06-08 |
TW493160B (en) | 2002-07-01 |
DE60020660T2 (de) | 2005-10-06 |
JP3683177B2 (ja) | 2005-08-17 |
ES2243210T3 (es) | 2005-12-01 |
US6571208B1 (en) | 2003-05-27 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
DE60020660D1 (de) | Kontextabhängige akustische Modelle für die Spracherkennung mit Eigenstimmenanpassung | |
DE60115738D1 (de) | Sprachmodelle für die Spracherkennung | |
DE69827988D1 (de) | Sprachmodelle für die Spracherkennung | |
FI19992351A (fi) | Puheentunnistus | |
DE10191732T1 (de) | Selektive Sprecheradaption für ein fahrzeuggebundenes Spracherkennungssystem | |
DE602004021716D1 (de) | Spracherkennungssystem | |
DE602004002230D1 (de) | Spracherkennungssystem für ein Mobilgerät | |
DE60005807D1 (de) | Mikrofon für ein hörgerät | |
DE69829235D1 (de) | Registrierung für die Spracherkennung | |
DE69925479D1 (de) | Dynamisch konfigurierbares akustisches modell für spracherkennungssysteme | |
DE69919842D1 (de) | Sprachmodell basierend auf der spracherkennungshistorie | |
DE03793861T8 (de) | Aufhängung für die schwingspule einer lautsprecherantriebseinheit | |
DE60325127D1 (de) | Kommunikationssystem für Hörbehinderte mit einer Sprache/text-umwandlungseinheit | |
DE69831114D1 (de) | Integration mehrfacher Modelle für die Spracherkennung in verschiedenen Umgebungen | |
DE60018886D1 (de) | Adaptive Wavelet-Extraktion für die Spracherkennung | |
DE69933623D1 (de) | Spracherkennung | |
DE69819951D1 (de) | Spracherkenner mit Rauschadaptierung | |
DE60109105D1 (de) | Hierarchisierte Wörterbücher für die Spracherkennung | |
DE60126882D1 (de) | Hierarchisierte Wörterbücher für die Spracherkennung | |
DE60323362D1 (de) | Spracherkennungseinrichtung | |
DE60305568D1 (de) | Schlüsselworterkennung in einem Sprachsignal | |
DE60204374D1 (de) | Spracherkennungsvorrichtung | |
DE60336102D1 (de) | Automatische Segmentierung in Sprachsynthese | |
DE60205971D1 (de) | Verbindungsstruktur für Kunststoffteile | |
HK1069664A1 (en) | Voice matching system for audio transducers |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
8364 | No opposition during term of opposition | ||
8339 | Ceased/non-payment of the annual fee |