DE60020660D1 - Kontextabhängige akustische Modelle für die Spracherkennung mit Eigenstimmenanpassung - Google Patents

Kontextabhängige akustische Modelle für die Spracherkennung mit Eigenstimmenanpassung

Info

Publication number
DE60020660D1
DE60020660D1 DE60020660T DE60020660T DE60020660D1 DE 60020660 D1 DE60020660 D1 DE 60020660D1 DE 60020660 T DE60020660 T DE 60020660T DE 60020660 T DE60020660 T DE 60020660T DE 60020660 D1 DE60020660 D1 DE 60020660D1
Authority
DE
Germany
Prior art keywords
voice
context
matching
acoustic models
dependent acoustic
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
DE60020660T
Other languages
English (en)
Other versions
DE60020660T2 (de
Inventor
Roland Kuhn
Matteo Contolini
Jean Claude Junqua
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Panasonic Holdings Corp
Original Assignee
Matsushita Electric Industrial Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Matsushita Electric Industrial Co Ltd filed Critical Matsushita Electric Industrial Co Ltd
Application granted granted Critical
Publication of DE60020660D1 publication Critical patent/DE60020660D1/de
Publication of DE60020660T2 publication Critical patent/DE60020660T2/de
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/065Adaptation
    • G10L15/07Adaptation to the speaker
DE60020660T 1999-11-29 2000-11-27 Kontextabhängige akustische Modelle für die Spracherkennung mit Eigenstimmenanpassung Expired - Fee Related DE60020660T2 (de)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US09/450,392 US6571208B1 (en) 1999-11-29 1999-11-29 Context-dependent acoustic models for medium and large vocabulary speech recognition with eigenvoice training
US450392 1999-11-29

Publications (2)

Publication Number Publication Date
DE60020660D1 true DE60020660D1 (de) 2005-07-14
DE60020660T2 DE60020660T2 (de) 2005-10-06

Family

ID=23787898

Family Applications (1)

Application Number Title Priority Date Filing Date
DE60020660T Expired - Fee Related DE60020660T2 (de) 1999-11-29 2000-11-27 Kontextabhängige akustische Modelle für die Spracherkennung mit Eigenstimmenanpassung

Country Status (7)

Country Link
US (1) US6571208B1 (de)
EP (1) EP1103952B1 (de)
JP (1) JP3683177B2 (de)
CN (1) CN1298172A (de)
DE (1) DE60020660T2 (de)
ES (1) ES2243210T3 (de)
TW (1) TW493160B (de)

Families Citing this family (37)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE10047723A1 (de) * 2000-09-27 2002-04-11 Philips Corp Intellectual Pty Verfahren zur Ermittlung eines Eigenraums zur Darstellung einer Mehrzahl von Trainingssprechern
DE10047724A1 (de) * 2000-09-27 2002-04-11 Philips Corp Intellectual Pty Verfahren zur Ermittlung eines Eigenraumes zur Darstellung einer Mehrzahl von Trainingssprechern
JP2002150614A (ja) * 2000-11-10 2002-05-24 Pioneer Electronic Corp 光ディスク
ATE297588T1 (de) * 2000-11-14 2005-06-15 Ibm Anpassung des phonetischen kontextes zur verbesserung der spracherkennung
US6970820B2 (en) * 2001-02-26 2005-11-29 Matsushita Electric Industrial Co., Ltd. Voice personalization of speech synthesizer
US6895376B2 (en) * 2001-05-04 2005-05-17 Matsushita Electric Industrial Co., Ltd. Eigenvoice re-estimation technique of acoustic models for speech recognition, speaker identification and speaker verification
US7085722B2 (en) * 2001-05-14 2006-08-01 Sony Computer Entertainment America Inc. System and method for menu-driven voice control of characters in a game environment
US20040024585A1 (en) * 2002-07-03 2004-02-05 Amit Srivastava Linguistic segmentation of speech
US20040006628A1 (en) * 2002-07-03 2004-01-08 Scott Shepard Systems and methods for providing real-time alerting
US7788096B2 (en) 2002-09-03 2010-08-31 Microsoft Corporation Method and apparatus for generating decision tree questions for speech processing
US7752045B2 (en) * 2002-10-07 2010-07-06 Carnegie Mellon University Systems and methods for comparing speech elements
US20040083090A1 (en) * 2002-10-17 2004-04-29 Daniel Kiecza Manager for integrating language technology components
US7165026B2 (en) * 2003-03-31 2007-01-16 Microsoft Corporation Method of noise estimation using incremental bayes learning
US7499857B2 (en) * 2003-05-15 2009-03-03 Microsoft Corporation Adaptation of compressed acoustic models
US8133115B2 (en) 2003-10-22 2012-03-13 Sony Computer Entertainment America Llc System and method for recording and displaying a graphical path in a video game
KR20050063986A (ko) * 2003-12-23 2005-06-29 한국전자통신연구원 고유음성 계수를 이용한 화자종속 음성인식 시스템 및 방법
TWI264702B (en) * 2004-05-03 2006-10-21 Acer Inc Method for constructing acoustic model
US20060071933A1 (en) 2004-10-06 2006-04-06 Sony Computer Entertainment Inc. Application binary interface for multi-pass shaders
US7636126B2 (en) 2005-06-22 2009-12-22 Sony Computer Entertainment Inc. Delay matching in audio/video systems
US7880746B2 (en) 2006-05-04 2011-02-01 Sony Computer Entertainment Inc. Bandwidth management through lighting control of a user environment via a display device
US7965859B2 (en) 2006-05-04 2011-06-21 Sony Computer Entertainment Inc. Lighting control of a user environment via a display device
WO2007131530A1 (en) * 2006-05-16 2007-11-22 Loquendo S.P.A. Intersession variability compensation for automatic extraction of information from voice
US20090030676A1 (en) * 2007-07-26 2009-01-29 Creative Technology Ltd Method of deriving a compressed acoustic model for speech recognition
US9126116B2 (en) 2007-09-05 2015-09-08 Sony Computer Entertainment America Llc Ranking of user-generated game play advice
US9108108B2 (en) 2007-09-05 2015-08-18 Sony Computer Entertainment America Llc Real-time, contextual display of ranked, user-generated game play advice
JP2010152081A (ja) * 2008-12-25 2010-07-08 Toshiba Corp 話者適応装置及びそのプログラム
GB2478314B (en) 2010-03-02 2012-09-12 Toshiba Res Europ Ltd A speech processor, a speech processing method and a method of training a speech processor
US10786736B2 (en) 2010-05-11 2020-09-29 Sony Interactive Entertainment LLC Placement of user information in a game space
US20120109649A1 (en) * 2010-11-01 2012-05-03 General Motors Llc Speech dialect classification for automatic speech recognition
US9342817B2 (en) 2011-07-07 2016-05-17 Sony Interactive Entertainment LLC Auto-creating groups for sharing photos
US9833707B2 (en) 2012-10-29 2017-12-05 Sony Interactive Entertainment Inc. Ambient light control and calibration via a console
CN104572631B (zh) * 2014-12-03 2018-04-13 北京捷通华声语音技术有限公司 一种语言模型的训练方法及系统
US10360357B2 (en) 2017-01-10 2019-07-23 International Business Machines Corporation Personal identification using action sequences detected by sensors
US10561942B2 (en) 2017-05-15 2020-02-18 Sony Interactive Entertainment America Llc Metronome for competitive gaming headset
US10128914B1 (en) 2017-09-06 2018-11-13 Sony Interactive Entertainment LLC Smart tags with multiple interactions
US11698927B2 (en) 2018-05-16 2023-07-11 Sony Interactive Entertainment LLC Contextual digital media processing systems and methods
US11410642B2 (en) * 2019-08-16 2022-08-09 Soundhound, Inc. Method and system using phoneme embedding

Family Cites Families (42)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4903035A (en) 1983-12-20 1990-02-20 Bsh Electronics, Ltd. Electrical signal separating device having isolating and matching circuitry
US4718088A (en) 1984-03-27 1988-01-05 Exxon Research And Engineering Company Speech recognition training method
JPS62231993A (ja) 1986-03-25 1987-10-12 インタ−ナシヨナル ビジネス マシ−ンズ コ−ポレ−シヨン 音声認識方法
US4817156A (en) 1987-08-10 1989-03-28 International Business Machines Corporation Rapidly training a speech recognizer to a subsequent speaker given training data of a reference speaker
JPH01102599A (ja) 1987-10-12 1989-04-20 Internatl Business Mach Corp <Ibm> 音声認識方法
JP2733955B2 (ja) 1988-05-18 1998-03-30 日本電気株式会社 適応型音声認識装置
US5127055A (en) 1988-12-30 1992-06-30 Kurzweil Applied Intelligence, Inc. Speech recognition apparatus & method having dynamic reference pattern adaptation
JPH0636156B2 (ja) 1989-03-13 1994-05-11 インターナショナル・ビジネス・マシーンズ・コーポレーション 音声認識装置
DE3931638A1 (de) 1989-09-22 1991-04-04 Standard Elektrik Lorenz Ag Verfahren zur sprecheradaptiven erkennung von sprache
JP3014177B2 (ja) 1991-08-08 2000-02-28 富士通株式会社 話者適応音声認識装置
US5280562A (en) 1991-10-03 1994-01-18 International Business Machines Corporation Speech coding apparatus with single-dimension acoustic prototypes for a speech recognizer
ES2128390T3 (es) 1992-03-02 1999-05-16 At & T Corp Metodo de adiestramiento y dispositivo para reconocimiento de voz.
US5233681A (en) 1992-04-24 1993-08-03 International Business Machines Corporation Context-dependent speech recognizer using estimated next word context
US5293584A (en) 1992-05-21 1994-03-08 International Business Machines Corporation Speech recognition system for natural language translation
US5473728A (en) 1993-02-24 1995-12-05 The United States Of America As Represented By The Secretary Of The Navy Training of homoscedastic hidden Markov models for automatic speech recognition
JPH075892A (ja) 1993-04-29 1995-01-10 Matsushita Electric Ind Co Ltd 音声認識方法
US5664059A (en) 1993-04-29 1997-09-02 Panasonic Technologies, Inc. Self-learning speaker adaptation based on spectral variation source decomposition
US5522011A (en) 1993-09-27 1996-05-28 International Business Machines Corporation Speech coding apparatus and method using classification rules
AU7802194A (en) 1993-09-30 1995-04-18 Apple Computer, Inc. Continuous reference adaptation in a pattern recognition system
JP2692581B2 (ja) 1994-06-07 1997-12-17 日本電気株式会社 音響カテゴリ平均値計算装置及び適応化装置
US5793891A (en) 1994-07-07 1998-08-11 Nippon Telegraph And Telephone Corporation Adaptive training method for pattern recognition
US5825978A (en) 1994-07-18 1998-10-20 Sri International Method and apparatus for speech recognition using optimized partial mixture tying of HMM state functions
US5737723A (en) 1994-08-29 1998-04-07 Lucent Technologies Inc. Confusable word detection in speech recognition
US5715468A (en) * 1994-09-30 1998-02-03 Budzinski; Robert Lucius Memory system for storing and retrieving experience and knowledge with natural language
US5864810A (en) 1995-01-20 1999-01-26 Sri International Method and apparatus for speech recognition adapted to an individual speaker
JP3453456B2 (ja) 1995-06-19 2003-10-06 キヤノン株式会社 状態共有モデルの設計方法及び装置ならびにその状態共有モデルを用いた音声認識方法および装置
US5842163A (en) 1995-06-21 1998-11-24 Sri International Method and apparatus for computing likelihood and hypothesizing keyword appearance in speech
US5806029A (en) 1995-09-15 1998-09-08 At&T Corp Signal conditioned minimum error rate training for continuous speech recognition
JP2871561B2 (ja) 1995-11-30 1999-03-17 株式会社エイ・ティ・アール音声翻訳通信研究所 不特定話者モデル生成装置及び音声認識装置
US5787394A (en) 1995-12-13 1998-07-28 International Business Machines Corporation State-dependent speaker clustering for speaker adaptation
US5778342A (en) 1996-02-01 1998-07-07 Dspc Israel Ltd. Pattern recognition system and method
US5895447A (en) 1996-02-02 1999-04-20 International Business Machines Corporation Speech recognition using thresholded speaker class model selection or model adaptation
JP3302266B2 (ja) 1996-07-23 2002-07-15 沖電気工業株式会社 ヒドン・マルコフ・モデルの学習方法
US6163769A (en) * 1997-10-02 2000-12-19 Microsoft Corporation Text-to-speech using clustered context-dependent phoneme-based units
US6230131B1 (en) * 1998-04-29 2001-05-08 Matsushita Electric Industrial Co., Ltd. Method for generating spelling-to-pronunciation decision tree
US6029132A (en) * 1998-04-30 2000-02-22 Matsushita Electric Industrial Co. Method for letter-to-sound in text-to-speech synthesis
US6016471A (en) * 1998-04-29 2000-01-18 Matsushita Electric Industrial Co., Ltd. Method and apparatus using decision trees to generate and score multiple pronunciations for a spelled word
US6343267B1 (en) * 1998-04-30 2002-01-29 Matsushita Electric Industrial Co., Ltd. Dimensionality reduction for speaker normalization and speaker and environment adaptation using eigenvoice techniques
US6263309B1 (en) * 1998-04-30 2001-07-17 Matsushita Electric Industrial Co., Ltd. Maximum likelihood method for finding an adapted speaker model in eigenvoice space
TW436758B (en) * 1998-04-30 2001-05-28 Matsushita Electric Ind Co Ltd Speaker and environment adaptation based on eigenvoices including maximum likelihood method
US6233553B1 (en) * 1998-09-04 2001-05-15 Matsushita Electric Industrial Co., Ltd. Method and system for automatically determining phonetic transcriptions associated with spelled words
US6324512B1 (en) * 1999-08-26 2001-11-27 Matsushita Electric Industrial Co., Ltd. System and method for allowing family members to access TV contents and program media recorder over telephone or internet

Also Published As

Publication number Publication date
CN1298172A (zh) 2001-06-06
EP1103952A2 (de) 2001-05-30
EP1103952A3 (de) 2002-04-03
JP2001195084A (ja) 2001-07-19
EP1103952B1 (de) 2005-06-08
TW493160B (en) 2002-07-01
DE60020660T2 (de) 2005-10-06
JP3683177B2 (ja) 2005-08-17
ES2243210T3 (es) 2005-12-01
US6571208B1 (en) 2003-05-27

Similar Documents

Publication Publication Date Title
DE60020660D1 (de) Kontextabhängige akustische Modelle für die Spracherkennung mit Eigenstimmenanpassung
DE60115738D1 (de) Sprachmodelle für die Spracherkennung
DE69827988D1 (de) Sprachmodelle für die Spracherkennung
FI19992351A (fi) Puheentunnistus
DE10191732T1 (de) Selektive Sprecheradaption für ein fahrzeuggebundenes Spracherkennungssystem
DE602004021716D1 (de) Spracherkennungssystem
DE602004002230D1 (de) Spracherkennungssystem für ein Mobilgerät
DE60005807D1 (de) Mikrofon für ein hörgerät
DE69829235D1 (de) Registrierung für die Spracherkennung
DE69925479D1 (de) Dynamisch konfigurierbares akustisches modell für spracherkennungssysteme
DE69919842D1 (de) Sprachmodell basierend auf der spracherkennungshistorie
DE03793861T8 (de) Aufhängung für die schwingspule einer lautsprecherantriebseinheit
DE60325127D1 (de) Kommunikationssystem für Hörbehinderte mit einer Sprache/text-umwandlungseinheit
DE69831114D1 (de) Integration mehrfacher Modelle für die Spracherkennung in verschiedenen Umgebungen
DE60018886D1 (de) Adaptive Wavelet-Extraktion für die Spracherkennung
DE69933623D1 (de) Spracherkennung
DE69819951D1 (de) Spracherkenner mit Rauschadaptierung
DE60109105D1 (de) Hierarchisierte Wörterbücher für die Spracherkennung
DE60126882D1 (de) Hierarchisierte Wörterbücher für die Spracherkennung
DE60323362D1 (de) Spracherkennungseinrichtung
DE60305568D1 (de) Schlüsselworterkennung in einem Sprachsignal
DE60204374D1 (de) Spracherkennungsvorrichtung
DE60336102D1 (de) Automatische Segmentierung in Sprachsynthese
DE60205971D1 (de) Verbindungsstruktur für Kunststoffteile
HK1069664A1 (en) Voice matching system for audio transducers

Legal Events

Date Code Title Description
8364 No opposition during term of opposition
8339 Ceased/non-payment of the annual fee