DE69635655D1 - Srecherangepasste spracherkennung - Google Patents

Srecherangepasste spracherkennung

Info

Publication number: DE69635655D1
Authority: DE; Germany
Prior art keywords: speaker; srecherangepasste; models; language identification; transformation
Prior art date: 1995-01-20
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.): Expired - Lifetime

Application number

DE69635655T

Other languages

English (en)

Other versions

DE69635655T2 (de

Inventor

Vassilios Digalakis

Leonardo Neumeyer

Dimitry Rtischev

Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)

SRI International Inc

Original Assignee

SRI International Inc

Stanford Research Institute

Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)

1995-01-20

Filing date

1996-01-19

Publication date

2006-02-02

1996-01-19 Application filed by SRI International Inc, Stanford Research Institute filed Critical SRI International Inc

2006-02-02 Publication of DE69635655D1 publication Critical patent/DE69635655D1/de

2006-09-14 Application granted granted Critical

2006-09-14 Publication of DE69635655T2 publication Critical patent/DE69635655T2/de

2016-01-20 Anticipated expiration legal-status Critical

Status Expired - Lifetime legal-status Critical Current

Links

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
- G10L15/07—Adaptation to the speaker
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/14—Speech classification or search using statistical models, e.g. Hidden Markov Models [HMMs]
- G10L15/142—Hidden Markov Models [HMMs]
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit

Landscapes

Engineering & Computer Science (AREA)
Physics & Mathematics (AREA)
Health & Medical Sciences (AREA)
Audiology, Speech & Language Pathology (AREA)
Human Computer Interaction (AREA)
Acoustics & Sound (AREA)
Multimedia (AREA)
Computational Linguistics (AREA)
Artificial Intelligence (AREA)
Probability & Statistics with Applications (AREA)
Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)
Machine Translation (AREA)
Devices For Executing Special Programs (AREA)

DE69635655T 1995-01-20 1996-01-19 Sprecherangepasste Spracherkennung Expired - Lifetime DE69635655T2 (de)

Applications Claiming Priority (3)

Application Number	Priority Date	Filing Date	Title
US375908		1982-05-07
US08/375,908 US5864810A (en)	1995-01-20	1995-01-20	Method and apparatus for speech recognition adapted to an individual speaker
PCT/US1996/000762 WO1996022514A2 (en)	1995-01-20	1996-01-19	Method and apparatus for speech recognition adapted to an individual speaker

Publications (2)

Publication Number	Publication Date
DE69635655D1 true DE69635655D1 (de)	2006-02-02
DE69635655T2 DE69635655T2 (de)	2006-09-14

Family

ID=23482858

Family Applications (1)

Application Number	Title	Priority Date	Filing Date
DE69635655T Expired - Lifetime DE69635655T2 (de)	1995-01-20	1996-01-19	Sprecherangepasste Spracherkennung

Country Status (8)

Country	Link
US (1)	US5864810A (de)
EP (1)	EP0804721B1 (de)
JP (1)	JP4217275B2 (de)
AT (1)	ATE314718T1 (de)
CA (1)	CA2210887C (de)
DE (1)	DE69635655T2 (de)
ES (1)	ES2252752T3 (de)
WO (1)	WO1996022514A2 (de)

Families Citing this family (137)

* Cited by examiner, † Cited by third party
Publication number	Priority date	Publication date	Assignee	Title
US6081660A (en) *	1995-12-01	2000-06-27	The Australian National University	Method for forming a cohort for use in identification of an individual
WO1998011534A1 (de) *	1996-09-10	1998-03-19	Siemens Aktiengesellschaft	Verfahren zur anpassung eines hidden-markov-lautmodelles in einem spracherkennungssystem
US6151575A (en) *	1996-10-28	2000-11-21	Dragon Systems, Inc.	Rapid adaptation of speech models
US6128587A (en) *	1997-01-14	2000-10-03	The Regents Of The University Of California	Method and apparatus using Bayesian subfamily identification for sequence analysis
JP3886024B2 (ja) *	1997-11-19	2007-02-28	富士通株式会社	音声認識装置及びそれを用いた情報処理装置
US6807537B1 (en) *	1997-12-04	2004-10-19	Microsoft Corporation	Mixtures of Bayesian networks
US6073096A (en) *	1998-02-04	2000-06-06	International Business Machines Corporation	Speaker adaptation system and method based on class-specific pre-clustering training speakers
US6148284A (en)	1998-02-23	2000-11-14	At&T Corporation	Method and apparatus for automatic speech recognition using Markov processes on curves
JP3412496B2 (ja) *	1998-02-25	2003-06-03	三菱電機株式会社	話者適応化装置と音声認識装置
US6343267B1 (en) *	1998-04-30	2002-01-29	Matsushita Electric Industrial Co., Ltd.	Dimensionality reduction for speaker normalization and speaker and environment adaptation using eigenvoice techniques
US6327565B1 (en) *	1998-04-30	2001-12-04	Matsushita Electric Industrial Co., Ltd.	Speaker and environment adaptation based on eigenvoices
US6263309B1 (en) *	1998-04-30	2001-07-17	Matsushita Electric Industrial Co., Ltd.	Maximum likelihood method for finding an adapted speaker model in eigenvoice space
EP0953971A1 (de) *	1998-05-01	1999-11-03	Entropic Cambridge Research Laboratory Ltd.	System und Verfahren zur Spracherkennung
WO1999059136A1 (en) *	1998-05-08	1999-11-18	T-Netix, Inc.	Channel estimation system and method for use in automatic speaker verification systems
JP3156668B2 (ja) *	1998-06-19	2001-04-16	日本電気株式会社	音声認識装置
US6269334B1 (en) *	1998-06-25	2001-07-31	International Business Machines Corporation	Nongaussian density estimation for the classification of acoustic feature vectors in speech recognition
US6269335B1 (en)	1998-08-14	2001-07-31	International Business Machines Corporation	Apparatus and methods for identifying homophones among words in a speech recognition system
US6185530B1 (en)	1998-08-14	2001-02-06	International Business Machines Corporation	Apparatus and methods for identifying potential acoustic confusibility among words in a speech recognition system
US6192337B1 (en) *	1998-08-14	2001-02-20	International Business Machines Corporation	Apparatus and methods for rejecting confusible words during training associated with a speech recognition system
US6725195B2 (en) *	1998-08-25	2004-04-20	Sri International	Method and apparatus for probabilistic recognition using small number of state clusters
US6256607B1 (en) *	1998-09-08	2001-07-03	Sri International	Method and apparatus for automatic recognition using features encoded with product-space vector quantization
US7873477B1 (en)	2001-08-21	2011-01-18	Codexis Mayflower Holdings, Llc	Method and system using systematically varied data libraries
US8457903B1 (en)	1999-01-19	2013-06-04	Codexis Mayflower Holdings, Llc	Method and/or apparatus for determining codons
US7702464B1 (en)	2001-08-21	2010-04-20	Maxygen, Inc.	Method and apparatus for codon determining
EP1022725B1 (de) *	1999-01-20	2005-04-06	Sony International (Europe) GmbH	Auswahl akustischer Modelle mittels Sprecherverifizierung
US6205426B1 (en) *	1999-01-25	2001-03-20	Matsushita Electric Industrial Co., Ltd.	Unsupervised speech model adaptation using reliable information among N-best strings
US6684186B2 (en) *	1999-01-26	2004-01-27	International Business Machines Corporation	Speaker recognition using a hierarchical speaker model tree
EP1159737B9 (de) *	1999-03-11	2004-11-03	BRITISH TELECOMMUNICATIONS public limited company	Sprecher-erkennung
US6463413B1 (en)	1999-04-20	2002-10-08	Matsushita Electrical Industrial Co., Ltd.	Speech recognition training for small hardware devices
DE19944325A1 (de) *	1999-09-15	2001-03-22	Thomson Brandt Gmbh	Verfahren und Vorrichtung zur Spracherkennung
KR100307623B1 (ko) *	1999-10-21	2001-11-02	윤종용	엠.에이.피 화자 적응 조건에서 파라미터의 분별적 추정 방법 및 장치 및 이를 각각 포함한 음성 인식 방법 및 장치
US6571208B1 (en)	1999-11-29	2003-05-27	Matsushita Electric Industrial Co., Ltd.	Context-dependent acoustic models for medium and large vocabulary speech recognition with eigenvoice training
US6526379B1 (en)	1999-11-29	2003-02-25	Matsushita Electric Industrial Co., Ltd.	Discriminative clustering methods for automatic speech recognition
US6466908B1 (en) *	2000-01-14	2002-10-15	The United States Of America As Represented By The Secretary Of The Navy	System and method for training a class-specific hidden Markov model using a modified Baum-Welch algorithm
US6539351B1 (en) *	2000-02-04	2003-03-25	International Business Machines Corporation	High dimensional acoustic modeling via mixtures of compound gaussians with linear transforms
GB0004097D0 (en) *	2000-02-22	2000-04-12	Ibm	Management of speech technology modules in an interactive voice response system
US6789062B1 (en) *	2000-02-25	2004-09-07	Speechworks International, Inc.	Automatically retraining a speech recognition system
US6470314B1 (en) *	2000-04-06	2002-10-22	International Business Machines Corporation	Method and apparatus for rapid adapt via cumulative distribution function matching for continuous speech
US6587824B1 (en) *	2000-05-04	2003-07-01	Visteon Global Technologies, Inc.	Selective speaker adaptation for an in-vehicle speech recognition system
US7047196B2 (en)	2000-06-08	2006-05-16	Agiletv Corporation	System and method of voice recognition near a wireline node of a network supporting cable television and/or video delivery
US6751590B1 (en) *	2000-06-13	2004-06-15	International Business Machines Corporation	Method and apparatus for performing pattern-specific maximum likelihood transformations for speaker recognition
US7216077B1 (en) *	2000-09-26	2007-05-08	International Business Machines Corporation	Lattice-based unsupervised maximum likelihood linear regression for speaker adaptation
DE10047718A1 (de) *	2000-09-27	2002-04-18	Philips Corp Intellectual Pty	Verfahren zur Spracherkennung
DE10047723A1 (de) *	2000-09-27	2002-04-11	Philips Corp Intellectual Pty	Verfahren zur Ermittlung eines Eigenraums zur Darstellung einer Mehrzahl von Trainingssprechern
US7454341B1 (en) *	2000-09-30	2008-11-18	Intel Corporation	Method, apparatus, and system for building a compact model for large vocabulary continuous speech recognition (LVCSR) system
EP1197949B1 (de) *	2000-10-10	2004-01-07	Sony International (Europe) GmbH	Vermeidung von Online-Sprecherüberanpassung bei der Spracherkennung
US7003465B2 (en) *	2000-10-12	2006-02-21	Matsushita Electric Industrial Co., Ltd.	Method for speech recognition, apparatus for the same, and voice controller
US7457750B2 (en) *	2000-10-13	2008-11-25	At&T Corp.	Systems and methods for dynamic re-configurable speech recognition
US7451085B2 (en)	2000-10-13	2008-11-11	At&T Intellectual Property Ii, L.P.	System and method for providing a compensated speech recognition model for speech recognition
US7024359B2 (en) *	2001-01-31	2006-04-04	Qualcomm Incorporated	Distributed voice recognition system using acoustic feature vector modification
US8095370B2 (en) *	2001-02-16	2012-01-10	Agiletv Corporation	Dual compression voice recordation non-repudiation system
US6895376B2 (en) *	2001-05-04	2005-05-17	Matsushita Electric Industrial Co., Ltd.	Eigenvoice re-estimation technique of acoustic models for speech recognition, speaker identification and speaker verification
WO2003034281A1 (en) *	2001-10-19	2003-04-24	Intel Zao	Method and apparatus to provide a hierarchical index for a language model data structure
US7209881B2 (en) *	2001-12-20	2007-04-24	Matsushita Electric Industrial Co., Ltd.	Preparing acoustic models by sufficient statistics and noise-superimposed speech data
US7013275B2 (en) *	2001-12-28	2006-03-14	Sri International	Method and apparatus for providing a dynamic speech-driven control and remote service access system
US6687672B2 (en)	2002-03-15	2004-02-03	Matsushita Electric Industrial Co., Ltd.	Methods and apparatus for blind channel estimation based upon speech correlation structure
US7016849B2 (en) *	2002-03-25	2006-03-21	Sri International	Method and apparatus for providing speech-driven routing between spoken language applications
US20030212761A1 (en) *	2002-05-10	2003-11-13	Microsoft Corporation	Process kernel
US7716047B2 (en) *	2002-10-16	2010-05-11	Sony Corporation	System and method for an automatic set-up of speech recognition engines
US7523034B2 (en) *	2002-12-13	2009-04-21	International Business Machines Corporation	Adaptation of Compound Gaussian Mixture models
US7676366B2 (en) *	2003-01-13	2010-03-09	Art Advanced Recognition Technologies Inc.	Adaptation of symbols
US7340396B2 (en) *	2003-02-18	2008-03-04	Motorola, Inc.	Method and apparatus for providing a speaker adapted speech recognition model set
US7499857B2 (en) *	2003-05-15	2009-03-03	Microsoft Corporation	Adaptation of compressed acoustic models
EP1639579A1 (de) *	2003-07-01	2006-03-29	France Telecom	Verfahren und system zur sprachanalyse zur kompakten darstellung von sprechern
US7480615B2 (en) *	2004-01-20	2009-01-20	Microsoft Corporation	Method of speech recognition using multimodal variational inference with switching state space models
KR100612840B1 (ko) *	2004-02-18	2006-08-18	삼성전자주식회사	모델 변이 기반의 화자 클러스터링 방법, 화자 적응 방법및 이들을 이용한 음성 인식 장치
WO2006051180A1 (fr) *	2004-11-08	2006-05-18	France Telecom	Procede de construction distribuee d'un modele de reconnaissance vocale , dispositif, serveur et programmes d'ordinateur pour mettre en œuvre un tel procede
CA2594929A1 (en) *	2005-01-14	2006-07-20	Tremor Media Llc	Dynamic advertisement system and method
US7885817B2 (en)	2005-03-08	2011-02-08	Microsoft Corporation	Easy generation and automatic training of spoken dialog systems using text-to-speech
US20060206333A1 (en) *	2005-03-08	2006-09-14	Microsoft Corporation	Speaker-dependent dialog adaptation
US7707131B2 (en) *	2005-03-08	2010-04-27	Microsoft Corporation	Thompson strategy based online reinforcement learning system for action selection
US7734471B2 (en) *	2005-03-08	2010-06-08	Microsoft Corporation	Online learning for dialog systems
US20070033044A1 (en) *	2005-08-03	2007-02-08	Texas Instruments, Incorporated	System and method for creating generalized tied-mixture hidden Markov models for automatic speech recognition
US20090220926A1 (en) *	2005-09-20	2009-09-03	Gadi Rechlis	System and Method for Correcting Speech
JP2009521736A (ja) *	2005-11-07	2009-06-04	スキャンスカウト，インコーポレイテッド	リッチメディアと共に広告をレンダリングするための技術
US20070129943A1 (en) *	2005-12-06	2007-06-07	Microsoft Corporation	Speech recognition using adaptation and prior knowledge
US7539616B2 (en) *	2006-02-20	2009-05-26	Microsoft Corporation	Speaker authentication using adapted background models
US8170868B2 (en) *	2006-03-14	2012-05-01	Microsoft Corporation	Extracting lexical features for classifying native and non-native language usage style
KR100815115B1 (ko) *	2006-03-31	2008-03-20	광주과학기술원	타 언어권 화자 음성에 대한 음성 인식시스템의 성능향상을 위한 발음 특성에 기반한 음향모델 변환 방법 및이를 이용한 장치
US7877255B2 (en) *	2006-03-31	2011-01-25	Voice Signal Technologies, Inc.	Speech recognition using channel verification
US8214213B1 (en) *	2006-04-27	2012-07-03	At&T Intellectual Property Ii, L.P.	Speech recognition based on pronunciation modeling
JP5088701B2 (ja) *	2006-05-31	2012-12-05	日本電気株式会社	言語モデル学習システム、言語モデル学習方法、および言語モデル学習用プログラム
US20080004876A1 (en) *	2006-06-30	2008-01-03	Chuang He	Non-enrolled continuous dictation
US7689417B2 (en) *	2006-09-04	2010-03-30	Fortemedia, Inc.	Method, system and apparatus for improved voice recognition
US20080109391A1 (en) *	2006-11-07	2008-05-08	Scanscout, Inc.	Classifying content based on mood
WO2008137616A1 (en) *	2007-05-04	2008-11-13	Nuance Communications, Inc.	Multi-class constrained maximum likelihood linear regression
US20090006085A1 (en) *	2007-06-29	2009-01-01	Microsoft Corporation	Automated call classification and prioritization
US8577996B2 (en) *	2007-09-18	2013-11-05	Tremor Video, Inc.	Method and apparatus for tracing users of online video web sites
US8549550B2 (en)	2008-09-17	2013-10-01	Tubemogul, Inc.	Method and apparatus for passively monitoring online video viewing and viewer behavior
US8775416B2 (en) *	2008-01-09	2014-07-08	Yahoo!Inc.	Adapting a context-independent relevance function for identifying relevant search results
CN101281746A (zh) *	2008-03-17	2008-10-08	黎自奋	一个百分之百辨认率的国语单音与句子辨认方法
US20090259552A1 (en) *	2008-04-11	2009-10-15	Tremor Media, Inc.	System and method for providing advertisements from multiple ad servers using a failover mechanism
EP2161718B1 (de) *	2008-09-03	2011-08-31	Harman Becker Automotive Systems GmbH	Spracherkennung
US8645135B2 (en) *	2008-09-12	2014-02-04	Rosetta Stone, Ltd.	Method for creating a speech model
US8145488B2 (en) *	2008-09-16	2012-03-27	Microsoft Corporation	Parameter clustering and sharing for variable-parameter hidden markov models
US9612995B2 (en)	2008-09-17	2017-04-04	Adobe Systems Incorporated	Video viewer targeting based on preference similarity
US8155961B2 (en) *	2008-12-09	2012-04-10	Nokia Corporation	Adaptation of automatic speech recognition acoustic models
US9418662B2 (en) *	2009-01-21	2016-08-16	Nokia Technologies Oy	Method, apparatus and computer program product for providing compound models for speech recognition adaptation
EP2216775B1 (de) *	2009-02-05	2012-11-21	Nuance Communications, Inc.	Sprechererkennung
US9026444B2 (en)	2009-09-16	2015-05-05	At&T Intellectual Property I, L.P.	System and method for personalization of acoustic models for automatic speech recognition
US20110093783A1 (en) *	2009-10-16	2011-04-21	Charles Parra	Method and system for linking media components
CA2781299A1 (en) *	2009-11-20	2012-05-03	Tadashi Yonezaki	Methods and apparatus for optimizing advertisement allocation
WO2011071484A1 (en)	2009-12-08	2011-06-16	Nuance Communications, Inc.	Guest speaker robust adapted speech recognition
GB2480084B (en) *	2010-05-05	2012-08-08	Toshiba Res Europ Ltd	A speech processing system and method
US8725506B2 (en) *	2010-06-30	2014-05-13	Intel Corporation	Speech audio processing
US8924453B2 (en) *	2011-12-19	2014-12-30	Spansion Llc	Arithmetic logic unit architecture
US9324323B1 (en)	2012-01-13	2016-04-26	Google Inc.	Speech recognition using topic-specific language models
US8965763B1 (en)	2012-02-02	2015-02-24	Google Inc.	Discriminative language modeling for automatic speech recognition with a weak acoustic model and distributed training
US8543398B1 (en)	2012-02-29	2013-09-24	Google Inc.	Training an automatic speech recognition system using compressed word frequencies
US8775177B1 (en) *	2012-03-08	2014-07-08	Google Inc.	Speech recognition process
US8838448B2 (en) *	2012-04-05	2014-09-16	Nuance Communications, Inc.	Forced/predictable adaptation for speech recognition
US8374865B1 (en)	2012-04-26	2013-02-12	Google Inc.	Sampling training data for an automatic speech recognition system based on a benchmark classification distribution
US9406299B2 (en) *	2012-05-08	2016-08-02	Nuance Communications, Inc.	Differential acoustic model representation and linear transform-based adaptation for efficient user profile update techniques in automatic speech recognition
TWI466101B (zh) *	2012-05-18	2014-12-21	Asustek Comp Inc	語音識別方法及系統
US8571859B1 (en) *	2012-05-31	2013-10-29	Google Inc.	Multi-stage speaker adaptation
US8805684B1 (en) *	2012-05-31	2014-08-12	Google Inc.	Distributed speaker adaptation
US8880398B1 (en)	2012-07-13	2014-11-04	Google Inc.	Localized speech recognition with offload
US9946699B1 (en) *	2012-08-29	2018-04-17	Intuit Inc.	Location-based speech recognition for preparation of electronic tax return
US9123333B2 (en)	2012-09-12	2015-09-01	Google Inc.	Minimum bayesian risk methods for automatic speech recognition
US9564125B2 (en) *	2012-11-13	2017-02-07	GM Global Technology Operations LLC	Methods and systems for adapting a speech system based on user characteristics
JP6316208B2 (ja)	2012-12-18	2018-04-25	インターナショナル・ビジネス・マシーンズ・コーポレーションＩｎｔｅｒｎａｔｉｏｎａｌＢｕｓｉｎｅｓｓＭａｃｈｉｎｅｓＣｏｒｐｏｒａｔｉｏｎ	特定の話者の音声を加工するための方法、並びに、その電子装置システム及び電子装置用プログラム
US9406298B2 (en) *	2013-02-07	2016-08-02	Nuance Communications, Inc.	Method and apparatus for efficient i-vector extraction
US20140222423A1 (en) *	2013-02-07	2014-08-07	Nuance Communications, Inc.	Method and Apparatus for Efficient I-Vector Extraction
US9865266B2 (en) *	2013-02-25	2018-01-09	Nuance Communications, Inc.	Method and apparatus for automated speaker parameters adaptation in a deployed speaker verification system
EP2797078B1 (de) *	2013-04-26	2016-10-12	Agnitio S.L.	Schätzung der Zuverlässigkeit bei der Sprechererkennung
US9258425B2 (en)	2013-05-22	2016-02-09	Nuance Communications, Inc.	Method and system for speaker verification
CN108135485B (zh)	2015-10-08	2021-08-17	科蒂奥医疗公司	通过语音分析评估肺部病症
CN107564513B (zh) *	2016-06-30	2020-09-08	阿里巴巴集团控股有限公司	语音识别方法及装置
US10847177B2 (en)	2018-10-11	2020-11-24	Cordio Medical Ltd.	Estimating lung volume by speech analysis
US10803875B2 (en)	2019-02-08	2020-10-13	Nec Corporation	Speaker recognition system and method of using the same
US11011188B2 (en)	2019-03-12	2021-05-18	Cordio Medical Ltd.	Diagnostic techniques based on speech-sample alignment
US11024327B2 (en)	2019-03-12	2021-06-01	Cordio Medical Ltd.	Diagnostic techniques based on speech models
KR20210078143A (ko) *	2019-12-18	2021-06-28	엘지전자 주식회사	신규 도메인의 간투어 검출 모델 생성 방법 및 장치
US11484211B2 (en)	2020-03-03	2022-11-01	Cordio Medical Ltd.	Diagnosis of medical conditions using voice recordings and auscultation
US10841424B1 (en)	2020-05-14	2020-11-17	Bank Of America Corporation	Call monitoring and feedback reporting using machine learning
US11417342B2 (en) *	2020-06-29	2022-08-16	Cordio Medical Ltd.	Synthesizing patient-specific speech models
CN112599121B (zh) *	2020-12-03	2023-06-20	天津大学	基于辅助数据正则化的说话人自适应方法

Family Cites Families (10)

* Cited by examiner, † Cited by third party
Publication number	Priority date	Publication date	Assignee	Title
JPS62231993A (ja) *	1986-03-25	1987-10-12	インタ−ナシヨナル　ビジネス　マシ−ンズ　コ−ポレ−シヨン	音声認識方法
JPS62232000A (ja) *	1986-03-25	1987-10-12	インタ−ナシヨナル・ビジネス・マシ−ンズ・コ−ポレ−シヨン	音声認識装置
US4817156A (en) *	1987-08-10	1989-03-28	International Business Machines Corporation	Rapidly training a speech recognizer to a subsequent speaker given training data of a reference speaker
JPH01102599A (ja) *	1987-10-12	1989-04-20	Internatl Business Mach Corp <Ibm>	音声認識方法
JPH0636156B2 (ja) *	1989-03-13	1994-05-11	インターナショナル・ビジネス・マシーンズ・コーポレーション	音声認識装置
US4977598A (en) *	1989-04-13	1990-12-11	Texas Instruments Incorporated	Efficient pruning algorithm for hidden markov model speech recognition
US5075896A (en) *	1989-10-25	1991-12-24	Xerox Corporation	Character and phoneme recognition based on probability clustering
US5450523A (en) *	1990-11-15	1995-09-12	Matsushita Electric Industrial Co., Ltd.	Training module for estimating mixture Gaussian densities for speech unit models in speech recognition systems
EP0515709A1 (de) *	1991-05-27	1992-12-02	International Business Machines Corporation	Verfahren und Einrichtung zur Darstellung von Segmenteinheiten zur Text-Sprache-Umsetzung
US5199077A (en) *	1991-09-19	1993-03-30	Xerox Corporation	Wordspotting for voice editing and indexing

1995
- 1995-01-20 US US08/375,908 patent/US5864810A/en not_active Expired - Lifetime
1996
- 1996-01-19 AT AT96904480T patent/ATE314718T1/de not_active IP Right Cessation
- 1996-01-19 DE DE69635655T patent/DE69635655T2/de not_active Expired - Lifetime
- 1996-01-19 JP JP52240696A patent/JP4217275B2/ja not_active Expired - Fee Related
- 1996-01-19 EP EP96904480A patent/EP0804721B1/de not_active Expired - Lifetime
- 1996-01-19 WO PCT/US1996/000762 patent/WO1996022514A2/en active IP Right Grant
- 1996-01-19 ES ES96904480T patent/ES2252752T3/es not_active Expired - Lifetime
- 1996-01-19 CA CA002210887A patent/CA2210887C/en not_active Expired - Lifetime

Also Published As

Publication number	Publication date
WO1996022514A3 (en)	1996-09-26
EP0804721B1 (de)	2005-12-28
ATE314718T1 (de)	2006-01-15
ES2252752T3 (es)	2006-05-16
CA2210887C (en)	2009-03-31
EP0804721A2 (de)	1997-11-05
US5864810A (en)	1999-01-26
DE69635655T2 (de)	2006-09-14
JP4217275B2 (ja)	2009-01-28
WO1996022514A2 (en)	1996-07-25
CA2210887A1 (en)	1996-07-25
JPH10512686A (ja)	1998-12-02

Legal Events

Date	Code	Title	Description
2007-01-18	8364	No opposition during term of opposition

Publication	Publication Date	Title
DE69635655D1 (de)	2006-02-02	Srecherangepasste spracherkennung
TW347619B (en)	1998-12-11	A communication system and method using a speaker dependent time-scaling technique a method for time-scale modification of speech using a modified version of the Waveform Similarity based Overlap-Add technique (WSOLA).
SG128406A1 (en)	2007-01-30	Character recognizing and translating system and voice recognizing and translating system
EP1022722A3 (de)	2000-08-16	Sprecheradaptation auf der Basis von Stimm-Eigenvektoren
ATE203119T1 (de)	2001-07-15	Spracherkennungssystem für sprachen mit zusammengesetzten wörtern
EP0758781A3 (de)	1998-04-29	Verifizierung einer Sprachäusserung für die Erkennung einer Folge von Wörtern mittels wortbezogenem Training zur Minimierung des Verifizierungsfehlers
AU2001250579A1 (en)	2001-10-15	Discriminatively trained mixture models in continuous speech recognition
EP1054388A3 (de)	2001-11-14	Verfahren und Vorrichtung zur Bestimmung des Zustands von sprachgesteuerten Geräten
DE3275779D1 (en)	1987-04-23	Recognition of speech or speech-like sounds
ATE265083T1 (de)	2004-05-15	Verfahren und vorrichtung zum unterscheidenden training von akustischen modellen in einem spracherkennungssystem
FR2522179B1 (fr)	1986-05-02	Procede et appareil de reconnaissance de paroles permettant de reconnaitre des phonemes particuliers du signal vocal quelle que soit la personne qui parle
WO1996023298A3 (en)	1996-12-19	System amd method for generating and using context dependent sub-syllable models to recognize a tonal language
AU640164B2 (en)	1993-08-19	Method of speech recognition
ATE297588T1 (de)	2005-06-15	Anpassung des phonetischen kontextes zur verbesserung der spracherkennung
EP0852374A3 (de)	1998-11-18	Verfahren und System zur sprecherunabhängigen Erkennung von benutzerdefinierten Sätzen
WO1996000962A3 (en)	1996-02-22	Method and device for adapting a speech recognition equipment for dialectal variations in a language
SG97998A1 (en)	2003-08-20	Method and apparatus for mandarin chinese speech recognition by using initial/final phoneme similarity vector
EP0949606A3 (de)	2000-10-11	Verfahren und Vorrichtung zur Spracherkennung unter Verwendung von phonetischen Transkriptionen
DE60308904D1 (de)	2006-11-16	Verfahren und system zur markierung eines tonsignals mit metadaten
SE9303623D0 (sv)	1993-11-03	Metod och anordning vid automatisk extrahering av prosodisk information
DE3673857D1 (de)	1990-10-11	Auf einem erworbenen wissensgut basierte einrichtung und verfahren zur automatischen spracherkennung.
EP1010170A4 (de)	2008-08-20	Verfahren und system zur automatischen textunabhängigen bewertung der aussprache für den sprachunterricht
EP1316944A3 (de)	2006-06-07	System und Verfahren zur Tonsignalerkennung, und diese anwendende System und Verfahren zur Dialogsteuerung
DE60219030D1 (de)	2007-05-03	Verfahren zur mehrsprachigen Spracherkennung
SE9601811L (sv)	1997-11-03	Metod och system för tal-till-tal-omvandling med extrahering av prosodiinformation