DE69427083T2 - Spracherkennungssystem für mehrere sprachen - Google Patents

Spracherkennungssystem für mehrere sprachen

Info

Publication number: DE69427083T2
Authority: DE; Germany
Prior art keywords: voice recognition; recognition system; multiple languages; spectrum; phones
Prior art date: 1993-07-13
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.): Expired - Fee Related

Application number

DE69427083T

Other languages

English (en)

Other versions

DE69427083D1 (de

Inventor

Theodore Austin Bordeaux

Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)

Individual

Original Assignee

Individual

Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)

1993-07-13

Filing date

1994-07-12

Publication date

2001-12-06

1994-07-12 Application filed by Individual filed Critical Individual

2001-05-17 Application granted granted Critical

2001-05-17 Publication of DE69427083D1 publication Critical patent/DE69427083D1/de

2001-12-06 Publication of DE69427083T2 publication Critical patent/DE69427083T2/de

2014-07-13 Anticipated expiration legal-status Critical

Status Expired - Fee Related legal-status Critical Current

Links

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/16—Speech classification or search using artificial neural networks
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
- G10L15/187—Phonemic context, e.g. pronunciation rules, phonotactical constraints or phoneme n-grams
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
- G10L2015/025—Phonemes, fenemes or fenones being the recognition units
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/063—Training
- G10L2015/0638—Interactive procedures
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/18—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being spectral information of each sub-band

DE69427083T 1993-07-13 1994-07-12 Spracherkennungssystem für mehrere sprachen Expired - Fee Related DE69427083T2 (de)

Applications Claiming Priority (2)

Application Number	Priority Date	Filing Date	Title
US9074793A	1993-07-13	1993-07-13
PCT/US1994/007742 WO1995002879A1 (en)	1993-07-13	1994-07-12	Multi-language speech recognition system

Publications (2)

Publication Number	Publication Date
DE69427083D1 DE69427083D1 (de)	2001-05-17
DE69427083T2 true DE69427083T2 (de)	2001-12-06

Family

ID=22224117

Family Applications (1)

Application Number	Title	Priority Date	Filing Date
DE69427083T Expired - Fee Related DE69427083T2 (de)	1993-07-13	1994-07-12	Spracherkennungssystem für mehrere sprachen

Country Status (8)

Country	Link
US (1)	US5758023A (de)
EP (1)	EP0708958B1 (de)
JP (1)	JPH09500223A (de)
AT (1)	ATE200590T1 (de)
AU (1)	AU682380B2 (de)
CA (1)	CA2167200A1 (de)
DE (1)	DE69427083T2 (de)
WO (1)	WO1995002879A1 (de)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number	Priority date	Publication date	Assignee	Title
US7162424B2 (en)	2001-04-26	2007-01-09	Siemens Aktiengesellschaft	Method and system for defining a sequence of sound modules for synthesis of a speech signal in a tonal language
US7949517B2 (en)	2006-12-01	2011-05-24	Deutsche Telekom Ag	Dialogue system with logical evaluation for language identification in speech recognition

Families Citing this family (74)

* Cited by examiner, † Cited by third party
Publication number	Priority date	Publication date	Assignee	Title
US5790754A (en) *	1994-10-21	1998-08-04	Sensory Circuits, Inc.	Speech recognition apparatus for consumer electronic applications
DE19636739C1 (de) *	1996-09-10	1997-07-03	Siemens Ag	Verfahren zur Mehrsprachenverwendung eines hidden Markov Lautmodelles in einem Spracherkennungssystem
EP0920692B1 (de) *	1996-12-24	2003-03-26	Cellon France SAS	Verfahren zum trainieren eines spracherkennungssystems und ein gerät zum praktizieren des verfahrens, insbesondere eines tragbaren telefons
US6061646A (en) *	1997-12-18	2000-05-09	International Business Machines Corp.	Kiosk for multiple spoken languages
US6085160A (en) *	1998-07-10	2000-07-04	Lernout & Hauspie Speech Products N.V.	Language independent speech recognition
WO2000022609A1 (en) *	1998-10-13	2000-04-20	Telefonaktiebolaget Lm Ericsson (Publ)	Speech recognition and control system and telephone
US6188984B1 (en) *	1998-11-17	2001-02-13	Fonix Corporation	Method and system for syllable parsing
US6377913B1 (en) *	1999-08-13	2002-04-23	International Business Machines Corporation	Method and system for multi-client access to a dialog system
JP4292646B2 (ja)	1999-09-16	2009-07-08	株式会社デンソー	ユーザインタフェース装置、ナビゲーションシステム、情報処理装置及び記録媒体
US6963837B1 (en) *	1999-10-06	2005-11-08	Multimodal Technologies, Inc.	Attribute-based word modeling
US9076448B2 (en)	1999-11-12	2015-07-07	Nuance Communications, Inc.	Distributed real time speech recognition system
US7725307B2 (en)	1999-11-12	2010-05-25	Phoenix Solutions, Inc.	Query engine for processing voice based queries including semantic decoding
US7392185B2 (en)	1999-11-12	2008-06-24	Phoenix Solutions, Inc.	Speech based learning/training system using semantic decoding
US7050977B1 (en)	1999-11-12	2006-05-23	Phoenix Solutions, Inc.	Speech-enabled server for internet website and method
DE10018134A1 (de) *	2000-04-12	2001-10-18	Siemens Ag	Verfahren und Vorrichtung zum Bestimmen prosodischer Markierungen
JP3339579B2 (ja) *	2000-10-04	2002-10-28	株式会社鷹山	電話装置
EP1217610A1 (de) *	2000-11-28	2002-06-26	Siemens Aktiengesellschaft	Verfahren und System zur multilingualen Spracherkennung
EP1217609A3 (de) *	2000-12-22	2004-02-25	Hewlett-Packard Company	Spracherkennung
US20020095274A1 (en) *	2001-01-17	2002-07-18	Richards Alfred N.	Pool cover design verifying system
US7107215B2 (en) *	2001-04-16	2006-09-12	Sakhr Software Company	Determining a compact model to transcribe the arabic language acoustically in a well defined basic phonetic study
US20030092423A1 (en) *	2001-11-09	2003-05-15	Roger Boivin	System and method to allow law enforcement agencies to track and monitor calls made on recyclable/disposable mobile telephones
US7295982B1 (en) *	2001-11-19	2007-11-13	At&T Corp.	System and method for automatic verification of the understandability of speech
US6990445B2 (en) *	2001-12-17	2006-01-24	Xl8 Systems, Inc.	System and method for speech recognition and transcription
WO2003060877A1 (de) *	2002-01-17	2003-07-24	Siemens Aktiengesellschaft	Betriebsverfahren eines automatischen spracherkenners zur sprecherunabhängigen spracherkennung von worten aus verschiedenen sprachen und automatischer spracherkenner
US7286993B2 (en) *	2002-01-31	2007-10-23	Product Discovery, Inc.	Holographic speech translation system and method
US20030208451A1 (en) *	2002-05-03	2003-11-06	Jim-Shih Liaw	Artificial neural systems with dynamic synapses
US7010488B2 (en) *	2002-05-09	2006-03-07	Oregon Health & Science University	System and method for compressing concatenative acoustic inventories for speech synthesis
US20040030555A1 (en) *	2002-08-12	2004-02-12	Oregon Health & Science University	System and method for concatenating acoustic contours for speech synthesis
DE10256935A1 (de) *	2002-12-05	2004-07-01	Siemens Ag	Auswahl der Benutzersprache an einem rein akustisch gesteuerten Telefon
KR100486735B1 (ko) *	2003-02-28	2005-05-03	삼성전자주식회사	최적구획 분류신경망 구성방법과 최적구획 분류신경망을이용한 자동 레이블링방법 및 장치
US7321852B2 (en) *	2003-10-28	2008-01-22	International Business Machines Corporation	System and method for transcribing audio files of various languages
US8036893B2 (en) *	2004-07-22	2011-10-11	Nuance Communications, Inc.	Method and system for identifying and correcting accent-induced speech recognition difficulties
US7406408B1 (en)	2004-08-24	2008-07-29	The United States Of America As Represented By The Director, National Security Agency	Method of recognizing phones in speech of any language
US7430503B1 (en)	2004-08-24	2008-09-30	The United States Of America As Represented By The Director, National Security Agency	Method of combining corpora to achieve consistency in phonetic labeling
US20060122834A1 (en) *	2004-12-03	2006-06-08	Bennett Ian M	Emotion detection device & method for use in distributed systems
US20070038455A1 (en) *	2005-08-09	2007-02-15	Murzina Marina V	Accent detection and correction system
US8032372B1 (en) *	2005-09-13	2011-10-04	Escription, Inc.	Dictation selection
US7970613B2 (en)	2005-11-12	2011-06-28	Sony Computer Entertainment Inc.	Method and system for Gaussian probability data bit reduction and computation
US20070138267A1 (en) *	2005-12-21	2007-06-21	Singer-Harter Debra L	Public terminal-based translator
US7778831B2 (en) *	2006-02-21	2010-08-17	Sony Computer Entertainment Inc.	Voice recognition with dynamic filter bank adjustment based on speaker categorization determined from runtime pitch
US8010358B2 (en) *	2006-02-21	2011-08-30	Sony Computer Entertainment Inc.	Voice recognition with parallel gender and age normalization
US7822605B2 (en) *	2006-10-19	2010-10-26	Nice Systems Ltd.	Method and apparatus for large population speaker identification in telephone interactions
US20080126093A1 (en) *	2006-11-28	2008-05-29	Nokia Corporation	Method, Apparatus and Computer Program Product for Providing a Language Based Interactive Multimedia System
US20100064234A1 (en) *	2007-03-09	2010-03-11	Ghost, Inc.	System and Method for Browser within a Web Site and Proxy Server
CN101578659B (zh) *	2007-05-14	2012-01-18	松下电器产业株式会社	音质转换装置及音质转换方法
KR100925479B1 (ko) *	2007-09-19	2009-11-06	한국전자통신연구원	음성 인식 방법 및 장치
US8032384B2 (en) *	2008-03-14	2011-10-04	Jay S Rylander	Hand held language translation and learning device
US9418662B2 (en) *	2009-01-21	2016-08-16	Nokia Technologies Oy	Method, apparatus and computer program product for providing compound models for speech recognition adaptation
US8442833B2 (en) *	2009-02-17	2013-05-14	Sony Computer Entertainment Inc.	Speech processing with source location estimation using signals from two or more microphones
US8788256B2 (en) *	2009-02-17	2014-07-22	Sony Computer Entertainment Inc.	Multiple language voice recognition
US8442829B2 (en) *	2009-02-17	2013-05-14	Sony Computer Entertainment Inc.	Automatic computation streaming partition for voice recognition on multiple processors with limited memory
WO2011037562A1 (en) *	2009-09-23	2011-03-31	Nuance Communications, Inc.	Probabilistic representation of acoustic segments
WO2011150969A1 (en) *	2010-06-02	2011-12-08	Naxos Finance Sa	Apparatus for image data recording and reproducing, and method thereof
FI20106048A0 (fi) *	2010-10-12	2010-10-12	Annu Marttila	Kieliprofiloinnin menetelmä
US8914242B2 (en)	2011-07-21	2014-12-16	Thermo Ramsey, Inc.	Signal processing in guided wave cutoff spectroscopy
US8442825B1 (en)	2011-08-16	2013-05-14	The United States Of America As Represented By The Director, National Security Agency	Biomimetic voice identifier
US9153235B2 (en)	2012-04-09	2015-10-06	Sony Computer Entertainment Inc.	Text dependent speaker recognition with long-term feature based on functional data analysis
CN103631802B (zh) *	2012-08-24	2015-05-20	腾讯科技（深圳）有限公司	歌曲信息检索方法、装置及相应的服务器
EP2736042A1 (de) *	2012-11-23	2014-05-28	Samsung Electronics Co., Ltd	Vorrichtung und Verfahren zur Erstellung eines mehrsprachigen akustischen Modells und computerlesbares Aufzeichnungsmedium für Speicherprogramm zur Ausführung des Verfahrens
US10510264B2 (en)	2013-03-21	2019-12-17	Neuron Fuel, Inc.	Systems and methods for customized lesson creation and application
US9595205B2 (en)	2012-12-18	2017-03-14	Neuron Fuel, Inc.	Systems and methods for goal-based programming instruction
US8800113B1 (en) *	2013-03-15	2014-08-12	Blackstone Medical, Inc.	Rigid modular connector
US9953630B1 (en) *	2013-05-31	2018-04-24	Amazon Technologies, Inc.	Language recognition for device settings
KR102084646B1 (ko) *	2013-07-04	2020-04-14	삼성전자주식회사	음성 인식 장치 및 음성 인식 방법
CN104143328B (zh) *	2013-08-15	2015-11-25	腾讯科技（深圳）有限公司	一种关键词检测方法和装置
US9589564B2 (en)	2014-02-05	2017-03-07	Google Inc.	Multiple speech locale-specific hotword classifiers for selection of a speech locale
US9135911B2 (en) *	2014-02-07	2015-09-15	NexGen Flight LLC	Automated generation of phonemic lexicon for voice activated cockpit management systems
WO2016039751A1 (en) *	2014-09-11	2016-03-17	Nuance Communications, Inc.	Method for scoring in an automatic speech recognition system
US20170011735A1 (en) *	2015-07-10	2017-01-12	Electronics And Telecommunications Research Institute	Speech recognition system and method
US10614826B2 (en)	2017-05-24	2020-04-07	Modulate, Inc.	System and method for voice-to-voice conversion
CN112364658A (zh)	2019-07-24	2021-02-12	阿里巴巴集团控股有限公司	翻译以及语音识别方法、装置、设备
KR102303785B1 (ko) *	2019-08-05	2021-09-23	엘지전자 주식회사	로봇의 언어를 설정하는 인공 지능 서버 및 그 방법
WO2021030759A1 (en)	2019-08-14	2021-02-18	Modulate, Inc.	Generation and detection of watermark for real-time voice conversion
US11551695B1 (en) *	2020-05-13	2023-01-10	Amazon Technologies, Inc.	Model training system for custom speech-to-text models

Family Cites Families (12)

* Cited by examiner, † Cited by third party
Publication number	Priority date	Publication date	Assignee	Title
US4536844A (en) *	1983-04-26	1985-08-20	Fairchild Camera And Instrument Corporation	Method and apparatus for simulating aural response information
US4882757A (en) *	1986-04-25	1989-11-21	Texas Instruments Incorporated	Speech recognition system
JP2717652B2 (ja) *	1986-06-02	1998-02-18	モトローラ・インコーポレーテッド	連続音声認識システム
US4852170A (en) *	1986-12-18	1989-07-25	R & D Associates	Real time computer speech recognition system
US4905285A (en) *	1987-04-03	1990-02-27	American Telephone And Telegraph Company, At&T Bell Laboratories	Analysis arrangement based on a model of human neural responses
US4910784A (en) *	1987-07-30	1990-03-20	Texas Instruments Incorporated	Low cost speech recognition system and method
US4984177A (en) *	1988-02-05	1991-01-08	Advanced Products And Technologies, Inc.	Voice language translator
JP2764277B2 (ja) *	1988-09-07	1998-06-11	株式会社日立製作所	音声認識装置
US4937870A (en) *	1988-11-14	1990-06-26	American Telephone And Telegraph Company	Speech recognition arrangement
US5033087A (en) *	1989-03-14	1991-07-16	International Business Machines Corp.	Method and apparatus for the automatic determination of phonological rules as for a continuous speech recognition system
US5278911A (en) *	1989-05-18	1994-01-11	Smiths Industries Public Limited Company	Speech recognition using a neural net
US5293584A (en) *	1992-05-21	1994-03-08	International Business Machines Corporation	Speech recognition system for natural language translation

1994
- 1994-07-12 AT AT94923413T patent/ATE200590T1/de active
- 1994-07-12 WO PCT/US1994/007742 patent/WO1995002879A1/en active IP Right Grant
- 1994-07-12 AU AU73282/94A patent/AU682380B2/en not_active Ceased
- 1994-07-12 CA CA002167200A patent/CA2167200A1/en not_active Abandoned
- 1994-07-12 JP JP7504646A patent/JPH09500223A/ja not_active Withdrawn
- 1994-07-12 EP EP94923413A patent/EP0708958B1/de not_active Expired - Lifetime
- 1994-07-12 DE DE69427083T patent/DE69427083T2/de not_active Expired - Fee Related
1995
- 1995-09-21 US US08/532,867 patent/US5758023A/en not_active Expired - Fee Related

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number	Priority date	Publication date	Assignee	Title
US7162424B2 (en)	2001-04-26	2007-01-09	Siemens Aktiengesellschaft	Method and system for defining a sequence of sound modules for synthesis of a speech signal in a tonal language
US7949517B2 (en)	2006-12-01	2011-05-24	Deutsche Telekom Ag	Dialogue system with logical evaluation for language identification in speech recognition

Also Published As

Publication number	Publication date
AU7328294A (en)	1995-02-13
EP0708958B1 (de)	2001-04-11
CA2167200A1 (en)	1995-01-26
WO1995002879A1 (en)	1995-01-26
JPH09500223A (ja)	1997-01-07
DE69427083D1 (de)	2001-05-17
EP0708958A4 (de)	1997-10-15
EP0708958A1 (de)	1996-05-01
AU682380B2 (en)	1997-10-02
US5758023A (en)	1998-05-26
ATE200590T1 (de)	2001-04-15

Legal Events

Date	Code	Title	Description
2004-06-09	8339	Ceased/non-payment of the annual fee

Publication	Publication Date	Title
ATE200590T1 (de)	2001-04-15	Spracherkennungssystem für mehrere sprachen
Juang et al.	2000	Automatic recognition and understanding of spoken language-a first step toward natural human-machine communication
Jelinek et al.	1977	Perplexity—a measure of the difficulty of speech recognition tasks
KR20200023456A (ko)	2020-03-04	발언 분류기
US7319959B1 (en)	2008-01-15	Multi-source phoneme classification for noise-robust automatic speech recognition
TW347619B (en)	1998-12-11	A communication system and method using a speaker dependent time-scaling technique a method for time-scale modification of speech using a modified version of the Waveform Similarity based Overlap-Add technique (WSOLA).
FR2522179B1 (fr)	1986-05-02	Procede et appareil de reconnaissance de paroles permettant de reconnaitre des phonemes particuliers du signal vocal quelle que soit la personne qui parle
DE69330427T2 (de)	2002-05-23	Spracherkennungssystem für sprachen mit zusammengesetzten wörtern
RU2466468C1 (ru)	2012-11-10	Система и способ распознавания речи
ATE363120T1 (de)	2007-06-15	Audio-dialogsystem und sprachgesteuertes browsing-verfahren
Hermansky et al.	2013	Perceptual properties of current speech recognition technology
US20160210982A1 (en)	2016-07-21	Method and Apparatus to Enhance Speech Understanding
US10143027B1 (en)	2018-11-27	Device selection for routing of communications
JPH10504404A (ja)	1998-04-28	音声認識のための方法および装置
JP6599828B2 (ja)	2019-10-30	音処理方法、音処理装置、及びプログラム
CN113488026A (zh)	2021-10-08	基于语用信息的语音理解模型生成方法和智能语音交互方法
Price et al.	2014	Combining linguistic with statistical methods in modeling prosody
Pols	1999	Flexible, robust, and efficient human speech processing versus present-day speech technology
KR20210000802A (ko)	2021-01-06	인공지능 음성 인식 처리 방법 및 시스템
US11172527B2 (en)	2021-11-09	Routing of communications to a device
Prasangini et al.	2018	Sinhala speech to sinhala unicode text conversion for disaster relief facilitation in sri lanka
Berger et al.	2020	Speech Activity Detection for Deaf People: Evaluation on the Developed Smart Solution Prototype
Naveena et al.	2017	Extraction of Prosodic Features to Automatically Recognize Tamil Dialects
Tomas et al.	2011	Determination of spectral parameters of speech signal by Goertzel algorithm
JPS6331798B2 (de)	1988-06-27

DE69427083T2 - Spracherkennungssystem für mehrere sprachen - Google Patents

Info

Links

Classifications

Applications Claiming Priority (2)

Publications (2)

Family

ID=22224117

Family Applications (1)

Country Status (8)

Cited By (2)

Families Citing this family (74)

Family Cites Families (12)

Cited By (2)

Also Published As

Similar Documents

Legal Events