DE69427083T2 - Spracherkennungssystem für mehrere sprachen - Google Patents

Spracherkennungssystem für mehrere sprachen

Info

Publication number
DE69427083T2
DE69427083T2 DE69427083T DE69427083T DE69427083T2 DE 69427083 T2 DE69427083 T2 DE 69427083T2 DE 69427083 T DE69427083 T DE 69427083T DE 69427083 T DE69427083 T DE 69427083T DE 69427083 T2 DE69427083 T2 DE 69427083T2
Authority
DE
Germany
Prior art keywords
voice recognition
recognition system
multiple languages
spectrum
phones
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
DE69427083T
Other languages
English (en)
Other versions
DE69427083D1 (de
Inventor
Theodore Austin Bordeaux
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Application granted granted Critical
Publication of DE69427083D1 publication Critical patent/DE69427083D1/de
Publication of DE69427083T2 publication Critical patent/DE69427083T2/de
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/16Speech classification or search using artificial neural networks
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/183Speech classification or search using natural language modelling using context dependencies, e.g. language models
    • G10L15/187Phonemic context, e.g. pronunciation rules, phonotactical constraints or phoneme n-grams
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/02Feature extraction for speech recognition; Selection of recognition unit
    • G10L2015/025Phonemes, fenemes or fenones being the recognition units
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/063Training
    • G10L2015/0638Interactive procedures
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/18Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being spectral information of each sub-band
DE69427083T 1993-07-13 1994-07-12 Spracherkennungssystem für mehrere sprachen Expired - Fee Related DE69427083T2 (de)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US9074793A 1993-07-13 1993-07-13
PCT/US1994/007742 WO1995002879A1 (en) 1993-07-13 1994-07-12 Multi-language speech recognition system

Publications (2)

Publication Number Publication Date
DE69427083D1 DE69427083D1 (de) 2001-05-17
DE69427083T2 true DE69427083T2 (de) 2001-12-06

Family

ID=22224117

Family Applications (1)

Application Number Title Priority Date Filing Date
DE69427083T Expired - Fee Related DE69427083T2 (de) 1993-07-13 1994-07-12 Spracherkennungssystem für mehrere sprachen

Country Status (8)

Country Link
US (1) US5758023A (de)
EP (1) EP0708958B1 (de)
JP (1) JPH09500223A (de)
AT (1) ATE200590T1 (de)
AU (1) AU682380B2 (de)
CA (1) CA2167200A1 (de)
DE (1) DE69427083T2 (de)
WO (1) WO1995002879A1 (de)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7162424B2 (en) 2001-04-26 2007-01-09 Siemens Aktiengesellschaft Method and system for defining a sequence of sound modules for synthesis of a speech signal in a tonal language
US7949517B2 (en) 2006-12-01 2011-05-24 Deutsche Telekom Ag Dialogue system with logical evaluation for language identification in speech recognition

Families Citing this family (74)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5790754A (en) * 1994-10-21 1998-08-04 Sensory Circuits, Inc. Speech recognition apparatus for consumer electronic applications
DE19636739C1 (de) * 1996-09-10 1997-07-03 Siemens Ag Verfahren zur Mehrsprachenverwendung eines hidden Markov Lautmodelles in einem Spracherkennungssystem
EP0920692B1 (de) * 1996-12-24 2003-03-26 Cellon France SAS Verfahren zum trainieren eines spracherkennungssystems und ein gerät zum praktizieren des verfahrens, insbesondere eines tragbaren telefons
US6061646A (en) * 1997-12-18 2000-05-09 International Business Machines Corp. Kiosk for multiple spoken languages
US6085160A (en) * 1998-07-10 2000-07-04 Lernout & Hauspie Speech Products N.V. Language independent speech recognition
WO2000022609A1 (en) * 1998-10-13 2000-04-20 Telefonaktiebolaget Lm Ericsson (Publ) Speech recognition and control system and telephone
US6188984B1 (en) * 1998-11-17 2001-02-13 Fonix Corporation Method and system for syllable parsing
US6377913B1 (en) * 1999-08-13 2002-04-23 International Business Machines Corporation Method and system for multi-client access to a dialog system
JP4292646B2 (ja) 1999-09-16 2009-07-08 株式会社デンソー ユーザインタフェース装置、ナビゲーションシステム、情報処理装置及び記録媒体
US6963837B1 (en) * 1999-10-06 2005-11-08 Multimodal Technologies, Inc. Attribute-based word modeling
US9076448B2 (en) 1999-11-12 2015-07-07 Nuance Communications, Inc. Distributed real time speech recognition system
US7725307B2 (en) 1999-11-12 2010-05-25 Phoenix Solutions, Inc. Query engine for processing voice based queries including semantic decoding
US7392185B2 (en) 1999-11-12 2008-06-24 Phoenix Solutions, Inc. Speech based learning/training system using semantic decoding
US7050977B1 (en) 1999-11-12 2006-05-23 Phoenix Solutions, Inc. Speech-enabled server for internet website and method
DE10018134A1 (de) * 2000-04-12 2001-10-18 Siemens Ag Verfahren und Vorrichtung zum Bestimmen prosodischer Markierungen
JP3339579B2 (ja) * 2000-10-04 2002-10-28 株式会社鷹山 電話装置
EP1217610A1 (de) * 2000-11-28 2002-06-26 Siemens Aktiengesellschaft Verfahren und System zur multilingualen Spracherkennung
EP1217609A3 (de) * 2000-12-22 2004-02-25 Hewlett-Packard Company Spracherkennung
US20020095274A1 (en) * 2001-01-17 2002-07-18 Richards Alfred N. Pool cover design verifying system
US7107215B2 (en) * 2001-04-16 2006-09-12 Sakhr Software Company Determining a compact model to transcribe the arabic language acoustically in a well defined basic phonetic study
US20030092423A1 (en) * 2001-11-09 2003-05-15 Roger Boivin System and method to allow law enforcement agencies to track and monitor calls made on recyclable/disposable mobile telephones
US7295982B1 (en) * 2001-11-19 2007-11-13 At&T Corp. System and method for automatic verification of the understandability of speech
US6990445B2 (en) * 2001-12-17 2006-01-24 Xl8 Systems, Inc. System and method for speech recognition and transcription
WO2003060877A1 (de) * 2002-01-17 2003-07-24 Siemens Aktiengesellschaft Betriebsverfahren eines automatischen spracherkenners zur sprecherunabhängigen spracherkennung von worten aus verschiedenen sprachen und automatischer spracherkenner
US7286993B2 (en) * 2002-01-31 2007-10-23 Product Discovery, Inc. Holographic speech translation system and method
US20030208451A1 (en) * 2002-05-03 2003-11-06 Jim-Shih Liaw Artificial neural systems with dynamic synapses
US7010488B2 (en) * 2002-05-09 2006-03-07 Oregon Health & Science University System and method for compressing concatenative acoustic inventories for speech synthesis
US20040030555A1 (en) * 2002-08-12 2004-02-12 Oregon Health & Science University System and method for concatenating acoustic contours for speech synthesis
DE10256935A1 (de) * 2002-12-05 2004-07-01 Siemens Ag Auswahl der Benutzersprache an einem rein akustisch gesteuerten Telefon
KR100486735B1 (ko) * 2003-02-28 2005-05-03 삼성전자주식회사 최적구획 분류신경망 구성방법과 최적구획 분류신경망을이용한 자동 레이블링방법 및 장치
US7321852B2 (en) * 2003-10-28 2008-01-22 International Business Machines Corporation System and method for transcribing audio files of various languages
US8036893B2 (en) * 2004-07-22 2011-10-11 Nuance Communications, Inc. Method and system for identifying and correcting accent-induced speech recognition difficulties
US7406408B1 (en) 2004-08-24 2008-07-29 The United States Of America As Represented By The Director, National Security Agency Method of recognizing phones in speech of any language
US7430503B1 (en) 2004-08-24 2008-09-30 The United States Of America As Represented By The Director, National Security Agency Method of combining corpora to achieve consistency in phonetic labeling
US20060122834A1 (en) * 2004-12-03 2006-06-08 Bennett Ian M Emotion detection device & method for use in distributed systems
US20070038455A1 (en) * 2005-08-09 2007-02-15 Murzina Marina V Accent detection and correction system
US8032372B1 (en) * 2005-09-13 2011-10-04 Escription, Inc. Dictation selection
US7970613B2 (en) 2005-11-12 2011-06-28 Sony Computer Entertainment Inc. Method and system for Gaussian probability data bit reduction and computation
US20070138267A1 (en) * 2005-12-21 2007-06-21 Singer-Harter Debra L Public terminal-based translator
US7778831B2 (en) * 2006-02-21 2010-08-17 Sony Computer Entertainment Inc. Voice recognition with dynamic filter bank adjustment based on speaker categorization determined from runtime pitch
US8010358B2 (en) * 2006-02-21 2011-08-30 Sony Computer Entertainment Inc. Voice recognition with parallel gender and age normalization
US7822605B2 (en) * 2006-10-19 2010-10-26 Nice Systems Ltd. Method and apparatus for large population speaker identification in telephone interactions
US20080126093A1 (en) * 2006-11-28 2008-05-29 Nokia Corporation Method, Apparatus and Computer Program Product for Providing a Language Based Interactive Multimedia System
US20100064234A1 (en) * 2007-03-09 2010-03-11 Ghost, Inc. System and Method for Browser within a Web Site and Proxy Server
CN101578659B (zh) * 2007-05-14 2012-01-18 松下电器产业株式会社 音质转换装置及音质转换方法
KR100925479B1 (ko) * 2007-09-19 2009-11-06 한국전자통신연구원 음성 인식 방법 및 장치
US8032384B2 (en) * 2008-03-14 2011-10-04 Jay S Rylander Hand held language translation and learning device
US9418662B2 (en) * 2009-01-21 2016-08-16 Nokia Technologies Oy Method, apparatus and computer program product for providing compound models for speech recognition adaptation
US8442833B2 (en) * 2009-02-17 2013-05-14 Sony Computer Entertainment Inc. Speech processing with source location estimation using signals from two or more microphones
US8788256B2 (en) * 2009-02-17 2014-07-22 Sony Computer Entertainment Inc. Multiple language voice recognition
US8442829B2 (en) * 2009-02-17 2013-05-14 Sony Computer Entertainment Inc. Automatic computation streaming partition for voice recognition on multiple processors with limited memory
WO2011037562A1 (en) * 2009-09-23 2011-03-31 Nuance Communications, Inc. Probabilistic representation of acoustic segments
WO2011150969A1 (en) * 2010-06-02 2011-12-08 Naxos Finance Sa Apparatus for image data recording and reproducing, and method thereof
FI20106048A0 (fi) * 2010-10-12 2010-10-12 Annu Marttila Kieliprofiloinnin menetelmä
US8914242B2 (en) 2011-07-21 2014-12-16 Thermo Ramsey, Inc. Signal processing in guided wave cutoff spectroscopy
US8442825B1 (en) 2011-08-16 2013-05-14 The United States Of America As Represented By The Director, National Security Agency Biomimetic voice identifier
US9153235B2 (en) 2012-04-09 2015-10-06 Sony Computer Entertainment Inc. Text dependent speaker recognition with long-term feature based on functional data analysis
CN103631802B (zh) * 2012-08-24 2015-05-20 腾讯科技(深圳)有限公司 歌曲信息检索方法、装置及相应的服务器
EP2736042A1 (de) * 2012-11-23 2014-05-28 Samsung Electronics Co., Ltd Vorrichtung und Verfahren zur Erstellung eines mehrsprachigen akustischen Modells und computerlesbares Aufzeichnungsmedium für Speicherprogramm zur Ausführung des Verfahrens
US10510264B2 (en) 2013-03-21 2019-12-17 Neuron Fuel, Inc. Systems and methods for customized lesson creation and application
US9595205B2 (en) 2012-12-18 2017-03-14 Neuron Fuel, Inc. Systems and methods for goal-based programming instruction
US8800113B1 (en) * 2013-03-15 2014-08-12 Blackstone Medical, Inc. Rigid modular connector
US9953630B1 (en) * 2013-05-31 2018-04-24 Amazon Technologies, Inc. Language recognition for device settings
KR102084646B1 (ko) * 2013-07-04 2020-04-14 삼성전자주식회사 음성 인식 장치 및 음성 인식 방법
CN104143328B (zh) * 2013-08-15 2015-11-25 腾讯科技(深圳)有限公司 一种关键词检测方法和装置
US9589564B2 (en) 2014-02-05 2017-03-07 Google Inc. Multiple speech locale-specific hotword classifiers for selection of a speech locale
US9135911B2 (en) * 2014-02-07 2015-09-15 NexGen Flight LLC Automated generation of phonemic lexicon for voice activated cockpit management systems
WO2016039751A1 (en) * 2014-09-11 2016-03-17 Nuance Communications, Inc. Method for scoring in an automatic speech recognition system
US20170011735A1 (en) * 2015-07-10 2017-01-12 Electronics And Telecommunications Research Institute Speech recognition system and method
US10614826B2 (en) 2017-05-24 2020-04-07 Modulate, Inc. System and method for voice-to-voice conversion
CN112364658A (zh) 2019-07-24 2021-02-12 阿里巴巴集团控股有限公司 翻译以及语音识别方法、装置、设备
KR102303785B1 (ko) * 2019-08-05 2021-09-23 엘지전자 주식회사 로봇의 언어를 설정하는 인공 지능 서버 및 그 방법
WO2021030759A1 (en) 2019-08-14 2021-02-18 Modulate, Inc. Generation and detection of watermark for real-time voice conversion
US11551695B1 (en) * 2020-05-13 2023-01-10 Amazon Technologies, Inc. Model training system for custom speech-to-text models

Family Cites Families (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4536844A (en) * 1983-04-26 1985-08-20 Fairchild Camera And Instrument Corporation Method and apparatus for simulating aural response information
US4882757A (en) * 1986-04-25 1989-11-21 Texas Instruments Incorporated Speech recognition system
JP2717652B2 (ja) * 1986-06-02 1998-02-18 モトローラ・インコーポレーテッド 連続音声認識システム
US4852170A (en) * 1986-12-18 1989-07-25 R & D Associates Real time computer speech recognition system
US4905285A (en) * 1987-04-03 1990-02-27 American Telephone And Telegraph Company, At&T Bell Laboratories Analysis arrangement based on a model of human neural responses
US4910784A (en) * 1987-07-30 1990-03-20 Texas Instruments Incorporated Low cost speech recognition system and method
US4984177A (en) * 1988-02-05 1991-01-08 Advanced Products And Technologies, Inc. Voice language translator
JP2764277B2 (ja) * 1988-09-07 1998-06-11 株式会社日立製作所 音声認識装置
US4937870A (en) * 1988-11-14 1990-06-26 American Telephone And Telegraph Company Speech recognition arrangement
US5033087A (en) * 1989-03-14 1991-07-16 International Business Machines Corp. Method and apparatus for the automatic determination of phonological rules as for a continuous speech recognition system
US5278911A (en) * 1989-05-18 1994-01-11 Smiths Industries Public Limited Company Speech recognition using a neural net
US5293584A (en) * 1992-05-21 1994-03-08 International Business Machines Corporation Speech recognition system for natural language translation

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7162424B2 (en) 2001-04-26 2007-01-09 Siemens Aktiengesellschaft Method and system for defining a sequence of sound modules for synthesis of a speech signal in a tonal language
US7949517B2 (en) 2006-12-01 2011-05-24 Deutsche Telekom Ag Dialogue system with logical evaluation for language identification in speech recognition

Also Published As

Publication number Publication date
AU7328294A (en) 1995-02-13
EP0708958B1 (de) 2001-04-11
CA2167200A1 (en) 1995-01-26
WO1995002879A1 (en) 1995-01-26
JPH09500223A (ja) 1997-01-07
DE69427083D1 (de) 2001-05-17
EP0708958A4 (de) 1997-10-15
EP0708958A1 (de) 1996-05-01
AU682380B2 (en) 1997-10-02
US5758023A (en) 1998-05-26
ATE200590T1 (de) 2001-04-15

Similar Documents

Publication Publication Date Title
ATE200590T1 (de) Spracherkennungssystem für mehrere sprachen
Juang et al. Automatic recognition and understanding of spoken language-a first step toward natural human-machine communication
Jelinek et al. Perplexity—a measure of the difficulty of speech recognition tasks
KR20200023456A (ko) 발언 분류기
US7319959B1 (en) Multi-source phoneme classification for noise-robust automatic speech recognition
TW347619B (en) A communication system and method using a speaker dependent time-scaling technique a method for time-scale modification of speech using a modified version of the Waveform Similarity based Overlap-Add technique (WSOLA).
FR2522179B1 (fr) Procede et appareil de reconnaissance de paroles permettant de reconnaitre des phonemes particuliers du signal vocal quelle que soit la personne qui parle
DE69330427T2 (de) Spracherkennungssystem für sprachen mit zusammengesetzten wörtern
RU2466468C1 (ru) Система и способ распознавания речи
ATE363120T1 (de) Audio-dialogsystem und sprachgesteuertes browsing-verfahren
Hermansky et al. Perceptual properties of current speech recognition technology
US20160210982A1 (en) Method and Apparatus to Enhance Speech Understanding
US10143027B1 (en) Device selection for routing of communications
JPH10504404A (ja) 音声認識のための方法および装置
JP6599828B2 (ja) 音処理方法、音処理装置、及びプログラム
CN113488026A (zh) 基于语用信息的语音理解模型生成方法和智能语音交互方法
Price et al. Combining linguistic with statistical methods in modeling prosody
Pols Flexible, robust, and efficient human speech processing versus present-day speech technology
KR20210000802A (ko) 인공지능 음성 인식 처리 방법 및 시스템
US11172527B2 (en) Routing of communications to a device
Prasangini et al. Sinhala speech to sinhala unicode text conversion for disaster relief facilitation in sri lanka
Berger et al. Speech Activity Detection for Deaf People: Evaluation on the Developed Smart Solution Prototype
Naveena et al. Extraction of Prosodic Features to Automatically Recognize Tamil Dialects
Tomas et al. Determination of spectral parameters of speech signal by Goertzel algorithm
JPS6331798B2 (de)

Legal Events

Date Code Title Description
8339 Ceased/non-payment of the annual fee