DE602004002312D1 - Verfahren und Vorrichtung zur Bestimmung von Formanten unter Benutzung eines Restsignalmodells - Google Patents

Verfahren und Vorrichtung zur Bestimmung von Formanten unter Benutzung eines Restsignalmodells

Info

Publication number
DE602004002312D1
DE602004002312D1 DE602004002312T DE602004002312T DE602004002312D1 DE 602004002312 D1 DE602004002312 D1 DE 602004002312D1 DE 602004002312 T DE602004002312 T DE 602004002312T DE 602004002312 T DE602004002312 T DE 602004002312T DE 602004002312 D1 DE602004002312 D1 DE 602004002312D1
Authority
DE
Germany
Prior art keywords
formants
identified
residual signal
signal model
model
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
DE602004002312T
Other languages
English (en)
Other versions
DE602004002312T2 (de
Inventor
Issam Bazzi
Li Deng
Alejandro Acero
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Microsoft Corp
Original Assignee
Microsoft Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Microsoft Corp filed Critical Microsoft Corp
Publication of DE602004002312D1 publication Critical patent/DE602004002312D1/de
Application granted granted Critical
Publication of DE602004002312T2 publication Critical patent/DE602004002312T2/de
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/02Feature extraction for speech recognition; Selection of recognition unit
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/15Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being formant information
DE602004002312T 2003-04-01 2004-04-01 Verfahren und Vorrichtung zur Bestimmung von Formanten unter Benutzung eines Restsignalmodells Expired - Lifetime DE602004002312T2 (de)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US404411 1999-09-23
US10/404,411 US7424423B2 (en) 2003-04-01 2003-04-01 Method and apparatus for formant tracking using a residual model

Publications (2)

Publication Number Publication Date
DE602004002312D1 true DE602004002312D1 (de) 2006-10-26
DE602004002312T2 DE602004002312T2 (de) 2006-12-28

Family

ID=32850595

Family Applications (1)

Application Number Title Priority Date Filing Date
DE602004002312T Expired - Lifetime DE602004002312T2 (de) 2003-04-01 2004-04-01 Verfahren und Vorrichtung zur Bestimmung von Formanten unter Benutzung eines Restsignalmodells

Country Status (7)

Country Link
US (1) US7424423B2 (de)
EP (1) EP1465153B1 (de)
JP (1) JP4718789B2 (de)
KR (1) KR101026632B1 (de)
CN (1) CN100562926C (de)
AT (1) ATE339756T1 (de)
DE (1) DE602004002312T2 (de)

Families Citing this family (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7475011B2 (en) * 2004-08-25 2009-01-06 Microsoft Corporation Greedy algorithm for identifying values for vocal tract resonance vectors
KR100634526B1 (ko) * 2004-11-24 2006-10-16 삼성전자주식회사 포만트 트래킹 장치 및 방법
KR100717625B1 (ko) 2006-02-10 2007-05-15 삼성전자주식회사 음성 인식에서의 포먼트 주파수 추정 방법 및 장치
US8010356B2 (en) 2006-02-17 2011-08-30 Microsoft Corporation Parameter learning in a hidden trajectory model
US7877255B2 (en) * 2006-03-31 2011-01-25 Voice Signal Technologies, Inc. Speech recognition using channel verification
DE602006008158D1 (de) * 2006-09-29 2009-09-10 Honda Res Inst Europe Gmbh Gemeinsame Schätzung von Formant-Trajektorien mittels Bayesischer Techniken und adaptiver Segmentierung
CN101067929B (zh) * 2007-06-05 2011-04-20 南京大学 使用共振峰增强提取话音共振峰轨迹的方法
EP2232700B1 (de) 2007-12-21 2014-08-13 Dts Llc System zur einstellung der wahrgenommenen lautstärke von tonsignalen
US8538042B2 (en) 2009-08-11 2013-09-17 Dts Llc System for increasing perceived loudness of speakers
US8204742B2 (en) 2009-09-14 2012-06-19 Srs Labs, Inc. System for processing an audio signal to enhance speech intelligibility
US20120078625A1 (en) * 2010-09-23 2012-03-29 Waveform Communications, Llc Waveform analysis of speech
US20140207456A1 (en) * 2010-09-23 2014-07-24 Waveform Communications, Llc Waveform analysis of speech
KR102060208B1 (ko) 2011-07-29 2019-12-27 디티에스 엘엘씨 적응적 음성 명료도 처리기
US9312829B2 (en) 2012-04-12 2016-04-12 Dts Llc System for adjusting loudness of audio signals in real time
US9728200B2 (en) 2013-01-29 2017-08-08 Qualcomm Incorporated Systems, methods, apparatus, and computer-readable media for adaptive formant sharpening in linear prediction coding
US9520141B2 (en) * 2013-02-28 2016-12-13 Google Inc. Keyboard typing detection and suppression
US9805714B2 (en) * 2016-03-22 2017-10-31 Asustek Computer Inc. Directional keyword verification method applicable to electronic device and electronic device using the same

Family Cites Families (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US3649765A (en) * 1969-10-29 1972-03-14 Bell Telephone Labor Inc Speech analyzer-synthesizer system employing improved formant extractor
JPH0785200B2 (ja) * 1986-11-13 1995-09-13 日本電気株式会社 スペクトル標準パタンの作成方法
US5799276A (en) * 1995-11-07 1998-08-25 Accent Incorporated Knowledge-based speech recognition system and methods having frame length computed based upon estimated pitch period of vocalic intervals
US5729694A (en) * 1996-02-06 1998-03-17 The Regents Of The University Of California Speech coding, reconstruction and recognition using acoustics and electromagnetic waves
US6064958A (en) 1996-09-20 2000-05-16 Nippon Telegraph And Telephone Corporation Pattern recognition scheme using probabilistic models based on mixtures distribution of discrete distribution
US5815090A (en) * 1996-10-31 1998-09-29 University Of Florida Research Foundation, Inc. Remote monitoring system for detecting termites
JP2986792B2 (ja) * 1998-03-16 1999-12-06 株式会社エイ・ティ・アール音声翻訳通信研究所 話者正規化処理装置及び音声認識装置
US6980952B1 (en) * 1998-08-15 2005-12-27 Texas Instruments Incorporated Source normalization training for HMM modeling of speech
US6502066B2 (en) 1998-11-24 2002-12-31 Microsoft Corporation System for generating formant tracks by modifying formants synthesized from speech units
US20010044719A1 (en) * 1999-07-02 2001-11-22 Mitsubishi Electric Research Laboratories, Inc. Method and system for recognizing, indexing, and searching acoustic signals
US6910007B2 (en) * 2000-05-31 2005-06-21 At&T Corp Stochastic modeling of spectral adjustment for high quality pitch modification
JP2002133411A (ja) * 2000-08-17 2002-05-10 Canon Inc 情報処理方法、情報処理装置及びプログラム
JP2002278592A (ja) * 2001-03-21 2002-09-27 Fujitsu Ltd データ照合プログラム、データ照合方法およびデータ照合装置
US6931374B2 (en) 2003-04-01 2005-08-16 Microsoft Corporation Method of speech recognition using variational inference with switching state space models

Also Published As

Publication number Publication date
EP1465153B1 (de) 2006-09-13
US7424423B2 (en) 2008-09-09
EP1465153A3 (de) 2005-01-19
US20040199382A1 (en) 2004-10-07
CN1534596A (zh) 2004-10-06
CN100562926C (zh) 2009-11-25
DE602004002312T2 (de) 2006-12-28
JP4718789B2 (ja) 2011-07-06
JP2004310091A (ja) 2004-11-04
KR101026632B1 (ko) 2011-04-04
ATE339756T1 (de) 2006-10-15
EP1465153A2 (de) 2004-10-06
KR20040088364A (ko) 2004-10-16

Similar Documents

Publication Publication Date Title
DE602004002312D1 (de) Verfahren und Vorrichtung zur Bestimmung von Formanten unter Benutzung eines Restsignalmodells
CN102779508B (zh) 语音库生成设备及其方法、语音合成系统及其方法
CN109949783B (zh) 歌曲合成方法及系统
CN101661675B (zh) 一种错误自感知的声调发音学习方法和系统
CN105206258A (zh) 声学模型的生成方法和装置及语音合成方法和装置
TW200601263A (en) Apparatus and method for synthesized audible response to an utterance in speaker-independent voice recognition
DE59010131D1 (de) Verfahren zur sprecheradaptiven Erkennung von Sprache
DE60310785D1 (de) Verfahren und Vorrichtung zur Übersetzung von gesprochener Sprache
DE60309142D1 (de) Vorrichtung zur bestimmung von parametern eines gauss'schen mischungmodells (gmm) oder eines gmm basierten hidden markov modells
CN106531185A (zh) 基于语音相似度的语音评测方法及系统
JP2000105596A5 (de)
RU2009119491A (ru) Способ и устройство кодирования кадров перехода в речевых сигналах
DE602004007786D1 (de) Verfahren und vorrichtung zur quantisierung des verstärkungsfaktors in einem breitbandsprachkodierer mit variabler bitrate
DE60124551D1 (de) Verfahren und vorrichtung zur erzeugung der referenzmuster für ein sprecherunabhängiges spracherkennungssystem
CN105206264B (zh) 语音合成方法和装置
ATE259532T1 (de) Verfahren und vorrichtung zum durchsuchen eines erregungskodebuches bei einem celp-kodierer
JP2016539355A5 (de)
Tamburini Automatic prosodic prominence detection in speech using acoustic features: an unsupervised system.
DE69937854D1 (de) Verfahren und Vorrichtung zur Spracherkennung unter Verwendung von phonetischen Transkriptionen
Yarra et al. Automatic detection of syllable stress using sonority based prominence features for pronunciation evaluation
DE602004004572D1 (de) Verfolgen von Vokaltraktresonanzen unter Verwendung einer zielgeführten Einschränkung
DE60027012D1 (de) Verfahren und vorrichtung zur verschachtelung der quantisierungsverfahren der spektralen frequenzlinien in einem sprachkodierer
Qin et al. A comparison of acoustic features for articulatory inversion
Hillenbrand et al. Perception of sinewave vowels
Mary et al. Evaluation of mimicked speech using prosodic features

Legal Events

Date Code Title Description
8364 No opposition during term of opposition