DE602004002312D1 - Verfahren und Vorrichtung zur Bestimmung von Formanten unter Benutzung eines Restsignalmodells - Google Patents

Verfahren und Vorrichtung zur Bestimmung von Formanten unter Benutzung eines Restsignalmodells

Info

Publication number: DE602004002312D1
Authority: DE; Germany
Prior art keywords: formants; identified; residual signal; signal model; model
Prior art date: 2003-04-01
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.): Expired - Lifetime

Application number

DE602004002312T

Other languages

English (en)

Other versions

DE602004002312T2 (de

Inventor

Issam Bazzi

Li Deng

Alejandro Acero

Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)

Microsoft Corp

Original Assignee

Microsoft Corp

Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)

2003-04-01

Filing date

2004-04-01

Publication date

2006-10-26

2004-04-01 Application filed by Microsoft Corp filed Critical Microsoft Corp

2006-10-26 Publication of DE602004002312D1 publication Critical patent/DE602004002312D1/de

2006-12-28 Application granted granted Critical

2006-12-28 Publication of DE602004002312T2 publication Critical patent/DE602004002312T2/de

2024-04-02 Anticipated expiration legal-status Critical

Status Expired - Lifetime legal-status Critical Current

Links

238000000034 method Methods 0.000 title abstract 2
239000013598 vector Substances 0.000 abstract 2
238000013507 mapping Methods 0.000 abstract 1

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/15—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being formant information

DE602004002312T 2003-04-01 2004-04-01 Verfahren und Vorrichtung zur Bestimmung von Formanten unter Benutzung eines Restsignalmodells Expired - Lifetime DE602004002312T2 (de)

Applications Claiming Priority (2)

Application Number	Priority Date	Filing Date	Title
US404411		1999-09-23
US10/404,411 US7424423B2 (en)	2003-04-01	2003-04-01	Method and apparatus for formant tracking using a residual model

Publications (2)

Publication Number	Publication Date
DE602004002312D1 true DE602004002312D1 (de)	2006-10-26
DE602004002312T2 DE602004002312T2 (de)	2006-12-28

Family

ID=32850595

Family Applications (1)

Application Number	Title	Priority Date	Filing Date
DE602004002312T Expired - Lifetime DE602004002312T2 (de)	2003-04-01	2004-04-01	Verfahren und Vorrichtung zur Bestimmung von Formanten unter Benutzung eines Restsignalmodells

Country Status (7)

Country	Link
US (1)	US7424423B2 (de)
EP (1)	EP1465153B1 (de)
JP (1)	JP4718789B2 (de)
KR (1)	KR101026632B1 (de)
CN (1)	CN100562926C (de)
AT (1)	ATE339756T1 (de)
DE (1)	DE602004002312T2 (de)

Families Citing this family (17)

* Cited by examiner, † Cited by third party
Publication number	Priority date	Publication date	Assignee	Title
US7475011B2 (en) *	2004-08-25	2009-01-06	Microsoft Corporation	Greedy algorithm for identifying values for vocal tract resonance vectors
KR100634526B1 (ko) *	2004-11-24	2006-10-16	삼성전자주식회사	포만트 트래킹 장치 및 방법
KR100717625B1 (ko)	2006-02-10	2007-05-15	삼성전자주식회사	음성 인식에서의 포먼트 주파수 추정 방법 및 장치
US8010356B2 (en)	2006-02-17	2011-08-30	Microsoft Corporation	Parameter learning in a hidden trajectory model
US7877255B2 (en) *	2006-03-31	2011-01-25	Voice Signal Technologies, Inc.	Speech recognition using channel verification
DE602006008158D1 (de) *	2006-09-29	2009-09-10	Honda Res Inst Europe Gmbh	Gemeinsame Schätzung von Formant-Trajektorien mittels Bayesischer Techniken und adaptiver Segmentierung
CN101067929B (zh) *	2007-06-05	2011-04-20	南京大学	使用共振峰增强提取话音共振峰轨迹的方法
EP2232700B1 (de)	2007-12-21	2014-08-13	Dts Llc	System zur einstellung der wahrgenommenen lautstärke von tonsignalen
US8538042B2 (en)	2009-08-11	2013-09-17	Dts Llc	System for increasing perceived loudness of speakers
US8204742B2 (en)	2009-09-14	2012-06-19	Srs Labs, Inc.	System for processing an audio signal to enhance speech intelligibility
US20120078625A1 (en) *	2010-09-23	2012-03-29	Waveform Communications, Llc	Waveform analysis of speech
US20140207456A1 (en) *	2010-09-23	2014-07-24	Waveform Communications, Llc	Waveform analysis of speech
KR102060208B1 (ko)	2011-07-29	2019-12-27	디티에스 엘엘씨	적응적 음성 명료도 처리기
US9312829B2 (en)	2012-04-12	2016-04-12	Dts Llc	System for adjusting loudness of audio signals in real time
US9728200B2 (en)	2013-01-29	2017-08-08	Qualcomm Incorporated	Systems, methods, apparatus, and computer-readable media for adaptive formant sharpening in linear prediction coding
US9520141B2 (en) *	2013-02-28	2016-12-13	Google Inc.	Keyboard typing detection and suppression
US9805714B2 (en) *	2016-03-22	2017-10-31	Asustek Computer Inc.	Directional keyword verification method applicable to electronic device and electronic device using the same

Family Cites Families (14)

* Cited by examiner, † Cited by third party
Publication number	Priority date	Publication date	Assignee	Title
US3649765A (en) *	1969-10-29	1972-03-14	Bell Telephone Labor Inc	Speech analyzer-synthesizer system employing improved formant extractor
JPH0785200B2 (ja) *	1986-11-13	1995-09-13	日本電気株式会社	スペクトル標準パタンの作成方法
US5799276A (en) *	1995-11-07	1998-08-25	Accent Incorporated	Knowledge-based speech recognition system and methods having frame length computed based upon estimated pitch period of vocalic intervals
US5729694A (en) *	1996-02-06	1998-03-17	The Regents Of The University Of California	Speech coding, reconstruction and recognition using acoustics and electromagnetic waves
US6064958A (en)	1996-09-20	2000-05-16	Nippon Telegraph And Telephone Corporation	Pattern recognition scheme using probabilistic models based on mixtures distribution of discrete distribution
US5815090A (en) *	1996-10-31	1998-09-29	University Of Florida Research Foundation, Inc.	Remote monitoring system for detecting termites
JP2986792B2 (ja) *	1998-03-16	1999-12-06	株式会社エイ・ティ・アール音声翻訳通信研究所	話者正規化処理装置及び音声認識装置
US6980952B1 (en) *	1998-08-15	2005-12-27	Texas Instruments Incorporated	Source normalization training for HMM modeling of speech
US6502066B2 (en)	1998-11-24	2002-12-31	Microsoft Corporation	System for generating formant tracks by modifying formants synthesized from speech units
US20010044719A1 (en) *	1999-07-02	2001-11-22	Mitsubishi Electric Research Laboratories, Inc.	Method and system for recognizing, indexing, and searching acoustic signals
US6910007B2 (en) *	2000-05-31	2005-06-21	At&T Corp	Stochastic modeling of spectral adjustment for high quality pitch modification
JP2002133411A (ja) *	2000-08-17	2002-05-10	Canon Inc	情報処理方法、情報処理装置及びプログラム
JP2002278592A (ja) *	2001-03-21	2002-09-27	Fujitsu Ltd	データ照合プログラム、データ照合方法およびデータ照合装置
US6931374B2 (en)	2003-04-01	2005-08-16	Microsoft Corporation	Method of speech recognition using variational inference with switching state space models

2003
- 2003-04-01 US US10/404,411 patent/US7424423B2/en not_active Expired - Fee Related
2004
- 2004-03-31 KR KR1020040022158A patent/KR101026632B1/ko active IP Right Grant
- 2004-03-31 JP JP2004108213A patent/JP4718789B2/ja not_active Expired - Lifetime
- 2004-04-01 DE DE602004002312T patent/DE602004002312T2/de not_active Expired - Lifetime
- 2004-04-01 EP EP04007986A patent/EP1465153B1/de not_active Expired - Lifetime
- 2004-04-01 AT AT04007986T patent/ATE339756T1/de not_active IP Right Cessation
- 2004-04-01 CN CNB2004100342429A patent/CN100562926C/zh not_active Expired - Fee Related

Also Published As

Publication number	Publication date
EP1465153B1 (de)	2006-09-13
US7424423B2 (en)	2008-09-09
EP1465153A3 (de)	2005-01-19
US20040199382A1 (en)	2004-10-07
CN1534596A (zh)	2004-10-06
CN100562926C (zh)	2009-11-25
DE602004002312T2 (de)	2006-12-28
JP4718789B2 (ja)	2011-07-06
JP2004310091A (ja)	2004-11-04
KR101026632B1 (ko)	2011-04-04
ATE339756T1 (de)	2006-10-15
EP1465153A2 (de)	2004-10-06
KR20040088364A (ko)	2004-10-16

Similar Documents

Publication	Publication Date	Title
DE602004002312D1 (de)	2006-10-26	Verfahren und Vorrichtung zur Bestimmung von Formanten unter Benutzung eines Restsignalmodells
CN102779508B (zh)	2016-11-09	语音库生成设备及其方法、语音合成系统及其方法
CN109949783B (zh)	2021-01-29	歌曲合成方法及系统
CN101661675B (zh)	2012-01-11	一种错误自感知的声调发音学习方法和系统
CN105206258A (zh)	2015-12-30	声学模型的生成方法和装置及语音合成方法和装置
TW200601263A (en)	2006-01-01	Apparatus and method for synthesized audible response to an utterance in speaker-independent voice recognition
DE59010131D1 (de)	1996-03-28	Verfahren zur sprecheradaptiven Erkennung von Sprache
DE60310785D1 (de)	2007-02-15	Verfahren und Vorrichtung zur Übersetzung von gesprochener Sprache
DE60309142D1 (de)	2006-11-30	Vorrichtung zur bestimmung von parametern eines gauss'schen mischungmodells (gmm) oder eines gmm basierten hidden markov modells
CN106531185A (zh)	2017-03-22	基于语音相似度的语音评测方法及系统
JP2000105596A5 (de)	2006-09-07
RU2009119491A (ru)	2010-11-27	Способ и устройство кодирования кадров перехода в речевых сигналах
DE602004007786D1 (de)	2007-09-06	Verfahren und vorrichtung zur quantisierung des verstärkungsfaktors in einem breitbandsprachkodierer mit variabler bitrate
DE60124551D1 (de)	2006-12-28	Verfahren und vorrichtung zur erzeugung der referenzmuster für ein sprecherunabhängiges spracherkennungssystem
CN105206264B (zh)	2017-06-27	语音合成方法和装置
ATE259532T1 (de)	2004-02-15	Verfahren und vorrichtung zum durchsuchen eines erregungskodebuches bei einem celp-kodierer
JP2016539355A5 (de)	2017-07-13
Tamburini	2003	Automatic prosodic prominence detection in speech using acoustic features: an unsupervised system.
DE69937854D1 (de)	2008-02-14	Verfahren und Vorrichtung zur Spracherkennung unter Verwendung von phonetischen Transkriptionen
Yarra et al.	2017	Automatic detection of syllable stress using sonority based prominence features for pronunciation evaluation
DE602004004572D1 (de)	2007-03-22	Verfolgen von Vokaltraktresonanzen unter Verwendung einer zielgeführten Einschränkung
DE60027012D1 (de)	2006-05-18	Verfahren und vorrichtung zur verschachtelung der quantisierungsverfahren der spektralen frequenzlinien in einem sprachkodierer
Qin et al.	2007	A comparison of acoustic features for articulatory inversion
Hillenbrand et al.	2011	Perception of sinewave vowels
Mary et al.	2013	Evaluation of mimicked speech using prosodic features

Legal Events

Date	Code	Title	Description
2007-10-04	8364	No opposition during term of opposition