DE602004002312D1 - Verfahren und Vorrichtung zur Bestimmung von Formanten unter Benutzung eines Restsignalmodells - Google Patents
Verfahren und Vorrichtung zur Bestimmung von Formanten unter Benutzung eines RestsignalmodellsInfo
- Publication number
- DE602004002312D1 DE602004002312D1 DE602004002312T DE602004002312T DE602004002312D1 DE 602004002312 D1 DE602004002312 D1 DE 602004002312D1 DE 602004002312 T DE602004002312 T DE 602004002312T DE 602004002312 T DE602004002312 T DE 602004002312T DE 602004002312 D1 DE602004002312 D1 DE 602004002312D1
- Authority
- DE
- Germany
- Prior art keywords
- formants
- identified
- residual signal
- signal model
- model
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Lifetime
Links
- 238000000034 method Methods 0.000 title abstract 2
- 239000013598 vector Substances 0.000 abstract 2
- 238000013507 mapping Methods 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/15—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being formant information
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US404411 | 1999-09-23 | ||
US10/404,411 US7424423B2 (en) | 2003-04-01 | 2003-04-01 | Method and apparatus for formant tracking using a residual model |
Publications (2)
Publication Number | Publication Date |
---|---|
DE602004002312D1 true DE602004002312D1 (de) | 2006-10-26 |
DE602004002312T2 DE602004002312T2 (de) | 2006-12-28 |
Family
ID=32850595
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
DE602004002312T Expired - Lifetime DE602004002312T2 (de) | 2003-04-01 | 2004-04-01 | Verfahren und Vorrichtung zur Bestimmung von Formanten unter Benutzung eines Restsignalmodells |
Country Status (7)
Country | Link |
---|---|
US (1) | US7424423B2 (de) |
EP (1) | EP1465153B1 (de) |
JP (1) | JP4718789B2 (de) |
KR (1) | KR101026632B1 (de) |
CN (1) | CN100562926C (de) |
AT (1) | ATE339756T1 (de) |
DE (1) | DE602004002312T2 (de) |
Families Citing this family (17)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7475011B2 (en) * | 2004-08-25 | 2009-01-06 | Microsoft Corporation | Greedy algorithm for identifying values for vocal tract resonance vectors |
KR100634526B1 (ko) * | 2004-11-24 | 2006-10-16 | 삼성전자주식회사 | 포만트 트래킹 장치 및 방법 |
KR100717625B1 (ko) | 2006-02-10 | 2007-05-15 | 삼성전자주식회사 | 음성 인식에서의 포먼트 주파수 추정 방법 및 장치 |
US8010356B2 (en) | 2006-02-17 | 2011-08-30 | Microsoft Corporation | Parameter learning in a hidden trajectory model |
US7877255B2 (en) * | 2006-03-31 | 2011-01-25 | Voice Signal Technologies, Inc. | Speech recognition using channel verification |
DE602006008158D1 (de) * | 2006-09-29 | 2009-09-10 | Honda Res Inst Europe Gmbh | Gemeinsame Schätzung von Formant-Trajektorien mittels Bayesischer Techniken und adaptiver Segmentierung |
CN101067929B (zh) * | 2007-06-05 | 2011-04-20 | 南京大学 | 使用共振峰增强提取话音共振峰轨迹的方法 |
EP2232700B1 (de) | 2007-12-21 | 2014-08-13 | Dts Llc | System zur einstellung der wahrgenommenen lautstärke von tonsignalen |
US8538042B2 (en) | 2009-08-11 | 2013-09-17 | Dts Llc | System for increasing perceived loudness of speakers |
US8204742B2 (en) | 2009-09-14 | 2012-06-19 | Srs Labs, Inc. | System for processing an audio signal to enhance speech intelligibility |
US20120078625A1 (en) * | 2010-09-23 | 2012-03-29 | Waveform Communications, Llc | Waveform analysis of speech |
US20140207456A1 (en) * | 2010-09-23 | 2014-07-24 | Waveform Communications, Llc | Waveform analysis of speech |
KR102060208B1 (ko) | 2011-07-29 | 2019-12-27 | 디티에스 엘엘씨 | 적응적 음성 명료도 처리기 |
US9312829B2 (en) | 2012-04-12 | 2016-04-12 | Dts Llc | System for adjusting loudness of audio signals in real time |
US9728200B2 (en) | 2013-01-29 | 2017-08-08 | Qualcomm Incorporated | Systems, methods, apparatus, and computer-readable media for adaptive formant sharpening in linear prediction coding |
US9520141B2 (en) * | 2013-02-28 | 2016-12-13 | Google Inc. | Keyboard typing detection and suppression |
US9805714B2 (en) * | 2016-03-22 | 2017-10-31 | Asustek Computer Inc. | Directional keyword verification method applicable to electronic device and electronic device using the same |
Family Cites Families (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US3649765A (en) * | 1969-10-29 | 1972-03-14 | Bell Telephone Labor Inc | Speech analyzer-synthesizer system employing improved formant extractor |
JPH0785200B2 (ja) * | 1986-11-13 | 1995-09-13 | 日本電気株式会社 | スペクトル標準パタンの作成方法 |
US5799276A (en) * | 1995-11-07 | 1998-08-25 | Accent Incorporated | Knowledge-based speech recognition system and methods having frame length computed based upon estimated pitch period of vocalic intervals |
US5729694A (en) * | 1996-02-06 | 1998-03-17 | The Regents Of The University Of California | Speech coding, reconstruction and recognition using acoustics and electromagnetic waves |
US6064958A (en) | 1996-09-20 | 2000-05-16 | Nippon Telegraph And Telephone Corporation | Pattern recognition scheme using probabilistic models based on mixtures distribution of discrete distribution |
US5815090A (en) * | 1996-10-31 | 1998-09-29 | University Of Florida Research Foundation, Inc. | Remote monitoring system for detecting termites |
JP2986792B2 (ja) * | 1998-03-16 | 1999-12-06 | 株式会社エイ・ティ・アール音声翻訳通信研究所 | 話者正規化処理装置及び音声認識装置 |
US6980952B1 (en) * | 1998-08-15 | 2005-12-27 | Texas Instruments Incorporated | Source normalization training for HMM modeling of speech |
US6502066B2 (en) | 1998-11-24 | 2002-12-31 | Microsoft Corporation | System for generating formant tracks by modifying formants synthesized from speech units |
US20010044719A1 (en) * | 1999-07-02 | 2001-11-22 | Mitsubishi Electric Research Laboratories, Inc. | Method and system for recognizing, indexing, and searching acoustic signals |
US6910007B2 (en) * | 2000-05-31 | 2005-06-21 | At&T Corp | Stochastic modeling of spectral adjustment for high quality pitch modification |
JP2002133411A (ja) * | 2000-08-17 | 2002-05-10 | Canon Inc | 情報処理方法、情報処理装置及びプログラム |
JP2002278592A (ja) * | 2001-03-21 | 2002-09-27 | Fujitsu Ltd | データ照合プログラム、データ照合方法およびデータ照合装置 |
US6931374B2 (en) | 2003-04-01 | 2005-08-16 | Microsoft Corporation | Method of speech recognition using variational inference with switching state space models |
-
2003
- 2003-04-01 US US10/404,411 patent/US7424423B2/en not_active Expired - Fee Related
-
2004
- 2004-03-31 KR KR1020040022158A patent/KR101026632B1/ko active IP Right Grant
- 2004-03-31 JP JP2004108213A patent/JP4718789B2/ja not_active Expired - Lifetime
- 2004-04-01 DE DE602004002312T patent/DE602004002312T2/de not_active Expired - Lifetime
- 2004-04-01 EP EP04007986A patent/EP1465153B1/de not_active Expired - Lifetime
- 2004-04-01 AT AT04007986T patent/ATE339756T1/de not_active IP Right Cessation
- 2004-04-01 CN CNB2004100342429A patent/CN100562926C/zh not_active Expired - Fee Related
Also Published As
Publication number | Publication date |
---|---|
EP1465153B1 (de) | 2006-09-13 |
US7424423B2 (en) | 2008-09-09 |
EP1465153A3 (de) | 2005-01-19 |
US20040199382A1 (en) | 2004-10-07 |
CN1534596A (zh) | 2004-10-06 |
CN100562926C (zh) | 2009-11-25 |
DE602004002312T2 (de) | 2006-12-28 |
JP4718789B2 (ja) | 2011-07-06 |
JP2004310091A (ja) | 2004-11-04 |
KR101026632B1 (ko) | 2011-04-04 |
ATE339756T1 (de) | 2006-10-15 |
EP1465153A2 (de) | 2004-10-06 |
KR20040088364A (ko) | 2004-10-16 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
DE602004002312D1 (de) | Verfahren und Vorrichtung zur Bestimmung von Formanten unter Benutzung eines Restsignalmodells | |
CN102779508B (zh) | 语音库生成设备及其方法、语音合成系统及其方法 | |
CN109949783B (zh) | 歌曲合成方法及系统 | |
CN101661675B (zh) | 一种错误自感知的声调发音学习方法和系统 | |
CN105206258A (zh) | 声学模型的生成方法和装置及语音合成方法和装置 | |
TW200601263A (en) | Apparatus and method for synthesized audible response to an utterance in speaker-independent voice recognition | |
DE59010131D1 (de) | Verfahren zur sprecheradaptiven Erkennung von Sprache | |
DE60310785D1 (de) | Verfahren und Vorrichtung zur Übersetzung von gesprochener Sprache | |
DE60309142D1 (de) | Vorrichtung zur bestimmung von parametern eines gauss'schen mischungmodells (gmm) oder eines gmm basierten hidden markov modells | |
CN106531185A (zh) | 基于语音相似度的语音评测方法及系统 | |
JP2000105596A5 (de) | ||
RU2009119491A (ru) | Способ и устройство кодирования кадров перехода в речевых сигналах | |
DE602004007786D1 (de) | Verfahren und vorrichtung zur quantisierung des verstärkungsfaktors in einem breitbandsprachkodierer mit variabler bitrate | |
DE60124551D1 (de) | Verfahren und vorrichtung zur erzeugung der referenzmuster für ein sprecherunabhängiges spracherkennungssystem | |
CN105206264B (zh) | 语音合成方法和装置 | |
ATE259532T1 (de) | Verfahren und vorrichtung zum durchsuchen eines erregungskodebuches bei einem celp-kodierer | |
JP2016539355A5 (de) | ||
Tamburini | Automatic prosodic prominence detection in speech using acoustic features: an unsupervised system. | |
DE69937854D1 (de) | Verfahren und Vorrichtung zur Spracherkennung unter Verwendung von phonetischen Transkriptionen | |
Yarra et al. | Automatic detection of syllable stress using sonority based prominence features for pronunciation evaluation | |
DE602004004572D1 (de) | Verfolgen von Vokaltraktresonanzen unter Verwendung einer zielgeführten Einschränkung | |
DE60027012D1 (de) | Verfahren und vorrichtung zur verschachtelung der quantisierungsverfahren der spektralen frequenzlinien in einem sprachkodierer | |
Qin et al. | A comparison of acoustic features for articulatory inversion | |
Hillenbrand et al. | Perception of sinewave vowels | |
Mary et al. | Evaluation of mimicked speech using prosodic features |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
8364 | No opposition during term of opposition |