DE602005025955D1 - Darstellung eines "deleted interpolation" N-gram Sprachmodells in ARPA Standardformat - Google Patents

Darstellung eines "deleted interpolation" N-gram Sprachmodells in ARPA Standardformat

Info

Publication number
DE602005025955D1
DE602005025955D1 DE602005025955T DE602005025955T DE602005025955D1 DE 602005025955 D1 DE602005025955 D1 DE 602005025955D1 DE 602005025955 T DE602005025955 T DE 602005025955T DE 602005025955 T DE602005025955 T DE 602005025955T DE 602005025955 D1 DE602005025955 D1 DE 602005025955D1
Authority
DE
Germany
Prior art keywords
language model
representation
standard format
deleted interpolation
gram language
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
DE602005025955T
Other languages
English (en)
Inventor
Alejandro Acero
Ciprian Chelba
Milind Mahajan
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Microsoft Corp
Original Assignee
Microsoft Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Microsoft Corp filed Critical Microsoft Corp
Publication of DE602005025955D1 publication Critical patent/DE602005025955D1/de
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F7/00Methods or arrangements for processing data by operating upon the order or content of the data handled
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/284Lexical analysis, e.g. tokenisation or collocates
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/10Complex mathematical operations
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/183Speech classification or search using natural language modelling using context dependencies, e.g. language models
    • G10L15/19Grammatical context, e.g. disambiguation of the recognition hypotheses based on word sequence rules
    • G10L15/197Probabilistic grammars, e.g. word n-grams
DE602005025955T 2004-03-26 2005-03-22 Darstellung eines "deleted interpolation" N-gram Sprachmodells in ARPA Standardformat Active DE602005025955D1 (de)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
US10/810,254 US7406416B2 (en) 2004-03-26 2004-03-26 Representation of a deleted interpolation N-gram language model in ARPA standard format

Publications (1)

Publication Number Publication Date
DE602005025955D1 true DE602005025955D1 (de) 2011-03-03

Family

ID=34862105

Family Applications (1)

Application Number Title Priority Date Filing Date
DE602005025955T Active DE602005025955D1 (de) 2004-03-26 2005-03-22 Darstellung eines "deleted interpolation" N-gram Sprachmodells in ARPA Standardformat

Country Status (7)

Country Link
US (1) US7406416B2 (de)
EP (1) EP1580667B1 (de)
JP (1) JP4974470B2 (de)
KR (1) KR101120773B1 (de)
CN (1) CN100535890C (de)
AT (1) ATE496342T1 (de)
DE (1) DE602005025955D1 (de)

Families Citing this family (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8700404B1 (en) * 2005-08-27 2014-04-15 At&T Intellectual Property Ii, L.P. System and method for using semantic and syntactic graphs for utterance classification
US20070078653A1 (en) * 2005-10-03 2007-04-05 Nokia Corporation Language model compression
US20080282154A1 (en) * 2006-09-11 2008-11-13 Nurmi Mikko A Method and apparatus for improved text input
US7774197B1 (en) 2006-09-27 2010-08-10 Raytheon Bbn Technologies Corp. Modular approach to building large language models
US8332207B2 (en) * 2007-03-26 2012-12-11 Google Inc. Large language models in machine translation
WO2010051654A1 (en) * 2008-11-05 2010-05-14 Google Inc. Custom language models
US8798983B2 (en) * 2009-03-30 2014-08-05 Microsoft Corporation Adaptation for statistical language model
US8655647B2 (en) * 2010-03-11 2014-02-18 Microsoft Corporation N-gram selection for practical-sized language models
US9367526B1 (en) * 2011-07-26 2016-06-14 Nuance Communications, Inc. Word classing for language modeling
CN102982024B (zh) * 2011-09-02 2016-03-23 北京百度网讯科技有限公司 一种搜索需求识别方法及装置
CN102509549B (zh) * 2011-09-28 2013-08-14 盛乐信息技术(上海)有限公司 语言模型训练方法及系统
US9224386B1 (en) 2012-06-22 2015-12-29 Amazon Technologies, Inc. Discriminative language model training using a confusion matrix
US9292487B1 (en) * 2012-08-16 2016-03-22 Amazon Technologies, Inc. Discriminative language model pruning
US20150088511A1 (en) * 2013-09-24 2015-03-26 Verizon Patent And Licensing Inc. Named-entity based speech recognition
KR101509727B1 (ko) * 2013-10-02 2015-04-07 주식회사 시스트란인터내셔널 자율학습 정렬 기반의 정렬 코퍼스 생성 장치 및 그 방법과, 정렬 코퍼스를 사용한 파괴 표현 형태소 분석 장치 및 그 형태소 분석 방법
US9400783B2 (en) * 2013-11-26 2016-07-26 Xerox Corporation Procedure for building a max-ARPA table in order to compute optimistic back-offs in a language model
US10311046B2 (en) * 2016-09-12 2019-06-04 Conduent Business Services, Llc System and method for pruning a set of symbol-based sequences by relaxing an independence assumption of the sequences

Family Cites Families (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US1940720A (en) * 1931-03-16 1933-12-26 Madsen Jens A Windfeld Water softener
US4096017A (en) * 1977-02-18 1978-06-20 H. C. Price Co. Method and article for forming field joints on pipe coated with thermoplastic material
US4111017A (en) * 1977-06-21 1978-09-05 The United States Of America As Represented By The United States Department Of Energy Manually operated coded switch
US5258909A (en) * 1989-08-31 1993-11-02 International Business Machines Corporation Method and apparatus for "wrong word" spelling error detection and correction
US5199464A (en) * 1989-12-28 1993-04-06 Interprovincial Pipe Line, Inc. Pipeline repair sleeve assembly having heat sink groove
US5267345A (en) * 1992-02-10 1993-11-30 International Business Machines Corporation Speech recognition apparatus which predicts word classes from context and words from word classes
IT1254723B (it) * 1992-03-18 1995-10-09 Snam Spa Procedimento perfezionato per gli interventi di riparazione di danni localizzati alle condotte mediante applicazione di corazze con una guaina protettiva interposta
EP0602296A1 (de) * 1992-12-17 1994-06-22 International Business Machines Corporation Adaptives Verfahren zur Erzeugung gebietsabhängiger Modelle für intelligente Systeme
US5467425A (en) * 1993-02-26 1995-11-14 International Business Machines Corporation Building scalable N-gram language models using maximum likelihood maximum entropy N-gram models
JP2886121B2 (ja) * 1995-11-10 1999-04-26 株式会社エイ・ティ・アール音声翻訳通信研究所 統計的言語モデル生成装置及び音声認識装置
US5937384A (en) 1996-05-01 1999-08-10 Microsoft Corporation Method and system for speech recognition using continuous density hidden Markov models
US5722463A (en) * 1996-11-25 1998-03-03 Petro-Line Upgrading Services Ltd. External pipe reinforcing sleeve
CA2192620C (en) * 1996-12-11 2000-08-29 Gerald Henderson Pipe repair assembly
US6188976B1 (en) * 1998-10-23 2001-02-13 International Business Machines Corporation Apparatus and method for building domain-specific language models
JP2000250583A (ja) * 1999-03-02 2000-09-14 Atr Interpreting Telecommunications Res Lab 統計的言語モデル生成装置及び音声認識装置
JP2000356997A (ja) 1999-06-15 2000-12-26 Atr Interpreting Telecommunications Res Lab 統計的言語モデル生成装置及び音声認識装置
JP2001142881A (ja) 1999-11-16 2001-05-25 Nippon Telegr & Teleph Corp <Ntt> 統計的言語モデル及びそれを用いた確率計算法

Also Published As

Publication number Publication date
EP1580667A3 (de) 2007-10-10
CN1673997A (zh) 2005-09-28
EP1580667B1 (de) 2011-01-19
KR20060044753A (ko) 2006-05-16
US20050216265A1 (en) 2005-09-29
KR101120773B1 (ko) 2012-03-23
CN100535890C (zh) 2009-09-02
JP4974470B2 (ja) 2012-07-11
EP1580667A2 (de) 2005-09-28
ATE496342T1 (de) 2011-02-15
US7406416B2 (en) 2008-07-29
JP2005293580A (ja) 2005-10-20

Similar Documents

Publication Publication Date Title
DE602005025955D1 (de) Darstellung eines &#34;deleted interpolation&#34; N-gram Sprachmodells in ARPA Standardformat
EP2189925A3 (de) Datenbankobfuskationssystem und -verfahren
ATE266521T1 (de) Wechselbehälter
WO2006010737A3 (en) Methods, apparatus and software for validating entries made on a form
ATE439665T1 (de) Verfahren zur personalisierung eines dienstes
WO2004064660A3 (en) Dental tool guides
EP1752884A4 (de) Schnelles hochgenaues matrix-singulärwertzerlegungsverfahren, programm und einrichtung
EP1582998A3 (de) Anpassung eines Sprachmodells unter Nutzung von semantischer Überwachung
DE602004021760D1 (de) Navigationsverfahren zur Darstellung eines beweglichen Fensters, Betrachtungsgerät zur Umsetzung des Verfahrens
TWI350459B (en) Computerized system, method and program product for managing an enterprise storage system
EP1653444A3 (de) Vorrichtung und Verfahren zur Umwandlung von Text zu Sprache
DE602005018542D1 (de) Verfahren zur herstellung einer flaschenartigen dose
DE60132994D1 (de) Verfahren zur herstellung eines leistungs-mosfets
DE602005019848D1 (de) Verfahren zur herstellung von tad- getrocknetem ti
WO2008080542A3 (de) Kombinationsprodukt zur bekämpfung von parasiten an tieren
GB2426612C (en) Method and apparatus for generating configuration.
GB0606235D0 (en) Apparatus and method for model adaptation for spoken language understanding
ATE380811T1 (de) Verfahren zur herstellung von 1-(2s,3s)-2- benzhydril-n-(5-tert.-butyl-2- methoxybenzyl)chinuklidin-3-amin
ATE381402T1 (de) Stopfenstange zur zufuhr von gas in eine metallschmelze
EP1719564A4 (de) Hydrogeformtes teil, hydroformverfahren und für das hydroformverfahren verwendetes formwerkzeug
DE60138255D1 (de) Verfahren zur Herstellung eines integrierten einstellbaren Kondensators
NL1027565A1 (nl) Toestel en werkwijze voor stabiliseren van een versterkte spanning, toestel en werkwijze voor opwekken van een versterkte spanning.
DE602005003750D1 (de) Laser zur Photoablation mit regulierbarer Puls-Emissionsfrequenz.
PL1750932T3 (pl) Urządzenie i sposób do produkcji pustych opakowań
EA200601397A1 (ru) Карбоксипептидаза для созревания сыра