WO2009044275A8 - Systems and methods for character correction in communication devices - Google Patents

Systems and methods for character correction in communication devices Download PDF

Info

Publication number
WO2009044275A8
WO2009044275A8 PCT/IB2008/002737 IB2008002737W WO2009044275A8 WO 2009044275 A8 WO2009044275 A8 WO 2009044275A8 IB 2008002737 W IB2008002737 W IB 2008002737W WO 2009044275 A8 WO2009044275 A8 WO 2009044275A8
Authority
WO
WIPO (PCT)
Prior art keywords
word
candidates
editing distance
engine
selector
Prior art date
Application number
PCT/IB2008/002737
Other languages
French (fr)
Other versions
WO2009044275A3 (en
WO2009044275A2 (en
Inventor
Weigen Qui
Samuel Yin Lun Pun
Original Assignee
Zi Corp Canada Inc
Weigen Qui
Samuel Yin Lun Pun
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zi Corp Canada Inc, Weigen Qui, Samuel Yin Lun Pun filed Critical Zi Corp Canada Inc
Priority to EP08807230A priority Critical patent/EP2206058A4/en
Publication of WO2009044275A2 publication Critical patent/WO2009044275A2/en
Publication of WO2009044275A8 publication Critical patent/WO2009044275A8/en
Publication of WO2009044275A3 publication Critical patent/WO2009044275A3/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/232Orthographic correction, e.g. spell checking or vowelisation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/166Editing, e.g. inserting or deleting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/274Converting codes to words; Guess-ahead of partial word inputs
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/26Techniques for post-processing, e.g. correcting the recognition result
    • G06V30/262Techniques for post-processing, e.g. correcting the recognition result using context analysis, e.g. lexical, syntactic or semantic context
    • G06V30/268Lexical context
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition

Abstract

A system and method for character error correction is provided. The system includes an interface, a word prediction engine, a statistical engine, an editing distance calculator, and a selector. A string of characters is entered into a mobile device via the interface. The word prediction engine then generates word candidates similar to the inputted word using fuzzy logic and user preferences generated from past behavior. The statistical engine then generates variable error costs determined by the probability of erroneously inputting any given character. The editing distance calculator then determines the editing distance between the inputted word and each of the word candidates by grid comparison using the variable error costs. The selector choses one or more preferred candidates from the word candidates using the editing distances.
PCT/IB2008/002737 2007-10-04 2008-09-25 Systems and methods for character correction in communication devices WO2009044275A2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
EP08807230A EP2206058A4 (en) 2007-10-04 2008-09-25 Systems and methods for character correction in communication devices

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US11/867,684 2007-10-04
US11/867,684 US8077983B2 (en) 2007-10-04 2007-10-04 Systems and methods for character correction in communication devices

Publications (3)

Publication Number Publication Date
WO2009044275A2 WO2009044275A2 (en) 2009-04-09
WO2009044275A8 true WO2009044275A8 (en) 2009-05-28
WO2009044275A3 WO2009044275A3 (en) 2009-07-09

Family

ID=40523297

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/IB2008/002737 WO2009044275A2 (en) 2007-10-04 2008-09-25 Systems and methods for character correction in communication devices

Country Status (3)

Country Link
US (3) US8077983B2 (en)
EP (1) EP2206058A4 (en)
WO (1) WO2009044275A2 (en)

Families Citing this family (52)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8117540B2 (en) * 2005-05-18 2012-02-14 Neuer Wall Treuhand Gmbh Method and device incorporating improved text input mechanism
US9606634B2 (en) 2005-05-18 2017-03-28 Nokia Technologies Oy Device incorporating improved text input mechanism
US8036878B2 (en) 2005-05-18 2011-10-11 Never Wall Treuhand GmbH Device incorporating improved text input mechanism
US8374846B2 (en) 2005-05-18 2013-02-12 Neuer Wall Treuhand Gmbh Text input device and method
US8077983B2 (en) * 2007-10-04 2011-12-13 Zi Corporation Of Canada, Inc. Systems and methods for character correction in communication devices
JP5327054B2 (en) * 2007-12-18 2013-10-30 日本電気株式会社 Pronunciation variation rule extraction device, pronunciation variation rule extraction method, and pronunciation variation rule extraction program
US8063879B2 (en) 2007-12-20 2011-11-22 Research In Motion Limited Method and handheld electronic device including first input component and second touch sensitive input component
DE602008005428D1 (en) 2008-06-11 2011-04-21 Exb Asset Man Gmbh Apparatus and method with improved text input mechanism
US8896470B2 (en) * 2009-07-10 2014-11-25 Blackberry Limited System and method for disambiguation of stroke input
ATE550710T1 (en) * 2009-07-10 2012-04-15 Research In Motion Ltd SYSTEM AND METHOD FOR DISAMBIGUING BAR INPUT
MY156899A (en) * 2009-09-24 2016-04-15 Nec Corp Word recognition apparatus, word recognition method, non-transitory computer readable medium storing word recognition program, and delivery item sorting apparatus
US8379801B2 (en) 2009-11-24 2013-02-19 Sorenson Communications, Inc. Methods and systems related to text caption error correction
US8782556B2 (en) 2010-02-12 2014-07-15 Microsoft Corporation User-centric soft keyboard predictive technologies
US20120290291A1 (en) * 2011-05-13 2012-11-15 Gabriel Lee Gilbert Shelley Input processing for character matching and predicted word matching
US9164983B2 (en) 2011-05-27 2015-10-20 Robert Bosch Gmbh Broad-coverage normalization system for social media language
US20120324391A1 (en) * 2011-06-16 2012-12-20 Microsoft Corporation Predictive word completion
US8640026B2 (en) 2011-07-11 2014-01-28 International Business Machines Corporation Word correction in a multi-touch environment
US11195057B2 (en) 2014-03-18 2021-12-07 Z Advanced Computing, Inc. System and method for extremely efficient image and pattern recognition and artificial intelligence platform
US8311973B1 (en) 2011-09-24 2012-11-13 Zadeh Lotfi A Methods and systems for applications for Z-numbers
US8873813B2 (en) 2012-09-17 2014-10-28 Z Advanced Computing, Inc. Application of Z-webs and Z-factors to analytics, search engine, learning, recognition, natural language, and other utilities
US11914674B2 (en) 2011-09-24 2024-02-27 Z Advanced Computing, Inc. System and method for extremely efficient image and pattern recognition and artificial intelligence platform
US11074495B2 (en) 2013-02-28 2021-07-27 Z Advanced Computing, Inc. (Zac) System and method for extremely efficient image and pattern recognition and artificial intelligence platform
US9916538B2 (en) 2012-09-15 2018-03-13 Z Advanced Computing, Inc. Method and system for feature detection
US9317606B1 (en) * 2012-02-03 2016-04-19 Google Inc. Spell correcting long queries
US8996356B1 (en) * 2012-04-10 2015-03-31 Google Inc. Techniques for predictive input method editors
US8881005B2 (en) * 2012-04-20 2014-11-04 King Abdulaziz City For Science And Technology Methods and systems for large-scale statistical misspelling correction
JP5910345B2 (en) * 2012-06-21 2016-04-27 富士通株式会社 Character input program, information processing apparatus, and character input method
CN103514236B (en) * 2012-06-30 2017-06-09 重庆新媒农信科技有限公司 Search condition error correcting prompt processing method based on phonetic in retrieval application
US9081821B2 (en) 2012-07-25 2015-07-14 Ebay Inc. Spell check using column cursor
US9158768B2 (en) 2012-07-25 2015-10-13 Paypal, Inc. System and methods to configure a query language using an operator dictionary
US8713433B1 (en) * 2012-10-16 2014-04-29 Google Inc. Feature-based autocorrection
US8612213B1 (en) 2012-10-16 2013-12-17 Google Inc. Correction of errors in character strings that include a word delimiter
US9047268B2 (en) * 2013-01-31 2015-06-02 Google Inc. Character and word level language models for out-of-vocabulary text input
US9454240B2 (en) 2013-02-05 2016-09-27 Google Inc. Gesture keyboard input of non-dictionary character strings
US8756499B1 (en) * 2013-04-29 2014-06-17 Google Inc. Gesture keyboard input of non-dictionary character strings using substitute scoring
KR101332757B1 (en) * 2013-05-16 2013-11-25 주식회사 큐키 Type correction method for word or letters without deletion input and device for implementing the method
JP2014235612A (en) 2013-06-03 2014-12-15 富士通株式会社 Terminal device, correction method, and correction program
US9262748B2 (en) 2013-08-15 2016-02-16 International Business Machines Corporation Identifying locations of potential user errors during manipulation of multimedia content
KR101381101B1 (en) * 2013-11-13 2014-04-02 주식회사 큐키 Error revising method through correlation decision between character strings
US9037967B1 (en) * 2014-02-18 2015-05-19 King Fahd University Of Petroleum And Minerals Arabic spell checking technique
US9361531B2 (en) 2014-07-21 2016-06-07 Optum, Inc. Targeted optical character recognition (OCR) for medical terminology
CN104268166B (en) * 2014-09-09 2017-04-19 北京搜狗科技发展有限公司 Input method, device and electronic device
US10032071B2 (en) * 2014-12-23 2018-07-24 Lenovo (Singapore) Pte. Ltd. Candidate handwriting words using optical character recognition and spell check
CN105468468B (en) * 2015-12-02 2018-07-27 北京光年无限科技有限公司 Data error-correcting method towards question answering system and device
CN107729318B (en) * 2017-10-17 2021-04-20 语联网(武汉)信息技术有限公司 Method for automatically correcting partial characters by Chinese part-of-speech judgment
CN107967303B (en) * 2017-11-10 2021-03-26 传神语联网网络科技股份有限公司 Corpus display method and apparatus
CN108052609A (en) * 2017-12-13 2018-05-18 武汉烽火普天信息技术有限公司 A kind of address matching method based on dictionary and machine learning
CN108153735B (en) * 2017-12-28 2021-05-18 北京奇艺世纪科技有限公司 Method and system for acquiring similar meaning words
CN109783811B (en) * 2018-12-26 2023-10-31 东软集团股份有限公司 Method, device, equipment and storage medium for identifying text editing errors
CN109918987B (en) * 2018-12-29 2021-05-14 中国电子科技集团公司信息科学研究院 Video subtitle keyword identification method and device
US11562731B2 (en) 2020-08-19 2023-01-24 Sorenson Ip Holdings, Llc Word replacement in transcriptions
CN113255330B (en) * 2021-05-31 2021-09-24 中南大学 Chinese spelling checking method based on character feature classifier and soft output

Family Cites Families (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5774588A (en) * 1995-06-07 1998-06-30 United Parcel Service Of America, Inc. Method and system for comparing strings with entries of a lexicon
US5953541A (en) 1997-01-24 1999-09-14 Tegic Communications, Inc. Disambiguating system for disambiguating ambiguous input sequences by displaying objects associated with the generated input sequences in the order of decreasing frequency of use
US6782505B1 (en) * 1999-04-19 2004-08-24 Daniel P. Miranker Method and system for generating structured data from semi-structured data sources
US6556841B2 (en) 1999-05-03 2003-04-29 Openwave Systems Inc. Spelling correction for two-way mobile communication devices
JP4519381B2 (en) 1999-05-27 2010-08-04 テジック コミュニケーションズ インク Keyboard system with automatic correction
US7254773B2 (en) 2000-12-29 2007-08-07 International Business Machines Corporation Automated spell analysis
US7117144B2 (en) * 2001-03-31 2006-10-03 Microsoft Corporation Spell checking for text input via reduced keypad keys
US20050032978A1 (en) * 2003-08-06 2005-02-10 O'dwyer James B. Etherified carbamate functional copolymers of isobutylene type monomers, and their use in curable compositions
US7508324B2 (en) 2004-08-06 2009-03-24 Daniel Suraqui Finger activated reduced keyboard and a method for performing text input
US7599828B2 (en) 2005-03-01 2009-10-06 Microsoft Corporation Grammatically correct contraction spelling suggestions for french
US9606634B2 (en) * 2005-05-18 2017-03-28 Nokia Technologies Oy Device incorporating improved text input mechanism
EP1855210B1 (en) * 2006-05-11 2018-01-03 Dassault Systèmes Spell checking
US7558725B2 (en) * 2006-05-23 2009-07-07 Lexisnexis, A Division Of Reed Elsevier Inc. Method and apparatus for multilingual spelling corrections
WO2008043582A1 (en) * 2006-10-13 2008-04-17 International Business Machines Corporation Systems and methods for building an electronic dictionary of multi-word names and for performing fuzzy searches in said dictionary
US8201087B2 (en) 2007-02-01 2012-06-12 Tegic Communications, Inc. Spell-check for a keyboard system with automatic correction
US8077983B2 (en) * 2007-10-04 2011-12-13 Zi Corporation Of Canada, Inc. Systems and methods for character correction in communication devices

Also Published As

Publication number Publication date
WO2009044275A3 (en) 2009-07-09
US9460066B2 (en) 2016-10-04
US20090092323A1 (en) 2009-04-09
US20120130706A1 (en) 2012-05-24
EP2206058A4 (en) 2011-02-23
EP2206058A2 (en) 2010-07-14
US20130205204A1 (en) 2013-08-08
US8401314B2 (en) 2013-03-19
WO2009044275A2 (en) 2009-04-09
US8077983B2 (en) 2011-12-13

Similar Documents

Publication Publication Date Title
WO2009044275A8 (en) Systems and methods for character correction in communication devices
US8542195B2 (en) Method for optimization of soft keyboards for multiple languages
CN101241514B (en) Method for creating error-correcting database, automatic error correcting method and system
JP6687741B2 (en) Information mining method, system, electronic device, and readable storage medium
CN103400576B (en) Based on speech model update method and the device of User action log
CN102135814B (en) A kind of character and word input method and system
AU2003299969A1 (en) Method and system for computer aided manufacturing
CN105261358A (en) N-gram grammar model constructing method for voice identification and voice identification system
WO2009066501A1 (en) Information search method, device, and program, and computer-readable recording medium
CN104508739A (en) Dynamic language model
CA2567958A1 (en) Multimodal disambiguation of speech recognition
CN107168957A (en) A kind of Chinese word cutting method
CN102830924A (en) Method and device for adjusting input method keyboards
WO2009152874A3 (en) Method and device for handwriting detection
CN102422245A (en) Input method editor
CN104008166A (en) Dialogue short text clustering method based on form and semantic similarity
CN102591880A (en) Information providing method and device
CN105354199A (en) Scene information based entity meaning identification method and system
US20170046330A1 (en) Context specific language model for input method editor
CN103389915A (en) Input error correcting method, input error correcting device, input error correcting server and input error correcting server system
CN105005563A (en) Abstract generation method and apparatus
CN103186522A (en) Electronic device and natural language analyzing method thereof
CN103246714B (en) Method and system of Chinese search engine inquiring and error correcting based on error pattern searching
CN103076894A (en) Method and equipment for building input entries for object identity information according to object identity information
CN104090865A (en) Text similarity calculation method and device

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 08807230

Country of ref document: EP

Kind code of ref document: A2

NENP Non-entry into the national phase

Ref country code: DE

WWE Wipo information: entry into national phase

Ref document number: 2008807230

Country of ref document: EP