WO2007136951A3 - Efficient storage and search of word lists and other text - Google Patents

Efficient storage and search of word lists and other text Download PDF

Info

Publication number
WO2007136951A3
WO2007136951A3 PCT/US2007/066979 US2007066979W WO2007136951A3 WO 2007136951 A3 WO2007136951 A3 WO 2007136951A3 US 2007066979 W US2007066979 W US 2007066979W WO 2007136951 A3 WO2007136951 A3 WO 2007136951A3
Authority
WO
WIPO (PCT)
Prior art keywords
search
text
word
columns
words
Prior art date
Application number
PCT/US2007/066979
Other languages
French (fr)
Other versions
WO2007136951B1 (en
WO2007136951A2 (en
Inventor
Erland Unruh
David Jon Kay
Original Assignee
Tegic Communications Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tegic Communications Inc filed Critical Tegic Communications Inc
Priority to EP07797256A priority Critical patent/EP2011003A2/en
Publication of WO2007136951A2 publication Critical patent/WO2007136951A2/en
Publication of WO2007136951A3 publication Critical patent/WO2007136951A3/en
Publication of WO2007136951B1 publication Critical patent/WO2007136951B1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/31Indexing; Data structures therefor; Storage structures
    • G06F16/316Indexing structures
    • G06F16/328Management therefor
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/99931Database or file accessing
    • Y10S707/99933Query processing, i.e. searching
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/99931Database or file accessing
    • Y10S707/99933Query processing, i.e. searching
    • Y10S707/99936Pattern matching access
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/99941Database schema or data structure
    • Y10S707/99942Manipulating data structure, e.g. compression, compaction, compilation

Abstract

A computer readable storage medium tangibly embodying machine-readable digital data arranged to facilitate expedited searching. The data includes a plurality of words residing in a table having rows and columns, each word residing in a different row and each letter of the word occupying a different column in that row. Each continuous run of same letters in a column forms an interval. The words are positioned relative to each other to maximize lengths of the intervals, and/or optimize efficiency of compression of the columns by run length encoding.
PCT/US2007/066979 2006-04-19 2007-04-19 Efficient storage and search of word lists and other text WO2007136951A2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
EP07797256A EP2011003A2 (en) 2006-04-19 2007-04-19 Efficient storage and search of word lists and other text

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US11/379,354 2006-04-19
US11/379,354 US7580925B2 (en) 2006-04-19 2006-04-19 Efficient storage and search of word lists and other text

Publications (3)

Publication Number Publication Date
WO2007136951A2 WO2007136951A2 (en) 2007-11-29
WO2007136951A3 true WO2007136951A3 (en) 2008-04-17
WO2007136951B1 WO2007136951B1 (en) 2008-06-12

Family

ID=38620662

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2007/066979 WO2007136951A2 (en) 2006-04-19 2007-04-19 Efficient storage and search of word lists and other text

Country Status (4)

Country Link
US (3) US7580925B2 (en)
EP (1) EP2011003A2 (en)
CN (1) CN101432686A (en)
WO (1) WO2007136951A2 (en)

Families Citing this family (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8117540B2 (en) * 2005-05-18 2012-02-14 Neuer Wall Treuhand Gmbh Method and device incorporating improved text input mechanism
US8036878B2 (en) * 2005-05-18 2011-10-11 Never Wall Treuhand GmbH Device incorporating improved text input mechanism
US8374846B2 (en) 2005-05-18 2013-02-12 Neuer Wall Treuhand Gmbh Text input device and method
US9606634B2 (en) 2005-05-18 2017-03-28 Nokia Technologies Oy Device incorporating improved text input mechanism
US7580925B2 (en) * 2006-04-19 2009-08-25 Tegic Communications, Inc. Efficient storage and search of word lists and other text
US20080089299A1 (en) * 2006-10-13 2008-04-17 Motorola, Inc. Method and system for distributing content in Ad-hoc networks using super peers
US8299943B2 (en) 2007-05-22 2012-10-30 Tegic Communications, Inc. Multiple predictions in a reduced keyboard disambiguating system
DE602008005428D1 (en) 2008-06-11 2011-04-21 Exb Asset Man Gmbh Apparatus and method with improved text input mechanism
US8108361B2 (en) * 2008-07-31 2012-01-31 Microsoft Corporation Efficient column based data encoding for large-scale data storage
EP2354976A1 (en) * 2010-02-09 2011-08-10 ExB Asset Management GmbH Online analysis and display of correlated information
US8782556B2 (en) * 2010-02-12 2014-07-15 Microsoft Corporation User-centric soft keyboard predictive technologies
US9734194B1 (en) 2013-03-14 2017-08-15 Google Inc. Encoding time interval information
US11294905B2 (en) * 2019-01-07 2022-04-05 Optumsoft, Inc. Sparse data index table
CN110928874B (en) * 2019-10-14 2021-09-14 京东数字科技控股有限公司 Information processing method, device and system, electronic equipment and storage medium
CN116016397B (en) * 2023-03-28 2023-05-30 湖南毂梁微电子有限公司 Method and circuit for quickly searching message transmission buffer area

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4882703A (en) * 1984-09-17 1989-11-21 Nicolai Robert L Procedure for fining all words contained within any given word including creation of a dictionary
US5369577A (en) * 1991-02-01 1994-11-29 Wang Laboratories, Inc. Text searching system
US5701459A (en) * 1993-01-13 1997-12-23 Novell, Inc. Method and apparatus for rapid full text index creation

Family Cites Families (63)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4299578A (en) * 1978-01-20 1981-11-10 Wayman Paul L Crossword system and game apparatus
US4903269A (en) * 1988-05-16 1990-02-20 General Electric Company Error detector for encoded digital signals
US5109352A (en) * 1988-08-09 1992-04-28 Dell Robert B O System for encoding a collection of ideographic characters
US6119120A (en) * 1996-06-28 2000-09-12 Microsoft Corporation Computer implemented methods for constructing a compressed data structure from a data string and for using the data structure to find data patterns in the data string
US6023670A (en) * 1996-08-19 2000-02-08 International Business Machines Corporation Natural language determination using correlation between common words
US5952942A (en) * 1996-11-21 1999-09-14 Motorola, Inc. Method and device for input of text messages from a keypad
US6009444A (en) * 1997-02-24 1999-12-28 Motorola, Inc. Text input device and method
US6054941A (en) * 1997-05-27 2000-04-25 Motorola, Inc. Apparatus and method for inputting ideographic characters
US7257528B1 (en) * 1998-02-13 2007-08-14 Zi Corporation Of Canada, Inc. Method and apparatus for Chinese character text input
US6104317A (en) * 1998-02-27 2000-08-15 Motorola, Inc. Data entry device and method
US6169538B1 (en) * 1998-08-13 2001-01-02 Motorola, Inc. Method and apparatus for implementing a graphical user interface keyboard and a text buffer on electronic devices
SE514377C2 (en) * 1998-08-26 2001-02-19 Gunnar Sparr character recognition
US6362752B1 (en) * 1998-12-23 2002-03-26 Motorola, Inc. Keypad with strokes assigned to key for ideographic text input
US6801659B1 (en) * 1999-01-04 2004-10-05 Zi Technology Corporation Ltd. Text input system for ideographic and nonideographic languages
US6204848B1 (en) * 1999-04-14 2001-03-20 Motorola, Inc. Data entry apparatus having a limited number of character keys and method
US6172625B1 (en) * 1999-07-06 2001-01-09 Motorola, Inc. Disambiguation method and apparatus, and dictionary data compression techniques
US6424743B1 (en) * 1999-11-05 2002-07-23 Motorola, Inc. Graphical handwriting recognition user interface
US6686852B1 (en) * 2000-09-15 2004-02-03 Motorola, Inc. Keypad layout for alphabetic character input
SE521911C2 (en) * 2001-01-15 2003-12-16 Decuma Ab Ideon Res Park Method, device and computer program for recognizing a handwritten character
SE519014C2 (en) * 2001-03-07 2002-12-23 Decuma Ab Ideon Res Park Method and apparatus for recognizing a handwritten pattern
US6982658B2 (en) * 2001-03-22 2006-01-03 Motorola, Inc. Keypad layout for alphabetic symbol input
US6502118B1 (en) * 2001-03-22 2002-12-31 Motorola, Inc. Fast system and method for producing a logarithmic signal approximation with variable precision
US20030023426A1 (en) * 2001-06-22 2003-01-30 Zi Technology Corporation Ltd. Japanese language entry mechanism for small keypads
US6947771B2 (en) * 2001-08-06 2005-09-20 Motorola, Inc. User interface for a portable electronic device
US6757544B2 (en) * 2001-08-15 2004-06-29 Motorola, Inc. System and method for determining a location relevant to a communication device and/or its associated user
US20030054830A1 (en) * 2001-09-04 2003-03-20 Zi Corporation Navigation system for mobile communication devices
US7075520B2 (en) * 2001-12-12 2006-07-11 Zi Technology Corporation Ltd Key press disambiguation using a keypad of multidirectional keys
US7949513B2 (en) * 2002-01-22 2011-05-24 Zi Corporation Of Canada, Inc. Language module and method for use with text processing devices
US6912581B2 (en) * 2002-02-27 2005-06-28 Motorola, Inc. System and method for concurrent multimodal communication session persistence
US6807529B2 (en) * 2002-02-27 2004-10-19 Motorola, Inc. System and method for concurrent multimodal communication
US6864809B2 (en) * 2002-02-28 2005-03-08 Zi Technology Corporation Ltd Korean language predictive mechanism for text entry by a user
US7272564B2 (en) * 2002-03-22 2007-09-18 Motorola, Inc. Method and apparatus for multimodal communication with user control of delivery modality
US7020849B1 (en) * 2002-05-31 2006-03-28 Openwave Systems Inc. Dynamic display for communication devices
US7095403B2 (en) * 2002-12-09 2006-08-22 Motorola, Inc. User interface of a keypad entry system for character input
US20040163032A1 (en) 2002-12-17 2004-08-19 Jin Guo Ambiguity resolution for predictive text entry
US20040153963A1 (en) * 2003-02-05 2004-08-05 Simpson Todd G. Information entry mechanism for small keypads
US20040153975A1 (en) * 2003-02-05 2004-08-05 Williams Roland E. Text entry mechanism for small keypads
US7256769B2 (en) * 2003-02-24 2007-08-14 Zi Corporation Of Canada, Inc. System and method for text entry on a reduced keyboard
US6955602B2 (en) * 2003-05-15 2005-10-18 Zi Technology Corporation Ltd Text entry within a video game
WO2004111871A1 (en) 2003-06-18 2004-12-23 Zi Corporation Configurable information identification system and method
US20060236239A1 (en) * 2003-06-18 2006-10-19 Zi Corporation Text entry system and method
US7057607B2 (en) * 2003-06-30 2006-06-06 Motorola, Inc. Application-independent text entry for touch-sensitive display
US7389235B2 (en) * 2003-09-30 2008-06-17 Motorola, Inc. Method and system for unified speech and graphic user interfaces
US6973332B2 (en) * 2003-10-24 2005-12-06 Motorola, Inc. Apparatus and method for forming compound words
US8136050B2 (en) * 2003-11-21 2012-03-13 Nuance Communications, Inc. Electronic device and user interface and input method therefor
US7890526B1 (en) * 2003-12-30 2011-02-15 Microsoft Corporation Incremental query refinement
US7603705B2 (en) * 2004-05-04 2009-10-13 Next It Corporation Methods and systems for enforcing network and computer use policy
WO2006026908A1 (en) 2004-08-25 2006-03-16 Dong Li A chinese characters inputting method which uses continuous phonetic letters in a portable terminal
US7890492B2 (en) * 2004-11-15 2011-02-15 Zi Corporation Of Canada, Inc. Organizing pointers to objects in an array to improve the speed of object retrieval
US20060129928A1 (en) * 2004-12-02 2006-06-15 Weigen Qiu Use of pointing device to identify ideographic characters
EP1836564A4 (en) * 2004-12-07 2011-07-27 Nuance Communications Inc User interface with augmented searching characteristics
US20060155536A1 (en) * 2004-12-20 2006-07-13 Williams Roland E Method and device for displaying a telephone number
US7466859B2 (en) * 2004-12-30 2008-12-16 Motorola, Inc. Candidate list enhancement for predictive text input in electronic devices
WO2006091156A1 (en) * 2005-02-28 2006-08-31 Zi Decuma Ab Recognition graph
JP4787275B2 (en) * 2005-02-28 2011-10-05 ザイ デクマ アクチボラゲット Segmentation-based recognition
US8117540B2 (en) * 2005-05-18 2012-02-14 Neuer Wall Treuhand Gmbh Method and device incorporating improved text input mechanism
JP5118054B2 (en) * 2005-11-21 2013-01-16 ザイ コーポレイション オブ カナダ インコーポレイテッド Communication information distribution system and communication information distribution method
US8018439B2 (en) * 2006-03-22 2011-09-13 Zi Corporation Of Canada, Inc. Mobile appliance system and method enabling efficient entry
US7972205B2 (en) * 2006-03-31 2011-07-05 Vijay Krishna Search engine crossword puzzle
US7774746B2 (en) * 2006-04-19 2010-08-10 Apple, Inc. Generating a format translator
US7580925B2 (en) * 2006-04-19 2009-08-25 Tegic Communications, Inc. Efficient storage and search of word lists and other text
US8166418B2 (en) 2006-05-26 2012-04-24 Zi Corporation Of Canada, Inc. Device and method of conveying meaning
CN101611417B (en) * 2006-12-01 2012-07-04 Zi德库玛股份公司 Method for character recognition

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4882703A (en) * 1984-09-17 1989-11-21 Nicolai Robert L Procedure for fining all words contained within any given word including creation of a dictionary
US5369577A (en) * 1991-02-01 1994-11-29 Wang Laboratories, Inc. Text searching system
US5701459A (en) * 1993-01-13 1997-12-23 Novell, Inc. Method and apparatus for rapid full text index creation

Also Published As

Publication number Publication date
US8204921B2 (en) 2012-06-19
EP2011003A2 (en) 2009-01-07
CN101432686A (en) 2009-05-13
US20120259888A1 (en) 2012-10-11
US20070250469A1 (en) 2007-10-25
US8676779B2 (en) 2014-03-18
US7580925B2 (en) 2009-08-25
US20090037371A1 (en) 2009-02-05
WO2007136951B1 (en) 2008-06-12
WO2007136951A2 (en) 2007-11-29

Similar Documents

Publication Publication Date Title
WO2007136951A3 (en) Efficient storage and search of word lists and other text
WO2005013046A3 (en) Ranking search results using conversion data
WO2004114077A3 (en) Method of improving reading of a text
Evans et al. Assessing the effect of economic inequality
Heads A new pygmy mole cricket in cretaceous amber from Burma (Orthoptera: Tridactylidae).
Mead A summary of welfare reform
Throsby Dance in Australia: A profile
Morley Reconnecting with Cuba: How Washington Lost a Cold War in Latin America
Osman The impact of Information Technology on SMEs in Oman
Kristensen et al. Innovation, management and sustainability-change processes in the food service sector
Thorseth et al. Dynamic miniature lighting system with low correlated colour temperature and high colour rendering index for museum lighting of fragile artefacts
Hedling The Battle of Dybbøl Revisited: The Danish Press Reception of the TV-series 1864
Vosters et al. Historical-sociolinguistic aspects of language variation and change in Belgium. Embracing negation in 18th and 19th-century Flemish language norms and use
Young et al. Union and identity: Scotland in a social and insitutional context
Ahlava Interview with Antti Ahlava
Mastoridis Thessaloniki, a special case in contemporary Greek visual communication
Cre Ioannis Rusbrocchii De ornatu spiritualium nuptiarium Whilhelmi Iordani interprete, ed. Kees Schepers, CCCM 207 (Turnhout, Brepols, 2004)
Rudy The Fagel Missal
Salgado The Aesthetics of Participative Design Pieces: Two Case Studies in Museums
Janiszewski et al. Greek-Australians
TH85093S (en) Wine rack
Bekers Reasonably Self-Willed: Interview with Nawal El Saadawi, Brussels, 26 November 2007
Dahl et al. Inertia and advance in the organic sector: food education in Denmark
Janiszewski et al. Miner Ashton (Minas Aslanis)
FR2868178A1 (en) Text document e.g. email, searching process for e.g. microcomputer, involves displaying information related to searched document as table with rows and columns activated/deactivated to mask documents which do not contain identification unit

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 07797256

Country of ref document: EP

Kind code of ref document: A2

WWE Wipo information: entry into national phase

Ref document number: 2007797256

Country of ref document: EP

NENP Non-entry into the national phase

Ref country code: DE

WWE Wipo information: entry into national phase

Ref document number: 200780015700.6

Country of ref document: CN