WO2007136951A3 - Efficient storage and search of word lists and other text - Google Patents
Efficient storage and search of word lists and other text Download PDFInfo
- Publication number
- WO2007136951A3 WO2007136951A3 PCT/US2007/066979 US2007066979W WO2007136951A3 WO 2007136951 A3 WO2007136951 A3 WO 2007136951A3 US 2007066979 W US2007066979 W US 2007066979W WO 2007136951 A3 WO2007136951 A3 WO 2007136951A3
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- search
- text
- word
- columns
- words
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/31—Indexing; Data structures therefor; Storage structures
- G06F16/316—Indexing structures
- G06F16/328—Management therefor
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/99931—Database or file accessing
- Y10S707/99933—Query processing, i.e. searching
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/99931—Database or file accessing
- Y10S707/99933—Query processing, i.e. searching
- Y10S707/99936—Pattern matching access
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/99941—Database schema or data structure
- Y10S707/99942—Manipulating data structure, e.g. compression, compaction, compilation
Abstract
A computer readable storage medium tangibly embodying machine-readable digital data arranged to facilitate expedited searching. The data includes a plurality of words residing in a table having rows and columns, each word residing in a different row and each letter of the word occupying a different column in that row. Each continuous run of same letters in a column forms an interval. The words are positioned relative to each other to maximize lengths of the intervals, and/or optimize efficiency of compression of the columns by run length encoding.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP07797256A EP2011003A2 (en) | 2006-04-19 | 2007-04-19 | Efficient storage and search of word lists and other text |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US11/379,354 | 2006-04-19 | ||
US11/379,354 US7580925B2 (en) | 2006-04-19 | 2006-04-19 | Efficient storage and search of word lists and other text |
Publications (3)
Publication Number | Publication Date |
---|---|
WO2007136951A2 WO2007136951A2 (en) | 2007-11-29 |
WO2007136951A3 true WO2007136951A3 (en) | 2008-04-17 |
WO2007136951B1 WO2007136951B1 (en) | 2008-06-12 |
Family
ID=38620662
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US2007/066979 WO2007136951A2 (en) | 2006-04-19 | 2007-04-19 | Efficient storage and search of word lists and other text |
Country Status (4)
Country | Link |
---|---|
US (3) | US7580925B2 (en) |
EP (1) | EP2011003A2 (en) |
CN (1) | CN101432686A (en) |
WO (1) | WO2007136951A2 (en) |
Families Citing this family (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8117540B2 (en) * | 2005-05-18 | 2012-02-14 | Neuer Wall Treuhand Gmbh | Method and device incorporating improved text input mechanism |
US8036878B2 (en) * | 2005-05-18 | 2011-10-11 | Never Wall Treuhand GmbH | Device incorporating improved text input mechanism |
US8374846B2 (en) | 2005-05-18 | 2013-02-12 | Neuer Wall Treuhand Gmbh | Text input device and method |
US9606634B2 (en) | 2005-05-18 | 2017-03-28 | Nokia Technologies Oy | Device incorporating improved text input mechanism |
US7580925B2 (en) * | 2006-04-19 | 2009-08-25 | Tegic Communications, Inc. | Efficient storage and search of word lists and other text |
US20080089299A1 (en) * | 2006-10-13 | 2008-04-17 | Motorola, Inc. | Method and system for distributing content in Ad-hoc networks using super peers |
US8299943B2 (en) | 2007-05-22 | 2012-10-30 | Tegic Communications, Inc. | Multiple predictions in a reduced keyboard disambiguating system |
DE602008005428D1 (en) | 2008-06-11 | 2011-04-21 | Exb Asset Man Gmbh | Apparatus and method with improved text input mechanism |
US8108361B2 (en) * | 2008-07-31 | 2012-01-31 | Microsoft Corporation | Efficient column based data encoding for large-scale data storage |
EP2354976A1 (en) * | 2010-02-09 | 2011-08-10 | ExB Asset Management GmbH | Online analysis and display of correlated information |
US8782556B2 (en) * | 2010-02-12 | 2014-07-15 | Microsoft Corporation | User-centric soft keyboard predictive technologies |
US9734194B1 (en) | 2013-03-14 | 2017-08-15 | Google Inc. | Encoding time interval information |
US11294905B2 (en) * | 2019-01-07 | 2022-04-05 | Optumsoft, Inc. | Sparse data index table |
CN110928874B (en) * | 2019-10-14 | 2021-09-14 | 京东数字科技控股有限公司 | Information processing method, device and system, electronic equipment and storage medium |
CN116016397B (en) * | 2023-03-28 | 2023-05-30 | 湖南毂梁微电子有限公司 | Method and circuit for quickly searching message transmission buffer area |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4882703A (en) * | 1984-09-17 | 1989-11-21 | Nicolai Robert L | Procedure for fining all words contained within any given word including creation of a dictionary |
US5369577A (en) * | 1991-02-01 | 1994-11-29 | Wang Laboratories, Inc. | Text searching system |
US5701459A (en) * | 1993-01-13 | 1997-12-23 | Novell, Inc. | Method and apparatus for rapid full text index creation |
Family Cites Families (63)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4299578A (en) * | 1978-01-20 | 1981-11-10 | Wayman Paul L | Crossword system and game apparatus |
US4903269A (en) * | 1988-05-16 | 1990-02-20 | General Electric Company | Error detector for encoded digital signals |
US5109352A (en) * | 1988-08-09 | 1992-04-28 | Dell Robert B O | System for encoding a collection of ideographic characters |
US6119120A (en) * | 1996-06-28 | 2000-09-12 | Microsoft Corporation | Computer implemented methods for constructing a compressed data structure from a data string and for using the data structure to find data patterns in the data string |
US6023670A (en) * | 1996-08-19 | 2000-02-08 | International Business Machines Corporation | Natural language determination using correlation between common words |
US5952942A (en) * | 1996-11-21 | 1999-09-14 | Motorola, Inc. | Method and device for input of text messages from a keypad |
US6009444A (en) * | 1997-02-24 | 1999-12-28 | Motorola, Inc. | Text input device and method |
US6054941A (en) * | 1997-05-27 | 2000-04-25 | Motorola, Inc. | Apparatus and method for inputting ideographic characters |
US7257528B1 (en) * | 1998-02-13 | 2007-08-14 | Zi Corporation Of Canada, Inc. | Method and apparatus for Chinese character text input |
US6104317A (en) * | 1998-02-27 | 2000-08-15 | Motorola, Inc. | Data entry device and method |
US6169538B1 (en) * | 1998-08-13 | 2001-01-02 | Motorola, Inc. | Method and apparatus for implementing a graphical user interface keyboard and a text buffer on electronic devices |
SE514377C2 (en) * | 1998-08-26 | 2001-02-19 | Gunnar Sparr | character recognition |
US6362752B1 (en) * | 1998-12-23 | 2002-03-26 | Motorola, Inc. | Keypad with strokes assigned to key for ideographic text input |
US6801659B1 (en) * | 1999-01-04 | 2004-10-05 | Zi Technology Corporation Ltd. | Text input system for ideographic and nonideographic languages |
US6204848B1 (en) * | 1999-04-14 | 2001-03-20 | Motorola, Inc. | Data entry apparatus having a limited number of character keys and method |
US6172625B1 (en) * | 1999-07-06 | 2001-01-09 | Motorola, Inc. | Disambiguation method and apparatus, and dictionary data compression techniques |
US6424743B1 (en) * | 1999-11-05 | 2002-07-23 | Motorola, Inc. | Graphical handwriting recognition user interface |
US6686852B1 (en) * | 2000-09-15 | 2004-02-03 | Motorola, Inc. | Keypad layout for alphabetic character input |
SE521911C2 (en) * | 2001-01-15 | 2003-12-16 | Decuma Ab Ideon Res Park | Method, device and computer program for recognizing a handwritten character |
SE519014C2 (en) * | 2001-03-07 | 2002-12-23 | Decuma Ab Ideon Res Park | Method and apparatus for recognizing a handwritten pattern |
US6982658B2 (en) * | 2001-03-22 | 2006-01-03 | Motorola, Inc. | Keypad layout for alphabetic symbol input |
US6502118B1 (en) * | 2001-03-22 | 2002-12-31 | Motorola, Inc. | Fast system and method for producing a logarithmic signal approximation with variable precision |
US20030023426A1 (en) * | 2001-06-22 | 2003-01-30 | Zi Technology Corporation Ltd. | Japanese language entry mechanism for small keypads |
US6947771B2 (en) * | 2001-08-06 | 2005-09-20 | Motorola, Inc. | User interface for a portable electronic device |
US6757544B2 (en) * | 2001-08-15 | 2004-06-29 | Motorola, Inc. | System and method for determining a location relevant to a communication device and/or its associated user |
US20030054830A1 (en) * | 2001-09-04 | 2003-03-20 | Zi Corporation | Navigation system for mobile communication devices |
US7075520B2 (en) * | 2001-12-12 | 2006-07-11 | Zi Technology Corporation Ltd | Key press disambiguation using a keypad of multidirectional keys |
US7949513B2 (en) * | 2002-01-22 | 2011-05-24 | Zi Corporation Of Canada, Inc. | Language module and method for use with text processing devices |
US6912581B2 (en) * | 2002-02-27 | 2005-06-28 | Motorola, Inc. | System and method for concurrent multimodal communication session persistence |
US6807529B2 (en) * | 2002-02-27 | 2004-10-19 | Motorola, Inc. | System and method for concurrent multimodal communication |
US6864809B2 (en) * | 2002-02-28 | 2005-03-08 | Zi Technology Corporation Ltd | Korean language predictive mechanism for text entry by a user |
US7272564B2 (en) * | 2002-03-22 | 2007-09-18 | Motorola, Inc. | Method and apparatus for multimodal communication with user control of delivery modality |
US7020849B1 (en) * | 2002-05-31 | 2006-03-28 | Openwave Systems Inc. | Dynamic display for communication devices |
US7095403B2 (en) * | 2002-12-09 | 2006-08-22 | Motorola, Inc. | User interface of a keypad entry system for character input |
US20040163032A1 (en) | 2002-12-17 | 2004-08-19 | Jin Guo | Ambiguity resolution for predictive text entry |
US20040153963A1 (en) * | 2003-02-05 | 2004-08-05 | Simpson Todd G. | Information entry mechanism for small keypads |
US20040153975A1 (en) * | 2003-02-05 | 2004-08-05 | Williams Roland E. | Text entry mechanism for small keypads |
US7256769B2 (en) * | 2003-02-24 | 2007-08-14 | Zi Corporation Of Canada, Inc. | System and method for text entry on a reduced keyboard |
US6955602B2 (en) * | 2003-05-15 | 2005-10-18 | Zi Technology Corporation Ltd | Text entry within a video game |
WO2004111871A1 (en) | 2003-06-18 | 2004-12-23 | Zi Corporation | Configurable information identification system and method |
US20060236239A1 (en) * | 2003-06-18 | 2006-10-19 | Zi Corporation | Text entry system and method |
US7057607B2 (en) * | 2003-06-30 | 2006-06-06 | Motorola, Inc. | Application-independent text entry for touch-sensitive display |
US7389235B2 (en) * | 2003-09-30 | 2008-06-17 | Motorola, Inc. | Method and system for unified speech and graphic user interfaces |
US6973332B2 (en) * | 2003-10-24 | 2005-12-06 | Motorola, Inc. | Apparatus and method for forming compound words |
US8136050B2 (en) * | 2003-11-21 | 2012-03-13 | Nuance Communications, Inc. | Electronic device and user interface and input method therefor |
US7890526B1 (en) * | 2003-12-30 | 2011-02-15 | Microsoft Corporation | Incremental query refinement |
US7603705B2 (en) * | 2004-05-04 | 2009-10-13 | Next It Corporation | Methods and systems for enforcing network and computer use policy |
WO2006026908A1 (en) | 2004-08-25 | 2006-03-16 | Dong Li | A chinese characters inputting method which uses continuous phonetic letters in a portable terminal |
US7890492B2 (en) * | 2004-11-15 | 2011-02-15 | Zi Corporation Of Canada, Inc. | Organizing pointers to objects in an array to improve the speed of object retrieval |
US20060129928A1 (en) * | 2004-12-02 | 2006-06-15 | Weigen Qiu | Use of pointing device to identify ideographic characters |
EP1836564A4 (en) * | 2004-12-07 | 2011-07-27 | Nuance Communications Inc | User interface with augmented searching characteristics |
US20060155536A1 (en) * | 2004-12-20 | 2006-07-13 | Williams Roland E | Method and device for displaying a telephone number |
US7466859B2 (en) * | 2004-12-30 | 2008-12-16 | Motorola, Inc. | Candidate list enhancement for predictive text input in electronic devices |
WO2006091156A1 (en) * | 2005-02-28 | 2006-08-31 | Zi Decuma Ab | Recognition graph |
JP4787275B2 (en) * | 2005-02-28 | 2011-10-05 | ザイ デクマ アクチボラゲット | Segmentation-based recognition |
US8117540B2 (en) * | 2005-05-18 | 2012-02-14 | Neuer Wall Treuhand Gmbh | Method and device incorporating improved text input mechanism |
JP5118054B2 (en) * | 2005-11-21 | 2013-01-16 | ザイ コーポレイション オブ カナダ インコーポレイテッド | Communication information distribution system and communication information distribution method |
US8018439B2 (en) * | 2006-03-22 | 2011-09-13 | Zi Corporation Of Canada, Inc. | Mobile appliance system and method enabling efficient entry |
US7972205B2 (en) * | 2006-03-31 | 2011-07-05 | Vijay Krishna | Search engine crossword puzzle |
US7774746B2 (en) * | 2006-04-19 | 2010-08-10 | Apple, Inc. | Generating a format translator |
US7580925B2 (en) * | 2006-04-19 | 2009-08-25 | Tegic Communications, Inc. | Efficient storage and search of word lists and other text |
US8166418B2 (en) | 2006-05-26 | 2012-04-24 | Zi Corporation Of Canada, Inc. | Device and method of conveying meaning |
CN101611417B (en) * | 2006-12-01 | 2012-07-04 | Zi德库玛股份公司 | Method for character recognition |
-
2006
- 2006-04-19 US US11/379,354 patent/US7580925B2/en active Active
-
2007
- 2007-04-19 WO PCT/US2007/066979 patent/WO2007136951A2/en active Application Filing
- 2007-04-19 CN CNA2007800157006A patent/CN101432686A/en active Pending
- 2007-04-19 EP EP07797256A patent/EP2011003A2/en not_active Withdrawn
-
2008
- 2008-10-01 US US12/243,882 patent/US8204921B2/en active Active
-
2012
- 2012-06-18 US US13/526,396 patent/US8676779B2/en not_active Expired - Fee Related
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4882703A (en) * | 1984-09-17 | 1989-11-21 | Nicolai Robert L | Procedure for fining all words contained within any given word including creation of a dictionary |
US5369577A (en) * | 1991-02-01 | 1994-11-29 | Wang Laboratories, Inc. | Text searching system |
US5701459A (en) * | 1993-01-13 | 1997-12-23 | Novell, Inc. | Method and apparatus for rapid full text index creation |
Also Published As
Publication number | Publication date |
---|---|
US8204921B2 (en) | 2012-06-19 |
EP2011003A2 (en) | 2009-01-07 |
CN101432686A (en) | 2009-05-13 |
US20120259888A1 (en) | 2012-10-11 |
US20070250469A1 (en) | 2007-10-25 |
US8676779B2 (en) | 2014-03-18 |
US7580925B2 (en) | 2009-08-25 |
US20090037371A1 (en) | 2009-02-05 |
WO2007136951B1 (en) | 2008-06-12 |
WO2007136951A2 (en) | 2007-11-29 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2007136951A3 (en) | Efficient storage and search of word lists and other text | |
WO2005013046A3 (en) | Ranking search results using conversion data | |
WO2004114077A3 (en) | Method of improving reading of a text | |
Evans et al. | Assessing the effect of economic inequality | |
Heads | A new pygmy mole cricket in cretaceous amber from Burma (Orthoptera: Tridactylidae). | |
Mead | A summary of welfare reform | |
Throsby | Dance in Australia: A profile | |
Morley | Reconnecting with Cuba: How Washington Lost a Cold War in Latin America | |
Osman | The impact of Information Technology on SMEs in Oman | |
Kristensen et al. | Innovation, management and sustainability-change processes in the food service sector | |
Thorseth et al. | Dynamic miniature lighting system with low correlated colour temperature and high colour rendering index for museum lighting of fragile artefacts | |
Hedling | The Battle of Dybbøl Revisited: The Danish Press Reception of the TV-series 1864 | |
Vosters et al. | Historical-sociolinguistic aspects of language variation and change in Belgium. Embracing negation in 18th and 19th-century Flemish language norms and use | |
Young et al. | Union and identity: Scotland in a social and insitutional context | |
Ahlava | Interview with Antti Ahlava | |
Mastoridis | Thessaloniki, a special case in contemporary Greek visual communication | |
Cre | Ioannis Rusbrocchii De ornatu spiritualium nuptiarium Whilhelmi Iordani interprete, ed. Kees Schepers, CCCM 207 (Turnhout, Brepols, 2004) | |
Rudy | The Fagel Missal | |
Salgado | The Aesthetics of Participative Design Pieces: Two Case Studies in Museums | |
Janiszewski et al. | Greek-Australians | |
TH85093S (en) | Wine rack | |
Bekers | Reasonably Self-Willed: Interview with Nawal El Saadawi, Brussels, 26 November 2007 | |
Dahl et al. | Inertia and advance in the organic sector: food education in Denmark | |
Janiszewski et al. | Miner Ashton (Minas Aslanis) | |
FR2868178A1 (en) | Text document e.g. email, searching process for e.g. microcomputer, involves displaying information related to searched document as table with rows and columns activated/deactivated to mask documents which do not contain identification unit |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 07797256 Country of ref document: EP Kind code of ref document: A2 |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2007797256 Country of ref document: EP |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
WWE | Wipo information: entry into national phase |
Ref document number: 200780015700.6 Country of ref document: CN |