Linguistic frequency data is encoded by identifying a plurality of sets of character strings in a source text, where each set comprises at least a first and a second character string. Frequency data is obtained for each set and stored at a memory position in a first memory array that is assigned to each...http://www.google.com/patents/US7031910?utm_source=gb-gplus-sharePatent US7031910 - Method and system for encoding and accessing linguistic frequency data