This specification describes technologies relating to identifying nearest neighbors are provided. In one implementation, a method includes using a first and a second collections of n-grams and their associated probabilities to generate a plurality of randomized ranked collections of n-grams of each of...http://www.google.com/patents/US8175864?utm_source=gb-gplus-sharePatent US8175864 - Identifying nearest neighbors for machine translation