A system, framework, and algorithms for data deduplication are described. A declarative language, such as a Datalog-type logic language, is provided. Programs in the language describe data to be deduplicated and soft and hard constraints that must/should be satisfied by data deduplicated according to...http://www.google.com/patents/US8200640?utm_source=gb-gplus-sharePatent US8200640 - Declarative framework for deduplication