A method and a system for information extraction from Web pages formatted with markup languages such as HTML [8]. A method and system for interactively and visually describing information patterns of interest based on visualized sample Web pages [5,6,16-29]. A method and data structure for representing...http://www.google.com/patents/US20050022115?utm_source=gb-gplus-sharePatent US20050022115 - Visual and interactive wrapper generation, automated information extraction from web pages, and translation into xml