A document processing apparatus for segmenting a color document image into regions obtains a binary image by binarizing a color image, and extracts regions having different background colors from the color image to generate region information indicating the position and size of each extracted region....http://www.google.com/patents/US7170647?utm_source=gb-gplus-sharePatent US7170647 - Document processing apparatus and method