US20010000314A1 - Iterative smoothing technique for pre-processing mixed raster content planes to improve the quality of a decompressed image and increase document compression ratios - Google Patents

Iterative smoothing technique for pre-processing mixed raster content planes to improve the quality of a decompressed image and increase document compression ratios Download PDF

Info

Publication number
US20010000314A1
US20010000314A1 US09/733,860 US73386000A US2001000314A1 US 20010000314 A1 US20010000314 A1 US 20010000314A1 US 73386000 A US73386000 A US 73386000A US 2001000314 A1 US2001000314 A1 US 2001000314A1
Authority
US
United States
Prior art keywords
image data
plane
image
reconstruction
locations
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
US09/733,860
Other versions
US6334001B2 (en
Inventor
Ricardo Queiroz
Reiner Eschbach
William Fuss
Robert Buckley
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Xerox Corp
Original Assignee
Xerox Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Xerox Corp filed Critical Xerox Corp
Priority to US09/733,860 priority Critical patent/US6334001B2/en
Publication of US20010000314A1 publication Critical patent/US20010000314A1/en
Application granted granted Critical
Publication of US6334001B2 publication Critical patent/US6334001B2/en
Assigned to BANK ONE, NA, AS ADMINISTRATIVE AGENT reassignment BANK ONE, NA, AS ADMINISTRATIVE AGENT SECURITY INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: XEROX CORPORATION
Assigned to JPMORGAN CHASE BANK, AS COLLATERAL AGENT reassignment JPMORGAN CHASE BANK, AS COLLATERAL AGENT SECURITY AGREEMENT Assignors: XEROX CORPORATION
Anticipated expiration legal-status Critical
Assigned to XEROX CORPORATION reassignment XEROX CORPORATION RELEASE BY SECURED PARTY (SEE DOCUMENT FOR DETAILS). Assignors: JPMORGAN CHASE BANK, N.A. AS SUCCESSOR-IN-INTEREST ADMINISTRATIVE AGENT AND COLLATERAL AGENT TO JPMORGAN CHASE BANK
Expired - Lifetime legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N1/00Scanning, transmission or reproduction of documents or the like, e.g. facsimile transmission; Details thereof
    • H04N1/40Picture signal circuits
    • H04N1/40062Discrimination between different image types, e.g. two-tone, continuous tone
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/10Segmentation; Edge detection
    • G06T7/11Region-based segmentation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/10Image acquisition modality
    • G06T2207/10004Still image; Photographic image
    • G06T2207/10008Still image; Photographic image from scanner, fax or copier
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20048Transform domain processing
    • G06T2207/20052Discrete cosine transform [DCT]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/30Subject of image; Context of image processing
    • G06T2207/30176Document

Definitions

  • This invention relates generally to image processing and, more particularly, to techniques for compressing the digital representation of a color document.
  • LZW Lempel-Ziv Welch
  • MRC Mixed Raster Content
  • the foreground plane contains the color or gray scale information of foreground items such as text.
  • the background plane contains the color or gray scale information for the “background” of the page and the continuous tone pictures that are contained on the page.
  • the selector plane stores information for selecting from either the foreground plane or background plane during decompression.
  • the segmentation process leaves data in both planes in the areas that will not be chosen by the selector plane. This often causes an increase in the number of bits that are required to encode the entire image, thereby decreasing its compression ratio. This results in inconveniences to the user of a printer, fax machine, scanner or other device in which the technique has been incorporated. For this reason, it is advantageous to somehow reduce the amount of data residing on each plane prior to processing.
  • the present invention is directed to using the information that is contained in the selector plane to aid in reducing the amount of data residing on the foreground and/or background planes.
  • the invention takes advantage of the fact that when the selector plane designates a plane to provide information about a given pixel, the information on the other plane that pertains to the same pixel will not be used.
  • the invention provides improved compression of the multi-plane image by treating this useless data in the described manner.
  • U.S. Pat. No. 5,251,271 to Fling issued Oct. 5, 1993 discloses a method for registering digitized multi-plane color images.
  • the method designates one plane as the reference plane and registers each of the other warped planes with the reference plane.
  • Each plane comprises pixels representing luminosity values having scalar x and y coordinates representing positions in the horizontal and vertical directions, respectively, of the plane.
  • the planes are divided into regions. Correlation values are calculated for regions within the divisional region of the reference plane with a plurality of regions offset from the corresponding warped divisional region.
  • a warp error value is calculated for each pixel of each divisional region as a function of the scalar offset.
  • the warp error values are interpolated and added to the current position of each pixel of the warped plane.
  • U.S. Pat. No. 5,060,980 to Johnson et al. issued Oct. 29, 1991 which describes a “form” that includes user modifiable fields and an encoded description of the location, size, type, etc. of the fields to allow for direct programming of a form interpreter.
  • Other information including the processing of the form, encoded data, etc. may be included in the encoded information.
  • a system for creating forms carrying an encoded description of selected attributes of the fields includes means for selecting or creating fields and locating the fields on a form while generating, substantially simultaneously, the encoded description of the selected attributes.
  • a form composer then allows merging of the form and its encoded description for printing or electronic transmission.
  • a system for reading such forms includes a scanner, decoding device, and processor. By reading such forms, data may be entered into or recalled from a data processing system, or a form interpreter may be programmed, locally or remotely, for subsequent handling of forms.
  • U.S. Pat. No. 5,784,175 to Lee discloses a video compression encoder process for compressing digitized video signals representing display motion in video sequences of multiple image frames.
  • the encoder process utilizes object-based video compression to improve the accuracy and versatility of encoding interframe motion and intraframe image features.
  • Video information is compressed relative to objects of arbitrary configurations, rather than fixed, regular arrays of pixels as in conventional video compression methods. This reduces the error components and thereby improves the compression efficiency and accuracy.
  • object-based video compression of this invention provides interactive video editing capabilities for processing compressed video information.
  • U.S. Pat. No. 5,303,313 to Mark et al. issued Apr. 12, 1994 describes image compression based on symbol matching. An image is “pre-compressed” prior to symbol matching using run-length encoding. Symbols are then extracted from the run-length representation. A voting scheme is used in conjunction with a plurality of similarity tests to improve symbol matching accuracy. A template composition scheme wherein the template may be modified based on symbol matches is also disclosed.
  • the “upper” and “lower” planes contain the color or gray scale for the page as well as the continuous tone pictures that are contained on the page.
  • the selector plane stores information for selecting from either the foreground plane or background plane during decompression. Information contained in the selector plane is first used to pre-process the upper and lower planes to reduce the amount of data on each of the other two planes that will be subjected to further processing. Each of the pre-processed planes is compressed using a compression technique optimal for the type of data that resides upon it.
  • an iterative smoothing technique for processing mixed raster content planes includes the steps of replacing each image data signal on the image plane that does not correspond to the reconstruction identified locations with a signal equal to an average of all image data neighbor signals that correspond to locations previously identified for reconstruction, wherein a neighbor signal is defined as an image data signal that is horizontally, vertically or diagonally adjacent to the image data signal; identifying all of the replaced image data signals as reconstruction image data signals in the image data plane map; outputting the image plane if all locations in the image plane map are reconstruction signals, and repeating the image data enhancing signal replacing step if less than all locations in the image plane map are reconstruction signals.
  • an iterative smoothing technique for processing mixed raster content planes which includes the steps of replacing each image data signal on the image plane that does not correspond to the reconstruction identified locations with a signal equal to an average of all image data neighbor signals that correspond to locations previously identified for reconstruction, wherein a neighbor signal is defined as an image data signal that is horizontally, vertically or diagonally adjacent to the image data signal; applying a discrete cosine transformation to the image plane and quantizing the results of the transformation; inverse quantizing the transformation results and performing an inverse discrete cosine transform thereon, thereby generating a pseudo-plane that has image data signals that lie in the same locations as signals in the image data plane; replacing each image data signal on the image data plane that does not correspond to a reconstruction identified location on the image data plane map with an image data signal on the pseudo-plane that lies in a same location.
  • FIG. 1 illustrates a composite image and includes an example of how such an image may be decomposed into three MRC image planes, an upper plane, a lower plane, and a selector plane.
  • FIG. 2 contains a flowchart illustrating the basic steps for compressing a document according to the present invention.
  • FIG. 3 shows a detailed example of the typical contents of a selector plane for an 8 ⁇ 8 block of pixels.
  • FIG. 4 shows a detailed example of an image plane map which corresponds to the selector map of FIG. 3.
  • FIG. 5 depicts one embodiment of the present invention for pre-processing image planes.
  • FIG. 6 illustrates another embodiment of the present invention for pre-processing image planes.
  • FIG. 7 shows the manner in which a near non-destructive embodiment of the present invention may be used in conjunction with a JPEG compression system to pre-process image planes for subsequent JPEG compression.
  • FIG. 8 contains a detailed illustrations of an iterative smoothing technique that may be used in conjunction with the present invention.
  • FIG. 9 illustrates a typical device in which the present invention may be implemented.
  • the present invention is directed to a method and apparatus for separately processing the various types of data contained in a composite image. While the invention is described in conjunction with a Mixed Raster Content (MRC) representation technique, those skilled in the art will recognize that it may be adapted for use with other methods and apparatus' and the invention is therefore, not limited to this description.
  • MRC Mixed Raster Content
  • the technique described herein is suitable for use in various devices required to store or transmit color or grayscale documents such as facsimile devices, image storage devices and the like. It should be noted that the examples and illustrations presented in the figures are in gray scale, but the same concepts apply to color documents and conversely, those portions of the invention that are described with reference to color documents apply equally to gray scale documents.
  • a pixel map is one in which each pixel represents some “value” which indicates the color or, in the case of gray scale document, how light or dark the image is at that point.
  • most pixel maps have values that are taken from a set of discrete, non-negative integers.
  • a typical gray-scale pixel map might have values ranging from 0, representing black, to 255, representing the whitest tone possible.
  • the pixel maps of concern in the currently preferred embodiment are representations of “scanned” images. That is, images which are created by digitizing light reflected off of physical media using a digital scanner.
  • the term bitmap is used to mean a binary pixel map in which pixels can take one of two values, 1 or 0.
  • An example of a device that may be used to obtain such scanned images is illustrated in FIG. 8.
  • pixel map 10 representing a color or gray-scale document is preferably decomposed into a three plane page format.
  • the document format is comprised of an upper plane 12 , a lower plane 14 , and a selector plane 16 .
  • Upper plane 12 and lower plane 14 are typically stored at the same bit depth and number of colors as the original pixel map 10 , but usually at reduced resolution.
  • the processing of planes can include a reduction in the bit depth or a palette color encoding. It is important to recognize that while the terms “upper” and “lower” are used to describe the planes on which data resides, it is not intended to limit the invention to any particular arrangement. Further, it is also possible to practice the invention with planes that are composed of multiple superimposed separations. If this is the case, it is possible to apply the present invention to all separations together or to each color separation individually.
  • Processing typically occurs on a block by block basis, rather than by simultaneously processing all the image data. For example, if JPEG compression will be applied, 8 ⁇ 8 blocks must be provided. That is, the image data must be separated into groups of 64 pixels, with 8 pixels extending in the horizontal direction and 8 blocks extending in the vertical direction.
  • JPEG is merely an example of one compression format that may be used with the present invention. The blocks may be organized in another configuration if required by the technique that will be used. After all blocks are processed, any or all three planes may be compressed using a method suitable for the type data residing thereon. Continuing with the example already provided, upper plane 12 and lower plane 14 may be compressed and stored using JPEG, while selector plane 16 is compressed using a symbol-based compression format.
  • Lower plane 14 commonly contains both information that is pertinent to the background color of the page (including the color of tints, washes, etc.) and the continuous-tone pictures that are found on the page.
  • Upper plane 12 commonly contains the “ink colors” of foreground items such as text.
  • Selector plane 16 is typically stored at higher resolution than the upper and lower planes. Selector plane 16 is used to describe, for each pixel in the selector plane, whether to use the pixel value found in the lower plane or the upper plane during image reconstruction. If a “white” pixel in the selector plane (i.e. a logical zero value) means the pixel value should be taken from the corresponding pixel from the lower plane, a “black” pixel in the selector plane (i.e. a logical one value) means that the pixel value should be taken from the corresponding pixel from the upper plane.
  • FIG. 2 contains a flowchart depicting the basic steps for compressing a document using an embodiment of the present invention.
  • Blocks from an original pixel map 10 a pixel map representation of the original document to be compressed—are first obtained as indicated in step 102 . This may be through scanning an original, by retrieving a stored pixel map representation of the document, or by converting an electronic or page description language representation of an original document into a pixel map representation.
  • Pixel map 10 representation is then analyzed to generate the information for the three planes as indicated in steps 104 - 108 .
  • Selector plane 16 is implicitly or explicitly computed first, as indicated in step 104 and is used to create the other planes.
  • selector plane 16 can be generated, the invention may be accomplished by simply moving pixels from one plane to another, and marking the pixels that have been moved. Technically, this calculates one plane such as lower plane 14 first, but simultaneously it implicitly calculates selector plane 16 .
  • Selector plane 16 is typically a bitmap computed using a technique suitable for finding text or the like on original pixel map 10 . What results is a bitmap where pixels have a 1 value where they represent text and a 0 elsewhere. It should be noted that the term “text” refers to page objects that have text properties, such as sharp, high contrast edges, etc., including many other objects that to not qualify as “readable” text. Pixels are placed on either upper plane 12 or lower plane 14 according to the data on selector plane 16 .
  • An upper plane 12 typically stored at a reduced resolution relative to original pixel map 10 , contains color (or gray scale) information of upper items such as text is computed using selector plane as indicated in step 106 .
  • creating upper plane 12 involves creating an image containing the color of the objects (pixels) selected in the selector plane.
  • the method can be viewed as pouring ink contents of the upper plane through a mask located on the selector plane onto the background of the lower plane.
  • the ink colors are placed in a reduced-resolution “ink map” that will ultimately become upper plane 12 .
  • the empty values are typically filled in with pre-computed ink colors.
  • a lower plane 14 is then computed as indicated in step 108 .
  • one embodiment of the invention includes an image segmentation process that identifies the “image” or non-text portions. This information is used to create the reduced resolution lower map, which contains background color information as well as continuous tone image information. The result is an image that has all small, text-like features deleted, but which includes tints as well as color or gray scale data.
  • step 116 the compressed data representing each plane can be recombined at step 116 , after the necessary compression has taken place, in order to create a single representation of the data, for storage in a computer file, or transmission in a single channel. If case multiple transmission channels are available step 116 may not be necessary.
  • selector plane 16 includes a pattern of zeros and ones, dispersed in an 8 ⁇ 8 block.
  • An 8 ⁇ 8 block such as that illustrated here corresponds to an 8 ⁇ 8 block of data that is provided by the compressor which, in the preferred embodiment of the invention, will be a JPEG compressor. If a compression technique that provides data in another configuration is used, selector plane 16 will have the zeros and ones placed thereon, dispersed in a corresponding pattern.
  • a 0 on selector plane 16 means that the pixel value should be taken from the corresponding pixel from the lower plane 14
  • a 1 on the selector plane means that the pixel value should be taken from the corresponding pixel from upper plane 12 .
  • the block size used in the pre-processing step may be enlarged to compensate for the reduction in image size, so that the final processed block size matches the block size used for compressing the image plane.
  • image plane maps that identify the pixels in each block that will be used to reconstruct the final output image from the two planes is next created.
  • map 304 is created wherein an “N” is placed in every location in which a 1 was located on selector plane 16 to mark the pixels in the block that will not be used during image reconstruction.
  • a “Y” is placed in those locations in which 0's were located on selector plane 16 to show the pixels in the block that are to be retained for the output image.
  • map 302 is created and N's are placed in those locations which correspond to 0's on selector plane 16 , while Y's are placed in the locations that correspond to 1's.
  • the second map generated may be created by simply inverting the first map.
  • the first step 402 is to determine the number of locations in the block in the image plane map 302 or 304 that have been identified as disposable (“N” pixels). For simplicity, the invention will continue to be described with reference to a block in the lower plane 14 . As shown in step 406 , if no locations in image plane map 304 have been identified as N locations, the block is simply output as is. Note that the average “A” of the block is implicitly or explicitly computed before it is output in step 406 .
  • a “near non-destructive processing” technique is used to process image data according to the present invention.
  • the phrase “near non-destructive” is used to indicate that some of the Y labeled pixels in the block are likely to be slightly modified using this approach.
  • Near non-destructive processing is generally accomplished by determining how much variance there is between the Y labeled pixels on the image plane and then comparing that variance to some pre-determined threshold value. If the variance of the Y labeled pixels is small enough, processing time can be reduced by replacing the entire image data block with a block of pixels that has a uniform value.
  • the embodiment first requires inputting threshold and computing the variance of the block.
  • the threshold value indicates the maximum amount of distortion that is acceptable for decompression.
  • the variance indicates the activity of the block—whether there are large variations in the type (i.e. text, pictorial) of image data within the given block.
  • the process begins by determining the number of locations in the image plane map 304 for which the block has N identified pixels, as indicated in step 402 . If no locations in the block have been identified as N pixels, the block is output in step 406 and as before, if all locations in the block have been identified as N pixels in image plane map 304 , all of the pixels in the block are replaced at step 410 with pixels that have a constant value such as the average value for pixels in a previously processed block, or some other appropriate value.
  • Near non-destructive processing may be applied if image plane map 304 has neither all N marked pixels or all Y marked pixels. If this is the case, processing of the image is dependent upon the relationship between the variance and the threshold described above and illustrated in step 414 . If the variance is greater than or equal to the threshold, all pixels in lower plane 14 that are in locations which correspond to those identified with N's on image plane map 304 are again replaced with values that will enhance compression. As before, the preferred embodiment of the invention includes calculating these values using an iterative image smoothing technique such as the one described below. The block will then be output at step 406 .
  • step 416 If the variance is less than the threshold, the entire block will be replaced by a uniform block with pixels that have a constant value as indicated in step 416 . It should be noted here that the constant value used at step 416 will typically not be the same as that which would have been used if all of the pixels had been marked with N (step 410 ). While minimizing the amount of data that will be generated during image decompression is still the goal in this step, a different averaging technique will often be required to accomplish that task. In the preferred embodiment of the invention, the average of the pixels corresponding to locations marked with Y's will be calculated and that value will be the constant used in step 416 . Again, those skilled in the art will recognize that numerous methods may be used to calculate the most appropriate constant value and the invention is not limited to this embodiment.
  • the present invention will be implemented using JPEG compression to compress upper plane 12 and lower plane 14 .
  • a simplifying method can be applied by incorporating the present invention within the JPEG compression module.
  • An implementation of this embodiment is provided in FIG. 7. As stated above, the process begins by determining the number of N locations in the image plane map 304 for the block at step 402 . If there are no N locations in the block, the block is still output as before, the difference here being that, as shown in step 606 , output 620 is preceded by JPEG encoding. Thus the output module 620 actually outputs a variable amount of bits generated by the JPEG compression process.
  • the JPEG compliant bitstring relative to the 0 DC difference is output, followed by an end of block symbol.
  • This bitstring will be 010110 for default luminance tables as indicated in step 610 and is perhaps the shortest possible valid string to represent a block in JPEG.
  • the resulting data block plane will again be output at step 620 . The motivation for using the average of the previous block as opposed to the current one is now clear since by using this method, the amount of JPEG compressed data for the block being processed will be minimal.
  • the next step occurs when neither all nor none of the pixels on image plane map 304 has been identified as N pixels. What takes place during this next step again depends upon the relationship between the variance and the threshold described above. Looking first at step 414 if the variance is greater than or equal to the threshold, all pixels in the plane in locations which correspond to those identified with Y's on image plane map 304 are replaced with values that will enhance decompression (i.e. minimize the amount of generated data). Again, in the preferred embodiment of the invention, this will be an iterative smoothing technique. The “smoothed” block is then compressed using JPEG at step 612 , and the compressed data bits are output at step 620 .
  • the plane will be replaced by a uniform block of pixels at step 608 .
  • the value of the uniform block will be equal to the average of pixels in the block that have been marked with Ys in image plane map 304 .
  • Output using simplified JPEG encoding will take place at step 616 .
  • Use of the term “simplified” JPEG encoding means that the block average is used as the DC value of the discrete cosine transform (DCT) which is the only DCT value to be encoded and output. Therefore, the DCT computation and the quantization or encoding of DCT AC values for the block do not take place.
  • DCT discrete cosine transform
  • step 412 in FIGS. 5, 6 and 7 will now be described.
  • one way to enhance compression is to replace the N pixel values in the block with values that will compress better, since those values will not be used during reconstruction anyway.
  • the status of map 304 is checked at step 702 to determine whether all locations in image plane map 304 that correspond to locations in the block are identified with Y's. The process is repeated until this is the case (i.e. until there are no more N's on image map 304 in locations that correspond to those in the block). Once no more N locations the block i output as indicated in step 406 .
  • step 412 another way to perform iterative smoothing for step 412 is to use a discrete cosine transform.
  • a is discrete cosine transformation (DCT) is then applied to this new block, and the results of the transformation are quantized as indicated in step 804 .
  • DCT discrete cosine transformation
  • some of the high frequency coefficients are removed at step 806 . It is anticipated that several iterations will occur before this process has been completed. How many levels of high frequency coefficients that are removed will depend upon how many iterations have occurred, with the number of levels removed in direct proportion to the number of iterations.
  • an 8 ⁇ 8 block which describes frequency “levels” is provided.
  • the 0 level coefficient is the DCC.
  • the first level coefficients are marked by 1's, second level coefficients marked by 2's, third level marked by 3's etc., until coefficients in all 14 levels are identified.
  • three pixel values will be provided—those values marked with numbers less than or equal to 1. If the fourth level coefficients are to be used, fifteen pixel values, those marked with numbers less than or equal to 4, will be used.
  • step 806 is skipped, and all coefficients produced by the DCT are used for subsequent processing.
  • the plane is subjected to inverse quantization and inverse DCT to produce a pseudo-plane as indicated in step 808 .
  • the pixel values in the original plane that correspond to N locations in image plane map 304 are then replaced with pixels in the same locations in the pseudo-plane. As indicated earlier, this is an iterative process and it is repeated until a designated criteria is met, as shown in step 812 .
  • the process is repeated for a fixed number of iterations.
  • step 112 once each of the respective planes is generated, they are each compressed using a suitable compression technique, step 112 .
  • upper plane 12 and lower plane 14 are compressed using JPEG while the selector plane 16 is compressed using a symbol based pattern matching technique such as CCITT Group 4 or a method of classifying scanned symbols into equivalence classes such as that described in U.S. Pat. No. 5,778,095 to Davies issued Jul. 7, 1998, the contents of which are hereby incorporated by reference.
  • a pixel map representation such as this may include an image with an associated mask, where the mask is used to select an irregularly shaped area from the image.
  • the image pixels not selected by the mask correspond to N locations in the image plane and can be processed by any of the methods described in the present invention to increase the compression ratio of the single image plane and improve the quality of the decompressed image.
  • any or all of these methods may be implemented in a computer any other device capable of storing a set of instructions which may be executed by a machine.
  • the program storage device will tangibly embody this set of instructions (most often referred to as a software program) to perform the above previously recited steps for compressing a document image in the manner described in detail above with reference to the attached figures.
  • the present invention uses the selector plane to replace, for each plane, pixels that have been designated to be provided by the other plane by carefully chosen values. The previously existing data is completely ignored, and the newly chosen values are calculated for such that the number of bits that will be generated during the subsequent compression is minimized. While the present invention has been described in connection with a preferred embodiment thereof, it will be understood that it is not intended to limit the invention to that embodiment. On the contrary, it is intended to cover all alternatives, modifications and equivalents as may be included within the spirit and scope of the invention as defined by the appended claims.

Abstract

A method and apparatus for compressing a mixed raster content image that represents a color or gray scale a document is disclosed. The pixel map is decomposed into a three-plane representation—a reduced-resolution “upper” plane, a reduced-resolution “lower” plane, and a high-resolution binary selector plane. An iterative smoothing technique is then used to pre-process the upper and lower planes using the information contained in the selector plane, thereby reducing the amount of data that will be subjected to further processing.

Description

    FIELD OF THE INVENTION
  • 1. This invention relates generally to image processing and, more particularly, to techniques for compressing the digital representation of a color document.
  • BACKGROUND OF THE INVENTION
  • 2. Data contained in documents that has been scanned at high resolutions requires very large amounts of storage space. This data is typically subjected to some form of data compression in order to avoid the high costs that would be associated with storing it. “Lossless” compression methods such as Lempel-Ziv Welch (LZW) do not perform particularly well on portions of the document that are scanned pixel maps; “lossy” methods such as JPEG work fairly well on continuous-tone pixel maps, but they do not work particularly well on the parts of the document that contain text. To optimize image data compression, techniques, which can recognize the type of data being compressed, are needed.
  • 3. One approach to satisfy the compression needs of differing types of data has been to use Mixed Raster Content (MRC) which involves separating a composite image—one having text intermingled with color or gray scale information—into three planes, and separately applying an appropriate compression technique to each plane. An approach such as this is discussed in U.S. Pat. No. 5,778,092 to MacLeod et al. issued Jul. 7, 1998, which discloses a technique for compressing a color or gray scale pixel map that represents a document. The pixel map is decomposed into a three-plane representation—a reduced-resolution foreground plane, a reduced-resolution background plane, and a high-resolution binary selector plane. The foreground plane contains the color or gray scale information of foreground items such as text. The background plane contains the color or gray scale information for the “background” of the page and the continuous tone pictures that are contained on the page. The selector plane stores information for selecting from either the foreground plane or background plane during decompression.
  • 4. While the MRC technique has shown to be successful at separately processing planes, the segmentation process leaves data in both planes in the areas that will not be chosen by the selector plane. This often causes an increase in the number of bits that are required to encode the entire image, thereby decreasing its compression ratio. This results in inconveniences to the user of a printer, fax machine, scanner or other device in which the technique has been incorporated. For this reason, it is advantageous to somehow reduce the amount of data residing on each plane prior to processing. The present invention is directed to using the information that is contained in the selector plane to aid in reducing the amount of data residing on the foreground and/or background planes. More specifically, the invention takes advantage of the fact that when the selector plane designates a plane to provide information about a given pixel, the information on the other plane that pertains to the same pixel will not be used. The invention provides improved compression of the multi-plane image by treating this useless data in the described manner.
  • 5. The following disclosures may be relevant to aspects of the present invention:
  • 6. U.S. Pat. No. 5,251,271 to Fling issued Oct. 5, 1993 discloses a method for registering digitized multi-plane color images. The method designates one plane as the reference plane and registers each of the other warped planes with the reference plane. Each plane comprises pixels representing luminosity values having scalar x and y coordinates representing positions in the horizontal and vertical directions, respectively, of the plane. The planes are divided into regions. Correlation values are calculated for regions within the divisional region of the reference plane with a plurality of regions offset from the corresponding warped divisional region. A warp error value is calculated for each pixel of each divisional region as a function of the scalar offset. The warp error values are interpolated and added to the current position of each pixel of the warped plane.
  • 7. Separate processing of various types of data contained in a document is disclosed in U.S. Pat. No. 5,060,980 to Johnson et al. issued Oct. 29, 1991 which describes a “form” that includes user modifiable fields and an encoded description of the location, size, type, etc. of the fields to allow for direct programming of a form interpreter. Other information including the processing of the form, encoded data, etc. may be included in the encoded information. A system for creating forms carrying an encoded description of selected attributes of the fields includes means for selecting or creating fields and locating the fields on a form while generating, substantially simultaneously, the encoded description of the selected attributes. A form composer then allows merging of the form and its encoded description for printing or electronic transmission. A system for reading such forms includes a scanner, decoding device, and processor. By reading such forms, data may be entered into or recalled from a data processing system, or a form interpreter may be programmed, locally or remotely, for subsequent handling of forms.
  • 8. U.S. Pat. No. 5,784,175 to Lee, issued Jul. 21, 1998 discloses a video compression encoder process for compressing digitized video signals representing display motion in video sequences of multiple image frames. The encoder process utilizes object-based video compression to improve the accuracy and versatility of encoding interframe motion and intraframe image features. Video information is compressed relative to objects of arbitrary configurations, rather than fixed, regular arrays of pixels as in conventional video compression methods. This reduces the error components and thereby improves the compression efficiency and accuracy. As another benefit, object-based video compression of this invention provides interactive video editing capabilities for processing compressed video information.
  • 9. U.S. Pat. No. 5,303,313 to Mark et al. issued Apr. 12, 1994 describes image compression based on symbol matching. An image is “pre-compressed” prior to symbol matching using run-length encoding. Symbols are then extracted from the run-length representation. A voting scheme is used in conjunction with a plurality of similarity tests to improve symbol matching accuracy. A template composition scheme wherein the template may be modified based on symbol matches is also disclosed.
  • 10. Concurrently filed U.S. Patent Application by DeQueiroz et al. identified as attorney docket No. D/97636 entitled “Method and Apparatus for Pre-processing Mixed Raster Content Planes to Improve the Quality of a Decompressed Image and Increase Document Compression Ratios” and assigned to the assignee of the present invention discloses a technique for processing a color or gray scale pixel map representing a document is disclosed. The pixel map is decomposed into a three-plane representation, a reduced-resolution “upper” plane, a reduced-resolution “lower” plane, and a high-resolution binary selector plane. The “upper” and “lower” planes contain the color or gray scale for the page as well as the continuous tone pictures that are contained on the page. The selector plane stores information for selecting from either the foreground plane or background plane during decompression. Information contained in the selector plane is first used to pre-process the upper and lower planes to reduce the amount of data on each of the other two planes that will be subjected to further processing. Each of the pre-processed planes is compressed using a compression technique optimal for the type of data that resides upon it.
  • 11. All of the references cited herein are incorporated by reference for their teachings.
  • 12. Accordingly, although known apparatus and processes are suitable for their intended purposes, a need remains for a method and apparatus that can efficiently process digital image data by separately compressing the various portions of a composite image.
  • SUMMARY OF THE INVENTION
  • 13. In one embodiment of the invention, an iterative smoothing technique for processing mixed raster content planes is disclosed, which includes the steps of replacing each image data signal on the image plane that does not correspond to the reconstruction identified locations with a signal equal to an average of all image data neighbor signals that correspond to locations previously identified for reconstruction, wherein a neighbor signal is defined as an image data signal that is horizontally, vertically or diagonally adjacent to the image data signal; identifying all of the replaced image data signals as reconstruction image data signals in the image data plane map; outputting the image plane if all locations in the image plane map are reconstruction signals, and repeating the image data enhancing signal replacing step if less than all locations in the image plane map are reconstruction signals.
  • 14. In another embodiment of the invention an iterative smoothing technique for processing mixed raster content planes is disclosed, which includes the steps of replacing each image data signal on the image plane that does not correspond to the reconstruction identified locations with a signal equal to an average of all image data neighbor signals that correspond to locations previously identified for reconstruction, wherein a neighbor signal is defined as an image data signal that is horizontally, vertically or diagonally adjacent to the image data signal; applying a discrete cosine transformation to the image plane and quantizing the results of the transformation; inverse quantizing the transformation results and performing an inverse discrete cosine transform thereon, thereby generating a pseudo-plane that has image data signals that lie in the same locations as signals in the image data plane; replacing each image data signal on the image data plane that does not correspond to a reconstruction identified location on the image data plane map with an image data signal on the pseudo-plane that lies in a same location.
  • BRIEF DESCRIPTION OF THE DRAWINGS
  • 15.FIG. 1 illustrates a composite image and includes an example of how such an image may be decomposed into three MRC image planes, an upper plane, a lower plane, and a selector plane.
  • 16.FIG. 2 contains a flowchart illustrating the basic steps for compressing a document according to the present invention.
  • 17.FIG. 3 shows a detailed example of the typical contents of a selector plane for an 8×8 block of pixels.
  • 18.FIG. 4 shows a detailed example of an image plane map which corresponds to the selector map of FIG. 3.
  • 19.FIG. 5 depicts one embodiment of the present invention for pre-processing image planes.
  • 20.FIG. 6 illustrates another embodiment of the present invention for pre-processing image planes.
  • 21.FIG. 7 shows the manner in which a near non-destructive embodiment of the present invention may be used in conjunction with a JPEG compression system to pre-process image planes for subsequent JPEG compression.
  • 22.FIG. 8 contains a detailed illustrations of an iterative smoothing technique that may be used in conjunction with the present invention.
  • 23.FIG. 9 illustrates a typical device in which the present invention may be implemented.
  • DESCRIPTION OF THE INVENTION
  • 24. The present invention is directed to a method and apparatus for separately processing the various types of data contained in a composite image. While the invention is described in conjunction with a Mixed Raster Content (MRC) representation technique, those skilled in the art will recognize that it may be adapted for use with other methods and apparatus' and the invention is therefore, not limited to this description. The technique described herein is suitable for use in various devices required to store or transmit color or grayscale documents such as facsimile devices, image storage devices and the like. It should be noted that the examples and illustrations presented in the figures are in gray scale, but the same concepts apply to color documents and conversely, those portions of the invention that are described with reference to color documents apply equally to gray scale documents.
  • 25. A pixel map is one in which each pixel represents some “value” which indicates the color or, in the case of gray scale document, how light or dark the image is at that point. As those skilled in the art will appreciate, most pixel maps have values that are taken from a set of discrete, non-negative integers. For example, a typical gray-scale pixel map might have values ranging from 0, representing black, to 255, representing the whitest tone possible. The pixel maps of concern in the currently preferred embodiment are representations of “scanned” images. That is, images which are created by digitizing light reflected off of physical media using a digital scanner. The term bitmap is used to mean a binary pixel map in which pixels can take one of two values, 1 or 0. An example of a device that may be used to obtain such scanned images is illustrated in FIG. 8.
  • 26. Turning now to the drawings for a general description of the invention, as indicated in FIG. 1, pixel map 10 representing a color or gray-scale document is preferably decomposed into a three plane page format. The document format is comprised of an upper plane 12, a lower plane 14, and a selector plane 16. Upper plane 12 and lower plane 14 are typically stored at the same bit depth and number of colors as the original pixel map 10, but usually at reduced resolution. However as those skilled in the art will appreciate, the processing of planes can include a reduction in the bit depth or a palette color encoding. It is important to recognize that while the terms “upper” and “lower” are used to describe the planes on which data resides, it is not intended to limit the invention to any particular arrangement. Further, it is also possible to practice the invention with planes that are composed of multiple superimposed separations. If this is the case, it is possible to apply the present invention to all separations together or to each color separation individually.
  • 27. Processing typically occurs on a block by block basis, rather than by simultaneously processing all the image data. For example, if JPEG compression will be applied, 8×8 blocks must be provided. That is, the image data must be separated into groups of 64 pixels, with 8 pixels extending in the horizontal direction and 8 blocks extending in the vertical direction. JPEG is merely an example of one compression format that may be used with the present invention. The blocks may be organized in another configuration if required by the technique that will be used. After all blocks are processed, any or all three planes may be compressed using a method suitable for the type data residing thereon. Continuing with the example already provided, upper plane 12 and lower plane 14 may be compressed and stored using JPEG, while selector plane 16 is compressed using a symbol-based compression format. It would be apparent to one of skill in the art to compress and store the planes using other formats that are suitable for the intended use of the document. For example, in the Color Facsimile arena, group 4 (MMR) would preferably used for the selector plane, since the particular compression format used must be one of the approved formats (MMR, MR, MH, JPEG, JBIG, etc.) for facsimile data transmission.
  • 28. Lower plane 14 commonly contains both information that is pertinent to the background color of the page (including the color of tints, washes, etc.) and the continuous-tone pictures that are found on the page. Upper plane 12 commonly contains the “ink colors” of foreground items such as text. Selector plane 16 is typically stored at higher resolution than the upper and lower planes. Selector plane 16 is used to describe, for each pixel in the selector plane, whether to use the pixel value found in the lower plane or the upper plane during image reconstruction. If a “white” pixel in the selector plane (i.e. a logical zero value) means the pixel value should be taken from the corresponding pixel from the lower plane, a “black” pixel in the selector plane (i.e. a logical one value) means that the pixel value should be taken from the corresponding pixel from the upper plane.
  • 29.FIG. 2 contains a flowchart depicting the basic steps for compressing a document using an embodiment of the present invention. Blocks from an original pixel map 10—a pixel map representation of the original document to be compressed—are first obtained as indicated in step 102. This may be through scanning an original, by retrieving a stored pixel map representation of the document, or by converting an electronic or page description language representation of an original document into a pixel map representation. Pixel map 10 representation is then analyzed to generate the information for the three planes as indicated in steps 104-108. Selector plane 16 is implicitly or explicitly computed first, as indicated in step 104 and is used to create the other planes. Those skilled in the art will recognize that use of the phrase “implicitly or explicitly” refers to the fact that the invention does not require actual calculation and generation of selector plane 16. While selector plane 16 can be generated, the invention may be accomplished by simply moving pixels from one plane to another, and marking the pixels that have been moved. Technically, this calculates one plane such as lower plane 14 first, but simultaneously it implicitly calculates selector plane 16.
  • 30. Selector plane 16 is typically a bitmap computed using a technique suitable for finding text or the like on original pixel map 10. What results is a bitmap where pixels have a 1 value where they represent text and a 0 elsewhere. It should be noted that the term “text” refers to page objects that have text properties, such as sharp, high contrast edges, etc., including many other objects that to not qualify as “readable” text. Pixels are placed on either upper plane 12 or lower plane 14 according to the data on selector plane 16.
  • 31. An upper plane 12, typically stored at a reduced resolution relative to original pixel map 10, contains color (or gray scale) information of upper items such as text is computed using selector plane as indicated in step 106. Briefly, creating upper plane 12 involves creating an image containing the color of the objects (pixels) selected in the selector plane. Conceptually, the method can be viewed as pouring ink contents of the upper plane through a mask located on the selector plane onto the background of the lower plane. The ink colors are placed in a reduced-resolution “ink map” that will ultimately become upper plane 12. Without the present invention, the empty values are typically filled in with pre-computed ink colors.
  • 32. A lower plane 14, also typically stored at a lower resolution than original pixel map 10, is then computed as indicated in step 108. In this step, one embodiment of the invention includes an image segmentation process that identifies the “image” or non-text portions. This information is used to create the reduced resolution lower map, which contains background color information as well as continuous tone image information. The result is an image that has all small, text-like features deleted, but which includes tints as well as color or gray scale data.
  • 33. Once the three planes have been generated, either or all of them may be compressed at steps 110-114 using a technique suitable for compressing the type of data that lies thereon. The compressed data representing each plane can be recombined at step 116, after the necessary compression has taken place, in order to create a single representation of the data, for storage in a computer file, or transmission in a single channel. If case multiple transmission channels are available step 116 may not be necessary.
  • 34. The present invention includes a method and apparatus which pre-processes the data on upper plane 12 and lower plane 14 using the information contained on selector plane 16. Turning now to FIG. 3, as stated earlier selector plane 16 includes a pattern of zeros and ones, dispersed in an 8×8 block. An 8×8 block such as that illustrated here corresponds to an 8×8 block of data that is provided by the compressor which, in the preferred embodiment of the invention, will be a JPEG compressor. If a compression technique that provides data in another configuration is used, selector plane 16 will have the zeros and ones placed thereon, dispersed in a corresponding pattern. As stated earlier, it is assumed here that a 0 on selector plane 16 means that the pixel value should be taken from the corresponding pixel from the lower plane 14, while a 1 on the selector plane means that the pixel value should be taken from the corresponding pixel from upper plane 12.
  • 35. In the preferred embodiment of the present invention, when processing image planes that will be reduced for compression, the block size used in the pre-processing step may be enlarged to compensate for the reduction in image size, so that the final processed block size matches the block size used for compressing the image plane.
  • 36. Referring now to FIG. 4, image plane maps that identify the pixels in each block that will be used to reconstruct the final output image from the two planes is next created. For lower plane 14 map 304, is created wherein an “N” is placed in every location in which a 1 was located on selector plane 16 to mark the pixels in the block that will not be used during image reconstruction. A “Y” is placed in those locations in which 0's were located on selector plane 16 to show the pixels in the block that are to be retained for the output image. Similarly, for upper plane 12, map 302 is created and N's are placed in those locations which correspond to 0's on selector plane 16, while Y's are placed in the locations that correspond to 1's. Those skilled in the art will recognize that the second map generated may be created by simply inverting the first map.
  • 37. In one embodiment of the invention, referred to as non-destructive processing, retained (“Y” labeled) pixels are never modified. As indicated in FIG. 5, the first step 402 is to determine the number of locations in the block in the image plane map 302 or 304 that have been identified as disposable (“N” pixels). For simplicity, the invention will continue to be described with reference to a block in the lower plane 14. As shown in step 406, if no locations in image plane map 304 have been identified as N locations, the block is simply output as is. Note that the average “A” of the block is implicitly or explicitly computed before it is output in step 406. Those skilled in the art will recognize that average “A” could be obtained by be re-using the DC term of the JPEG compression, and that while an explicit calculation may occur, it is not necessary. On the other hand, if all locations in image plane map 304 have been identified as N locations, all of the pixels in the block that lie on lower plane 14 can be set to a constant value. In one embodiment of the invention, the constant value is set equal to the average of all pixels values in the previously processed block, i.e. set to “A”. Those skilled in the art will recognize that numerous methods can be used to calculate the most appropriate constant value, and that the invention is not limited to using this average. Lower plane 14 with its newly assigned values is then output at step 406.
  • 38. With continued reference to FIG. 5, if neither all nor none of the pixels on image plane map 304 in the block being processed have been identified as N pixels (i.e. the number of N identified pixels is not equal to either zero or the maximum value which, in the case of JPEG compression would be 64) all of the pixels in the block that correspond to Y locations on image plane map 304 are replaced with values that will enhance the compression of the block. Specifically, values placed on lower plane 14 will be those that will minimize the amount of data that will be generated during image compression. In the preferred embodiment of the invention, these values will be provided using an iterative image smoothing technique, which will be described in detail later (See FIG. 8 and corresponding discussion). Lower plane 14 with its newly updated values is then output at step 406.
  • 39. It is important to understand that even in the non-destructive case, artifacts can occur during decompression that are caused by pixel values at N locations. Assume for simplicity that all Y pixels have a value of 200. Filling all N pixels with value 55 will produce a ringing artifact that protrudes into the area of Y-marked pixels. It is therefor necessary and one intention of this invention, to use values for the N-marked pixels that optimize compression while not introducing artifacts in the Y-marked regions on decompression.
  • 40. Turning now to FIG. 6, in another embodiment of the invention a “near non-destructive processing” technique is used to process image data according to the present invention. The phrase “near non-destructive” is used to indicate that some of the Y labeled pixels in the block are likely to be slightly modified using this approach. Near non-destructive processing is generally accomplished by determining how much variance there is between the Y labeled pixels on the image plane and then comparing that variance to some pre-determined threshold value. If the variance of the Y labeled pixels is small enough, processing time can be reduced by replacing the entire image data block with a block of pixels that has a uniform value. Thus, the embodiment first requires inputting threshold and computing the variance of the block. The threshold value indicates the maximum amount of distortion that is acceptable for decompression. The variance indicates the activity of the block—whether there are large variations in the type (i.e. text, pictorial) of image data within the given block.
  • 41. As before, the process begins by determining the number of locations in the image plane map 304 for which the block has N identified pixels, as indicated in step 402. If no locations in the block have been identified as N pixels, the block is output in step 406 and as before, if all locations in the block have been identified as N pixels in image plane map 304, all of the pixels in the block are replaced at step 410 with pixels that have a constant value such as the average value for pixels in a previously processed block, or some other appropriate value.
  • 42. Near non-destructive processing may be applied if image plane map 304 has neither all N marked pixels or all Y marked pixels. If this is the case, processing of the image is dependent upon the relationship between the variance and the threshold described above and illustrated in step 414. If the variance is greater than or equal to the threshold, all pixels in lower plane 14 that are in locations which correspond to those identified with N's on image plane map 304 are again replaced with values that will enhance compression. As before, the preferred embodiment of the invention includes calculating these values using an iterative image smoothing technique such as the one described below. The block will then be output at step 406.
  • 43. If the variance is less than the threshold, the entire block will be replaced by a uniform block with pixels that have a constant value as indicated in step 416. It should be noted here that the constant value used at step 416 will typically not be the same as that which would have been used if all of the pixels had been marked with N (step 410). While minimizing the amount of data that will be generated during image decompression is still the goal in this step, a different averaging technique will often be required to accomplish that task. In the preferred embodiment of the invention, the average of the pixels corresponding to locations marked with Y's will be calculated and that value will be the constant used in step 416. Again, those skilled in the art will recognize that numerous methods may be used to calculate the most appropriate constant value and the invention is not limited to this embodiment.
  • 44. As explained earlier, in the preferred embodiment the present invention will be implemented using JPEG compression to compress upper plane 12 and lower plane 14. Thus, a simplifying method can be applied by incorporating the present invention within the JPEG compression module. An implementation of this embodiment is provided in FIG. 7. As stated above, the process begins by determining the number of N locations in the image plane map 304 for the block at step 402. If there are no N locations in the block, the block is still output as before, the difference here being that, as shown in step 606, output 620 is preceded by JPEG encoding. Thus the output module 620 actually outputs a variable amount of bits generated by the JPEG compression process.
  • 45. If all locations in image plane map 304 for the block have been identified as N pixels, the JPEG compliant bitstring relative to the 0 DC difference is output, followed by an end of block symbol. When decoding the image data, those two symbols indicate that the current block is uniform and has the same average as the previously coded block. This bitstring will be 010110 for default luminance tables as indicated in step 610 and is perhaps the shortest possible valid string to represent a block in JPEG. The resulting data block plane will again be output at step 620. The motivation for using the average of the previous block as opposed to the current one is now clear since by using this method, the amount of JPEG compressed data for the block being processed will be minimal.
  • 46. Still referring to FIG. 7, assuming the near non-destructive processing method is being used, the next step occurs when neither all nor none of the pixels on image plane map 304 has been identified as N pixels. What takes place during this next step again depends upon the relationship between the variance and the threshold described above. Looking first at step 414 if the variance is greater than or equal to the threshold, all pixels in the plane in locations which correspond to those identified with Y's on image plane map 304 are replaced with values that will enhance decompression (i.e. minimize the amount of generated data). Again, in the preferred embodiment of the invention, this will be an iterative smoothing technique. The “smoothed” block is then compressed using JPEG at step 612, and the compressed data bits are output at step 620.
  • 47. If the variance is less than the threshold, the plane will be replaced by a uniform block of pixels at step 608. In one embodiment of the invention, the value of the uniform block will be equal to the average of pixels in the block that have been marked with Ys in image plane map 304. Output using simplified JPEG encoding will take place at step 616. Use of the term “simplified” JPEG encoding means that the block average is used as the DC value of the discrete cosine transform (DCT) which is the only DCT value to be encoded and output. Therefore, the DCT computation and the quantization or encoding of DCT AC values for the block do not take place.
  • 48. Referring now to FIG. 8, the details of one embodiment of an iterative image smoothing technique, step 412 in FIGS. 5, 6 and 7 will now be described. As indicated above, one way to enhance compression is to replace the N pixel values in the block with values that will compress better, since those values will not be used during reconstruction anyway.
  • 49. The fact that iterative smoothing is being applied means that there are initially at least some N's on the map. Those N locations that have at least one vertical or horizontal Y neighbor are noted. It should be pointed out that diagonal neighbors are not counted during this part of the process. Next, the values of all pixels in the block that correspond to the selected N locations will be replaced by the average of all of their neighboring pixels that correspond to previously identified Y locations as indicated in step 706. Diagonal as well as vertical and horizontal neighbors may be included in this averaging. The replaced pixels are identified with Y's in corresponding locations in image plane map block 304 as indicated in step 708. The status of map 304 is checked at step 702 to determine whether all locations in image plane map 304 that correspond to locations in the block are identified with Y's. The process is repeated until this is the case (i.e. until there are no more N's on image map 304 in locations that correspond to those in the block). Once no more N locations the block i output as indicated in step 406.
  • 50. Turning now to FIG. 9, another way to perform iterative smoothing for step 412 is to use a discrete cosine transform. As before, N locations that have at least one vertical or horizontal Y neighbor replaced by the average of all of pixels that correspond to Y locations as indicated in step 802. A is discrete cosine transformation (DCT) is then applied to this new block, and the results of the transformation are quantized as indicated in step 804.
  • 51. In one embodiment of the invention, some of the high frequency coefficients are removed at step 806. It is anticipated that several iterations will occur before this process has been completed. How many levels of high frequency coefficients that are removed will depend upon how many iterations have occurred, with the number of levels removed in direct proportion to the number of iterations.
  • 52. Turning for a moment to FIG. 10, an 8×8 block which describes frequency “levels” is provided. As shown, the 0 level coefficient is the DCC. The first level coefficients are marked by 1's, second level coefficients marked by 2's, third level marked by 3's etc., until coefficients in all 14 levels are identified. Thus, when it is desired to use only the first level coefficients, three pixel values will be provided—those values marked with numbers less than or equal to 1. If the fourth level coefficients are to be used, fifteen pixel values, those marked with numbers less than or equal to 4, will be used.
  • 53. Turning back to FIG. 9, in another embodiment of the invention, step 806 is skipped, and all coefficients produced by the DCT are used for subsequent processing.
  • 54. Next, the plane is subjected to inverse quantization and inverse DCT to produce a pseudo-plane as indicated in step 808. As indicated in step 810, the pixel values in the original plane that correspond to N locations in image plane map 304 are then replaced with pixels in the same locations in the pseudo-plane. As indicated earlier, this is an iterative process and it is repeated until a designated criteria is met, as shown in step 812.
  • 55. In one embodiment of the invention, the process is repeated for a fixed number of iterations. An example of this embodiment is to perform processing only once (stop when K=2) and the results of that single iteration can be used. In another embodiment, processing stops after a comparison of either Y or N identified pixels in consecutive iterations takes place, and it is determined that a designated amount of improvement or change has occurred.
  • 56. Turning again to FIG. 2, once each of the respective planes is generated, they are each compressed using a suitable compression technique, step 112. In the currently preferred embodiment, upper plane 12 and lower plane 14 are compressed using JPEG while the selector plane 16 is compressed using a symbol based pattern matching technique such as CCITT Group 4 or a method of classifying scanned symbols into equivalence classes such as that described in U.S. Pat. No. 5,778,095 to Davies issued Jul. 7, 1998, the contents of which are hereby incorporated by reference.
  • 57. While this invention has been described in terms of compressing a pixel map that is represented as a selector plane and two image planes, those skilled in the art will recognize that it can be adapted to compress a pixel map that is represented as a selector plane and a single image plane. A pixel map representation such as this may include an image with an associated mask, where the mask is used to select an irregularly shaped area from the image. In a representation such as that described, the image pixels not selected by the mask correspond to N locations in the image plane and can be processed by any of the methods described in the present invention to increase the compression ratio of the single image plane and improve the quality of the decompressed image.
  • 58. In the preferred embodiment of the invention, any or all of these methods may be implemented in a computer any other device capable of storing a set of instructions which may be executed by a machine. The program storage device will tangibly embody this set of instructions (most often referred to as a software program) to perform the above previously recited steps for compressing a document image in the manner described in detail above with reference to the attached figures.
  • 59. In summary, the present invention uses the selector plane to replace, for each plane, pixels that have been designated to be provided by the other plane by carefully chosen values. The previously existing data is completely ignored, and the newly chosen values are calculated for such that the number of bits that will be generated during the subsequent compression is minimized. While the present invention has been described in connection with a preferred embodiment thereof, it will be understood that it is not intended to limit the invention to that embodiment. On the contrary, it is intended to cover all alternatives, modifications and equivalents as may be included within the spirit and scope of the invention as defined by the appended claims.

Claims (16)

What is claimed is:
1. A method of processing mixed master content planes which represent a compressed pixel map representation of a document, wherein the mixed raster content planes include an upper plane a lower plane and a selector plane, and wherein the upper and lower planes have been processed based upon data contained in said selector plane comprising the steps of:
a) generating an image plane map which identifies locations on an associated image data plane of image data signals that shall be used to reconstruct said digital image; and
b) replacing all image data signals that correspond to locations not identified for image reconstruction using an iterative smoothing technique; and
c) outputting said image plane.
2. A method of processing mixed raster content planes as claimed in
claim 1
wherein for image data signal replacing step, said iterative smoothing technique comprises the steps of:
a) replacing each image data signal on said image plane that does not correspond to said reconstruction identified locations with a signal equal to an average of all image data neighbor signals that correspond to locations previously identified for reconstruction, wherein a neighbor signal is defined as an image data signal that is horizontally, vertically or diagonally adjacent to said image data signal;
b) identifying all of said replaced image data signals as reconstruction image data signals in said image data plane map;
c) outputting said image plane if all locations in said image plane map are reconstruction signals, and repeating said image data enhancing signal replacing step if less than all locations in said image plane map are reconstruction signals.
3. A method of processing mixed raster content planes as claimed in
claim 1
wherein for said image data signal replacing step said iterative smoothing technique comprises the steps of:
a) replacing each image data signal on said image plane that does not correspond to said reconstruction identified locations with a signal equal to an average of all image data neighbor signals that correspond to locations previously identified for reconstruction, wherein a neighbor signal is defined as an image data signal that is horizontally, vertically or diagonally adjacent to said image data signal;
b) applying a discrete cosine transformation to said image plane and quantizing the results of said transformation;
c) inverse quantizing said transformation results and performing an inverse discrete cosine transform thereon, thereby generating a pseudo-plane that has image data signals that lie in the same locations as signals in said image data plane;
d) replacing each image data signal on said image data plane that does not correspond to a reconstruction identified location on said image data plane map with an image data signal on said pseudo-plane that lies in a same location.
4. A method of processing a compressed pixel map as claimed in
claim 3
wherein for said image data signal replacing step, said iterative smoothing step further comprises after once completing said replacing step, repeating said applying step, said inverse quantizing step and said replacing step until a designated stop criteria is met.
5. A method of processing a compressed pixel map as claimed in
claim 4
wherein for said iterative smoothing step, said designated stop criteria further comprises:
a) comparing a magnitude of image data signals identified for reconstruction in consecutive processing operations;
b) measuring a difference in said consecutive image data signal magnitudes; and
c) canceling said repeating step when said measured difference is less than a predetermined value.
6. A method of processing a compressed pixel map as claimed in
claim 3
further comprising removing all but the lowest frequency coefficients, thereby generating a pseudo plane.
7. A method of processing a compressed pixel map as claimed in
claim 6
wherein for said image data replacing step, said iterative smoothing technique further comprises after once completing said replacing step, repeating said applying step, said lowest frequency removing step, said inverse quantizing step and said replacing step until a designated stop criteria is met.
8. A method of processing a compressed pixel map as claimed in
claim 7
wherein for said iterative smoothing step, said designated stop criteria further comprises:
a) comparing a magnitude of image data signals identified for reconstruction in consecutive processing operations;
b) measuring a difference in said consecutive image data signal magnitudes; and
c) canceling said repeating step when said measured difference is less than a predetermined value.
9. A method of processing mixed master content planes which represent a compressed pixel map representation of a document, wherein the mixed raster content planes include an upper plane a lower plane and a selector plane, and wherein the upper and lower planes have been processed based upon data contained in said selector plane comprising the steps of:
a) inputting a threshold signal, which indicates an acceptable level of distortion for a subsequent processing operation;
b) inputting a variance signal, which indicates a maximum acceptable magnitude difference between an image data signal and a threshold signal;
c) generating an image plane map which identifies locations on an associated image data plane of image data signals that shall be used to reconstruct said digital image;
d) replacing all image data signals that correspond to locations not identified for image reconstruction using an iterative smoothing technique if said variance signal is greater than or equal to said threshold signal; and
e) outputting said image plane.
10. A method of processing a compressed pixel map as claimed in
claim 9
wherein for said image data signal replacing step, said iterative smoothing technique comprises the steps of:
a) replacing each image data signal on said image plane that does not correspond to said reconstruction identified locations with a signal equal to an average of all image data neighbor signals that correspond to locations previously identified for reconstruction, wherein a neighbor signal is defined as an image data signal that is horizontally, vertically or diagonally adjacent to said image data signal;
b) identifying all of said replaced image data signals as reconstruction image data signals in said image data plane map;
c) outputting said image plane if all locations in said image plane map are reconstruction signals, and repeating replacing step if less than all locations in said image plane map are reconstruction signals.
11. A method of processing a compressed pixel map as claimed in
claim 9
wherein for said image data signal replacing step, said iterative smoothing technique further comprises the steps of:
a) replacing each image data signal on said image plane that does not correspond to said reconstruction identified locations with a signal equal to an average of all image data neighbor signals that correspond to locations previously identified for reconstruction, wherein a neighbor signal is defined as an image data signal that is horizontally, vertically or diagonally adjacent to said image data signal;
b) applying a discrete cosine transformation to said image plane and quantizing the results of said transformation;
c) inverse quantizing said transformation results and performing an inverse discrete cosine transform thereon, thereby generating a pseudo-plane that has image data signals that lie in the same locations as signals in said image data plane;
d) replacing each image data signal on said image data plane that does not correspond to a reconstruction identified location on said image data plane map with an image data signal on said pseudo-plane that lies in a same location.
12. A method of processing a compressed pixel map as claimed in
claim 11
wherein for said image data signal replacing step, said iterative smoothing step further comprises after once completing said replacing step, repeating said applying step, said inverse quantizing step and said replacing step until a designated stop criteria is met.
13. A method of processing a compressed pixel map as claimed in
claim 12
wherein for said iterative smoothing step, said designated stop criteria further comprises:
a) comparing a magnitude of image data signals identified for reconstruction in consecutive processing operations;
b) measuring a difference in said consecutive image data signal magnitudes; and
c) canceling said repeating step when said measured difference is less than a predetermined value.
14. A method of processing a compressed pixel map as claimed in
claim 11
further comprising removing all but the lowest frequency coefficients, thereby generating a pseudo plane.
15. A method of processing a compressed pixel map as claimed in
claim 14
wherein for said image data signal replacing step, said iterative smoothing step further comprises after once completing said replacing step, repeating said applying step, said lowest frequency removing step, said inverse quantizing step and said replacing step until a designated stop criteria is met.
16. A method of processing a compressed pixel map as claimed in
claim 15
wherein for said iterative smoothing step, said designated stop criteria further comprises:
a) comparing a magnitude of image data signals identified for reconstruction in consecutive processing operations;
b) measuring a difference in said consecutive image data signal magnitudes; and
c) canceling said repeating step when said measured difference is less than a predetermined value.
US09/733,860 1998-12-07 2000-12-07 Iterative smoothing technique for pre-processing mixed raster content planes to improve the quality of a decompressed image and increase document compression ratios Expired - Lifetime US6334001B2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US09/733,860 US6334001B2 (en) 1998-12-07 2000-12-07 Iterative smoothing technique for pre-processing mixed raster content planes to improve the quality of a decompressed image and increase document compression ratios

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US20648898A 1998-12-07 1998-12-07
US09/733,860 US6334001B2 (en) 1998-12-07 2000-12-07 Iterative smoothing technique for pre-processing mixed raster content planes to improve the quality of a decompressed image and increase document compression ratios

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
US20648898A Division 1998-12-02 1998-12-07

Publications (2)

Publication Number Publication Date
US20010000314A1 true US20010000314A1 (en) 2001-04-19
US6334001B2 US6334001B2 (en) 2001-12-25

Family

ID=22766626

Family Applications (1)

Application Number Title Priority Date Filing Date
US09/733,860 Expired - Lifetime US6334001B2 (en) 1998-12-07 2000-12-07 Iterative smoothing technique for pre-processing mixed raster content planes to improve the quality of a decompressed image and increase document compression ratios

Country Status (3)

Country Link
US (1) US6334001B2 (en)
JP (1) JP2000175053A (en)
BR (1) BR9907400A (en)

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030133617A1 (en) * 2002-01-14 2003-07-17 Debargha Mukherjee Coder matched layer separation and interpolation for compression of compound documents
US20050141035A1 (en) * 2003-12-04 2005-06-30 Xerox Corporation System and method for processing portions of documents using variable data
US20070013951A1 (en) * 2002-04-30 2007-01-18 Microsoft Corporation Mixed raster content files
US20070189615A1 (en) * 2005-08-12 2007-08-16 Che-Bin Liu Systems and Methods for Generating Background and Foreground Images for Document Compression
US20070217701A1 (en) * 2005-08-12 2007-09-20 Che-Bin Liu Systems and Methods to Convert Images into High-Quality Compressed Documents
US20080298718A1 (en) * 2007-05-31 2008-12-04 Che-Bin Liu Image Stitching
CN101505358B (en) * 2008-02-06 2011-01-12 株式会社Pfu Image processor, image processing method
TWI398816B (en) * 2004-02-12 2013-06-11 Xerox Corp Systems and methods for adjusting image data to form highly compressible image planes
US20140317345A1 (en) * 2013-04-18 2014-10-23 Xerox Corporation Method and apparatus for an efficient hardware implementation of dictionary based lossless compression

Families Citing this family (24)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3608356B2 (en) * 1997-11-18 2005-01-12 富士ゼロックス株式会社 Image processing apparatus, image processing method, image transmission apparatus, and image transmission method
US7120167B1 (en) * 1999-06-03 2006-10-10 Matsushita Electric Industrial Co., Ltd. Broadcasting system and its method
US7869462B2 (en) * 1999-06-03 2011-01-11 Panasonic Corporation Broadcast system and method therefor
US6591019B1 (en) * 1999-12-07 2003-07-08 Nintendo Co., Ltd. 3D transformation matrix compression and decompression
US7062087B1 (en) * 2000-05-16 2006-06-13 International Busniness Machines Corporation System and method for optimizing color compression using transparency control bits
JP2002369010A (en) 2001-06-05 2002-12-20 Nec Corp Image coder and image decoder
US7027647B2 (en) * 2001-12-31 2006-04-11 Hewlett-Packard Development Company, L.P. Coder matched layer separation for compression of compound documents
MXPA04007916A (en) * 2002-01-16 2005-05-16 Cornerstone Group Ltd Optimized data transmission system and method.
US7164797B2 (en) * 2002-04-25 2007-01-16 Microsoft Corporation Clustering
US7392472B2 (en) 2002-04-25 2008-06-24 Microsoft Corporation Layout analysis
US7110596B2 (en) 2002-04-25 2006-09-19 Microsoft Corporation System and method facilitating document image compression utilizing a mask
US7263227B2 (en) * 2002-04-25 2007-08-28 Microsoft Corporation Activity detector
US7043079B2 (en) 2002-04-25 2006-05-09 Microsoft Corporation “Don't care” pixel interpolation
US7024039B2 (en) * 2002-04-25 2006-04-04 Microsoft Corporation Block retouching
US7120297B2 (en) * 2002-04-25 2006-10-10 Microsoft Corporation Segmented layered image system
US7194143B2 (en) * 2002-04-26 2007-03-20 Pegasus Imaging Corporation Method of enhancement of the visual display of images and other visual data records
US7345782B2 (en) * 2002-05-13 2008-03-18 Texas Instruments Incorporated Efficient implementation of raster operations flow
US7174049B2 (en) * 2002-12-11 2007-02-06 Seiko Epson Corporation Image upscaling by joint optimization of low and mid-level image variables
US7139442B2 (en) * 2002-12-16 2006-11-21 Xerox Corporation Template matching applied to selector planes for multiple raster content (MRC) representation of documents
US7212676B2 (en) * 2002-12-30 2007-05-01 Intel Corporation Match MSB digital image compression
US8086050B2 (en) 2004-08-25 2011-12-27 Ricoh Co., Ltd. Multi-resolution segmentation and fill
JP2006121645A (en) * 2004-09-24 2006-05-11 Fuji Photo Film Co Ltd Image compression apparatus and image compression program
US9230161B2 (en) 2013-12-06 2016-01-05 Xerox Corporation Multiple layer block matching method and system for image denoising
US9445108B1 (en) 2015-05-26 2016-09-13 International Business Machines Corporation Document compression with neighborhood biased pixel labeling

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE3752300T2 (en) * 1986-08-29 2000-04-13 Canon Kk Input / output device and method for processing image data
US5359676A (en) * 1993-07-19 1994-10-25 Xerox Corporation Decompression of standard ADCT-compressed document images
US5778092A (en) * 1996-12-20 1998-07-07 Xerox Corporation Method and apparatus for compressing color or gray scale documents
US6006226A (en) * 1997-09-24 1999-12-21 Ricoh Company Limited Method and system for document image feature extraction

Cited By (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030133617A1 (en) * 2002-01-14 2003-07-17 Debargha Mukherjee Coder matched layer separation and interpolation for compression of compound documents
WO2003061292A1 (en) * 2002-01-14 2003-07-24 Hewlett-Packard Company Coder matched layer separation and interpolation for compression of compound documents
US6941024B2 (en) 2002-01-14 2005-09-06 Hewlett-Packard Development Company, L.P. Coder matched layer separation and interpolation for compression of compound documents
CN100373948C (en) * 2002-01-14 2008-03-05 惠普公司 Coder matched layer separation and interpolation for compression of compound documents
US20070013951A1 (en) * 2002-04-30 2007-01-18 Microsoft Corporation Mixed raster content files
US20050141035A1 (en) * 2003-12-04 2005-06-30 Xerox Corporation System and method for processing portions of documents using variable data
US8144360B2 (en) 2003-12-04 2012-03-27 Xerox Corporation System and method for processing portions of documents using variable data
TWI398816B (en) * 2004-02-12 2013-06-11 Xerox Corp Systems and methods for adjusting image data to form highly compressible image planes
US7899258B2 (en) 2005-08-12 2011-03-01 Seiko Epson Corporation Systems and methods to convert images into high-quality compressed documents
US7783117B2 (en) 2005-08-12 2010-08-24 Seiko Epson Corporation Systems and methods for generating background and foreground images for document compression
US20070217701A1 (en) * 2005-08-12 2007-09-20 Che-Bin Liu Systems and Methods to Convert Images into High-Quality Compressed Documents
US20070189615A1 (en) * 2005-08-12 2007-08-16 Che-Bin Liu Systems and Methods for Generating Background and Foreground Images for Document Compression
US7894689B2 (en) 2007-05-31 2011-02-22 Seiko Epson Corporation Image stitching
US20080298718A1 (en) * 2007-05-31 2008-12-04 Che-Bin Liu Image Stitching
CN101505358B (en) * 2008-02-06 2011-01-12 株式会社Pfu Image processor, image processing method
US20140317345A1 (en) * 2013-04-18 2014-10-23 Xerox Corporation Method and apparatus for an efficient hardware implementation of dictionary based lossless compression
US9015429B2 (en) * 2013-04-18 2015-04-21 Xerox Corporation Method and apparatus for an efficient hardware implementation of dictionary based lossless compression

Also Published As

Publication number Publication date
JP2000175053A (en) 2000-06-23
BR9907400A (en) 2000-09-12
US6334001B2 (en) 2001-12-25

Similar Documents

Publication Publication Date Title
US6275620B2 (en) Method and apparatus for pre-processing mixed raster content planes to improve the quality of a decompressed image and increase document compression ratios
US6334001B2 (en) Iterative smoothing technique for pre-processing mixed raster content planes to improve the quality of a decompressed image and increase document compression ratios
US6324305B1 (en) Method and apparatus for segmenting a composite image into mixed raster content planes
US6373981B1 (en) Method and apparatus for segmenting data to create mixed raster content planes
US6400844B1 (en) Method and apparatus for segmenting data to create mixed raster content planes
US5432870A (en) Method and apparatus for compressing and decompressing images of documents
US6556711B2 (en) Image processing apparatus and method
US6307962B1 (en) Document data compression system which automatically segments documents and generates compressed smart documents therefrom
US7489830B2 (en) Methods for generating anti-aliased text and line graphics in compressed document images
US6751356B2 (en) Image processing apparatus and method
De Queiroz et al. Optimizing block-thresholding segmentation for multilayer compression of compound images
US20030202697A1 (en) Segmented layered image system
US6608928B1 (en) Generic pre-processing of mixed raster content planes
US5854857A (en) Using encoding cost data for segmentation and background suppression in JPEG-compressed images
US5442459A (en) Process for encoding a half tone image considering similarity between blocks
Huttenlocher et al. Digipaper: A versatile color document image representation
US6594385B2 (en) Image compression of background and text tiles
US6728412B1 (en) Method and apparatus for on-the-fly image coding
EP1006714A2 (en) Method of processing mixed raster content planes
EP0902398B1 (en) Method and system for compressing and decompressing binary representations of dithered images
de Queiroz et al. Compressing compound documents
EP1006717B1 (en) Method and apparatus for segmenting data
EP1006711A2 (en) Method and apparatus for processing a pixel map
JPH10108011A (en) Data-processing unit
JP3647071B2 (en) Image processing apparatus and method

Legal Events

Date Code Title Description
STCF Information on status: patent grant

Free format text: PATENTED CASE

AS Assignment

Owner name: BANK ONE, NA, AS ADMINISTRATIVE AGENT, ILLINOIS

Free format text: SECURITY INTEREST;ASSIGNOR:XEROX CORPORATION;REEL/FRAME:013153/0001

Effective date: 20020621

AS Assignment

Owner name: JPMORGAN CHASE BANK, AS COLLATERAL AGENT, TEXAS

Free format text: SECURITY AGREEMENT;ASSIGNOR:XEROX CORPORATION;REEL/FRAME:015134/0476

Effective date: 20030625

Owner name: JPMORGAN CHASE BANK, AS COLLATERAL AGENT,TEXAS

Free format text: SECURITY AGREEMENT;ASSIGNOR:XEROX CORPORATION;REEL/FRAME:015134/0476

Effective date: 20030625

FPAY Fee payment

Year of fee payment: 4

FPAY Fee payment

Year of fee payment: 8

FPAY Fee payment

Year of fee payment: 12

AS Assignment

Owner name: XEROX CORPORATION, CONNECTICUT

Free format text: RELEASE BY SECURED PARTY;ASSIGNOR:JPMORGAN CHASE BANK, N.A. AS SUCCESSOR-IN-INTEREST ADMINISTRATIVE AGENT AND COLLATERAL AGENT TO JPMORGAN CHASE BANK;REEL/FRAME:066728/0193

Effective date: 20220822