WO1999034319A1 - Image subregion querying using color correlograms - Google Patents

Image subregion querying using color correlograms Download PDF

Info

Publication number
WO1999034319A1
WO1999034319A1 PCT/US1998/027671 US9827671W WO9934319A1 WO 1999034319 A1 WO1999034319 A1 WO 1999034319A1 US 9827671 W US9827671 W US 9827671W WO 9934319 A1 WO9934319 A1 WO 9934319A1
Authority
WO
WIPO (PCT)
Prior art keywords
color
correlogram
image object
image
values
Prior art date
Application number
PCT/US1998/027671
Other languages
French (fr)
Inventor
Jing Huang
Shanmugasundaram Ravi Kumar
Mandar Mitra
Wei-Jing Zhu
Original Assignee
Cornell Research Foundation, Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Cornell Research Foundation, Inc. filed Critical Cornell Research Foundation, Inc.
Priority to AU22075/99A priority Critical patent/AU2207599A/en
Publication of WO1999034319A1 publication Critical patent/WO1999034319A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/90Determination of colour characteristics
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/50Information retrieval; Database structures therefor; File system structures therefor of still image data
    • G06F16/58Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/50Information retrieval; Database structures therefor; File system structures therefor of still image data
    • G06F16/58Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/583Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
    • G06F16/5838Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content using colour
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/40Extraction of image or video features
    • G06V10/56Extraction of image or video features relating to colour
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/10Image acquisition modality
    • G06T2207/10024Color image

Definitions

  • This invention relates generally to data management, and, more particularly to retrieving specific portions of images using color correlograms.
  • Image databases are becoming larger and more widespread, and there is a growing need for effective and efficient image retrieval systems.
  • Image retrieval systems are systems that extract from a large collection of images ones that are "similar" to an image of interest to the user.
  • Most existing image retrieval systems adopt the following two-step approach to search image databases: (i) indexing: for each image in the database, a feature vector capturing certain essential properties of the image is computed and stored in a featurebase, and (ii) searching: given a query image, its feature vector is computed, compared to the feature vectors in the featurebase, and images most similar to the query image are returned to the user.
  • the feature defined for an image should have certain desirable qualities: (i) the difference between pre-selected features of two images should be large if and only if the images are not "similar", (ii) the feature should be fast to compute, and (iii) the size of the feature should be small.
  • target searching the user specifies a subregion (usually an interesting object) of an image as a query. For example, a user might wish to find pictures in which a given object appears, or scenes in a video with a given appearance of a person.
  • the system should then retrieve images containing this subregion, or object from the database.
  • This task called image subregion querying, is made challenging by the wide variety of effects, such as different viewing positions, camera noise and variation, and object occlusion, that cause the same object to have a different appearance in different images.
  • Color histograms are commonly used as feature vectors for image retrieval and for detecting cuts in video processing because histograms are efficient to compute and insensitive to camera motions. Histograms are not robust to local changes in images, so false positives easily occur using histograms. Though the histogram is easy to compute and seemingly effective, it is liable to cause false positive matches, especially where databases are large, and is not robust to large appearance changes . Another disadvantage of the color histogram is insensitivity to illumination changes. Recently, several approaches have attempted to improve upon the histogram by incorporating spatial information with color.
  • the color coherence vector (CCV) method uses the image feature (s), e.g. spatial coherence of colors and pixel position, to refine the histogram.
  • the image subregion retrieval system should also be able to solve the location problem, i.e. the system should be able to find the location of the object in the image.
  • the location problem arises in tasks such as real-time object tracking and video searching, where it is necessary to localize the position of an object in a sequence of frames.
  • Template matching is one approach used to solve the location problem. This method generally yields good results but is computationally expensive.
  • a refined form of template matching is the histogram backprojection method. The method of histogram backprojection is to first compute a "goodness value" for each pixel in an image (the goodness of each pixel is the likelihood that this pixel is in the target) and then to obtain the subimage and therefore the location whose pixels have the highest goodness values.
  • Histogram backprojection however gives the same goodness value to all pixels of the same color.
  • the technique emphasizes colors that appear frequently in the image. This may result in overemphasizing certain colors in the object Q. If the image has a subimage that has many pixels of color c, then this method tends to identify Q with this subimage, even though the two objects may be unrelated, thus causing an error in some cases.
  • Cut detection is the process of segmenting a video into different camera shots which allows the extraction of key frames for video parsing and querying.
  • a flexible tool for browsing video databases should also provide users with the capability to place object-level queries that have semantic content, such as "track this person in a sequence of video".
  • object-level queries that have semantic content, such as "track this person in a sequence of video”.
  • the system has to find which frames contain the specific object or person, and has to locate the object in those frames.
  • the problems of image retrieval are solved by the present invention of providing and using a color correlogram to query objects in images.
  • the color correlogram of the present invention is a three-dimensional representation indexed by color and distance between pixels which expresses how the spatial correlation of color changes with distance in a stored image.
  • the color correlogram includes spatial correlation of colors, combines both the global and local distributions of colors, is easy to compute, and is small from a data storage perspective.
  • the color correlogram is robust in tolerating large changes in the appearance of a scene caused by changes in viewing positions, changes in the background scene, partial occlusions, and magnification that causes radical changes in shape .
  • the colors in the image are quantized into m color values, c. ... c ra .
  • the distance values ⁇ e[ ⁇ j to be used in the correlogram are determined where [d] is the set of distances between pixels in the image, and where dmax is the maximum distance measurement between pixels in the image.
  • Each entry in the color correlogram is the probability of finding a pixel of color c at a selected distance k from a pixel of color c t .
  • a color autocorrelogram as provided in this invention, is a restricted version of the color correlogram that considers color pairs of the form (i,i) only.
  • the color correlogram may be used to query objects in images as well as entire images stored in a database. Extensions to the color correlogram may also be used in object retrieval tasks. The general theme behind the extensions are the improvement of storage efficiency of the correlogram without compromising the image discrimination capability of the correlogram and the use of additional information (such as an edge) to further refine the correlogram which improves image retrieval performance.
  • the correlogram intersection is used for image subregion querying. Using the correlogram intersection, the relative counts of color pairs in the images being compared are determined. The comparison easily eliminates the images which do not match.
  • the correlogram may also be used in locating objects in images.
  • the location problem arises in tasks such as realtime object tracking or video searching, where it is necessary to localize the position of an object in a sequence of frames. Efficiency is also required in location because large amounts of data must be processed.
  • Any norm for comparing vectors may be used to compare color correlograms/color autocorrelograms .
  • Figure 3 is a graphical representation of a plurality of autocorrelograms according to principles of the present invention.
  • Figure 4 is a flow chart of the process of retrieving from a database images matching a query image using the color correlogram according to principles of the present invention.
  • FIG. 1 illustrates a graphic representation of the color correlogram 10 of the present invention.
  • the color correlogram 10 is a three-dimensional table indexed by color value i, color value j, and by distance k between pixels in an image.
  • the color correlogram 10 expresses how the spatial correlation of color changes with distance in the image.
  • the spatial correlation of color in a particular image is a feature which may be used to distinguish the image from other images .
  • Putting the spatial correlation of colors data into the format of the color correlogram creates a data object associated with the image which may be stored in a database and queried.
  • the color correlogram embodies color characteristics of an image in a way which distinguishes the image from other images while tolerating large changes in appearance of the image content due to changes in, but not limited to, viewing positions, changes in the background scene, partial occlusions, and camera zoom that causes radical changes in shape.
  • the color correlogram of the present invention includes spatial correlation of colors, combines both the global and local distributions of colors, is easy to compute, and is small from a data storage perspective.
  • the colors in the image are quantized into m color values, c x ... c ra .
  • the distance values Dc[J] to be used in the correlogram are determined where [d] is the set of distances between pixels in the image, and where dmax is the maximum distance measurement between pixels in the image.
  • an image I for example, is an n x n matrix (square for the sake of simplicity) .
  • the image I has a set of values of distances between pixels [d] , the maximum value of d being the largest distance between pixels in the image.
  • the color values and distances are used to index the correlogram as shown in Figure 1.
  • the value in each entry (c i t c ⁇ k) of the correlogram 10, such as the entry (c., c x , 3) 15, is the probability Pr of finding a pixel of a color value c ⁇ at a distance k away from a pixel of color value c t .
  • a color autocorrelogram may also be used with the concepts of this invention to distinguish an image from other images.
  • the color autocorrelogram is a restricted version of the color correlogram that considers only same-color pairs, that is color values of the form (c i# c ) .
  • a comprehensive correlogram identification of the image I involves calculating correlograms from a number of distances k from the set of [d] for all of the quantized color pairs ( c , c ) .
  • Experimental evidence has indicated, however, that only the autocorrelogram, which uses same color-value color-pairs, and a few values of k are needed to produce a useful image identifier.
  • FIG. 3 shows several example autocorrelograms where probability is plotted against distance k.
  • the solid line 60 in the graph is representative of the autocorrelogram for a first color value in a first exemplary image.
  • the dot- dash line 65 in the graph yields the autocorrelogram for a second color in the first exemplary image.
  • the dotted line 70 in the graph gives the autocorrelogram for the first color in a second exemplary image.
  • the images are identifiable from their correlogram and may be compared using their correlograms .
  • the straightforward method for calculating the color correlogram of this invention is to take a first pixel of the color c A in the image I, and for each selected k in the set of [d] , to count all pixels of color c ⁇ which are k distance away from the first pixel . This process is repeated for each pixel in the image over all of the selected values k in the set of [d] . This method takes a long time.
  • This quantity represents those pixels in the image of color c. Then the following quantities are defined:
  • the correlogram entry (c ⁇ c ⁇ k) can be computed as ⁇ * ( ( 'j. ⁇ I8& H (i) ⁇ where f ⁇ is the number of pixels of the color c i in the image.
  • the color correlogram and the autocorrelogram may be stored in a database and queried in order to identify matching images .
  • Figure 4 shows a flow chart of the method of this invention of image retrieval, using color correlograms, from a database having stored color correlograms.
  • an input query image is provided, block 100.
  • the correlogram of the input query image is computed, block 110, using one of the methods described above, depending on the type of correlograms stored in the database.
  • the correlogram of the input query image is compared to the correlograms stored in the database, block 115.
  • the standard x norm is used to compare color correlograms and color autocorrelograms however any method for comparing vectors may be used.
  • the L is used to compare color correlograms and color autocorrelograms however any method for comparing vectors may be used.
  • the distance commonly used to compare vectors, is the sum of absolute differences of the components of the vectors being compared.
  • the relative distance between two numbers x and y is given by the expression
  • the relative distance measure calculates the sum of the relative differences of the components of the vectors and in most cases performs better than the absolute measure.
  • the resulting distances are sorted by increasing order, block 120. Generally, a number of top matches is preselected and this number of images are presented as an output of images matching the input query image, block 125.
  • the color correlogram may be used to query objects in images as well as entire images stored in a database.
  • the image subregion querying problem may be defined as follows: given as an input a subregion query Q of an image I and an image set S, retrieve from S those images Q ' in which the query Q appears according to human perception (denoted Q ' QQ) .
  • the set of images might consist of a database of still images, or videos, or some combination of both. The problem is made even more difficult than image retrieval by a wide variety of effects on the appearance of an object, such as changing viewpoint, camera noise and occlusion.
  • intersection correlogram A solution to the image subregion querying problem is the intersection correlogram.
  • the intersection correlogram is defined as the correlogram of the intersection Q ]I .
  • the color pair count in the nonintersection correlogram is defined as:
  • intersection correlogram is defined as:
  • the image T should have at least as many counts of correlating color pairs as the object Q.
  • the counts r and H for Qf]I becomes exactly the correlogram of Q, giving
  • the difference between the correlogram of the object Q and the intersection correlogram of the object Q and the image I is zero, there is a match of the object Q with the image I.
  • the distance between Q and Qf)I vanishes when Q is actually a subset of T. This affirms the fact that the correlogram is a stable property of images.
  • the stability of the correlogram is not satisfied by all image features. For example, spatial coherence is not preserved under subset operations .
  • the color correlogram may also be used to find the location of an object in an image.
  • the location problem may be defined as follows: given a query image (also called a target or a model) Q and an image I such that Q c / , find the location in I where Q is present.
  • the mathematical location is defined at the center of the target for convenience .
  • a correlogram backprojection is combined with a correlogram correction in order to incorporate local spatial correlation information.
  • the objective is to integrate discriminating local characteristics while avoiding local color template matching.
  • the image and the image object are correlogrammed according to the principles of the present invention. Then, each color value is assigned a frequency value according to how often the color value appears in the object versus the background of the image in which the object appears. The frequency values are back- projected into the image so that each point in the resulting image correlogram has a color-frequency value as well as a color value, the color-frequency value representing the degree to which a particular color is a useful object indicator.
  • a local correlogram contribution is defined by the autocorrelogram of the subimage I ⁇ p so that the goodness of a pixel depends on its position in addition to its color.
  • the local autocorrelogram 0.p is computed for each distance k e [d] ( [d] should contain only small values so that otp captures local information for each pixel) .
  • the contribution of p is the L x -similarity between the local autocorrelogram at p and the part of the autocorrelogram for Q that corresponds to the color of p.
  • a final goodness value of a subimage I ⁇ p ⁇ that is the values to be back- projected onto the image, is given by the equation
  • C b is defined as a less dominant color, e.g. the background color, that has a high autocorrelogram. If image T has a subimage l ⁇ p (which may be totally irrelevant to object Q) that has many pixels of color c b with high autocorrelations, then the correlogram backprojection has a tendency to identify Q with T
  • the increasing availability of video data makes automated video analysis a necessity.
  • the step to automate video content analysis is to segment a video into camera shots (also known as key frame extraction) .
  • a camera shot is an unbroken sequence of frames from one camera and a cut is said to occur when two consecutive frames are from different shots.
  • Cut detection algorithms usually work as follows: adjacent frames are compared using some image feature and frames that are sufficiently similar are assumed to belong to the same shot, and dissimilar frames are taken to signify a cut. Different cut detectors use different features to compute the similarity between consecutive frames, e.g. pixel difference, statistical differences, histogram comparisons, edge differences. Correlograms have been shown to be robust to large appearance changes for image retrieval and correlograms are used for cut detection. In a sequence of video frames that have some number of cuts in the sequence, a pair of adjacent images are evaluated using color correlograms using a preselected feature f to determine if the cut occurs between the two images. If the frames do not match according to the feature f, then the cut does occur between the two adjacent images.

Abstract

A color correlogram (110) is a representation expressing the spatial correlation of color and distance between pixels in a stored image. The color correlogram may be used to distinguish objects in an image as well as between images in a plurality of images. By intersecting a color correlogram of an image object (115) with correlograms of images to be searched (100), those images which contain the objects are identified by the intersection correlogram (125).

Description

IMAGE SUBREGION QUERYING USING COLOR CORRELOGRAMS
CROSS REFERENCE TO RELATED APPLICATIONS This application claims priority of U.S. provisional applications Serial No. 60/068,915 entitled, "Technique for Image Subregion Querying" filed December 29, 1997 by the present applicants, and Serial No. 60/089,684, entitled "Image Indexing Using Color Correlograms" filed June 17, 1998 by the present applicants . This application is also related to co-pending application Serial No. , entitled, "Image
Indexing Using Color Correlograms" by the present applicants.
STATEMENT OF GOVERNMENT INTEREST
This invention was partially funded by the Government under a grant from DARPA/ARL, ONR Young Investigator Award N00014-93-1-0590, NSF grants DMI-91157199 and IRI 93-00124, career grant CCR-9624552, and DOE grant DEFG02-89ER45405. The Government has certain rights in portions of the invention.
BACKGROUND OF THE INVENTION This invention relates generally to data management, and, more particularly to retrieving specific portions of images using color correlograms.
With the rapid proliferation of the Internet and the World- ide Web, the amount of digital image data accessible to users has grown enormously. Image databases are becoming larger and more widespread, and there is a growing need for effective and efficient image retrieval systems. Image retrieval systems are systems that extract from a large collection of images ones that are "similar" to an image of interest to the user. Most existing image retrieval systems adopt the following two-step approach to search image databases: (i) indexing: for each image in the database, a feature vector capturing certain essential properties of the image is computed and stored in a featurebase, and (ii) searching: given a query image, its feature vector is computed, compared to the feature vectors in the featurebase, and images most similar to the query image are returned to the user.
For a retrieval system to be successful, the feature defined for an image should have certain desirable qualities: (i) the difference between pre-selected features of two images should be large if and only if the images are not "similar", (ii) the feature should be fast to compute, and (iii) the size of the feature should be small. While most image retrieval systems retrieve images based on overall image comparison, users are typically interested in target searching such as in a database of images or in video browsing. In target searching, the user specifies a subregion (usually an interesting object) of an image as a query. For example, a user might wish to find pictures in which a given object appears, or scenes in a video with a given appearance of a person. In response to the user's query, the system should then retrieve images containing this subregion, or object from the database. This task, called image subregion querying, is made challenging by the wide variety of effects, such as different viewing positions, camera noise and variation, and object occlusion, that cause the same object to have a different appearance in different images.
Color histograms are commonly used as feature vectors for image retrieval and for detecting cuts in video processing because histograms are efficient to compute and insensitive to camera motions. Histograms are not robust to local changes in images, so false positives easily occur using histograms. Though the histogram is easy to compute and seemingly effective, it is liable to cause false positive matches, especially where databases are large, and is not robust to large appearance changes . Another disadvantage of the color histogram is insensitivity to illumination changes. Recently, several approaches have attempted to improve upon the histogram by incorporating spatial information with color.
Many of these methods are still unable to handle large changes in appearance. For instance, the color coherence vector (CCV) method uses the image feature (s), e.g. spatial coherence of colors and pixel position, to refine the histogram. These additional features improve performance, but also require increased storage and computation time.
The image subregion retrieval system should also be able to solve the location problem, i.e. the system should be able to find the location of the object in the image. The location problem arises in tasks such as real-time object tracking and video searching, where it is necessary to localize the position of an object in a sequence of frames. Template matching is one approach used to solve the location problem. This method generally yields good results but is computationally expensive. A refined form of template matching is the histogram backprojection method. The method of histogram backprojection is to first compute a "goodness value" for each pixel in an image (the goodness of each pixel is the likelihood that this pixel is in the target) and then to obtain the subimage and therefore the location whose pixels have the highest goodness values. Histogram backprojection however gives the same goodness value to all pixels of the same color. The technique emphasizes colors that appear frequently in the image. This may result in overemphasizing certain colors in the object Q. If the image has a subimage that has many pixels of color c, then this method tends to identify Q with this subimage, even though the two objects may be unrelated, thus causing an error in some cases.
Another task requiring object retrieval from images is cut detection in video processing. Cut detection is the process of segmenting a video into different camera shots which allows the extraction of key frames for video parsing and querying.
A flexible tool for browsing video databases should also provide users with the capability to place object-level queries that have semantic content, such as "track this person in a sequence of video". To handle to queries, the system has to find which frames contain the specific object or person, and has to locate the object in those frames.
It remains desirable to have an efficient and accurate means of identifying and retrieving objects in images which allows for changes in the appearance of the image content such as viewing angle and magnification.
It is therefore an object of the present invention to provide a method and apparatus to perform efficient image comparisons in order to retrieve objects in images.
It is a further object of the present invention to provide a method and apparatus to provide to perform image comparisons for image subregion querying which allow for significant changes in the image such as viewing position, background, and focus.
It is another object of the present invention to provide a method and apparatus which enables efficient image subregion retrieval from a database.
SUMMARY OF THE INVENTION
The objects set forth above as well as further and other objects and advantages of the present invention are achieved by the embodiments of the invention described hereinbelow. The problems of image retrieval are solved by the present invention of providing and using a color correlogram to query objects in images. The color correlogram of the present invention is a three-dimensional representation indexed by color and distance between pixels which expresses how the spatial correlation of color changes with distance in a stored image. The color correlogram includes spatial correlation of colors, combines both the global and local distributions of colors, is easy to compute, and is small from a data storage perspective. The color correlogram is robust in tolerating large changes in the appearance of a scene caused by changes in viewing positions, changes in the background scene, partial occlusions, and magnification that causes radical changes in shape .
To create a color correlogram, the colors in the image are quantized into m color values, c. ... cra. Also, the distance values κe[αj to be used in the correlogram are determined where [d] is the set of distances between pixels in the image, and where dmax is the maximum distance measurement between pixels in the image. Each entry in the color correlogram is the probability of finding a pixel of color c at a selected distance k from a pixel of color ct. A color autocorrelogram, as provided in this invention, is a restricted version of the color correlogram that considers color pairs of the form (i,i) only.
The color correlogram may be used to query objects in images as well as entire images stored in a database. Extensions to the color correlogram may also be used in object retrieval tasks. The general theme behind the extensions are the improvement of storage efficiency of the correlogram without compromising the image discrimination capability of the correlogram and the use of additional information (such as an edge) to further refine the correlogram which improves image retrieval performance.
The correlogram intersection is used for image subregion querying. Using the correlogram intersection, the relative counts of color pairs in the images being compared are determined. The comparison easily eliminates the images which do not match.
The correlogram may also be used in locating objects in images. The location problem arises in tasks such as realtime object tracking or video searching, where it is necessary to localize the position of an object in a sequence of frames. Efficiency is also required in location because large amounts of data must be processed.
Any norm for comparing vectors, for example the standard L1 norm, may be used to compare color correlograms/color autocorrelograms .
Experimental evidence shows that the color correlogram outperforms not only color histograms but also more recent histogram refinements such as the color coherence vector method for image indexing and retrieval . The present invention together with the above and other advantages may best be understood from the following detailed description of the embodiments of the invention illustrated in the drawings, wherein: BRIEF DESCRIPTION OF THE DRAWINGS Figure 1 is a graphic representation of a color correlogram according to principles of the invention; Figure 2 is an image X;
Figure 3 is a graphical representation of a plurality of autocorrelograms according to principles of the present invention; and,
Figure 4 is a flow chart of the process of retrieving from a database images matching a query image using the color correlogram according to principles of the present invention.
DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS Figure 1 illustrates a graphic representation of the color correlogram 10 of the present invention. The color correlogram 10 is a three-dimensional table indexed by color value i, color value j, and by distance k between pixels in an image. The color correlogram 10 expresses how the spatial correlation of color changes with distance in the image.
The spatial correlation of color in a particular image is a feature which may be used to distinguish the image from other images . Putting the spatial correlation of colors data into the format of the color correlogram creates a data object associated with the image which may be stored in a database and queried. The color correlogram embodies color characteristics of an image in a way which distinguishes the image from other images while tolerating large changes in appearance of the image content due to changes in, but not limited to, viewing positions, changes in the background scene, partial occlusions, and camera zoom that causes radical changes in shape. In sum, the color correlogram of the present invention includes spatial correlation of colors, combines both the global and local distributions of colors, is easy to compute, and is small from a data storage perspective. To create a color correlogram as defined in this invention, the colors in the image are quantized into m color values, cx ... cra. Also, the distance values Dc[J] to be used in the correlogram are determined where [d] is the set of distances between pixels in the image, and where dmax is the maximum distance measurement between pixels in the image. In Figure 2, an image I, for example, is an n x n matrix (square for the sake of simplicity) . The distance between pixels px and p2, where p. = (x., yx) and p2 = (x2, y2) , is
I Pi - P2 I = max{|x1 - x21 , |y1 - y21 } (1).
The image I has a set of values of distances between pixels [d] , the maximum value of d being the largest distance between pixels in the image.
The color values and distances are used to index the correlogram as shown in Figure 1. The value in each entry (ci t c^ k) of the correlogram 10, such as the entry (c., cx , 3) 15, is the probability Pr of finding a pixel of a color value c^ at a distance k away from a pixel of color value ct .
A color autocorrelogram may also be used with the concepts of this invention to distinguish an image from other images. The color autocorrelogram is a restricted version of the color correlogram that considers only same-color pairs, that is color values of the form (ci# c ) . A comprehensive correlogram identification of the image I involves calculating correlograms from a number of distances k from the set of [d] for all of the quantized color pairs ( c , c ) . Experimental evidence has indicated, however, that only the autocorrelogram, which uses same color-value color-pairs, and a few values of k are needed to produce a useful image identifier. The simplified nature of the autocorrelogram facilitates a two-dimensional representation which is shown graphically in Figure 3. Figure 3 shows several example autocorrelograms where probability is plotted against distance k. The solid line 60 in the graph is representative of the autocorrelogram for a first color value in a first exemplary image. The dot- dash line 65 in the graph yields the autocorrelogram for a second color in the first exemplary image. The dotted line 70 in the graph gives the autocorrelogram for the first color in a second exemplary image. The images are identifiable from their correlogram and may be compared using their correlograms . Referring once again to Figure 1, the straightforward method for calculating the color correlogram of this invention, is to take a first pixel of the color cA in the image I, and for each selected k in the set of [d] , to count all pixels of color c^ which are k distance away from the first pixel . This process is repeated for each pixel in the image over all of the selected values k in the set of [d] . This method takes a long time.
To reduce the time of the correlogram calculation, the following algorithm is used. First, Ic is defined as an n x n 0-1 matrix such that lc p) = 1 <= l{p = c . This quantity represents those pixels in the image of color c. Then the following quantities are defined:
χ*y)(ky.-
Figure imgf000010_0001
≤ i ≤ k}\ (2 )
λZy)(k) v\{(x,y + i)
Figure imgf000010_0002
≤ j ≤ k}\ (3)
These quantities count the number of pixels of a given color c within a given distance k from a fixed pixel (x,y) in the positive horizontal and vertical directions. These expressions, equations 2 and 3, represent a restricted count the number of pixels of a particular color within a specified distance k from a selected pixel in the positive horizontal and vertical directions instead of all the pixels in a radius around the first pixel as described above. The method of calculating the color correlogram works by first computing ^ ''v and '' where pixel p = (x,y) .
Figure imgf000010_0003
with the initial condition ^ J' (0) = 1 if P ^ JC and for each k = 1 ... d using equation 4. In a similar manner, V can also be efficiently computed.
The modulo boundaries are defined as follows:
!, =
Σ( λ<r 2* - 2) + K -k)w + λ k,y+k)m + x^ 2k -2))
from which the correlogram entry (c^c^k) can be computed as Λ*( ('j. ÷ I8& H (i)\ where f{ is the number of pixels of the color ci in the image. As stated above, the color correlogram and the autocorrelogram may be stored in a database and queried in order to identify matching images .
Figure 4 shows a flow chart of the method of this invention of image retrieval, using color correlograms, from a database having stored color correlograms. First, an input query image is provided, block 100. The correlogram of the input query image is computed, block 110, using one of the methods described above, depending on the type of correlograms stored in the database. Then the correlogram of the input query image is compared to the correlograms stored in the database, block 115. In the present embodiment of the invention, the standard x norm is used to compare color correlograms and color autocorrelograms however any method for comparing vectors may be used. The L. distance, commonly used to compare vectors, is the sum of absolute differences of the components of the vectors being compared. The relative distance between two numbers x and y is given by the expression |x-y | / (1+x+y) . The relative distance measure calculates the sum of the relative differences of the components of the vectors and in most cases performs better than the absolute measure. The resulting distances are sorted by increasing order, block 120. Generally, a number of top matches is preselected and this number of images are presented as an output of images matching the input query image, block 125. The color correlogram may be used to query objects in images as well as entire images stored in a database. The image subregion querying problem may be defined as follows: given as an input a subregion query Q of an image I and an image set S, retrieve from S those images Q ' in which the query Q appears according to human perception (denoted Q ' QQ) . The set of images might consist of a database of still images, or videos, or some combination of both. The problem is made even more difficult than image retrieval by a wide variety of effects on the appearance of an object, such as changing viewpoint, camera noise and occlusion.
A solution to the image subregion querying problem is the intersection correlogram. The intersection correlogram is defined as the correlogram of the intersection Q ]I . The color pair count in the nonintersection correlogram is defined as:
Figure imgf000012_0001
Using this the intersection correlogram is defined as:
Figure imgf000012_0002
The presence of object Q in I is measured by the distance
Figure imgf000012_0003
If Q ς_ I , then the image T should have at least as many counts of correlating color pairs as the object Q. Thus the counts r and H for Qf]I becomes exactly the correlogram of Q, giving |_2- In other words, where the difference
Figure imgf000012_0004
between the correlogram of the object Q and the intersection correlogram of the object Q and the image I is zero, there is a match of the object Q with the image I. The distance between Q and Qf)I vanishes when Q is actually a subset of T. This affirms the fact that the correlogram is a stable property of images. The stability of the correlogram, however, is not satisfied by all image features. For example, spatial coherence is not preserved under subset operations .
The color correlogram may also be used to find the location of an object in an image. The location problem may be defined as follows: given a query image (also called a target or a model) Q and an image I such that Q c / , find the location in I where Q is present. The mathematical location is defined at the center of the target for convenience .
To locate objects in an image, a correlogram backprojection is combined with a correlogram correction in order to incorporate local spatial correlation information. The objective is to integrate discriminating local characteristics while avoiding local color template matching. To create a color correlogram backprojection, the image and the image object are correlogrammed according to the principles of the present invention. Then, each color value is assigned a frequency value according to how often the color value appears in the object versus the background of the image in which the object appears. The frequency values are back- projected into the image so that each point in the resulting image correlogram has a color-frequency value as well as a color value, the color-frequency value representing the degree to which a particular color is a useful object indicator.
A local correlogram contribution is defined by the autocorrelogram of the subimage I \p so that the goodness of a pixel depends on its position in addition to its color.
For each pixel p e I, the local autocorrelogram 0.p is computed for each distance k e [d] ( [d] should contain only small values so that otp captures local information for each pixel) . The contribution of p is the Lx-similarity between the local autocorrelogram at p and the part of the autocorrelogram for Q that corresponds to the color of p. A final goodness value of a subimage I \p ι that is the values to be back- projected onto the image, is given by the equation
Figure imgf000014_0001
where 0 < β < 1. The correlogram contribution to correlogram correction by itself is sensitive and may in some cases overemphasize less dominant colors. Cb is defined as a less dominant color, e.g. the background color, that has a high autocorrelogram. If image T has a subimage l\ p (which may be totally irrelevant to object Q) that has many pixels of color cb with high autocorrelations, then the correlogram backprojection has a tendency to identify Q with T|p thus causing an error. Because the problems with histograms and correlograms are somewhat complementary to each other, the best results are obtained when the goodness of a pixel is given by a weighted linear combination of the histogram and correlogram backprojection contributions. This is called a correlogram correction. The color histogram is known in the art. The best weight can be determined by experimental means and is dependent on the particular application and the database being used.
The increasing availability of video data makes automated video analysis a necessity. The step to automate video content analysis is to segment a video into camera shots (also known as key frame extraction) . A camera shot is an unbroken sequence of frames from one camera and a cut is said to occur when two consecutive frames are from different shots.
Cut detection algorithms usually work as follows: adjacent frames are compared using some image feature and frames that are sufficiently similar are assumed to belong to the same shot, and dissimilar frames are taken to signify a cut. Different cut detectors use different features to compute the similarity between consecutive frames, e.g. pixel difference, statistical differences, histogram comparisons, edge differences. Correlograms have been shown to be robust to large appearance changes for image retrieval and correlograms are used for cut detection. In a sequence of video frames that have some number of cuts in the sequence, a pair of adjacent images are evaluated using color correlograms using a preselected feature f to determine if the cut occurs between the two images. If the frames do not match according to the feature f, then the cut does occur between the two adjacent images.
It is to be understood that the above-described embodiments are simply illustrative of the principles of the invention. Various and other modifications and changes may be made by those skilled in the art which will embody the principles of the invention and fall within the spirit and scope thereof :
What is claimed is:

Claims

1. A computer-implemented method for retrieving an image object from a plurality of images, comprising the steps of: providing an image object color correlogram; providing a plurality of color values; selecting a distance value to be used as the distance between pixels, in the image object and in the plurality of images, to be evaluated for color value; analyzing said image object according to said color values and said selected distance value; determining in response to the analyzing step a probability of finding a pixel of a particular color value at said distance value from a pre-selected pixel of a pre- selected color value; entering said probability into the image object color correlogram; providing color correlograms for each of said plurality of images; and intersecting the image object color correlogram with correlograms of the plurality of images to produce an intersection result, wherein the image object is distinguished by the intersection result from the images which do not contain the image object.
2. The method of claim 1 wherein the intersecting step further comprises comparing a count of a first plurality of color pairs in the image object with a count of a second plurality of color pairs in the image, the first plurality of color pairs correlating to the second plurality of color pairs .
3. The method of claim 1 wherein the step of providing color correlograms for each of said plurality of images further comprises storing the provided correlograms in a database.
4. The method of claim 1 further comprising the steps of: selecting a plurality of distance values; and performing said analyzing step, said determining step and said entering step using said plurality of distance values.
5. A system for retrieving an image object from a plurality of images, comprising: means for providing an image object color correlogram; means for providing a plurality of color values; means for selecting a distance value to be used as the distance between pixels to be evaluated for color value in the image object and in the plurality of images; means for analyzing said image object according to said color values and said selected distance value; means for determining in response to said means for analyzing, a probability of finding a pixel of a particular color value at said distance value from a pre-selected pixel of a pre-selected color value; means for entering said probability into said image object color correlogram; means for providing color correlograms for each of said plurality of images; and means for intersecting said image object color correlogram with correlograms of the plurality of images to produce an intersection result, wherein the image object is distinguished from the images which do not contain the image object by the intersection result.
6. The system of claim 1 wherein said means for intersecting further comprises a means for comparing a count of a first plurality of color pairs in the image object with a count of a second plurality of color pairs in the image, the first plurality of color pairs correlating to the second plurality of color pairs.
7. The system of claim 1 further comprising a database for storing said provided correlograms of said plurality of images .
8. The system of claim 1 further comprising: means for selecting a plurality of distance values; means for analyzing said image object and said plurality of images according to said color values and said plurality of distance values; and means for determining, in response to said analyzing means, a probability of finding a pixel of a particular color value for each of said plurality of distance values from a selected pixel of a selected color value.
9. A computer-implemented method of locating an image object in an image comprising the steps of: providing a plurality of color values and at least one distance value; computing a color correlogram for the image object using the plurality of color values and the at least one distance value; computing a color correlogram for the image using the plurality of color values and the at least one distance value; analyzing the image object and the image to determine a color frequency value for each color value; assigning the color-frequency value to each pixel in the image object to make a back-projection image object correlogram; and combining the back-projection image object correlogram with the image correlogram to create a correlogram backprojection indicating the location of the image object in the image .
10. The method of claim 9 wherein said step of computing the image object color correlogram further comprises computing an autocorrelogram for the image object; and said step of computing the image color correlogram further comprises computing an autocorrelogram for the image.
11. The method of claim 9 further comprising the step of: locating the image object by the mathematical center of the image object.
12. The method of claim 9 further comprising the step of: combining the back-projection image object correlogram with a color histogram of the image object to obtain correction values; and combining the correction values with the color correlogram of the image to accurately locate the image obj ect .
13. The method of claim 12 further comprising the step of weighting the values of the back-projection image object correlogram and weighting the values of the color histogram to produce weighted correction values to be combined with the color correlogram of the image.
14. A system for locating an image object in an image, comprising: means for providing a plurality of color values and at least one distance value; means for computing a color correlogram for the image object using the plurality of color values and the at least one distance value; means for computing a color correlogram for the image using the plurality of color values and the at least one distance value; means for analyzing the image object and the image to determine a color frequency value for each color value; means for assigning the color-frequency value to each pixel in the image object to make a back-projection image object correlogram; and means for combining the back-projection image object 18 PCMJS98/27671 correlogram with the image correlogram to create a correlogram backprojection indicating the location of the image object in the image .
15. The system of claim 14 wherein said means for computing the image object color correlogram further comprises means for computing an autocorrelogram for the image object; and said means for computing the image color correlogram further comprises means for computing an autocorrelogram for the image .
16. The system of claim 14 further comprising: means for combining the back-projection image object correlogram with a color histogram of the image object to obtain correction values; and means for combining the correction values with the color correlogram of the image to accurately locate the image object.
17. The system of claim 16 further comprising: means for weighting the values of the back-projection image object correlogram; means for weighting the values of the color histogram to produce weighted correction values to be combined with the color correlogram of the image.
18. A method for detecting cuts in a sequence of video frames comprising the steps of: providing an image object; computing a color correlogram of the image object; computing a color correlogram of a first video frame; intersecting the image object color correlogram with the first video frame color correlogram to obtain a first result determining the presence or absence of the image object in the first video frame; computing a color correlogram of a second video frame, the second video frame being adjacent to the first video frame in the sequence of video frames; intersecting the image object color correlogram with the second video frame color correlogram to obtain a second result determining the presence or absence of the image object in the second video frame; and, comparing the first result with the second result in order to determine a cut between the first video frame and the second video frame, wherein a cut occurs where the image object is present in one of the adjacent video frames and not present in the other adjacent video frame.
PCT/US1998/027671 1997-12-29 1998-12-28 Image subregion querying using color correlograms WO1999034319A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
AU22075/99A AU2207599A (en) 1997-12-29 1998-12-28 Image subregion querying using color correlograms

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US6891597P 1997-12-29 1997-12-29
US60/068,915 1997-12-29
US8968498P 1998-06-17 1998-06-17
US60/089,684 1998-06-17

Publications (1)

Publication Number Publication Date
WO1999034319A1 true WO1999034319A1 (en) 1999-07-08

Family

ID=26749511

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US1998/027671 WO1999034319A1 (en) 1997-12-29 1998-12-28 Image subregion querying using color correlograms

Country Status (3)

Country Link
US (2) US6430312B1 (en)
AU (1) AU2207599A (en)
WO (1) WO1999034319A1 (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2001011489A2 (en) * 1999-08-09 2001-02-15 Almen Laboratories, Inc. Retrieving images by defining an image content comprising objects
EP1225546A2 (en) * 2001-01-18 2002-07-24 Lg Electronics Inc. Method for setting dominant color using spatial coherency
US6584465B1 (en) 2000-02-25 2003-06-24 Eastman Kodak Company Method and system for search and retrieval of similar patterns
EP2165525A1 (en) * 2007-06-04 2010-03-24 Enswers Co., Ltd. Method of processing moving picture and apparatus thereof

Families Citing this family (119)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7280251B1 (en) 1996-02-26 2007-10-09 Rah Color Technologies System and method for calibrating color printers
US7728845B2 (en) 1996-02-26 2010-06-01 Rah Color Technologies Llc Color calibration of color image rendering devices
US6430312B1 (en) * 1997-12-29 2002-08-06 Cornell Research Foundation, Inc. Image subregion querying using color correlograms
US6411730B1 (en) * 1999-01-15 2002-06-25 Adobe Systems Incorporated Histogram for generating a palette of colors
US7417640B1 (en) * 1999-01-29 2008-08-26 Lg Electronics Inc. Method for dominant color setting of video region and data structure and method of confidence measure extraction
US6477269B1 (en) * 1999-04-20 2002-11-05 Microsoft Corporation Method and system for searching for images based on color and shape of a selected image
US7102648B1 (en) * 2000-04-11 2006-09-05 Rah Color Technologies Llc Methods and apparatus for calibrating a color display
KR100448452B1 (en) 2000-06-09 2004-09-13 엘지전자 주식회사 Method for supporting menu of a high-density recording medium
US7191103B2 (en) * 2001-08-08 2007-03-13 Hewlett-Packard Development Company, L.P. Predominant color identification in digital images
US6993180B2 (en) * 2001-09-04 2006-01-31 Eastman Kodak Company Method and system for automated grouping of images
US7415153B2 (en) * 2002-09-20 2008-08-19 International Business Machines Corporation Color naming, color categorization and describing color composition of images
EP1547067A4 (en) * 2002-10-02 2009-06-24 Lg Electronics Inc Recording medium having a data structure for managing reproduction of graphic data and recording and reproducing methods and apparatuses
CN100487807C (en) * 2002-10-04 2009-05-13 Lg电子有限公司 Recording medium having a data structure for managing reproduction of graphic data and recording and reproducing methods and apparatuses
US7263220B2 (en) * 2003-02-28 2007-08-28 Eastman Kodak Company Method for detecting color objects in digital images
EP1618562A4 (en) * 2003-04-29 2011-03-16 Lg Electronics Inc Recording medium having a data structure for managing reproduction of graphic data and methods and apparatuses of recording and reproducing
US7616865B2 (en) * 2003-04-30 2009-11-10 Lg Electronics Inc. Recording medium having a data structure for managing reproduction of subtitle data and methods and apparatuses of recording and reproducing
US8948468B2 (en) 2003-06-26 2015-02-03 Fotonation Limited Modification of viewing parameters for digital images using face detection information
US9692964B2 (en) 2003-06-26 2017-06-27 Fotonation Limited Modification of post-viewing parameters for digital images using image region or feature information
US7440593B1 (en) 2003-06-26 2008-10-21 Fotonation Vision Limited Method of improving orientation and color balance of digital images using face detection information
US8553949B2 (en) 2004-01-22 2013-10-08 DigitalOptics Corporation Europe Limited Classification and organization of consumer digital images using workflow, and face detection and recognition
US7587068B1 (en) 2004-01-22 2009-09-08 Fotonation Vision Limited Classification database for consumer digital images
US9129381B2 (en) 2003-06-26 2015-09-08 Fotonation Limited Modification of post-viewing parameters for digital images using image region or feature information
US8896725B2 (en) 2007-06-21 2014-11-25 Fotonation Limited Image capture device with contemporaneous reference image capture mechanism
US8989453B2 (en) 2003-06-26 2015-03-24 Fotonation Limited Digital image processing using face detection information
US7565030B2 (en) * 2003-06-26 2009-07-21 Fotonation Vision Limited Detecting orientation of digital images using face detection information
US8593542B2 (en) 2005-12-27 2013-11-26 DigitalOptics Corporation Europe Limited Foreground/background separation using reference images
US7620218B2 (en) 2006-08-11 2009-11-17 Fotonation Ireland Limited Real-time face tracking with reference images
US7616233B2 (en) 2003-06-26 2009-11-10 Fotonation Vision Limited Perfecting of digital image capture parameters within acquisition devices using face detection
US8494286B2 (en) 2008-02-05 2013-07-23 DigitalOptics Corporation Europe Limited Face detection in mid-shot digital images
US7844076B2 (en) 2003-06-26 2010-11-30 Fotonation Vision Limited Digital image processing using face detection and skin tone information
US7574016B2 (en) 2003-06-26 2009-08-11 Fotonation Vision Limited Digital image processing using face detection information
US8682097B2 (en) 2006-02-14 2014-03-25 DigitalOptics Corporation Europe Limited Digital image enhancement with reference images
US8363951B2 (en) 2007-03-05 2013-01-29 DigitalOptics Corporation Europe Limited Face recognition training method and apparatus
US8498452B2 (en) 2003-06-26 2013-07-30 DigitalOptics Corporation Europe Limited Digital image processing using face detection information
US7471846B2 (en) 2003-06-26 2008-12-30 Fotonation Vision Limited Perfecting the effect of flash within an image acquisition devices using face detection
US8330831B2 (en) 2003-08-05 2012-12-11 DigitalOptics Corporation Europe Limited Method of gathering visual meta data using a reference image
US8155397B2 (en) 2007-09-26 2012-04-10 DigitalOptics Corporation Europe Limited Face tracking in a camera processor
US7792970B2 (en) 2005-06-17 2010-09-07 Fotonation Vision Limited Method for establishing a paired connection between media devices
US7269292B2 (en) 2003-06-26 2007-09-11 Fotonation Vision Limited Digital image adjustable compression and resolution using face detection information
KR20050005074A (en) * 2003-07-01 2005-01-13 엘지전자 주식회사 Method for managing grahics data of high density optical disc, and high density optical disc therof
KR20050004339A (en) 2003-07-02 2005-01-12 엘지전자 주식회사 Method for managing grahics data of high density optical disc, and high density optical disc therof
EP2293250B1 (en) 2003-07-04 2012-05-09 Mitsubishi Electric Information Technology Centre Europe B.V. Method and apparatus for representing a group of images
US7379627B2 (en) * 2003-10-20 2008-05-27 Microsoft Corporation Integrated solution to digital image similarity searching
KR20050064150A (en) * 2003-12-23 2005-06-29 엘지전자 주식회사 Method for managing and reproducing a menu information of high density optical disc
US7558408B1 (en) 2004-01-22 2009-07-07 Fotonation Vision Limited Classification system for consumer digital images using workflow and user interface modules, and face detection and recognition
US7551755B1 (en) 2004-01-22 2009-06-23 Fotonation Vision Limited Classification and organization of consumer digital images using workflow, and face detection and recognition
US7564994B1 (en) 2004-01-22 2009-07-21 Fotonation Vision Limited Classification system for consumer digital images using automatic workflow and face detection and recognition
US7555148B1 (en) 2004-01-22 2009-06-30 Fotonation Vision Limited Classification system for consumer digital images using workflow, face detection, normalization, and face recognition
CA2554778C (en) * 2004-03-05 2010-12-21 Samsung Electronics Co., Ltd. System and method for handover to minimize service delay due to ping pong effect in bwa communication system
US7697785B2 (en) * 2004-03-31 2010-04-13 Fuji Xerox Co., Ltd. Generating a highly condensed visual summary
US7848567B2 (en) * 2004-09-23 2010-12-07 Fuji Xerox Co., Ltd. Determining regions of interest in synthetic images
US9405751B2 (en) 2005-08-23 2016-08-02 Ricoh Co., Ltd. Database for mixed media document system
US8156116B2 (en) 2006-07-31 2012-04-10 Ricoh Co., Ltd Dynamic presentation of targeted information in a mixed media reality recognition system
US10192279B1 (en) 2007-07-11 2019-01-29 Ricoh Co., Ltd. Indexed document modification sharing with mixed media reality
US7812986B2 (en) 2005-08-23 2010-10-12 Ricoh Co. Ltd. System and methods for use of voice mail and email in a mixed media environment
US9384619B2 (en) 2006-07-31 2016-07-05 Ricoh Co., Ltd. Searching media content for objects specified using identifiers
US9171202B2 (en) 2005-08-23 2015-10-27 Ricoh Co., Ltd. Data organization and access for mixed media document system
US7702673B2 (en) 2004-10-01 2010-04-20 Ricoh Co., Ltd. System and methods for creation and use of a mixed media environment
US8825682B2 (en) * 2006-07-31 2014-09-02 Ricoh Co., Ltd. Architecture for mixed media reality retrieval of locations and registration of images
US8856108B2 (en) 2006-07-31 2014-10-07 Ricoh Co., Ltd. Combining results of image retrieval processes
US8176054B2 (en) 2007-07-12 2012-05-08 Ricoh Co. Ltd Retrieving electronic documents by converting them to synthetic text
US8868555B2 (en) 2006-07-31 2014-10-21 Ricoh Co., Ltd. Computation of a recongnizability score (quality predictor) for image retrieval
US9530050B1 (en) 2007-07-11 2016-12-27 Ricoh Co., Ltd. Document annotation sharing
US8838591B2 (en) * 2005-08-23 2014-09-16 Ricoh Co., Ltd. Embedding hot spots in electronic documents
US9373029B2 (en) 2007-07-11 2016-06-21 Ricoh Co., Ltd. Invisible junction feature recognition for document security or annotation
US8949287B2 (en) 2005-08-23 2015-02-03 Ricoh Co., Ltd. Embedding hot spots in imaged documents
US8965145B2 (en) 2006-07-31 2015-02-24 Ricoh Co., Ltd. Mixed media reality recognition using multiple specialized indexes
US8320641B2 (en) * 2004-10-28 2012-11-27 DigitalOptics Corporation Europe Limited Method and apparatus for red-eye detection using preview or other reference images
US7715597B2 (en) 2004-12-29 2010-05-11 Fotonation Ireland Limited Method and component for image recognition
US7315631B1 (en) 2006-08-11 2008-01-01 Fotonation Vision Limited Real-time face tracking in a digital image acquisition device
US8503800B2 (en) 2007-03-05 2013-08-06 DigitalOptics Corporation Europe Limited Illumination detection using classifier chains
US9020326B2 (en) 2005-08-23 2015-04-28 At&T Intellectual Property Ii, L.P. System and method for content-based navigation of live and recorded TV and video programs
US9042703B2 (en) * 2005-10-31 2015-05-26 At&T Intellectual Property Ii, L.P. System and method for content-based navigation of live and recorded TV and video programs
US7904455B2 (en) * 2005-11-03 2011-03-08 Fuji Xerox Co., Ltd. Cascading cluster collages: visualization of image search results on small displays
US8078618B2 (en) 2006-01-30 2011-12-13 Eastman Kodak Company Automatic multimode system for organizing and retrieving content data files
DE602007012246D1 (en) 2006-06-12 2011-03-10 Tessera Tech Ireland Ltd PROGRESS IN EXTENDING THE AAM TECHNIQUES FROM GRAY CALENDAR TO COLOR PICTURES
US9176984B2 (en) 2006-07-31 2015-11-03 Ricoh Co., Ltd Mixed media reality retrieval of differentially-weighted links
US9063952B2 (en) 2006-07-31 2015-06-23 Ricoh Co., Ltd. Mixed media reality recognition with image tracking
US8201076B2 (en) 2006-07-31 2012-06-12 Ricoh Co., Ltd. Capturing symbolic information from documents upon printing
US9020966B2 (en) 2006-07-31 2015-04-28 Ricoh Co., Ltd. Client device for interacting with a mixed media reality recognition system
US8489987B2 (en) 2006-07-31 2013-07-16 Ricoh Co., Ltd. Monitoring and analyzing creation and usage of visual content using image and hotspot interaction
US7515740B2 (en) 2006-08-02 2009-04-07 Fotonation Vision Limited Face recognition with combined PCA-based datasets
US7916897B2 (en) 2006-08-11 2011-03-29 Tessera Technologies Ireland Limited Face tracking for controlling imaging parameters
US7403643B2 (en) 2006-08-11 2008-07-22 Fotonation Vision Limited Real-time face tracking in a digital image acquisition device
US8055067B2 (en) 2007-01-18 2011-11-08 DigitalOptics Corporation Europe Limited Color segmentation
ATE472140T1 (en) 2007-02-28 2010-07-15 Fotonation Vision Ltd SEPARATION OF DIRECTIONAL ILLUMINATION VARIABILITY IN STATISTICAL FACIAL MODELING BASED ON TEXTURE SPACE DECOMPOSITIONS
EP2123008A4 (en) 2007-03-05 2011-03-16 Tessera Tech Ireland Ltd Face categorization and annotation of a mobile phone contact list
US8649604B2 (en) 2007-03-05 2014-02-11 DigitalOptics Corporation Europe Limited Face searching and detection in a digital image acquisition device
US7916971B2 (en) 2007-05-24 2011-03-29 Tessera Technologies Ireland Limited Image processing method and apparatus
US7945576B2 (en) * 2007-05-29 2011-05-17 Microsoft Corporation Location recognition using informative feature vocabulary trees
US8111912B2 (en) * 2008-02-15 2012-02-07 Yahoo! Inc. Cost-effective image metadata creation using near-duplicate image detection
US7855737B2 (en) 2008-03-26 2010-12-21 Fotonation Ireland Limited Method of making a digital camera image of a scene including the camera user
US8131066B2 (en) * 2008-04-04 2012-03-06 Microsoft Corporation Image classification
US20090263014A1 (en) * 2008-04-17 2009-10-22 Yahoo! Inc. Content fingerprinting for video and/or image
JP5547730B2 (en) 2008-07-30 2014-07-16 デジタルオプティックス・コーポレイション・ヨーロッパ・リミテッド Automatic facial and skin beautification using face detection
US8422731B2 (en) * 2008-09-10 2013-04-16 Yahoo! Inc. System, method, and apparatus for video fingerprinting
WO2010063463A2 (en) 2008-12-05 2010-06-10 Fotonation Ireland Limited Face recognition using face tracker classifier data
WO2010136593A2 (en) * 2009-05-29 2010-12-02 Tessera Technologies Ireland Limited Methods and apparatuses for foreground, top-of-the-head separation from background
JP5337252B2 (en) * 2009-09-18 2013-11-06 株式会社東芝 Feature extraction device
US8379917B2 (en) 2009-10-02 2013-02-19 DigitalOptics Corporation Europe Limited Face recognition performance using additional image features
EP2323069A2 (en) * 2009-11-17 2011-05-18 Samsung Electronics Co., Ltd. Method, device and system for content based image categorization field
US11409825B2 (en) 2009-12-18 2022-08-09 Graphika Technologies, Inc. Methods and systems for identifying markers of coordinated activity in social media movements
US10324598B2 (en) 2009-12-18 2019-06-18 Graphika, Inc. System and method for a search engine content filter
US8971628B2 (en) 2010-07-26 2015-03-03 Fotonation Limited Face detection using division-generated haar-like features for illumination invariance
US8970770B2 (en) 2010-09-28 2015-03-03 Fotonation Limited Continuous autofocus based on face detection and tracking
US9552442B2 (en) * 2010-10-21 2017-01-24 International Business Machines Corporation Visual meme tracking for social media analysis
US8648959B2 (en) 2010-11-11 2014-02-11 DigitalOptics Corporation Europe Limited Rapid auto-focus using classifier chains, MEMS and/or multiple object focusing
US8659697B2 (en) 2010-11-11 2014-02-25 DigitalOptics Corporation Europe Limited Rapid auto-focus using classifier chains, MEMS and/or multiple object focusing
US8508652B2 (en) 2011-02-03 2013-08-13 DigitalOptics Corporation Europe Limited Autofocus method
US9058331B2 (en) 2011-07-27 2015-06-16 Ricoh Co., Ltd. Generating a conversation in a social network based on visual search results
US8897553B2 (en) 2011-12-13 2014-11-25 The Nielsen Company (Us), Llc Image comparison using color histograms
US8750613B2 (en) 2011-12-13 2014-06-10 The Nielsen Company (Us), Llc Detecting objects in images using color histograms
US8897554B2 (en) 2011-12-13 2014-11-25 The Nielsen Company (Us), Llc Video comparison using color histograms
KR20130085316A (en) * 2012-01-19 2013-07-29 한국전자통신연구원 Apparatus and method for acquisition of high quality face image with fixed and ptz camera
ES2530687B1 (en) * 2013-09-04 2016-08-19 Shot & Shop. S.L. Method implemented by computer for image recovery by content and computer program of the same
CN105141903B (en) * 2015-08-13 2018-06-19 中国科学院自动化研究所 A kind of method for carrying out target retrieval in video based on colouring information
CN105205171B (en) * 2015-10-14 2018-09-21 杭州中威电子股份有限公司 Image search method based on color characteristic
US11216505B2 (en) * 2019-09-05 2022-01-04 Adobe Inc. Multi-resolution color-based image search
US11887217B2 (en) 2020-10-26 2024-01-30 Adobe Inc. Text editing of digital images

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5652881A (en) * 1993-11-24 1997-07-29 Hitachi, Ltd. Still picture search/retrieval method carried out on the basis of color information and system for carrying out the same
US5689575A (en) * 1993-11-22 1997-11-18 Hitachi, Ltd. Method and apparatus for processing images of facial expressions
US5828779A (en) * 1995-05-05 1998-10-27 Siemens Aktiengesellschaft Method for constructing a color table in a computer unit for the classification of picture elements in an image
US5845009A (en) * 1997-03-21 1998-12-01 Autodesk, Inc. Object tracking system using statistical modeling and geometric relationship

Family Cites Families (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS63276676A (en) 1986-09-26 1988-11-14 Olympus Optical Co Ltd Detecting system for interpicture corresponding area
JP2603946B2 (en) 1986-09-26 1997-04-23 オリンパス光学工業株式会社 Apparatus for detecting corresponding areas between images
JP2603947B2 (en) 1986-09-26 1997-04-23 オリンパス光学工業株式会社 Apparatus for detecting corresponding areas between primary color images
US4998286A (en) 1987-02-13 1991-03-05 Olympus Optical Co., Ltd. Correlation operational apparatus for multi-dimensional images
US5321470A (en) 1988-05-13 1994-06-14 Canon Kabushiki Kaisha Apparatus with anti-forgery provision
JPH03218581A (en) 1989-11-01 1991-09-26 Hitachi Ltd Picture segmentation method
US5420979A (en) * 1989-12-22 1995-05-30 Eastman Kodak Company Method and apparatus for using composite transforms to form intermediary image data metrics which achieve device/media compatibility for subsequent imaging applications
DE69127591T2 (en) 1990-06-22 1998-01-22 Canon Kk Device and method for processing images
US5208911A (en) * 1990-09-28 1993-05-04 Eastman Kodak Company Method and apparatus for storing and communicating a transform definition which includes sample values representing an input/output relation of an image transformation
US5432906A (en) * 1990-09-28 1995-07-11 Eastman Kodak Company Color image processing system for preparing a composite image transformation module for performing a plurality of selected image transformations
JPH0514683A (en) 1991-07-01 1993-01-22 Canon Inc Picture processing unit
US5481620A (en) 1991-09-27 1996-01-02 E. I. Du Pont De Nemours And Company Adaptive vision system
US5245589A (en) * 1992-03-20 1993-09-14 Abel Jonathan S Method and apparatus for processing signals to extract narrow bandwidth features
US5343538A (en) 1992-10-02 1994-08-30 International Remote Imaging Systems, Inc. Method and an apparatus for identifying an object using quantile partitions
JP3234064B2 (en) 1993-09-02 2001-12-04 キヤノン株式会社 Image retrieval method and apparatus
US5537488A (en) 1993-09-16 1996-07-16 Massachusetts Institute Of Technology Pattern recognition system with statistical classification
JP3026712B2 (en) 1993-12-09 2000-03-27 キヤノン株式会社 Image search method and apparatus
JPH07212585A (en) 1994-01-25 1995-08-11 Dainippon Screen Mfg Co Ltd Picture recording device
US5630037A (en) 1994-05-18 1997-05-13 Schindler Imaging, Inc. Method and apparatus for extracting and treating digital images for seamless compositing
US6043909A (en) * 1996-02-26 2000-03-28 Imagicolor Corporation System for distributing and controlling color reproduction at multiple sites
US5963203A (en) * 1997-07-03 1999-10-05 Obvious Technology, Inc. Interactive video icon with designated viewing position
US6181817B1 (en) * 1997-11-17 2001-01-30 Cornell Research Foundation, Inc. Method and system for comparing data objects using joint histograms
US6430312B1 (en) * 1997-12-29 2002-08-06 Cornell Research Foundation, Inc. Image subregion querying using color correlograms

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5689575A (en) * 1993-11-22 1997-11-18 Hitachi, Ltd. Method and apparatus for processing images of facial expressions
US5652881A (en) * 1993-11-24 1997-07-29 Hitachi, Ltd. Still picture search/retrieval method carried out on the basis of color information and system for carrying out the same
US5828779A (en) * 1995-05-05 1998-10-27 Siemens Aktiengesellschaft Method for constructing a color table in a computer unit for the classification of picture elements in an image
US5845009A (en) * 1997-03-21 1998-12-01 Autodesk, Inc. Object tracking system using statistical modeling and geometric relationship

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
DATABASE INSPEC 1 January 1900 (1900-01-01), JONG-WAN KIM, CHEOL-HI LEE, HEE-YEUNG HWANG: "Color Image Segmentation by Detecting Three Dimensional Clusters of RGB Components", XP002917573, Database accession no. 94:4603406 *
LUPATINI G, SARACENO C, LEONARDI R: "SCENE BREAK DETECTION: A COMPARISON", INTERNATIONAL WORKSHOP ON RESEARCH ISSUES IN DATA ENGINEERING.CONTINUOUS-MEDIA DATABASES AND APPLICATIONS, XX, XX, 1 February 1998 (1998-02-01), XX, pages 34 - 41, XP002917572, DOI: 10.1109/RIDE.1998.658276 *

Cited By (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6941323B1 (en) 1999-08-09 2005-09-06 Almen Laboratories, Inc. System and method for image comparison and retrieval by enhancing, defining, and parameterizing objects in images
US8775451B2 (en) 1999-08-09 2014-07-08 Almen Laboratories, Inc. Object based image retrieval
WO2001011489A2 (en) * 1999-08-09 2001-02-15 Almen Laboratories, Inc. Retrieving images by defining an image content comprising objects
WO2001011489A3 (en) * 1999-08-09 2004-04-22 Almen Lab Inc Retrieving images by defining an image content comprising objects
US6584465B1 (en) 2000-02-25 2003-06-24 Eastman Kodak Company Method and system for search and retrieval of similar patterns
EP1225546A3 (en) * 2001-01-18 2003-10-29 Lg Electronics Inc. Method for setting dominant color using spatial coherency
EP1530160A1 (en) * 2001-01-18 2005-05-11 Lg Electronics Inc. Method for setting dominant color using spatial coherency
US7006687B2 (en) 2001-01-18 2006-02-28 Lg Electronics Inc. Method for setting dominant color using spatial coherency
US7079683B2 (en) 2001-01-18 2006-07-18 Lg Electronics Inc. Method for setting dominant color using spatial coherency
US7321684B2 (en) 2001-01-18 2008-01-22 Lg Electronics Inc. Method for setting dominant color using spatial coherency
EP1225546A2 (en) * 2001-01-18 2002-07-24 Lg Electronics Inc. Method for setting dominant color using spatial coherency
EP2165525A1 (en) * 2007-06-04 2010-03-24 Enswers Co., Ltd. Method of processing moving picture and apparatus thereof
EP2165525A4 (en) * 2007-06-04 2013-09-11 Enswers Co Ltd Method of processing moving picture and apparatus thereof

Also Published As

Publication number Publication date
US6246790B1 (en) 2001-06-12
AU2207599A (en) 1999-07-19
US6430312B1 (en) 2002-08-06

Similar Documents

Publication Publication Date Title
US6430312B1 (en) Image subregion querying using color correlograms
JP3568117B2 (en) Method and system for video image segmentation, classification, and summarization
US6819797B1 (en) Method and apparatus for classifying and querying temporal and spatial information in video
US6965645B2 (en) Content-based characterization of video frame sequences
US7643686B2 (en) Multi-tiered image clustering by event
US7016916B1 (en) Method of searching multimedia data
US7877414B2 (en) Method and apparatus for representing and searching for an object using shape
JP4973188B2 (en) Video classification device, video classification program, video search device, and video search program
JP5711387B2 (en) Method and apparatus for comparing pictures
US20040175058A1 (en) System and method for adaptive video fast forward using scene generative models
US8249353B2 (en) Method for finding representative vectors in a class of vector spaces
US20030110163A1 (en) System and method for efficiently finding near-similar images in massive databases
JP2002125178A (en) Media segmentation system and related method
EP1494136A1 (en) Method and device for measuring visual similarity of images
WO2010119410A1 (en) Key frames extraction for video content analysis
Xiong et al. Automatic video data structuring through shot partitioning and key-frame computing
CN111581423B (en) Target retrieval method and device
Chaira et al. Fuzzy measures for color image retrieval
Sabbar et al. Video summarization using shot segmentation and local motion estimation
Hanjalic et al. Template-based detection of anchorperson shots in news programs
US6882746B1 (en) Normalized bitmap representation of visual object&#39;s shape for search/query/filtering applications
JP2002513487A (en) Algorithms and systems for video search based on object-oriented content
EP2465056B1 (en) Method, system and controller for searching a database
Zhang et al. Shot boundary detection based on block-wise principal component analysis
Chua et al. Color-based pseudo object model for image retrieval with relevance feedback

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): AL AM AT AU AZ BA BB BG BR BY CA CH CN CU CZ DE DK EE ES FI GB GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MD MG MK MN MW MX NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT UA UG UZ VN YU ZW

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): GH GM KE LS MW SD SZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
NENP Non-entry into the national phase

Ref country code: KR

REG Reference to national code

Ref country code: DE

Ref legal event code: 8642

122 Ep: pct application non-entry in european phase