Publication number | USRE42367 E1 |
Publication type | Grant |
Application number | US 12/188,654 |
Publication date | May 17, 2011 |
Filing date | Aug 8, 2008 |
Priority date | Oct 31, 2001 |
Fee status | Paid |
Also published as | US7088863, US20030190075 |
Publication number | 12188654, 188654, US RE42367 E1, US RE42367E1, US-E1-RE42367, USRE42367 E1, USRE42367E1 |
Inventors | Amir Averbuch, Ofer Miller |
Original Assignee | Ponset Mgmt. Limited Liability Company |
Export Citation | BiBTeX, EndNote, RefMan |
Patent Citations (6), Non-Patent Citations (2), Referenced by (2), Classifications (11), Legal Events (4) | |
External Links: USPTO, USPTO Assignment, Espacenet | |
Change detection between a pair of digital images is used for automatic detection of new blobs appearing in a given scene. A blob is a connected component that has visible local saliency. An object includes one or more blobs. Therefore, in the context of the present invention blob extraction includes extraction of parts or the entire object. In other words, the term object extraction is also covered by the term “blob extraction” as used herein.
The problem of change detection that is addressed herein is defined as follows: a pair of registered gray-level images of the same scene, with different illumination, where each image may contain blobs that are not contained in the other, is given as input. The set of all blobs that do not exist in both images is defined as the change. Prior knowledge about the image, its statistics, and the changes is not given. A typical example is demonstrated in
It is important to emphasize that since non-constant (i.e. different) illumination is assumed between images, methods that are based on image subtraction will yield many false alarms. More sophisticated methods that are based on gray-level surface approximation may fail to detect changes that are caused because of small blobs, since surface approximations tend to smooth the boundaries of the blobs and reduce the ability to detect small blobs
1. Prior Art Methods
Most of the prior art dealing with the problem of identification of change detection does not address the situation of existing non-constant (different) illumination between two images. Many of the prior art works focus on situations of change detection in video. Usually, the illumination in the two images is assumed to be identical. When the change detection in moving video is motivated by compression, the goal is to detect areas of change, and not blobs of change. An area of a change can include parts of a complete blob.
The known techniques for change detection can be classified into the following categories:
1.1 | Pixel-level change detection. |
1.2 | Surface modeling. |
1.3 | Comparison among derivative images. |
1.4 | Contrast invariant representation. |
1.5 | Region based comparison of first or higher order statistics. |
All these methods are region based, in contrast with the method of the present invention, which is blob based. The regions are independent of the image content. It is important to note that none of the reviewed prior art methods use segmentation, since exact image segmentation of noisy scenes is very complicated, and it is still considered a difficult problem. Many of the reviewed prior art methods use polynomial approximation on the image surface. This is another drawback, since this approximation smoothes the image, which leads to less accurate results than a blob-based approach.
1.1. Pixel-Level Change Detection
Change between two images or frames can be detected by comparing the differences in intensity values of corresponding pixels in the two frames. An algorithm counts the number of the changed pixels, and a camera break is declared if the percentage of the total number of pixels changed exceeds a certain threshold [R. Kasturi and R. Jain, “Dynamic Vision”, Computer Vision: Principles, Eds. R. Kasturi, R. Jain, IEEE Computer Society Press, Washington, pp. 469-480, 1991 (hereinafter KAS91); A. Nagasaka, and Y. Tanaka, “Automatic Video Indexing and Full-Video Search for Blob Appearances”, Visual Database Systems, II, Eds. E. Knuth, and L. M. Wegner, Elsevier Science Publishers B. V., IFIP, pp. 113-127, 1992 (hereinafter NAG92); H. J. Zhang, A. Kankanhalli, and S. W. Smoliar, “Automatic Partitioning of Full-Motion Video”, ACM/Springer Multi-media Systems, Vol. 1, No. 1, pp. 10-28, 1993 (hereinafter ZHA93)].
Mathematically, the difference in pixels and the threshold calculation can be represented by Eqs. 1 and 2.
In Eq. 1, F_{i}(x,y) is the intensity value of the pixel in frame i at the coordinates (x,y). If the difference between the corresponding pixels in the two consecutive frames is above a certain minimum intensity value, then DP_{i}(x,y), the difference picture, is set to one. In Eq. 2, the percentage difference between the pixels in the two frames is calculated by summing the difference picture and dividing by the total number of pixels in a frame. If this percentage is above a certain threshold T, a camera break is declared.
Camera movement, e.g., pan or zoom, can have an effect of a large number of pixel changes, and hence a segment will be detected. Fast moving blobs also have the same effect. If the mean intensity values of the pixels and their connected pixels are compared [ZHA93], then the effects of the camera and blob motion are reduced.
1.2. Surface Modeling
Here the idea is to model the gray-level surface of a pair of images such that the surface of the errors between the images is negligible. Hsu Y. Z., Nagel H. H, and Rekers G., “New likelihood test methods for change detection in image sequences”, Computer Vision Graphics Image Processing, vol. 26, pp. 73-106, 1984 (hereinafter HSU84)] model the gray-level surface by patches of a second order bivariate polynomial in the pixel coordinates. Given two corresponding regions, R_{1}(x_{0},y_{0}) in the image I^{(1) }and R_{2}(x_{0},y_{0}) in I^{(2)}, they represent each region by a set of seven parameters—the six coefficients of the quadratic polynomial patch, and the sum of square differences between the polynomial patch and the gray-levels. Under the assumption that the approximating patch represents the gray-level surface up to uncorrected noise errors, a likelihood test to the two hypotheses is made:
H_{0}: R_{1}(x_{0},y_{0}) and R_{2}(x_{0},y_{0}) come from the same gray-value distribution.
H_{0}: R_{1}(x_{0},y_{0}) and R_{2}(x_{0},y_{0}) come from different gray-value distributions.
This method is not adequate to handle changes in illumination in the pair of images, as shown in Skifstad Kurt and Jain Ramesh, “Illumination Independent Change Detection for Real World Image Sequences”, Computer Vision, Graphics, and Image Processing, Vol. 46, pp. 387-399, 1989 (hereinafter SKI89).
1.3. Comparison among Derivative Images
These methods are based on the derivative images instead of working on the original gray-level images. A version of this concept is used by [SKI89]. They partition the image into regions, and each surface in each region is approximated by polynomials. Then, the derivatives of each patch are computed. If the images of the derivatives are denoted by I^{(D1) }and I^{(D2)}, then a threshold is used in order to create a binary image from the image of differences, I^{(D1)−I(D2)}. Areas of change are supposed to be white regions in this binary image. This method is inadequate for noisy inputs.
1.4. Contrast—Invariant Representation
Another method that can be used to perform change detection is described in P. Monasse, F. Guichard, “Fast Computation of a Contrast-Invariant Image Representation”, IEEE Trans. on Image Processing, Vol. 9, No. 5, 860-872, 2000 (hereinafter MON00). This paper sets out a new representation of an image, which is contrast-independent. The image is decomposed into a tree of “shapes” based on connected components of level sets, which provides a full and un-redundant representation of the image. This method, which is based on invariance under change of contrast, can be used to perform some kind of change detection between images that have different illumination. However, the formulation of the solution using a level set method cannot handle efficiently many blobs at the same time.
1.5. Region Based Comparison of First or Higher Order Statistics
The input images are divided into regions, usually squares of m×m pixels. Denote by R_{1}(x_{0},y_{0}) the square in the image I^{(1) }that its center is the pixel with coordinates (x_{0},y_{0}), and similarly denote by R_{2}(x_{0},y_{0}) the corresponding square in the image I^{(2)}. The gray-levels in the region R_{1}(x_{0},y_{0}) are normalized such that the mean gray-level and the variance of the gray-levels of R_{1}(x_{0},y_{0}) are the same as the mean and variance gray-level of R_{2}(x_{0},y_{0}). Then, the image I^{(2) }is compare to the image I^{(1)}. The normalization process of this statistical method is supposed to be a variation of illumination correction.
The shading model method was suggested by SKI89. Each gray-level is basically the product of two components: (1) the amount of source light incident on the scene and (2) the amount of light reflected by the blobs in the scene. The amount of source light incident on a small region of the scene is approximately uniform, but the reflected light of two adjacent blobs may be different. Denote by i(x_{0},y_{0}) the amount of source light incident on point (x_{0},y_{0}) in the scene, and by r(x_{0},y_{0}) the amount of reflected light from the point (x_{0},y_{0}) in the scene. Let I^{(1) }and I^{(2) }be two images with corresponding functions, i_{1}(x,y), i_{2}(x,y), r_{1}(x,y) and r_{2}(x,y). If at pixel (x_{0},y_{0}) both images contain the same blob, then the following is satisfied:
since the amount of reflected light from point (x_{0},y_{0}) depends on the blob itself. Let F be the image of real numbers that is the result of the division of the two images, I^{(1) }and I^{(2)}, that is:
where F is assumed to have accuracy of real numbers. Let R_{F}(x_{0},y_{0}) be a small neighborhood around the point (x_{0},y_{0}) in the image F. Then, for a point (x0,y0) that belongs to the same blob in both images, I^{(1) }and I^{(2)}, the surface patch that is composed of the values in the region R_{F}(x_{0},y_{0}) is expected to be a smooth and slow varying surface, since the change of the illumination in a small region is slow-varying. On the other hand, for a pixel (x_{0},y_{0}) that belongs to a different blob in each image, the surface patch that is composed of the values in R_{F}(x_{0},y_{0}) is expected to be much less smooth, since the region R_{F}(x_{0},y_{0}) can include a transition from one blob to another. The method in [SKI89] proposes to examine the variance in each pixel (x_{0},y_{0}) of the region R_{F}(x_{0},y_{0}). If the variance is higher than some pre-specified threshold, then the pixel is considered as belonging to a region of change. The change detection mask of this method is defined for each pixel by the following formula:
where μ_{i }is the average value of the ratio of intensities, E is the expectation, N is the size of the image, and “Λ” is a 5×5 region. Among all the other reviewed methods, this method, and the statistical method that will be introduced in the next paragraph, are the only ones that address directly the problem of change in illumination. This method is based on the assumption that the division of the images cancels the difference in the illumination between the two images, which does not always hold in practice. Moreover, the variance inside a region R_{F}(x_{0},y_{0}), whose size is not based on the image content, adds inaccuracies of its own.
Another method of statistical model-based change detection was proposed by Til Aach, Andre Kaup and Rudolf Mester, “Statistical model-based change detection in moving video”, Signal Processing, Vol. 31, pp. 165-180, 1993 (hereinafter AKM93). Given two successive frames I_{(k) }and I_{(k+1)}, let
d_{k}=(x, y)=I_{k+1}(x, y)+I_{k}(x, y) (6)
denote the image of gray level differences between frames I_{(k) }and I_{(k+1)}. Under the
hypothesis than no changes occurred at position (x,y) (the null hypothesis H_{0}), the corresponding difference d_{k}(x,y) follows a zero-mean Gaussian distribution
where the noise variance σ^{2 }is equal to twice the variance of the camera noise, assuming that the camera noise is white. Rather than performing the significance test on the values d_{k}(x,y), it is better to evaluate a local sum of normalized differences:
where w(x,y) is a window of observation centered at (x,y). Under the assumption that no changes occur within the windows, the normalized differences d_{k}/σ obey a Gaussian distribution N(0,1) and are spatially uncorrelated. Thus, the local sum Δ_{k}(x,y) follows a x^{2 }distribution with N degrees of freedom, N being the number of pixels within the windows w(x,y). With the distribution p(Δ_{k}(x,y)) known, a decision rule for each pixel can obtained by a significance test on Δ_{k}(x,y). For a specific level a one can compute a corresponding threshold T_{α }using:
α=P_{r}{Δ_{k}(x, y)>T_{α|H} _{0}} (9)
The significance level α is in fact the false alarm rate associate with the statistical test. The higher the value of α, the more likely is the classification of unchanged pixels as change. It is obvious that the significance test depends on the noise variance α^{2}. Thus, an accurate estimate of the noise variance is crucial for performance of the test. To ensure that, the variance is estimated only within the background region of the current frame, to remove the influence change region. The background regions are determined according to the tracked mask of the previous frame. One of the problems of this concept is the initial step when the background regions are not yet known: it requires a heuristics method that is strongly based on a threshold for estimating the background region.
The likelihood ratio approach is suggested based on the assumption of uniform second-order statistics over a region [KAS91; N. H. Nagel, “Formulation of a blob concept by analysis of systematic time variation in the optically perceptible environment”, Computer Graphics and Image Processing, Vol. 7. pp. 149-194, 1978 (hereinafter NAG78); ZHA93]. The frames can be subdivided into blocks, and then the blocks are compared on the basis of the statistical characteristics of their intensity levels. Eq. (10) represents the formula that calculates the likelihood function. Let μ_{i }and μ_{i+1 }be the mean intensity values for a given region in two consecutive frames, and σ_{i }and σ_{i+1 }be the corresponding variances. The number of the blocks that exceed a certain threshold t are counted. If the number of blocks exceeds a certain value (dependent on the number of blocks), a segment is declared. A subset of the blocks can be used to detect the difference between the images so as to expedite the process of block matching.
a camera break is declared. This approach increases the tolerance against noise associated with camera and blob movement. It is possible that even though the two corresponding blocks are different, they can have the same density function. In such cases no change is detected.
Another method based on statistical computations was suggested by Sze-Chu Liu, Chang-Wu Fu, and Shyang Chang, “Statistical change detection with moments under time-varying illumination”, IEEE Transactions On Image Processing, Vol. 7, No. 9, September 1998 (hereinafter SZE98). The proposed method consists of two parts. First, based on the defined circular shift moments, this method tries to distinguish the structural changes from those caused by illumination in a noise-free case, which is mentioned by [SKI89]. Moreover, the amount of computation in calculating higher-order circular shift moments can be reduced via a set of iterative formulae. Therefore, the time required for the computation is less than that of the shading model [SKI89]. Second, in accordance with the characteristics of the defined moments, SZE98 also propose a statistical decision rule to cope with the effects of noise. The change detection problem can be treated as one of hypothesis testing. Critical values are determined according to the desired level of significance. This does not perform change detection well, and there are many “false alarms”.
There is thus a widely recognized need for, and it would be highly advantageous to have, a fast and robust illumination-insensitive method for extracting blobs that appear in only one of a pair of images, a method that has a low time complexity, and is exact, robust and fast.
According to the present invention there is provided a method for illumination-independent change detection between a pair of registered images, comprising: a) providing a first original gray-level image, a second original gray-level image, a first negative image related to the first original image and a second negative images related to the second original image, b) extracting respective pluralities of blobs from each of the first and second original images and each of the first and second negative images, c) matching each extracted blob in the first original and negative images with each extracted blob in the second original and negative images to obtain matched and unmatched blobs, and d) testing all the unmatched blobs to identify blobs of change, whereby the blobs of change indicate a change detected by a method that is exact, fast, robust, illumination-insensitive and has low time-complexity.
According to the present invention there is provided a method for change detection in images, comprising: a) providing a pair of first and second registered gray level images, b) extracting respective first and second pluralities of blobs from each of the images using a modified connectivity along gray levels (CAG) analysis, c) locating at least one unmatched blob in at least one of the images, and d) identifying at least one blob of change related to the images by applying a test on the at least one unmatched blob.
The invention is herein described, by way of example only, with reference to the accompanying drawings, wherein:
The present invention is of a method to extract blobs that appear in only one of two images of any registered pair of images. Specifically, the present invention can be used for illumination-independent change detection in a pair of gray images based on connectivity analysis along gray-levels. The principles and operation of a method to extract blobs that appear in only one of two images according to the present invention may be better understood with reference to the drawings and the accompanying description.
The first step (the blob extraction step) of the present invention is preferably based on algorithm for blobs extraction based on connectivity analysis along gray-levels (CAG) as shown in Pikaz Arie, “Connectivity Analysis in Digital Gray-Levels Images and Its Applications”, Ph.D. thesis, Tel-Aviv University, Israel, February 1998 (hereinafter PIK98). The original CAG algorithm is preferably used herein as a starting point for detecting, with an appropriate set of parameters, all visually conspicuous blobs. It is worthwhile pointing out that the CAG algorithm does not deal with change detection. Thus, all the steps of the present method beyond the blob extraction step are novel.
The change detection method of the present invention works in general as follows: Let I^{(1) }and I^{(2) }be the input gray-level images. Denote by {C_{i} ^{(1)}}_{i=1} ^{N} ^{ 1 }. the set of N_{1 }blobs that where detected and extracted by the blobs extracting algorithm in image I^{(1)}, and similarly by {C_{j} ^{(2)}}_{j=1} ^{N} ^{ 2 }the set of N_{2 }blobs that were detected and extracted in I^{(2)}. The blobs that define the change belong to either {C_{i} ^{(1)}}_{i=1} ^{N} ^{ 1 }or {C_{j} ^{(2)}}_{j=1} ^{N} ^{ 2 }. The original problem of change detection is reduced to the problem of detecting blobs in the set {C_{j} ^{(2)}}_{j=1} ^{N} ^{ 2 }that are not included in the set {C_{i} ^{(1)}}_{i=1} ^{N} ^{ 1 }and vice versa. A perfect match between corresponding blobs is not expected. More than that, several blobs from one image may be connected or united into a big blob in the other image (thus, even if no change occurred, the values N_{1 }and N_{2 }might be different). For each blob in {C_{i} ^{(1)}}_{i=1} ^{N} ^{ 1 }the change detection method searches for a corresponding blob in {C_{j} ^{(2)}}_{j=1} ^{N} ^{ 2 }, and vice versa. Recall that the pair of images is registered, thus the coordinates of the corresponding blobs should be the same in both images. If sufficient number of pixels of both blobs has the same coordinates (“sufficient” is determined by a pre-defined parameter), the examined pair of blobs is considered as a match. If not, then the blob from one image is “marked” in the second image. The image gradients along its boundary are examined. A measure of saliency is defined according to the distribution of the magnitudes of these gradients. If this saliency measure is sufficiently high, then the blob is classified as existing in both images. Otherwise, it represents a change. The proposed change detection method is very efficient and robust, and it is adequate for real-time applications.
2. The Original CAG Blob Extraction Algorithm
Given a digital gray-level image I, I_{t }denotes a binary image that is the result of thresholding image I with a threshold t. In a binary image, a segment is defined as a set of black pixels in which there exists a 4-connected path of black pixels between each two pixels of the set. It is clear that a binary image can be represented by the set of all segments that it contains. A gray-level image I with G gray-levels can be represented by the set {I_{t}}_{t=0}, . . . , of G binary images.
Generally, a blob in a gray-level image is visually conspicuous if one or more of the following exist:
Let C_{i} ^{(t) }be the i^{th }connected-segment in the binary image I_{t}. For a value {tilde over (t)}>t there exists a (single) connected segment C_{j} ^{({tilde over (t)}) }in the binary image I_{{tilde over (t)} that contains Ci} ^{(t)}. If we consider t as a time parameter, connected segments spread out as the time advances. As t increases, connected segments are united into larger connected segments until finally, for t=G−1, the whole image becomes one connected segment.
For any given significant blob O there exists a sequence of segments (C^{(t)})_{t=t} _{ 0 } ^{G−1}, where C^{(t) }is a connected segment in I_{t}, that satisfies the following:
There is a value of t (or sub-interval of values) for which C^{(t) }represents best the significant blob. If t is too small, the corresponding segment C^{(t) }is only an inner part of the blob. If t is too large, the blob is an inner part of C^{(t)}. The spread-out of connected segments of I_{t }as t is increased is demonstrated in
As part of the blob extraction process, the threshold value t that corresponds to each significant blob has to be detected automatically. For this purpose, we define a weight function that is attached to each segment. The weight is a function of the threshold parameter t. It is denoted by w_{c}(t), where C is the relevant segment. The function w_{c}(t) is defined as follows: in the binary image I_{t }there exists at most a single segment C′ that satisfies C′∩C≠Ø. The value of w_{c}(t) is defined as the average value of the gradient magnitudes along the boundary of the segment C′.
We choose the gradient magnitude as the weight of each pixel. The weight of a blob is defined as the average weight of the pixels along the blob boundary. This weight is expected to be proportional to the blob saliency, defined hereinbelow. Let C^{(t) }be a segment that corresponds to a “real blob”. Its weight is expected to be the maximal weight among the weights of all the clusters that are not disjoint to C^{({tilde over (t)})}. Any connected cluster C^{(t) }satisfies one of the following: C^{(t)}∩C^{({tilde over (t)})}=Ø, or C^{(t)} ⊂C^{({tilde over (t)}) }and then t≦{tilde over (t)} or C^{({tilde over (t)})} ⊂C^{(t) }and then {tilde over (t)}≦t.
The weight function w_{c}(t) is expected to have local maxima at values of t that correspond to the binary image I_{t }that contains the significant blob. For illustration, the weight function that corresponds to blob 304 in
A weight is attached to each pixel in the input gray-level image I. The weight that is attached to a pixel is a measure of edge saliency. In particular, a pixel that resides on an edge gets assigned a higher weight than a non-edge pixel. A reasonable choice for the image of weights is the magnitudes of the gradients of I. There are many ways to compute an approximation of the gradient at a pixel. We preferably approximate the gradient vector at a pixel (u,v) according to
where ƒ(u,v) is the gray-level of the pixel at column u and row v in the image I. Denote the image of the magnitudes of the gradients by I^{(MAG)}. For a given pixel p _{i}=(u_{i}, v_{i}) let w(p _{i}) be the intensity value at column u_{i }and now v_{i }in the image I^{(MAG)}. The weight of a given segment C is defined by
where ∂C is the set of boundary pixels of the segment C and |∂C| is the size of this set. A pixel q is defined as a boundary pixel of the segment C if it belongs to C and at least one of its four nearest neighbors does not belong to C. The definition in Eq. 13 has the following convenient property. The weight of the union of a segment C with a segment that is composed of a single pixel can be computed in a constant number of operations. Let p be a pixel that is united with the segment C. Let C′ be the result of the union between the segment C and {p}. The weight of C′ satisfies
where s(C) is the number of pixels that are contained in segment C. It is clear that the set {q|qε∂C and qε∂C′} is composed only of pixels that are nearest-neighbors of the pixel p. Therefore, only a constant number of operations is required to compute
The same is true for w(C′).
If an examined blob O corresponds to a connected segment C in a binary image I_{{tilde over (t)}}, then its gray-levels must differ from the gray-levels of its local background. From the definition of the function w_{c}(t) we conclude that if a local maximum of w_{c}(t) exists at point {tilde over (t)}, then the blob O is salient related to its local background. Thus, the combination of connectivity and gradients along the boundary of the connected segments is a powerful measure of the significance of a given blob.
3. The Change Detection Algorithm
3.1 Modifications of the CAG Blob Extraction Algorithm
The change detection starts with a novel and much more efficient method and algorithm of the present invention disclose a much more efficient way to extract blobs than the original CAG algorithm in PIK98. The present invention significantly adds to, and substantially enhances the capabilities of the original CAG algorithm with the following features: 1) the CAG of the present invention selects the conspicuous blobs based on local considerations; 2) local considerations are added to the original algorithm so that the analysis of the image is more reliable; 3) the present algorithm is more robust, this being achieved by changing the formulae and computations of local weights. 4) the computation of connected components is based herein on lower and upper bounds, which increases the accuracy of the detected blobs. In the original CAG algorithm, this computation was based only on a lower bound, which could not guarantee the accuracy of the detected blobs. The use of lower and upper bounds yields an enhanced detection capability for very small blobs, on the order of ca. 30 pixels. As suggested in the original CAG algorithm, the lower bound is computed by increasing the value of the threshold t from 0. This yields a binary image according to the threshold t. Then a weight is computed according to Eqs. 11 and 12. As t increases, one gets a monotonically increasing function until t reaches the maximum. This is illustrated in
3.2—Steps of the Change Detection Method
Within the change detection method of the present invention, the modified blob extraction algorithm is first applied on two input images I_{1 }and I_{2}(herein “original” images), and on their negatives
A blob extraction step 22 is separately applied on each of the input images I_{1}, I_{2 }and their negatives
Next, in a first unification step 26, lists SOL_{1 }and
In a matching step 30, unique to the present invention, each blob O_{i} ^{(1) }in SOL^{1 }is trial matched with a blob O_{j} ^{(2) }in SOL^{2}. If blobs O_{i} ^{(1) }and O_{j} ^{(2) }overlap (have the same coordinates) by at least α % of pixels (where α ranges typically from 75% to 100%, and preferably between 90-100%, i.e. where preferably the lower bound of α is about 90%) then a logical check step 32 checks that blob O_{i} ^{(1) }also exists in SOL^{2}. A logical yes answer in checking step 32 leads to a “no change” step 34 that determines that this is not a blob of change. This leads back (loop) to step 30 where another blob is taken from the list. If the answer in step 32 is a logical no, the algorithm proceeds to another novel step, unique to the present invention: a fitness-measuring step 36, which computes a fitness measure fm_{o} _{ 1 } ^{(I} ^{ 1 } ^{,I} ^{ 2 } ^{) }of blob O_{i} ^{(1) }in SOL^{2}. The fitness measure is then compared to a pre-defined parameter (threshold) γ in a comparison step 38. If fm_{o} _{ 1 } ^{(I} ^{ 1 } ^{,I} ^{ 2 } ^{)}<γ, then blob O_{i} ^{(1) }is determined as existing in both images (yes), i.e. this is not a blob of change, and the algorithm proceeds back to step 30 to process another blob. If fm_{o} _{ 1 } ^{(i} ^{ t } ^{I} ^{ 2 } ^{)}>γ (no) then the blob is declared in a step 40 as a blob that exists in one list and not in the other. Therefore, this is a blob of change, and the algorithm returns to process another blob in step 30.
The parameter γ is preferably between 0 and 1, and most preferably about 0.6.
3.3. Determination of Match Between Corresponding Blobs
In the following there is a detailed description of a preferred embodiment of a process of matching blobs in SOL^{1 }with blobs in SOL^{2}. The same procedure is valid for the match among blobs in SOL^{2 }and blobs in SOL^{1}. Let O_{i} ^{(1) }be a blob from SOL^{1}. The matching procedure is composed of two phases.
We look for a matched blob O_{j} ^{(2) }in SOL^{2}. In an ideal situation, each pixel of O_{j} ^{(2) }has a corresponding pixel in O^{i} ^{ (1) }with identical coordinates, but usually this is not the case. In our implementation we claim that the two blobs O^{i} ^{ (2) }and O_{i} ^{(1) }match if the coordinates of at least α % of the pixels from both blobs are identical. In real experiments, a is typically chosen to be 90.
It is possible that the blob O_{i} ^{(1) }has no matching blob in SOL^{2 }but it has a corresponding blob in the image I^{(2)}. An example for such a case is shown in
Next, we deal with all the blobs that remain unmatched after the first step. If a blob in one image exists also in the other image, then its saliency measure must be sufficiently high also in the second image. The saliency measure of a blob is a function of the magnitudes of the gradients of its boundary pixels. In the original CAG algorithm, the saliency measure of a blob O with boundary ∂O was defined as:
where |∂O| is the number of boundary pixels, ∇I(x,y) is the gradient vector of the image I at pixel (x,y) and |∇I(x,y)| is the magnitude of the gradient. The saliency measure of a blob O_{i} ^{(1) }in an image I^{(2) }is computed according to the gradient values of the pixels (x,y)ε∂O_{i} ^{(1) }in image I^{(2)}. In the example of
For each blob O_{i} ^{(1) }of SOL^{1 }that remains unmatched after the first step, we perform the following: assume, without loss of generality, that O_{i} ^{(1) }is contained in SOL^{1}. Compute the values sal(O_{i} ^{(1)};I^{(1)}) and sal(O_{i} ^{(1)};I^{(2)}). If the ratio
is sufficiently high (above some pre-defined threshold), the blob O_{i} ^{(1) }is declared as an “blob of change”. Otherwise, analysis of a second order statistic is required.
3.4 The use of Gradient Distribution Vector for the Matching Process
Assume |∂O|=n. Let {i_{1 }. . . , i_{m}} be the indices of the pixels in ∂O with gradient magnitude greater than the sal(O;I). Then, the indices of the m pixels, i_{1}, . . . , i_{m}, should be uniformly distributed.
Let:
It is clear that for each k,
Denote by E the entropy of p_{1}, . . . , P_{m}, that is:
The maximal entropy is achieved when p_{k }are uniformly distributed: p_{k}=1/m. Denote by E_{m }this maximal entropy value, that is:
When there are more p_{k }s that are uniformly distributed we get higher value for the gradients distribution and it is denoted by:
m/n>γ where γε[0,1]. We preferably choose γ to be 0.6. Eq. 15 enables to determine whether the blob is a change when the sm_{o} ^{(I} ^{ 1, } ^{I} ^{ 2 } ^{) }function yields a value that is less than β, hence the “distribution-measure” of the gradients is defined as:
where the ratio in dm_{o} _{ 1 } ^{(I} ^{ 1 } ^{I} ^{ 2 } ^{) }(Eq. 17) along the gradients boundary pixels is considered only if sm_{o} _{ 1 } ^{(I} ^{ 1 } ^{,I} ^{ 2 } ^{) }(Eq. 15) is less than β and βε[0,1]. We preferably choose β to be 0.6.
3.5 The Fitness Measure
The fitness measure, denoted by fm, constitutes the foundation for the second phase of the matching phase, and contains the functions in Eqns. 15 and 17 above:
fm_{o} _{ 1 } ^{(I} ^{ 1 } ^{,I} ^{ 2 } ^{)}=sm_{o} _{ 1 } ^{(I} ^{ 1 } ^{,I} ^{ 2 } ^{)}*dm_{o} _{ 1 } ^{(I} ^{ 1 } ^{,I} ^{ 2 } ^{) } (18)
fm enables to decide whether blobs exist in both images or only in one of them. According to the definition of (Eq. 17), when the value of sm (Eq. 15) is higher than βfm will be based only on the ratio of the sm_{o} ^{(I} ^{ 1 } ^{,I} ^{ 2 } ^{) }functions. Otherwise, the multiplication of dm (Eq. 17) with sm (Eq. 15) intends to offset the outcome of sm_{o} ^{(I} ^{ 1 } ^{,I} ^{ 2 } ^{) }when two blobs are attached to each other, and thus prevent a misdetection of the attached blob. This happens only when the value of sm_{o} ^{(I} ^{ 1 } ^{,I} ^{ 2 } ^{) }is low. Therefore, the fitness measure is defined as fm_{o} _{ 1 } ^{(I} ^{ 1 } ^{,I} ^{ 2 } ^{) }which verifies whether the blob O(i) in I^{(1) }also exists in I^{(2)}:
4. Implementation of the Change Detection Algorithm
4.1 The Pseudo-code
Input: Let I^{(1) }and I^{(2) }be the inputs pair of registered gray-levels images. Let
Output: SOL^{(out) }are the list of the output blobs, that is, the final list of “blobs of change”. SOL^{(out) }is initialized to be an empty list.
Process:
The worst-case complexity of this pass is linear in the number of pixels of the blob.
4.2. Complexity Analysis
The overall time complexity of the algorithm is almost linear in the image size, n. Specifically, it is O(n·α(n,n)), where α(n,n) is the inverse of the Ackermman function [COR90, chapter 22.4], which is almost a constant. Following is the time complexity analysis of each of the three steps listed above in 5.1. The numbers correspond to the step numbers that appeared in the pseudo-code description.
The construction of each of the four SOL lists takes O(n·α(n,n)) operations in the worst-case, where α(n,n) is the inverse of the Ackermman function. Therefore, the worst-case time complexity for the creation of SOL^{1 }and SOL^{2 }is O(n·α(n,n)).
1. In the following we compute the complexity analysis of steps 30 and 32 in
In this section we present four examples, and analyze the performance of the change detection algorithm. The first example, in section 5.1, presents the complete process of the change detection algorithm. In section 5.2 we demonstrate the robustness of the algorithm, and its insensitivity to change in the illumination. In section 5.3, we focus on step 36,
5.1 Example I
Blobs in SOL^{1 }that have no corresponding blobs in SOL^{2 }(e.g. blob 1002) with α % (α=90) of overlap (the first step in the matching phase) are passed to step 36 in
In view of the test above (step 36), the blobs that correspond to the two vehicles in I^{(1) }(802 and 804 in
5.2. Example II
This example demonstrates the robustness and the insensitivity of the algorithm of the present invention to change in illumination. Two generated images I^{(1) }(a) and I^{(2) }(b) having extreme differences between their illuminations are shown in
Step 24 (
In matching step 30, each blob in SOL^{1 }is searched for a corresponding blob in SOL^{2}, and vice versa. Seven blobs that correspond to the letters “DISK 2/2”, and the blob for capital letter “A” in SOL^{2 }(
5.3. Example III
In this example we focus on the “fitness measure” as described above.
5.4 Example IV: Comparison with the “Shading Model” Method
This example demonstrates the difference between the method of the present invention and the prior art “Shading Model” described above.
In summary, the present invention introduces an efficient and robust method that provides a novel algorithm for performing change detection between a pair of registered gray-level images, under severe differences in the illumination of the images. The output of the algorithm is a set of connected components, where each component describes a “blob of change”, which is a blob that exists in only one of the two images.
The time complexity of the change detection algorithm is almost linear in the image size. Therefore, it is suitable for real-time applications. The examples detailed above demonstrate its robustness even when extreme changes in the illumination exist. The main advantages of the method disclosed herein include:
All publications, patents and patent applications mentioned in this specification are herein incorporated in their entirety by reference into the specification, to the same extent as if each individual publication, patent or patent application was specifically and individually indicated to be incorporated herein by reference. In addition, citation or identification of any reference in this application shall not be construed as an admission that such reference is available as prior art to the present invention.
While the invention has been described with respect to a limited number of embodiments, it will be appreciated that many variations, modifications and other applications of the invention may be made.
Cited Patent | Filing date | Publication date | Applicant | Title |
---|---|---|---|---|
US5627907 * | Dec 1, 1994 | May 6, 1997 | University Of Pittsburgh | Computerized detection of masses and microcalcifications in digital mammograms |
US6028970 | Oct 14, 1997 | Feb 22, 2000 | At&T Corp | Method and apparatus for enhancing optical character recognition |
US6691126 | Jun 14, 2000 | Feb 10, 2004 | International Business Machines Corporation | Method and apparatus for locating multi-region objects in an image or video database |
US6701007 * | Mar 10, 2000 | Mar 2, 2004 | Fuji Photo Film Co., Ltd. | Image processing device, image processing method, and recording medium for correcting color balance of output images |
US6751345 * | Feb 12, 2001 | Jun 15, 2004 | Koninklijke Philips Electronics N.V. | Method and apparatus for improving object boundaries extracted from stereoscopic images |
US6829371 | Nov 30, 2000 | Dec 7, 2004 | Cognex Corporation | Auto-setup of a video safety curtain system |
Reference | ||
---|---|---|
1 | * | Kurt et al. "Illumination independent change detection for real world image sequences", Computer Vision, Graphics, and Image Processing, vol. 46, pp. 387-399, 1989. |
2 | * | Teasdale et al. "Functional MRI Study of the Cognitive Generation of Affect", American Journal of Psychiatry (GPSI), v156 n2, p. 209-215, Feb. 7, 1999. |
Citing Patent | Filing date | Publication date | Applicant | Title |
---|---|---|---|---|
US8643363 * | Nov 27, 2009 | Feb 4, 2014 | Syntheticmr Ab | Method of visualizing segmented MR images with absolute-scale values independent of MR scanner settings |
US20100127704 * | Nov 27, 2009 | May 27, 2010 | Marcel Warntjes | Method of Visualizing Segmented MR Images |
U.S. Classification | 382/209, 382/217 |
International Classification | G06K9/62, G06T5/00, G06T7/20 |
Cooperative Classification | G06T2207/10152, G06T2207/10016, G06K9/38, G06T7/2053 |
European Classification | G06K9/38, G06T7/20D |
Date | Code | Event | Description |
---|---|---|---|
Jan 29, 2009 | AS | Assignment | Owner name: PONSET MGMT. LIMITED LIABILITY COMPANY, DELAWARE Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:INFOWRAP SYSTEMS, LTD.;REEL/FRAME:022187/0936 Effective date: 20080526 |
Aug 30, 2011 | CC | Certificate of correction | |
Jan 28, 2014 | FPAY | Fee payment | Year of fee payment: 8 |
Dec 11, 2015 | AS | Assignment | Owner name: INTELLECTUAL VENTURES FUND 83 LLC, NEVADA Free format text: MERGER;ASSIGNOR:PONSET MGMT. LIMITED LIABILITY COMPANY;REEL/FRAME:037274/0785 Effective date: 20150826 |