CA2124712C - Block transform coder for arbitrarily shaped image segments - Google Patents

Block transform coder for arbitrarily shaped image segments

Info

Publication number
CA2124712C
CA2124712C CA002124712A CA2124712A CA2124712C CA 2124712 C CA2124712 C CA 2124712C CA 002124712 A CA002124712 A CA 002124712A CA 2124712 A CA2124712 A CA 2124712A CA 2124712 C CA2124712 C CA 2124712C
Authority
CA
Canada
Prior art keywords
tcs
transform
pel
transform coefficients
region block
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CA002124712A
Other languages
French (fr)
Other versions
CA2124712A1 (en
Inventor
Homer H. Chen
Mehmet Reha Civanlar
Barin Geoffry Haskell
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
AT&T Corp
Original Assignee
American Telephone and Telegraph Co Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by American Telephone and Telegraph Co Inc filed Critical American Telephone and Telegraph Co Inc
Publication of CA2124712A1 publication Critical patent/CA2124712A1/en
Application granted granted Critical
Publication of CA2124712C publication Critical patent/CA2124712C/en
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/48Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using compressed domain processing techniques other than decoding, e.g. modification of transform coefficients, variable length coding [VLC] data or run-length data
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/115Selection of the code volume for a coding unit prior to coding
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/134Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
    • H04N19/154Measured or subjectively estimated visual quality after decoding, e.g. measurement of distortion
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/17Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/189Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the adaptation method, adaptation tool or adaptation type used for the adaptive coding
    • H04N19/192Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the adaptation method, adaptation tool or adaptation type used for the adaptive coding the adaptation method, adaptation tool or adaptation type being iterative or recursive
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/20Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using video object coding

Abstract

The present invention is directed at a Transform Coder Unit (TCU) to transform an arbitrarily shaped image into optimal transform coefficients (OTC) for data transmission. The TCU comprises a forward transform which transforms the image to transform coefficients, and a TCS
generator which generates a transform coefficient set (TCS) from the transform coefficients. The TCU also contains an inverse transform which transforms the TCS to a computed region block having computed pel values.
Finally, the TCU comprises a replacer which replaces those computed pel values corresponding to the interior pel set with the original pel values to form a modified computed region block which is re-iterated until optimal transform coefficients are determined.
The present invention is also directed at a process for determining optimal transform coefficients using the aforementioned device.

Description

' 212~712 BLOC~ T~t-~O ~ ~O~ FOR
ARBIT~RTT-Y SEULP~D I~LaG~ S~GME~NTS

Fi-ld of the Invention The present invention relates generally to a method and device to code images for data transmission, and more specifically to a method and device to determine the optimal transform coefficients for an irregular shaped image for low bit-rate transmission using standard transforms.

Information Disclosur- gtat _ nt Although current video coding st~n~rds may operate at very low bitrates, the trade-off between temporal and spatial resolution results in visually annoying motion or spatial artifacts. Therefore, the International Organization for Standardization is considering developing a new standard for very low bitrate A/V coding. ISO/IEC
JTC1/SC29/WG11 MPEG 92/699, "Project Description for Very-Low Bitrate A/V Coding" (Nov. 5, 1992). This document reviews the state of the art and proposes a direction for future research.
In typical image coding systems, the image to be coded is usually processed using N X N blocks of picture elements (pels) regardless of the image content. This approach, however, may lead to visible distortions known as blocking and mosquito effects, particularly at low bit-rates. To avoid these visual artifacts, region-based image representation partitions the image into regions of similar motion or texture, yielding image segments of arbitrary shape instead of fixed (rectangular) blocks.
Such image representation offers several advantages over the conventional block-based representation such as adaptation to local image characteristics. Consequently, region-based image representation has received considerable attention in MPEG4 video coding st~n~rd work for very low bitrate coding.
A flln~mental issue in region-based image compression is the coding of arbitrarily shaped image segments. An arbitrarily shaped image segment f(x,y) can be approximated by a set of basis functions optimized for the shape of the image segment to be coded:
f(x,y) = ~ ai~i(X Y) (1) where x,y ~ S, S is the region occupied by the image segment, f (x~ y) is the approximation of the image segment, and ~i's are the basis functions. However, such shape-adapted transform techniques require a large amount of memory for storing the set of basis functions. As a result, these techniques are only suitable for small regions. Furthermore, for each new segment a new set of basis functions has to be computed. Thus, extensive computation is involved. Since no fast algorithms exist, these techniques are not attractive for practical use.
Another popular approach is to use one of the most popular image compression techniques, transform coding.
In transform coding, an image is transformed from the image intensity domain to a new domain prior to coding and transmission. The new domain is selected so that the energy of the image becomes concentrated to a small region in the new domain. Among the various transforms, the discrete cosine transform (DCT) is the most widely used transform. It has become the industry standard because it provides a good approximation of the optimal Karhunen-Loeve transform (KLT) for a certain class of images, andcan be computed by means of fast algorithms.
With block transform coding, the image segment can be approximated by a set of two-dimensional basis functions defined on a rectangular block "B" which circumscribes the '' 212g712 image:
f (x,Y) = ~ i(x,y) (2) where x,y ~ S, and ~i's are the basis functions defined on the full block B. The best approximation f (x,y) of an image segment can be found by m;n;m; zing the squared error between the image segment and the approximation, i.e., error = ~ (f(x,y) -f (x,y) ) .2 (3) This is equivalent to solving the Gaussian normal equations. Note that the summation is taken over the region defined by the image segment; pels outside the region are discarded. Since the number of pels of the image segment is usually less than the number of basis functions, the problem is undetermined, and several solutions are possible. To arrive at a single solution, the problem can be solved by successive approximation.
This involves starting with a small subset of basis functions and exhaustively searching for the best solution. Although successive progression will yield a solution, the computational cost is high. Furthermore, like the shape-adapted techniques, no fast algorithms are available to make real-time implementation possible.
A more efficient approach is to perform the transform on the entire block, f (x,y, ) = ~ i(X~Y) (4) where x, y ~ B, and B is the area of the block. The transform can be performed in real-time by special purpose chips designed for block transforms. However, this technique requires that the pels outside the image segment be initialized before the transform occurs. The outside pels can be chosen such that the sum of squared errors _ 4 ~ 47 ~
over the image segment expresses by Equation (3) is minimized. This approach enables the transform spectrum to be optimized by choosing appropriate pel values outside the image segment. To this end, zeroing the outside pels would be an easy way to initialize them. This approach, however, introduces discontinuities at the boundary of the image segment, yielding high frequency components that degrade the coding performance. To alleviate the problem, the image segments can be extrapolated outside the boundary by mirroring or pel repetition such that a smoother transformation can be obtained. This ad hoc approach though, fails to provide consistent, satisfactory results. Consequently, a more promising method is needed.
The present invention fulfills this need.
The present invention utilizes the theory of successive projection onto convex sets (POCS). In Patrick L. Combettes, "The Foundation of Set Theoretic Estimation," Proceedings of the IEEE, Vol. 81, No. 2 (Feb.
1993), this theory is described in a theoretical sense.
The present invention applies this theory in a practical sense to image coding.

Summary of The Invention In accordance with one aspect of the present invention there is provided an apparatus for selecting image data representing an arbitrarily shaped image for optimizing transmission of said image data said apparatus comprising: a. first means for transforming said arbitrarily shaped image to transform coefficients; b.
second means coupled to said first means for generating a transform coefficient set (TCS) from said transform coefficients, said TCS generator being configured to output said TCS when said TCS represents said selected image data, and to send said TCS to an inverse transform when said TCS does not represent said selected image data;
c. third means coupled to said second means for transforming said TCS to a computed region block having . _~ . bS~
,~ ", 2~ ~7 ~

computed pel values; and d. fourth means coupled to said third means for replacing computed pel values corresponding to an interior pel set of said arbitrarily shaped image with original pel values of said arbitrarily shaped image so as to form a modified computed region block (MCRB), said fourth means being configured to send a modified computed region block to a reiterative forward transform for reiteration.
In accordance with another aspect of the present invention there is provided an apparatus for selecting image data representing an arbitrarily shaped image for optimizing low-data rate transmission of said image data, said apparatus comprising: (a) generating means for generating original pel values, said generating means including; (i) circumscribing means for circumscribing said arbitrarily shaped image with a rectangular region block, thereby creating an internal pel set which lies within said arbitrarily shaped image and within said region block, and an external pel set which lies outside said arbitrarily shaped image within said region block;
and (ii) initializing means for initializing pel values of said external pel set by extrapolating the pel values of said internal pel set; (b) operating means for operating a transform coder unit (TCU) which calculates optimal transform coefficients, said operating means including;
(i) means for performing a forward transform on said region block to generate transform coefficient; (ii) means for generating a transform coefficient set (TCS) from transform coefficients; (iii) means for performing an inverse transform on said TCS thereby generating a computed region block having computed pel values; (iv) means for replacing those computed pel values ~ corresponding to said internal pel set with original pel values to form a modified computer region block (MCRB);
(v) means for determining whether said TCS represents said OTC; (vi) means for reiterating steps (i) and (ii) on said modified computed region block and outputting said TCS

~A~' - 5a _ ~ ~ 2~ 7 ~ ~
when said TCS represents OTC; and (vii) reiterating steps (i) through (vii) on said modified computed region block when said TCS values do not represent said OTC.
In accordance with yet another aspect of the present invention there is provided an apparatus for selecting image data representing an arbitrarily shaped image for optimizing transmission of said image data, said apparatus comprising: (a) means for generating original pel values, said means including: (i) means for circumscribing said arbitrarily shaped image with a rectangular region block, thereby creating an internal pel set which lies within said arbitrarily shaped image and within said region block, and an external pel set which lies outside said arbitrarily shaped image and within said region block; and (ii) means for initializing pel values of said external pel set by extrapolating the pel values of said internal pel set; (b) means for operating a transform coder unit (TCU) for calculating optimal transform coefficients, said means for operating a TCU including: (i) means for performing a forward transform on said region block to generate transform coefficients; (ii) means for generating a transform coefficient set (TCS) from said transform coefficients; (iii) means for determining whether said TCS
represents optimal transform coefficients (OTC); (iv) means for outputting said TCS when said TCS represents said OTC; (v) means for performing an inverse transform on said set TCS when said TCS does not represent said OTC, said inverse transform generates a computed region block having computed pel values; (vi) means for replacing those computed pel values corresponding to said internal pel set with original pel values so as to form a modified computed region block; and (vii) means for reiterating steps (i) through (vii) on said modified computed region block.

Brief DeRcriPtion of The Drawin~R
Fig. 1 depicts an arbitrary shape and the circumscribed rectangular region.

. ~ ,.

~ ~ ~ 4 7 i ~

Fig. 2 shows a preferred embodiment of the TCU which detects convergence in the image domain.
Fig. 3 shows another preferred embodiment of the TCU
which detects convergence in the transform domain.
Fig. 4 shows another preferred embodiment of the present invention wherein a multiplicity of TCU are connected in series.

Detailed Description of The Present Invention The present invention relates to an iterative technique to determine optimal transform coefficient values for the coding of arbitrarily shaped images. The convergence of the iteration to the optimal solution is guaranteed by the theory of successive projection onto convex sets (POCS). The technique can be described within the POCS context by using two sets of images.
The first set is defined based on a basic premise of transform coding -- the energy compaction property of transform coefficients. This property provides that a large amount of energy is concentrated in a small fraction of the transform coefficients, and only these coefficients need to be kept for coding the image. The set of images which can be represented using a selected group of transform coefficients constitute the first set and will be referred to as the transform coefficients set (TCS).
This set is convex for all linear and some non-linear transformations. The projection of an arbitrarily shaped ~, ,, . .

212~712 image block onto this set can be determined by computing the block transform and selecting and ret~;n;ng high energy coefficients. The r~m~;n;ng, non-selected coefficients are zeroed (region-zeroing in the frequency domain).
The second set is derived form the fact that the values of the pels outside of the arbitrary shaped region are irrelevant to coding. Thus, the second set becomes the set of images whose pel values within the arbitrarily shaped region are specified by the image to be coded.
This set is referred to as the region of support set (RSS). This set is convex. The projection of an arbitrarily shaped region onto this set can be obtained by replacing those pel values corresponding to the image~s interior pels with the original pel values (region-enforcing in the space domain). This theory provides the basis for the present invention.
The present invention basically comprises two parts.
Fig. 1 depicts the first part which involves generating and preparing the data to be coded. In this step, a rectangular region block is circumscribed around an arbitrarily shaped image 2. This defines an original internal pel set 3 which lies within arbitrarily shaped image 2 and within region block 1, and an original external pel set 4 which lies outside arbitrarily shaped image 2 and within region block 1.
To initialize the pel values of external pel set 4, an extrapolator 5 extrapolates the pel values of internal pel set 3. Examples of extrapolation methods include mirroring or pel repetition of the segments of internal pel set 3. Once external pel set 4 is initialized, the image data can be manipulated in the second part.
The second part involves a transform coder unit (TCU) 6 performing a POCS iteration loop on the image data. TCU
6 is shown in Fig. 2. TCU 6 comprises a forward transform 212~712 7, which operates at real-time and transforms the image from the image ~om~in 30 to the transform domain 31.
Next, a TCS generator 8 generates a transform coefficient set (TCS) from the transform coefficients.
This can be accomplished in a couple of ways. First, TCS
generator 8 may contain a quantizer which generates the TCS by quantizing the transform coefficients. There is no convergence guarantee, however, under this alternative. A
more preferred embodiment utilizes the energy compaction property of transform coefficients. This property holds that a large amount of energy is concentrated in a small fraction of the transform coefficients. Therefore, TCS
generator 8 need only select and retain these coefficients for coding the image. The r~m~;n;ng transform coefficients can be zeroed.
If the energy compaction property is used to generate the TCS, then the number of coefficients to retain should be established. This may accomplished via a rate controller 12. Rate controller 12 can establish the threshold energy level at which to retain coefficients based on the size of the arbitrarily shaped image, and the bit budget of the encoder which will eventually code the transform coefficients. Alternatively, the number of transform coefficients to retain can be established independently via a TCS limiter 13 at the beginning of each iteration. A combination of both these mechanisms could be used as well.
TCS generator 8 outputs the TCS from the TCU if the TCS represents the optimal transform coefficients (OTC).
Otherwise, TCS generator 8 sends the TCS to an inverse transform 9. Inverse transform 9 converts the TCS from transform domain 31 to image domain 30, thereby producing a computed regional block having computed pel values.
A replacer 10 replaces those computed pel values corresponding with internal pel set 3 with the original 21247t2 pel values, thereby forming a modified computed regional block (MCRB). The MCRB is then re-iterated through a re-iterative forward transform. In the preferred embodiment of Figs. 2 and 3, the re-iterative forward transform and forward transform 7 are the same. Thus, the same TCU will re-iterate the MCRB.
The re-iterative forward transform and forward transform 7, however, can be different. For example, Fig.
4 shows a successive connection of TCUs 201-204. In this configuration, the re-iterative forward transform of TCU
201 is the forward transform of succeeding TCU 202. Thus, the modified computed region block is re-iterated through different TCUs. The number of TCUs in series determines the number of iterations performed.
Although the number of iterations depends upon the number of successive TCUs in the embodiment of Fig. 4, the number of iterations is variable in the embodiments of Figs. 2 and 3. Consequently, an iteration controller 11 is employed in both embodiments. Referring only to Fig.
2, iteration controller 11 controls switch 15 which has a first position 19 and a second position 20. First position 19 directs the TCS from TCS generator 8 to inverse transform 9 when the TCS does not represent the OTC. Second position 20 directs the TCS from TCS
generator 8 to a quantizer when the TCS represents the OTC.
Iteration controller 11 may control the switching of switch 15 through a couple of mechanisms. As Fig. 2 shows, an iteration counter 14 can be used to count the number of iterations. When a pre-determined number is reached, iteration counter 14 will signal iteration controller 11 which will move switch 15 from first position 19 to second position 20.
Fig. 2 depicts another method of controlling switch 15 by monitoring image domain 30 of the TCU. Here, a 212~712 g convergence detector 21, and a frame buffer 17 are employed. Frame buffer 17 stores the pel values of the previous iteration. Convergence detector 21 switches switch 15 from first position 19 to second position 20 when the mean squared difference between the computed pel values stored in frame buffer 17 and those of the current iteration reaches a pre-determined level.
Fig. 3 depicts a device which also controls switch 115, but does so by monitoring transform ~om~;n 131 of TCU
106 using a convergence detector 121, and a frame buffer 117. Frame buffer 117 stores the TCS of the previous iteration. Convergence detector 121 switches switch 115 from first position 119 to second position 120 when the mean squared difference between the TCS stored in frame buffer 117 and that of the current iteration reaches a pre-determined level.
Obviously, numerous modifications and variations of the present invention are possible in light of the above teachings. It is therefore understood that within the scope of the appended claims, the invention may be practiced otherwise than as specifically described herein.

Claims (20)

1. An apparatus for selecting image data representing an arbitrarily shaped image for optimizing transmission of said image data said apparatus comprising:
a. first means for transforming said arbitrarily shaped image to transform coefficients;
b. second means coupled to said first means for generating a transform coefficient set (TCS) from said transform coefficients, said TCS generator being configured to output said TCS when said TCS represents said selected image data, and to send said TCS to an inverse transform when said TCS does not represent said selected image data;
c. third means coupled to said second means for transforming said TCS to a computed region block having computed pel values; and d. fourth means coupled to said third means for replacing computed pel values corresponding to an interior pel set of said arbitrarily shaped image with original pel values of said arbitrarily shaped image so as to form a modified computed region block (MCRB), said fourth means being configured to send a modified computed region block to a reiterative forward transform for reiteration.
2. The apparatus of claim 1, wherein said second means includes a quantizer which generates said TCS by quantizing said transform coefficients.
3. The apparatus of claim 1, wherein said second means generates said TCS by selecting and retaining those transform coefficients which have high energy according to the energy compaction property of transform coefficients, and by zeroing all the non-selected transform coefficients.
4. The apparatus of claim 3, wherein said second means comprises a rate controller to establish a threshold energy level at which said TCS selector retains transform coefficients, said rate controller establishes said level based on the bit budget of an encoder and the size of said arbitrarily shaped image.
5. The apparatus of claim 3, wherein said second means comprises a TCS limiter to independently establish the number of transform coefficients to retain.
6. The apparatus of claim 1, wherein said reiterative forward transform and said forward transform are one in the same, and further comprising:
e. an iteration controller which controls an iteration switch having a first position and a second position, said first position directs TCS from said TCS
generator to said inverse transform when said TCS does not represent said selected image data, said second position directs said TCS from said TCS generator to output of said TCU.
7. The apparatus of claim 6, wherein said iteration controller comprises an iteration counter to independently establish the number of iterations to perform, after said apparatus performs the established number of iterations, said switch switches to said second position.
8. The apparatus of claim 6, wherein said iteration controller contains a convergence detector, and a frame buffer, said frame buffer stores the pel values of a previous iteration, said convergence detector switches said switch from said first position to said second position when the mean squared difference between said MCRB stored in said frame buffer and that of the current iteration reaches a pre-determined level.
9. The apparatus of claim 6, wherein said iteration controller contains a convergence detector, and a frame buffer, said frame buffer stores the TCS of a previous iteration, said convergence detector switches said switch from said first position to said second position when the mean squared difference between the TCS stored in said frame buffer and that of the current iteration reaches a pre-determined level.
10. The apparatus of claim 1, wherein said reiterative forward transform comprises a forward transform of a succeeding apparatus, said succeeding apparatus connected in series with said apparatus.
11. The apparatus of claim 1, wherein said forward transform is a discrete cosine transform (DCT) chip.
12. An apparatus for selecting image data representing an arbitrarily shaped image for optimizing low-data rate transmission of said image data, said apparatus comprising:
(a) generating means for generating original pel values, said generating means including;
(i) circumscribing means for circumscribing said arbitrarily shaped image with a rectangular region block, thereby creating an internal pel set which lies within said arbitrarily shaped image and within said region block, and an external pel set which lies outside said arbitrarily shaped image within said region block;
and (ii) initializing means for initializing pel values of said external pel set by extrapolating the pel values of said internal pel set;
(b) operating means for operating a transform coder unit (TCU) which calculates optimal transform coefficients, said operating means including;

(i) means for performing a forward transform on said region block to generate transform coefficient;
(ii) means for generating a transform coefficient set (TCS) from transform coefficients;
(iii) means for performing an inverse transform on said TCS thereby generating a computed region block having computed pel values;
(iv) means for replacing those computed pel values corresponding to said internal pel set with original pel values to form a modified computer region block (MCRB);
(v) means for determining whether said TCS
represents said OTC;
(vi) means for reiterating steps (i) and (ii) on said modified computed region block and outputting said TCS when said TCS represents OTC; and (vii) reiterating steps (i) through (vii) on said modified computed region block when said TCS
values do not represent said OTC.
13. The apparatus as recited in claim 12, wherein said means for performing a forward transform includes a discrete cosine transform (DCT) chip.
14. The apparatus as recited in claim 12, wherein said means for generating said TCS is configured to quantize said transform coefficients.
15. The apparatus as recited in claim 14, wherein said means for generating said TCS is further configured to select and retain those transform coefficients which have high energy according to the energy compaction property of transform coefficients, and zeroing the non-selected transform coefficients.
16. The apparatus as recited in claim 15, wherein said TCS includes a rate controller to establish a threshold energy level at which transform coefficients are retained, said-rate controller being configured to establish said level based upon the bit budget of an encoder and the size of said arbitrarily shaped image.
17. The apparatus as recited in claim 15, wherein said means for generating said TCS is further configured to independent establish a number of transform coefficients to retain.
18. The apparatus as recited in claim 12, wherein said means for determining whether said TCS represents said OTC is configured to independently establish the number of iterations to perform.
19. The apparatus as recited in claim 18, wherein said means for determining whether said TCS represents said OTC is further configured to determine when the means squared difference between said MCRB of one iteration and that of a subsequent iteration reaches a predetermined threshold.
20. An apparatus for selecting image data representing an arbitrarily shaped image for optimizing transmission of said image data, said apparatus comprising:
(a) means for generating original pel values, said means including:
(i) means for circumscribing said arbitrarily shaped image with a rectangular region block, thereby creating an internal pel set which lies within said arbitrarily shaped image and within said region block, and an external pel set which lies outside said arbitrarily shaped image and within said region block; and (ii) means for initializing pel values of said external pel set by extrapolating the pel values of said internal pel set;
(b) means for operating a transform coder unit (TCU) for calculating optimal transform coefficients, said means for operating a TCU including:
(i) means for performing a forward transform on said region block to generate transform coefficients;
(ii) means for generating a transform coefficient set (TCS) from said transform coefficients;
(iii) means for determining whether said TCS represents optimal transform coefficients (OTC);
(iv) means for outputting said TCS when said TCS represents said OTC;
(v) means for performing an inverse transform on said set TCS when said TCS does not represent said OTC, said inverse transform generates a computed region block having computed pel values;
(vi) means for replacing those computed pel values corresponding to said internal pel set with original pel values so as to form a modified computed region block; and (vii) means for reiterating steps (i) through (vii) on said modified computed region block.
CA002124712A 1993-10-15 1994-05-31 Block transform coder for arbitrarily shaped image segments Expired - Fee Related CA2124712C (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US138,295 1993-10-15
US08/138,295 US5422963A (en) 1993-10-15 1993-10-15 Block transform coder for arbitrarily shaped image segments

Publications (2)

Publication Number Publication Date
CA2124712A1 CA2124712A1 (en) 1995-04-16
CA2124712C true CA2124712C (en) 1999-06-29

Family

ID=22481384

Family Applications (1)

Application Number Title Priority Date Filing Date
CA002124712A Expired - Fee Related CA2124712C (en) 1993-10-15 1994-05-31 Block transform coder for arbitrarily shaped image segments

Country Status (5)

Country Link
US (1) US5422963A (en)
EP (1) EP0649258B1 (en)
JP (1) JP3078460B2 (en)
CA (1) CA2124712C (en)
DE (1) DE69420662T2 (en)

Families Citing this family (81)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3202433B2 (en) * 1993-09-17 2001-08-27 株式会社リコー Quantization device, inverse quantization device, image processing device, quantization method, inverse quantization method, and image processing method
JP2720926B2 (en) * 1993-10-26 1998-03-04 富士ゼロックス株式会社 Image coding device
JP3195142B2 (en) * 1993-10-29 2001-08-06 キヤノン株式会社 Image processing method and apparatus
US6580819B1 (en) 1993-11-18 2003-06-17 Digimarc Corporation Methods of producing security documents having digitally encoded data and documents employing same
US7171016B1 (en) 1993-11-18 2007-01-30 Digimarc Corporation Method for monitoring internet dissemination of image, video and/or audio files
US6122403A (en) 1995-07-27 2000-09-19 Digimarc Corporation Computer system linked by using information in data objects
US5748763A (en) 1993-11-18 1998-05-05 Digimarc Corporation Image steganography system featuring perceptually adaptive and globally scalable signal embedding
US6614914B1 (en) 1995-05-08 2003-09-02 Digimarc Corporation Watermark embedder and reader
US6424725B1 (en) 1996-05-16 2002-07-23 Digimarc Corporation Determining transformations of media signals with embedded code signals
US5768426A (en) 1993-11-18 1998-06-16 Digimarc Corporation Graphics processing system employing embedded code signals
US6983051B1 (en) 1993-11-18 2006-01-03 Digimarc Corporation Methods for audio watermarking and decoding
US5832119C1 (en) 1993-11-18 2002-03-05 Digimarc Corp Methods for controlling systems using control signals embedded in empirical data
US5636292C1 (en) 1995-05-08 2002-06-18 Digimarc Corp Steganography methods employing embedded calibration data
US5748783A (en) 1995-05-08 1998-05-05 Digimarc Corporation Method and apparatus for robust information coding
US5710834A (en) 1995-05-08 1998-01-20 Digimarc Corporation Method and apparatus responsive to a code signal conveyed through a graphic image
US6516079B1 (en) 2000-02-14 2003-02-04 Digimarc Corporation Digital watermark screening and detecting strategies
US5822436A (en) 1996-04-25 1998-10-13 Digimarc Corporation Photographic products and methods employing embedded information
US5862260A (en) 1993-11-18 1999-01-19 Digimarc Corporation Methods for surveying dissemination of proprietary empirical data
US5841886A (en) 1993-11-18 1998-11-24 Digimarc Corporation Security system for photographic identification
US6408082B1 (en) 1996-04-25 2002-06-18 Digimarc Corporation Watermark detection using a fourier mellin transform
CA2174413C (en) 1993-11-18 2009-06-09 Geoffrey B. Rhoads Steganographic methods and apparatuses
US6611607B1 (en) 1993-11-18 2003-08-26 Digimarc Corporation Integrating digital watermarks in multimedia content
US5841978A (en) 1993-11-18 1998-11-24 Digimarc Corporation Network linking method using steganographically embedded data objects
US6522770B1 (en) 1999-05-19 2003-02-18 Digimarc Corporation Management of documents and other objects using optical devices
US5608458A (en) * 1994-10-13 1997-03-04 Lucent Technologies Inc. Method and apparatus for a region-based approach to coding a sequence of video images
US6560349B1 (en) 1994-10-21 2003-05-06 Digimarc Corporation Audio monitoring using steganographic information
US5978514A (en) 1994-11-10 1999-11-02 Kabushiki Kaisha Toshiba Image data coding and decoding system for efficiently compressing information using the shape and position of the image content
JP3169783B2 (en) * 1995-02-15 2001-05-28 日本電気株式会社 Video encoding / decoding system
US5852681A (en) * 1995-04-20 1998-12-22 Massachusetts Institute Of Technology Method and apparatus for eliminating artifacts in data processing and compression systems
US6760463B2 (en) 1995-05-08 2004-07-06 Digimarc Corporation Watermarking methods and media
US6721440B2 (en) 1995-05-08 2004-04-13 Digimarc Corporation Low visibility watermarks using an out-of-phase color
US6744906B2 (en) 1995-05-08 2004-06-01 Digimarc Corporation Methods and systems using multiple watermarks
US6788800B1 (en) 2000-07-25 2004-09-07 Digimarc Corporation Authenticating objects using embedded data
US6411725B1 (en) 1995-07-27 2002-06-25 Digimarc Corporation Watermark enabled video objects
US6577746B1 (en) 1999-12-28 2003-06-10 Digimarc Corporation Watermark-based object linking and embedding
US6829368B2 (en) 2000-01-26 2004-12-07 Digimarc Corporation Establishing and interacting with on-line media collections using identifiers in media signals
US6408331B1 (en) * 1995-07-27 2002-06-18 Digimarc Corporation Computer linking methods using encoded graphics
JP3094390B2 (en) * 1995-11-29 2000-10-03 三星電子株式会社 Transform coding apparatus for block including boundary of object of arbitrary form
DE19609860C1 (en) * 1996-03-13 1997-09-04 Siemens Ag Process for processing pixels of an image segment by a computer
KR100209411B1 (en) * 1996-05-10 1999-07-15 전주범 Method for processing image signals using contour information
US6381341B1 (en) 1996-05-16 2002-04-30 Digimarc Corporation Watermark encoding method exploiting biases inherent in original signal
DE19625402A1 (en) * 1996-06-25 1998-01-02 Siemens Ag Process for processing pixels of an image segment by a computer
JP3474707B2 (en) * 1996-07-04 2003-12-08 シャープ株式会社 Image encoding device and image decoding device
FR2752474B1 (en) * 1996-08-14 1998-12-31 Iona Donescu PROCESS FOR TRANSFORMING THE IMAGE SIGNAL ON ARBITRARY MEDIA
FR2758636B1 (en) * 1997-01-21 2000-12-29 France Telecom PROCESSING OF IMAGES BY REGIONS USING DISCRETE TRANSFORMATION ON FINISHED SEGMENTS WITHOUT EXTENSION
US7054463B2 (en) 1998-01-20 2006-05-30 Digimarc Corporation Data encoding using frail watermarks
US6058214A (en) * 1998-01-20 2000-05-02 At&T Corp. Compression of partially masked still images
US6456744B1 (en) 1999-12-30 2002-09-24 Quikcat.Com, Inc. Method and apparatus for video compression using sequential frame cellular automata transforms
US6330283B1 (en) 1999-12-30 2001-12-11 Quikcat. Com, Inc. Method and apparatus for video compression using multi-state dynamical predictive systems
US6400766B1 (en) 1999-12-30 2002-06-04 Quikcat.Com, Inc. Method and apparatus for digital video compression using three-dimensional cellular automata transforms
US6625297B1 (en) 2000-02-10 2003-09-23 Digimarc Corporation Self-orienting watermarks
US6804377B2 (en) 2000-04-19 2004-10-12 Digimarc Corporation Detecting information hidden out-of-phase in color channels
US6718066B1 (en) 2000-08-14 2004-04-06 The Hong Kong University Of Science And Technology Method and apparatus for coding an image object of arbitrary shape
US6959113B2 (en) * 2000-09-29 2005-10-25 Pentax Corporation Arbitrary-shape image-processing device and arbitrary-shape image-reproducing device
JP2002300581A (en) * 2001-03-29 2002-10-11 Matsushita Electric Ind Co Ltd Image-coding apparatus and image-coding program
DK1456810T3 (en) 2001-12-18 2011-07-18 L 1 Secure Credentialing Inc Multiple image security features to identify documents and methods of producing them
WO2003056500A1 (en) 2001-12-24 2003-07-10 Digimarc Id Systems, Llc Covert variable information on id documents and methods of making same
EP1459246B1 (en) 2001-12-24 2012-05-02 L-1 Secure Credentialing, Inc. Method for full color laser marking of id documents
US7694887B2 (en) 2001-12-24 2010-04-13 L-1 Secure Credentialing, Inc. Optically variable personalized indicia for identification documents
US7728048B2 (en) 2002-12-20 2010-06-01 L-1 Secure Credentialing, Inc. Increasing thermal conductivity of host polymer used with laser engraving methods and compositions
US6862371B2 (en) 2001-12-31 2005-03-01 Hewlett-Packard Development Company, L.P. Method of compressing images of arbitrarily shaped objects
US7824029B2 (en) 2002-05-10 2010-11-02 L-1 Secure Credentialing, Inc. Identification card printer-assembler for over the counter card issuing
AU2003298731A1 (en) 2002-11-26 2004-06-18 Digimarc Id Systems Systems and methods for managing and detecting fraud in image databases used with identification documents
US7712673B2 (en) 2002-12-18 2010-05-11 L-L Secure Credentialing, Inc. Identification document with three dimensional image of bearer
US7225991B2 (en) 2003-04-16 2007-06-05 Digimarc Corporation Three dimensional data storage
US7744002B2 (en) 2004-03-11 2010-06-29 L-1 Secure Credentialing, Inc. Tamper evident adhesive and identification document including same
US8467447B2 (en) * 2004-05-07 2013-06-18 International Business Machines Corporation Method and apparatus to determine prediction modes to achieve fast video encoding
US7810748B2 (en) 2004-06-01 2010-10-12 Kabushiki Kaisha Towani Scrapping machine
US8798131B1 (en) 2010-05-18 2014-08-05 Google Inc. Apparatus and method for encoding video using assumed values with intra-prediction
US9210442B2 (en) 2011-01-12 2015-12-08 Google Technology Holdings LLC Efficient transform unit representation
US9380319B2 (en) 2011-02-04 2016-06-28 Google Technology Holdings LLC Implicit transform unit representation
US9219915B1 (en) 2013-01-17 2015-12-22 Google Inc. Selection of transform size in video coding
US9967559B1 (en) 2013-02-11 2018-05-08 Google Llc Motion vector dependent spatial transformation in video coding
US9544597B1 (en) 2013-02-11 2017-01-10 Google Inc. Hybrid transform in video encoding and decoding
US9674530B1 (en) 2013-04-30 2017-06-06 Google Inc. Hybrid transforms in video coding
US9565451B1 (en) 2014-10-31 2017-02-07 Google Inc. Prediction dependent transform coding
US9769499B2 (en) 2015-08-11 2017-09-19 Google Inc. Super-transform video coding
US10277905B2 (en) 2015-09-14 2019-04-30 Google Llc Transform selection for non-baseband signal coding
US9807423B1 (en) 2015-11-24 2017-10-31 Google Inc. Hybrid transform scheme for video coding
WO2018141385A1 (en) 2017-02-02 2018-08-09 Huawei Technologies Co., Ltd. Image and video processing apparatuses and methods
US11122297B2 (en) 2019-05-03 2021-09-14 Google Llc Using border-aligned block functions for image compression

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE4136636A1 (en) * 1991-11-07 1993-07-01 Bosch Gmbh Robert Image signal encoder for data-reduced transmission of moving images - uses block encoding of image zones not qualifying for object-oriented encoding
JP3068304B2 (en) * 1992-01-21 2000-07-24 日本電気株式会社 Video coding and decoding systems

Also Published As

Publication number Publication date
US5422963A (en) 1995-06-06
EP0649258A2 (en) 1995-04-19
CA2124712A1 (en) 1995-04-16
DE69420662T2 (en) 2000-04-27
JP3078460B2 (en) 2000-08-21
EP0649258B1 (en) 1999-09-15
EP0649258A3 (en) 1995-05-17
JPH07177516A (en) 1995-07-14
DE69420662D1 (en) 1999-10-21

Similar Documents

Publication Publication Date Title
CA2124712C (en) Block transform coder for arbitrarily shaped image segments
Chen et al. Block transform coder for arbitrarily shaped image segments
US5608458A (en) Method and apparatus for a region-based approach to coding a sequence of video images
US6084908A (en) Apparatus and method for quadtree based variable block size motion estimation
Gerken Object-based analysis-synthesis coding of image sequences at very low bit rates
CA2295689C (en) Apparatus and method for object based rate control in a coding system
Pao et al. Modeling DCT coefficients for fast video encoding
US6160846A (en) Apparatus and method for optimizing the rate control in a coding system
EP0705035B1 (en) Encoding data represented as a multi-dimensional array
Chen et al. A block transform coder for arbitrarily shaped image segments
Kaup Object-based texture coding of moving video in MPEG-4
US20080008246A1 (en) Optimizing video coding
Kaup et al. Coding of segmented images using shape-independent basis functions
JP2000511366A6 (en) Apparatus and method for variable block size motion estimation based on quadrant tree
Ribas-Corbera et al. Optimizing motion-vector accuracy in block-based video coding
EP1012778A1 (en) Apparatus and method for macroblock based rate control in a coding system
US20120082217A1 (en) Motion compensation using decoder-defined vector quantized interpolation filters
Banham et al. A selective update approach to matching pursuits video coding
KR100571920B1 (en) Video encoding method for providing motion compensation method based on mesh structure using motion model and video encoding apparatus therefor
EP0734166A2 (en) Apparatus for encoding an image signal having a still object
Ribas-Corbera et al. Optimal motion vector accuracy for block-based motion-compensated video coders
Hsia et al. Efficient postprocessor for blocky effect removal based on transform characteristics
KR100209411B1 (en) Method for processing image signals using contour information
Ropert et al. RD Optimization of uniform threshold scalar quantization for Laplacian distributions
Wu et al. Additive vector decoding of transform coded images

Legal Events

Date Code Title Description
EEER Examination request
MKLA Lapsed