US20020031181A1 - Coding a sequence of pictures - Google Patents

Coding a sequence of pictures Download PDF

Info

Publication number
US20020031181A1
US20020031181A1 US09/976,226 US97622601A US2002031181A1 US 20020031181 A1 US20020031181 A1 US 20020031181A1 US 97622601 A US97622601 A US 97622601A US 2002031181 A1 US2002031181 A1 US 2002031181A1
Authority
US
United States
Prior art keywords
block
pixels
motion vector
motion
macro
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US09/976,226
Inventor
Anne-Claude Doux
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
US Philips Corp
Original Assignee
US Philips Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by US Philips Corp filed Critical US Philips Corp
Priority to US09/976,226 priority Critical patent/US20020031181A1/en
Publication of US20020031181A1 publication Critical patent/US20020031181A1/en
Abandoned legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/503Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
    • H04N19/51Motion estimation or motion compensation
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/503Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction

Definitions

  • the invention relates to coding a sequence of pictures.
  • the invention may be applied, for example, in a video coder operating in accordance with a standard fixed by the Moving Pictures Experts Group (MPEG).
  • MPEG Moving Pictures Experts Group
  • Video coding in accordance with the MPEG-4 standard will comprise functions such as motion estimation, motion compensation, discrete-cosine transformation (DCT) and quantization.
  • the motion estimation establishes a motion vector for a block of pixels in a current picture to be coded.
  • the motion vector indicates a block of pixels in a previous picture which is similar to the block of pixels in the current picture to be coded.
  • the motion compensation provides a prediction-error block which is the difference between the two aforementioned blocks of pixels.
  • the prediction-error block undergoes a DCT.
  • the result is a block of DCT coefficients.
  • the quantization involves dividing each DCT coefficient by a quantization parameter and rounding off the result to the nearest integer.
  • the invention takes the following aspects into consideration.
  • the perceived picture quality at a decoding end is mainly determined by certain portions in a picture which a viewer regards as being “interesting”. In many cases, a viewer will focus on moving objects rather than on stationary objects. Thus, the precision with which moving objects are coded will greatly influence the perceived picture quality.
  • a motion vector is established indicating a block of pixels in a previous picture which is similar to a block of pixels in a current picture to be coded.
  • Data relating to the block of pixels in the current picture is compressed, the extent to which the data is compressed depending on a compression parameter.
  • These features are comprised in, for example, MPEG video-coding in which blocks of DCT coefficients constitute the data to be compressed, and the quantization parameter constitutes the compression parameter.
  • the compression parameter is controlled on the basis of the motion vector. Accordingly, data relating to a block of pixels representing a moving object can be compressed to a relatively small extent, whereas data relating to a block of pixels representing a stationary object can be compressed to a relatively large extent.
  • Such a motion-vector based control of the quantization parameter makes that a moving object is coded with a relatively great precision.
  • the invention allows a relatively good picture quality at a decoding end.
  • a stationary camera provides a sequence of pictures to be coded. Then, a block of pixels representing a moving object will have a relatively large motion vector, and a block of pixels representing a background will have a relatively small motion vector. Consequently, data should be compressed to a relatively small extent if the motion vector is large and, conversely, data should be compressed to a relatively great extent if the motion vector is relatively small.
  • a camera that is tracking a moving object provides a sequence of pictures to be coded.
  • a block of pixels representing a moving object will have a relatively small motion vector
  • a block of pixels representing the background will have a relatively large motion vector. Consequently, data should be compressed to a relatively small extent if the motion vector is relatively small and, conversely, it should be compressed to a relatively great extent if the motion vector is relatively large.
  • FIG. 1 is a conceptual diagram illustrating basic features of the invention as claimed in claim 1 ;
  • FIGS. 2 to 6 are conceptual diagrams illustrating additional features as claimed in claims 2 to 6 , respectively, and;
  • FIG. 7 is a functional block diagram of an example of an MPEG video coder in accordance with the invention.
  • FIGS 8 a to 8 d illustrate the operation of a filter in the MPEG video coder.
  • FIG. 1 illustrates basic features of the invention.
  • a motion vector MV is established for a block of pixels B(*,*;n) in a current picture P(n) to be coded.
  • the motion vector MV indicates a block of pixels B(*,*;n ⁇ 1) in a previous picture P(n ⁇ 1) which is similar to the block of pixels B(*,*;n) in the current picture P(n) to be coded.
  • a data compressor CMP compresses data D relating to the block of pixels B(*,*;n) in the current picture P(n). The extent to which the data D is compressed depends on a compression parameter CP.
  • a controller CON controls the compression parameter CP the basis of the motion vector MV.
  • the number of bits obtained by coding a sequence of pictures should preferably not exceed a certain limit. Stated otherwise, a certain number of bits is available for coding the sequence of pictures. It is beneficial to the picture quality, if a relatively great portion of the available bits is used for coding moving objects. The smaller the portion of the available bits which is used for coding stationary objects, the greater the portion which can be used for coding moving objects. Thus, it is desirable to prevent that data which relates to a block of pixels not representing a moving object, is compressed to a relatively small extent only.
  • the sequence of pictures P comprises a moving object.
  • the object will cover various blocks of pixels in each picture. If, in a particular picture, motion vectors are established for these blocks of pixels, these motions vectors will not be too different. Conversely, if a motion vector differs from its neighboring motion vectors to a relatively large extent, it is safe to assume that the block of pixels to which the motion vector belongs, does not represent a moving object or a portion thereof.
  • FIG. 2 illustrates the following features in addition to the features illustrated in FIG. 1.
  • a motion indication MI is established for the block of pixels B(*,*;n) in the current picture P(n) on the basis of the motion vector MV belonging to the block of pixels B(*,*;n) and motion vectors MV+,MV ⁇ belonging to neighboring blocks of pixels.
  • the controller CON controls the compression parameter CP in dependence on the motion indication MI.
  • the features illustrated in FIG. 2 provide the following advantageous effects.
  • the motion indication MI allows a distinction between a block of pixels having a motion vector which is similar to its neighboring motion vectors, and a block of pixels having a motion vector which is different from its neighboring motion vectors. Consequently, the motion indication MI allows a distinction between a block of pixels which probably represents a portion of a moving object, and a block of pixels which does probably not represent a moving object, although its motion vector as such suggests the contrary. Since the compression parameter CP is controlled in dependence on the motion indication MI, it can be prevented that data relating to a block of pixels which does not represent a moving object, is compressed to a relatively small extent only. This allows a relatively great portion of the available bits to be used for coding moving objects. Thus, the features illustrated in FIG. 2 contribute to the picture quality.
  • FIG. 3 illustrates the following features in addition to the features illustrated in FIG. 1.
  • An average length AL of motion vectors MV belonging to a group GRP of blocks of pixels is calculated. If the average length AL is below a threshold TH, the controller CON maintains the compression parameter CP at a global value VAL_glob for compressing data D relating to the group GRP of blocks of pixels.
  • the features illustrated in FIG. 3 provide the following advantageous effects. If the average length AL of the motion vectors MV is below the threshold TH, it is likely the that the group GRP of blocks of pixels represents a scene which is rather static. Since, in that case, the compression parameter CP is maintained at the global value VAL_glob, and thus is not adjusted block-by-block, it is prevented that random effects influence the compression of data D. Thus, the features illustrated in FIG. 3 contribute to picture quality.
  • FIG. 4 illustrates the following features in addition to the features which have been discussed with reference to FIG. 1.
  • the controller CON controls the compression parameter CP on the basis of the length L and the orientation O of the motion vector MV.
  • the features illustrated in FIG. 4 provide the following advantageous effects. Since the compression parameter CP is controlled not only on the basis of the length L of the motion vector MV, but also on the basis of its orientation O, data can be compressed to a greater or to a smaller extent depending on whether it represents an object moving in a substantially diagonal direction, or an object moving in a substantially horizontal or vertical direction, respectively. Thus, the data compression can be controlled in accordance with the properties of human vision so as to make an efficient use of the available bits. Consequently, the features illustrated in FIG. 4 contribute to the picture quality.
  • FIG. 5 illustrates the following features in addition to the features illustrated in FIG. 1.
  • a picture-detail indication PDI is established for the block of pixels B(*,*;n) to be coded.
  • the compression parameter CP is controlled on the basis of the motion vector MV and the picture-detail indication PDI.
  • the additional features illustrated in FIG. 5 provide the following advantageous effects.
  • the motion vector MV and the picture-detail indication PDI in combination, provide a relatively precise indication of whether the block of pixels is ‘interesting’ and, if so, to what extent. Since the compression parameter CP is controlled on the basis of this combination, a close correspondence can be achieved between the extent to which the block of pixels is interesting, and the extent to which the data relating to it is compressed. Thus, the FIG. 5 features contribute to the picture quality.
  • the compression parameter CP is adjusted on a block-by-block basis, there is a potential risk that, on average, its value gradually increases or decreases throughout the coding of a picture, without this being necessitated by the motion vectors concerned. In other words, there is a risk that there is a certain drift in the value of the compression parameter CP. Such a drift will generally adversely affect the picture quality.
  • FIG. 6 illustrates the following features in addition to the features illustrated in FIG. 1.
  • a calculator CAL1 calculates a global value VAL_glob for the compression parameter CP.
  • the global value VAL_glob is valid for all blocks of pixels in the current picture P(n).
  • a calculator CAL2 calculates a delta value VAL_delta for the compression parameter CP on the basis of the motion vector MV.
  • the controller CON controls the compression parameter CP in dependence on a sum of the global value VAL_glob and the delta value VAL_delta.
  • the features illustrated in FIG. 6 provide the following advantageous effects.
  • the compression parameter CP will have a value which may fluctuate but which, on average, will be substantially equal to the global value VAL_glob for the picture concerned. This prevents the compression parameter CP from drifting away in value throughout the coding of the picture and, consequently, it ensures a sufficiently homogeneous picture quality.
  • the features illustrated in FIG. 6 contribute to the picture quality.
  • FIG. 7 illustrates an example of an MPEG video coder which recaptures the features illustrated in FIGS. 1 - 6 and discussed hereinbefore.
  • the MPEG video coder provides an MPEG data stream DS representing a sequence of pictures P.
  • FIG. 7 is a functional block schematic diagram in which a block having the letter F in its interior represents a data processing function.
  • the MPEG video coder illustrated in FIG. 7 comprises the following data processing functions: motion estimation MEST, vector length calculation VLEC, average calculation AVC, motion detection MDET, filtering FIL, motion compensation MCOMP, subtraction SUB, picture-detail calculation PDC, discrete cosine transformation DCT, quantization parameter control QPC, quantization Q in dependence on a quantization parameter QP, variable length coding VLC, de-quantization DQ, inverse discrete cosine transformation IDCT, and addition ADD.
  • the other blocks represent memories MEM for storing data to be processed or data which has been processed, or both.
  • Memory MEM1 contains a current picture P(n) which has to be coded.
  • Memory MEM2 contains a coded-and-decoded version of a previous picture PCD(n ⁇ 1).
  • the current picture P(n) as well as the coded-and decoded version of the previous picture PCD(n ⁇ 1), comprise macro-blocks of 16 by 16 pixels.
  • FIG. 7 comprises two main parts: a part I and a part II.
  • the part I comprises data processing functions which constitute a preparation for coding a group of macro-blocks in the current picture P(n).
  • the part II comprises data processing functions which constitute the actual coding of each macro-block. That is, the processing functions in part I are first applied to the group of macro-blocks in the current picture P(n), after which the processing functions in part II are applied to each individual macro-block.
  • the motion estimation MEST searches a similar block of 16-by-16 pixels in the coded-and-decoded version of the previous picture PCD(n ⁇ 1).
  • the result of this search is a motion vector MV which indicates the similar block found in the search.
  • the motion vector MV may comprise x,y coordinates -1,2 indicating that the similar block of 16-by-16 pixels in the coded-and decoded version of the previous picture PCD(n ⁇ 1), is one position to the left and two position higher with respect to the macro-block in the current picture P(n) for which the search has been carried out.
  • the motion vectors MV thus obtained are stored in memory MEM3.
  • the memory MEM3 will comprise a motion vector map MVM for the group of macro-blocks for which the motion estimation MEST has been carried out.
  • the motion vector map MVM is in fact a matrix, each element of which is a motion vector MV for a different macro-block.
  • the vector length calculation VLEC provides a motion vector length MVL for each motion vector in the motion vector map MVM. Accordingly, a motion vector length map MVLM is obtained for the group of macro-blocks concerned.
  • MVL 2 which is indeed the real motion vector length.
  • the motion vector length MVL calculated will exceed the real motion vector length by an amount which increases as the motion vector's orientation is more horizontal or vertical than diagonal.
  • a greater weight is given to motion in a horizontal or a vertical direction than to motion in a diagonal direction. This corresponds to human visual perception in the sense that the eyes' resolution is not isotropic: the eyes are able to distinguish higher frequencies in a horizontal or vertical direction than in a diagonal direction.
  • the average calculation AVC calculates an average of the motion vector lengths MVL contained in the motion vector length map MVLM. As a result, an average motion vector length AMVL is obtained. The average motion vector length AMVL is compared with a threshold value, for example 2. The result of this comparison has an impact on the data processing functions comprised in part II as will be discussed hereinafter.
  • the motion detection MDET provides a binary motion indication BMI for each motion vector length MVL contained in the motion vector length map MVLM. It does so by comparing the motion vector length MVL with a threshold value. If the motion vector length MVL is above the threshold value, the binary motion indication BMI has the value 1. If not, the binary motion indication BMI has the value 0. Accordingly, a binary motion indication map BMIM is obtained for the group of macro-blocks concerned, in which each binary motion indication BMI is associated with a different macro-block and indicates whether the macro-block represents a moving object, or rather a portion thereof, or not.
  • the filtering FIL compares each binary motion indication BMI with one or more of its neighboring binary motion indications and provides a filtered motion indication FMI in dependence on the outcome of this comparison. Accordingly, a filtered motion indication map FMIM is obtained.
  • the filtered motion indication FMI will have a value which is different from the corresponding binary motion indication BMI.
  • the filtered motion indication FMI will have a value which is identical to the value of the corresponding binary motion indication BMI.
  • FIG. 8 a illustrates case 1 in which the binary motion indication BMI belonging to a certain macro-block is 0, but the binary motion indication belonging to the preceding macro-block and that belonging to the following macro-block are both 1.
  • the filtered motion indication FMI for the macro-block in question is given the value 2.
  • the value 2 indicates that, although the motion vector length is below the threshold value, the macro-block probably represents a portion of a moving object.
  • the moving object should, in its entirety, be coded with greater precision than stationary objects. As will be shown hereinafter, this is achieved by giving the value 2 to the filtered motion indication FMI for the macro-block concerned.
  • An example of a macro-block for which the filtered motion indication FMI may be 2 is a macro-block representing a cheek of a person who is talking in a videophone picture sequence.
  • the cheek is a moving object and, consequently, most of the macro-blocks representing it will have a binary motion indication BMI which is 1.
  • the middle of the cheek will generally move to a relatively little extent.
  • the binary motion indication BMI for a macro-block in the middle of the cheek may be 0, whereas the binary motion indications for neighboring macro-blocks are 1.
  • the filtered motion indication FMI for the macro-block in the middle of the cheek will be 2. It will hereinafter be shown that, as a result, the entire cheek is coded with sufficient precision which contributes to the perceived overall picture quality.
  • FIG. 8 b illustrates case 2 in which the binary motion indication BMI belonging to a certain macro-block is 1, but the binary motion indication belonging to the preceding macro-block and that belonging to the next macro-block are both 0.
  • the filtered motion indication FMI for the macro-block concerned is given the value 0.
  • the value 0 is given because the macro-block concerned does probably not represent a portion of a moving object. Otherwise at least one of the neighboring binary motion indications would have had the value 1.
  • the fact that the binary motion indication BMI for the macro-block concerned is 1 instead of 0, which is the value of its neighbors, is probably due to random effects.
  • the value 0 to the filtered motion indication FMI for the macro-block concerned, it is prevented that the quantization parameter is adjusted as a result of random effects which is undesirable.
  • FIG. 8 c illustrates case 3 in which the binary motion indication BMI belonging to a certain macro-block is 0, but the binary motion indications belonging to the two following macro-blocks are 1.
  • the filtered motion indication FMI for the macro-block concerned is given the value 1.
  • a succession of two or more macro-blocks having binary motion indications which are 1, generally represents a moving object.
  • the moving object may need to be coded with much greater precision than the background surrounding it.
  • the coding precision cannot be increased to any given extent from one macro-block to a next macro-block.
  • a solution is to already increase the coding precision for the macro-block concerned which precedes the succession of two or more macro-blocks representing the moving object. As will be shown hereinafter, this is achieved by giving the value 1 to the filtered motion indication FMI for the macro-block concerned.
  • FIG. 8 d illustrates case 4 in which the binary motion indication BMI belonging to a certain macro-block is 0, but the binary motion indications belonging to the two preceding macro-blocks are 1.
  • the filtered motion indication FMI for the macro-block concerned is given the value 1.
  • the macro-block concerned is adjacent to a moving object.
  • the macro-block concerned in case 3 illustrated by FIG. 8 c is also adjacent to a moving object. Any of these macro-blocks may represent a part of a background which, in previously coded pictures, has been shielded off by the moving object. Consequently, any of these macro-blocks may constitute new information.
  • the macro-block constituting the new information is coded with relatively great precision in order to avoid so-called block effects. As will be shown hereinafter, this is achieved by giving the value 1 to the filtered motion indication FMI for the macro-blocks concerned in cases 3 and 4 illustrated by FIGS. 8 c and 8 d , respectively.
  • the picture-detail calculation PDC and the quantization parameter control QPC is not carried out if the average motion vector length AMVL is below the threshold value. If the average motion vector length AMVL is below the threshold value, a global value VAL_glob for the quantization parameter QP is applied for the quantization Q of each macro-block in the group. In any case, the global value VAL_glob is calculated before the coding of the current picture P(n) starts and is valid during the coding of the current picture P(n).
  • the global value VAL_glob for the quantization parameter QP may be calculated on the basis of the amount of data which has been obtained by coding the previous picture P(n ⁇ 1), or a portion thereof, the value of the quantization parameter QP which was then used, the bit rate of the MPEG data stream DS, and the amount of data contained in memory MEM7 which constitutes an output buffer for providing coded pictures in the form of a regular bit stream.
  • EP-A-0 670 663 (attorney's docket PHF 94,510) describes an example of such a quantization parameter calculation.
  • the motion compensation MCOMP reads from the motion vector map NIVM, the motion vector MV which belongs to the macro-block to be coded. It uses the motion vector MV to provide the similar block of 16-by-16 pixels contained in the coded-and-decoded version of the previous picture PCD(n ⁇ 1).
  • the subtraction SUB of the similar block of 16-by-16 pixels from the macro-block to be coded provides a prediction-error macro-block.
  • the discrete-cosine transformation DCT transforms the prediction-error macro-block into a block of DCT-coefficients.
  • the picture-detail calculation PDC provides a sum of the squared values in the prediction error macro-block which has been provided by the subtraction SUB. This sum is hereinafter referred to as the sum of absolute differences SAD. As mentioned hereinbefore, the picture-detail calculation PDC is carried out only if the average motion vector length AMVL is above the threshold value.
  • the quantization parameter control QPC which is carried out only if the average motion vector length AMVL is above the threshold value, first calculates a desired value VAL_des for the quantization parameter QP.
  • the delta value VAL_delta depends on the sum of absolute differences SAD, the motion vector length MVL, and the filtered motion indication FMI belonging to the macro-block to be coded.
  • the sum of absolute differences SAD is classified as low, average or high depending on its value.
  • the motion vector length MVL is classified as null, average or high depending on its value.
  • a macro-block will not be coded if the sum of absolute differences SAD is low and the filtered motion indication FMI has the value 0. If the latter conditions apply, the macro-block represents a rather static scene. Not coding the macro-block means that all DCT coefficients are set to zero which makes that the macro-block can be represented in the MPEG data stream DS with relatively few bits. This leaves more room, in terms of number of bits, for coding macro-blocks representing moving objects which contributes to the picture quality.
  • the quantization parameter control QPC establishes an adapted value VAL_adp for the quantization parameter QP.
  • the adapted value VAL_adp is equal to the desired value VAL_des if the following condition fulfilled. The difference between the desired value VAL_des and the adapted value applied for the previous macro-block, is less than 2. If this condition is not fulfilled, the adapted value VAL_adp is the adapted value applied for the previous macro-block plus 2 or minus 2, depending on whether the desired value VAL_des is above or below the previously adapted value, respectively.
  • the adapted value VAL_adp is not always automatically equal to the desired value VAL_des is because, in MPEG-coding, the quantization parameter's value is transmitted in a differential form. That is, a differential value is transmitted which is the difference between current quantization parameter's value and the previous quantization parameter's value.
  • the differential value must be in a range between ⁇ 2 and 2, boundaries included. Thus, any change in the quantization parameter's value between two adjacent macro-blocks, may not exceed 2.
  • the quantization Q divides each DCT-coefficient by the quantization parameter QP and by rounds off the result of this division to the nearest integer.
  • the quantization parameter QP is either given the adapted value VAL_adp, when the average motion vector length AMVL is above the threshold value, or the global value VAL_glob, when the latter condition is not fulfilled. Accordingly, the quantization Q provides a block of quantized DCT coefficients.
  • variable-length coding VLC transforms the block of quantized DCT-coefficients into variable length codes.
  • the variable length codes are written into the memory MEM7 and are later removed from the memory MEM7 so as to form part of the MPEG data stream DS.
  • a decoded version is of the just coded macro-block is established for storage in the memory MEM2.
  • the de-quantization DQ transforms the quantized block of DCT-coefficients into a de-quantized block of DCT-coefficients.
  • the inverse cosine transformation IDCT transforms the de-quantized block of DCT-coefficients into a-decoded error-prediction macro-block.
  • the memory MEM2 will contain the coded-and-decoded version of the current picture, which version will be used for coding a next picture P(n+1).
  • the MPEG video coder illustrated in FIG. 7 recaptures the features illustrated in FIGS. 1 to 6 .
  • the prediction-error macro-block provided by the subtractor SUB in the MPEG video coder illustrated in FIG. 7, constitutes data D which relates to the macro-block to be coded as indicated in FIG. 1.
  • the discrete cosine transformation DCT, the quantization Q in dependence on a quantization parameter QP, and the variable length coding VLC, constitute the compressor CMP as illustrated in FIG. 1.

Abstract

In coding a sequence of pictures (P), a motion vector (MV) is established for a block of pixels (B(*,*;n)) in a current picture (P(n)) to be coded. The motion vector (MV) indicates a block of pixels (B(*,*;n−1)) in a previous picture (P(n−1)) which is similar to the block of pixels (B(*,*;n)) in the current picture (P(n)) to be coded. Data (D) relating to the block of pixels (B(*,*;n)) in the current picture (P(n)), is compressed (CMP). The extent to which the data (D) is compressed depends on a compression parameter (CP). The compression parameter (CP) is controlled on the basis of the motion vector (MV). Accordingly, data relating to a block of pixels representing a moving object can be compressed to a relatively small extent, whereas data relating to a block of pixels representing a stationary object can be compressed to a relatively large extent. This allows a relatively good picture quality at a decoding end.

Description

    FIELD OF THE INVENTION
  • The invention relates to coding a sequence of pictures. The invention may be applied, for example, in a video coder operating in accordance with a standard fixed by the Moving Pictures Experts Group (MPEG). [0001]
  • BACKGROUND ART
  • Video coding in accordance with the MPEG-4 standard will comprise functions such as motion estimation, motion compensation, discrete-cosine transformation (DCT) and quantization. The motion estimation establishes a motion vector for a block of pixels in a current picture to be coded. The motion vector indicates a block of pixels in a previous picture which is similar to the block of pixels in the current picture to be coded. The motion compensation provides a prediction-error block which is the difference between the two aforementioned blocks of pixels. The prediction-error block undergoes a DCT. The result is a block of DCT coefficients. The quantization involves dividing each DCT coefficient by a quantization parameter and rounding off the result to the nearest integer. [0002]
  • In MPEG-4 information “Q2 Core Experiment: macroblock rate control”, ISO/IEC JTC1/SC29/WG11 MPEG96/M2159, April 1997, Bristol (UK), a proposal has been made for controlling the quantization parameter block-by-block. For each block of pixels, a sum of squared values in the prediction-error block is calculated. This sum is referred to as the sum of absolute differences (SAD). The quantization parameter is controlled on the basis of the SAD. [0003]
  • SUMMMARY OF THE INVENTION
  • It is an object of the invention to code of a sequence of pictures such that a relatively good picture quality can be obtained at a decoding end. [0004]
  • The invention takes the following aspects into consideration. The perceived picture quality at a decoding end is mainly determined by certain portions in a picture which a viewer regards as being “interesting”. In many cases, a viewer will focus on moving objects rather than on stationary objects. Thus, the precision with which moving objects are coded will greatly influence the perceived picture quality. [0005]
  • The invention builds on the following features. A motion vector is established indicating a block of pixels in a previous picture which is similar to a block of pixels in a current picture to be coded. Data relating to the block of pixels in the current picture is compressed, the extent to which the data is compressed depending on a compression parameter. These features are comprised in, for example, MPEG video-coding in which blocks of DCT coefficients constitute the data to be compressed, and the quantization parameter constitutes the compression parameter. [0006]
  • In accordance with the invention, the compression parameter is controlled on the basis of the motion vector. Accordingly, data relating to a block of pixels representing a moving object can be compressed to a relatively small extent, whereas data relating to a block of pixels representing a stationary object can be compressed to a relatively large extent. Such a motion-vector based control of the quantization parameter makes that a moving object is coded with a relatively great precision. Thus, the invention allows a relatively good picture quality at a decoding end. [0007]
  • It should be noted that there are two basic methods of controlling the compression parameter on the basis of the motion vector. In one basic method, a stationary camera provides a sequence of pictures to be coded. Then, a block of pixels representing a moving object will have a relatively large motion vector, and a block of pixels representing a background will have a relatively small motion vector. Consequently, data should be compressed to a relatively small extent if the motion vector is large and, conversely, data should be compressed to a relatively great extent if the motion vector is relatively small. In the other method, a camera that is tracking a moving object provides a sequence of pictures to be coded. Then, a block of pixels representing a moving object will have a relatively small motion vector, and a block of pixels representing the background will have a relatively large motion vector. Consequently, data should be compressed to a relatively small extent if the motion vector is relatively small and, conversely, it should be compressed to a relatively great extent if the motion vector is relatively large. [0008]
  • The invention and additional features, which may be optionally used to implement the invention to advantage, are apparent from and elucidated with reference to the drawings described hereinafter. [0009]
  • BRIEF DESCRIPTION OF THE DRAWINGS
  • FIG. 1 is a conceptual diagram illustrating basic features of the invention as claimed in [0010] claim 1;
  • FIGS. [0011] 2 to 6 are conceptual diagrams illustrating additional features as claimed in claims 2 to 6, respectively, and;
  • FIG. 7 is a functional block diagram of an example of an MPEG video coder in accordance with the invention. [0012]
  • FIGS [0013] 8 a to 8 d illustrate the operation of a filter in the MPEG video coder.
  • DETAILED DESCRIPTION OF THE DRAWINGS
  • First, some remarks will be made on the use of reference signs. Similar entities are denoted with an identical letter code throughout the drawings. In a single drawing, various similar entities may be shown. In that case, a suffix is added to the letter code, to distinguish similar entities from each other. The suffix will be between parentheses if the number of similar entities is a running parameter. In the description and the claims, any suffix in a reference sign may be omitted if this is appropriate. [0014]
  • FIG. 1 illustrates basic features of the invention. In coding a sequence of pictures P, a motion vector MV is established for a block of pixels B(*,*;n) in a current picture P(n) to be coded. The motion vector MV indicates a block of pixels B(*,*;n−1) in a previous picture P(n−1) which is similar to the block of pixels B(*,*;n) in the current picture P(n) to be coded. A data compressor CMP compresses data D relating to the block of pixels B(*,*;n) in the current picture P(n). The extent to which the data D is compressed depends on a compression parameter CP. A controller CON controls the compression parameter CP the basis of the motion vector MV. [0015]
  • The following aspects have been taken into consideration with regard to the basic features illustrated in FIG. 1. In many applications, the number of bits obtained by coding a sequence of pictures should preferably not exceed a certain limit. Stated otherwise, a certain number of bits is available for coding the sequence of pictures. It is beneficial to the picture quality, if a relatively great portion of the available bits is used for coding moving objects. The smaller the portion of the available bits which is used for coding stationary objects, the greater the portion which can be used for coding moving objects. Thus, it is desirable to prevent that data which relates to a block of pixels not representing a moving object, is compressed to a relatively small extent only. [0016]
  • Let it be assumed that the sequence of pictures P comprises a moving object. In many cases, the object will cover various blocks of pixels in each picture. If, in a particular picture, motion vectors are established for these blocks of pixels, these motions vectors will not be too different. Conversely, if a motion vector differs from its neighboring motion vectors to a relatively large extent, it is safe to assume that the block of pixels to which the motion vector belongs, does not represent a moving object or a portion thereof. [0017]
  • FIG. 2 illustrates the following features in addition to the features illustrated in FIG. 1. A motion indication MI is established for the block of pixels B(*,*;n) in the current picture P(n) on the basis of the motion vector MV belonging to the block of pixels B(*,*;n) and motion vectors MV+,MV− belonging to neighboring blocks of pixels. The controller CON controls the compression parameter CP in dependence on the motion indication MI. [0018]
  • The features illustrated in FIG. 2 provide the following advantageous effects. The motion indication MI allows a distinction between a block of pixels having a motion vector which is similar to its neighboring motion vectors, and a block of pixels having a motion vector which is different from its neighboring motion vectors. Consequently, the motion indication MI allows a distinction between a block of pixels which probably represents a portion of a moving object, and a block of pixels which does probably not represent a moving object, although its motion vector as such suggests the contrary. Since the compression parameter CP is controlled in dependence on the motion indication MI, it can be prevented that data relating to a block of pixels which does not represent a moving object, is compressed to a relatively small extent only. This allows a relatively great portion of the available bits to be used for coding moving objects. Thus, the features illustrated in FIG. 2 contribute to the picture quality. [0019]
  • The following aspects have also been taken into consideration with regard to the basic features illustrated in FIG. 1. It may happen that a sequence of pictures, or a portion thereof, represents a scene which is rather static. In that case, most of the motion vectors in a picture will be due to random effects, rather than to moving objects. In other words, most of the motion vectors constitute noise. If the compression parameter CP were controlled in dependence on such motion vectors, its value would vary block-by-block in a random manner. Such a random variation of the compression parameter CP will generally have an adverse effect on the picture quality at a decoding end. This is all the more true because, if the compression parameter CP is adjusted in value, this adjustment has to be coded which requires a certain number of bits that otherwise might be used for coding the picture itself. [0020]
  • FIG. 3 illustrates the following features in addition to the features illustrated in FIG. 1. An average length AL of motion vectors MV belonging to a group GRP of blocks of pixels, is calculated. If the average length AL is below a threshold TH, the controller CON maintains the compression parameter CP at a global value VAL_glob for compressing data D relating to the group GRP of blocks of pixels. [0021]
  • The features illustrated in FIG. 3 provide the following advantageous effects. If the average length AL of the motion vectors MV is below the threshold TH, it is likely the that the group GRP of blocks of pixels represents a scene which is rather static. Since, in that case, the compression parameter CP is maintained at the global value VAL_glob, and thus is not adjusted block-by-block, it is prevented that random effects influence the compression of data D. Thus, the features illustrated in FIG. 3 contribute to picture quality. [0022]
  • The following aspects have also been taken into consideration with regard to the basic features illustrated in FIG. 1. Human vision has a resolution which is greater in a horizontal and a vertical direction than in a diagonal direction. A viewer will therefore be able to distinguish finer details of an object if it substantially moves in horizontal or vertical direction than if it moves in a diagonal direction. [0023]
  • FIG. 4 illustrates the following features in addition to the features which have been discussed with reference to FIG. 1. The controller CON controls the compression parameter CP on the basis of the length L and the orientation O of the motion vector MV. The features illustrated in FIG. 4 provide the following advantageous effects. Since the compression parameter CP is controlled not only on the basis of the length L of the motion vector MV, but also on the basis of its orientation O, data can be compressed to a greater or to a smaller extent depending on whether it represents an object moving in a substantially diagonal direction, or an object moving in a substantially horizontal or vertical direction, respectively. Thus, the data compression can be controlled in accordance with the properties of human vision so as to make an efficient use of the available bits. Consequently, the features illustrated in FIG. 4 contribute to the picture quality. [0024]
  • The following aspects have also been taken into consideration with regard to the features illustrated in FIG. 1. It has been explained hereinbefore that the picture quality is substantially influenced by the extent to which ‘interesting’ blocks of pixels are compressed. A block of pixels representing a moving object is ‘interesting’. However, a block of pixels comprising many (new) details is also ‘interesting’. [0025]
  • FIG. 5 illustrates the following features in addition to the features illustrated in FIG. 1. A picture-detail indication PDI is established for the block of pixels B(*,*;n) to be coded. The compression parameter CP is controlled on the basis of the motion vector MV and the picture-detail indication PDI. [0026]
  • The additional features illustrated in FIG. 5 provide the following advantageous effects. The motion vector MV and the picture-detail indication PDI, in combination, provide a relatively precise indication of whether the block of pixels is ‘interesting’ and, if so, to what extent. Since the compression parameter CP is controlled on the basis of this combination, a close correspondence can be achieved between the extent to which the block of pixels is interesting, and the extent to which the data relating to it is compressed. Thus, the FIG. 5 features contribute to the picture quality. [0027]
  • The following aspects have also been taken into consideration with regard to the features illustrated in FIG. 1. In order to achieve a sufficiently homogeneous picture quality, it is desirable that the extent to which the data D is compressed, is substantially independent from the position of the current picture P(n) to be coded in the sequence of pictures, and is substantially independent from the position of the blocks of pixels B(*,*;n) in the picture itself. If the compression parameter CP is adjusted on a block-by-block basis, there is a potential risk that, on average, its value gradually increases or decreases throughout the coding of a picture, without this being necessitated by the motion vectors concerned. In other words, there is a risk that there is a certain drift in the value of the compression parameter CP. Such a drift will generally adversely affect the picture quality. [0028]
  • FIG. 6 illustrates the following features in addition to the features illustrated in FIG. 1. A calculator CAL1 calculates a global value VAL_glob for the compression parameter CP. The global value VAL_glob is valid for all blocks of pixels in the current picture P(n). A calculator CAL2 calculates a delta value VAL_delta for the compression parameter CP on the basis of the motion vector MV. The controller CON controls the compression parameter CP in dependence on a sum of the global value VAL_glob and the delta value VAL_delta. [0029]
  • The features illustrated in FIG. 6 provide the following advantageous effects. The compression parameter CP will have a value which may fluctuate but which, on average, will be substantially equal to the global value VAL_glob for the picture concerned. This prevents the compression parameter CP from drifting away in value throughout the coding of the picture and, consequently, it ensures a sufficiently homogeneous picture quality. Thus, the features illustrated in FIG. 6 contribute to the picture quality. [0030]
  • FIG. 7 illustrates an example of an MPEG video coder which recaptures the features illustrated in FIGS. [0031] 1-6 and discussed hereinbefore. The MPEG video coder provides an MPEG data stream DS representing a sequence of pictures P.
  • FIG. 7 is a functional block schematic diagram in which a block having the letter F in its interior represents a data processing function. The MPEG video coder illustrated in FIG. 7 comprises the following data processing functions: motion estimation MEST, vector length calculation VLEC, average calculation AVC, motion detection MDET, filtering FIL, motion compensation MCOMP, subtraction SUB, picture-detail calculation PDC, discrete cosine transformation DCT, quantization parameter control QPC, quantization Q in dependence on a quantization parameter QP, variable length coding VLC, de-quantization DQ, inverse discrete cosine transformation IDCT, and addition ADD. The other blocks represent memories MEM for storing data to be processed or data which has been processed, or both. A reference sign in the interior of such a block indicates which type of data is stored in the memory MEM. Memory MEM1 contains a current picture P(n) which has to be coded. Memory MEM2 contains a coded-and-decoded version of a previous picture PCD(n−1). The current picture P(n) as well as the coded-and decoded version of the previous picture PCD(n−1), comprise macro-blocks of 16 by 16 pixels. [0032]
  • FIG. 7 comprises two main parts: a part I and a part II. The part I comprises data processing functions which constitute a preparation for coding a group of macro-blocks in the current picture P(n). The part II comprises data processing functions which constitute the actual coding of each macro-block. That is, the processing functions in part I are first applied to the group of macro-blocks in the current picture P(n), after which the processing functions in part II are applied to each individual macro-block. [0033]
  • The processing functions comprised in the part I will now be discussed in greater detail. For each macro-block in the group, the motion estimation MEST searches a similar block of 16-by-16 pixels in the coded-and-decoded version of the previous picture PCD(n−1). The result of this search is a motion vector MV which indicates the similar block found in the search. For example, the motion vector MV may comprise x,y coordinates -1,2 indicating that the similar block of 16-by-16 pixels in the coded-and decoded version of the previous picture PCD(n−1), is one position to the left and two position higher with respect to the macro-block in the current picture P(n) for which the search has been carried out. The motion vectors MV thus obtained are stored in memory MEM3. As a result, the memory MEM3 will comprise a motion vector map MVM for the group of macro-blocks for which the motion estimation MEST has been carried out. The motion vector map MVM is in fact a matrix, each element of which is a motion vector MV for a different macro-block. [0034]
  • The vector length calculation VLEC provides a motion vector length MVL for each motion vector in the motion vector map MVM. Accordingly, a motion vector length map MVLM is obtained for the group of macro-blocks concerned. The motion vector length MVL is calculated as follows: MVL=¦MVx¦+¦MVy¦+¦¦MVx¦−¦Mvy¦¦ with MVx and MVy being the x and the y component, respectively, of the motion vector MV. The motion vector length MVL thus calculated does generally not correspond to the real length of the motion vector MV. For example, let it be assumed that ¦MVx¦=0 and that ¦MVy¦=1. In that case MVL=2, whereas the real motion vector length is 1. If, however, ¦MVx¦=1 and ¦Mvy=1¦¦, MVL=2 which is indeed the real motion vector length. Generally, the motion vector length MVL calculated will exceed the real motion vector length by an amount which increases as the motion vector's orientation is more horizontal or vertical than diagonal. Thus, a greater weight is given to motion in a horizontal or a vertical direction than to motion in a diagonal direction. This corresponds to human visual perception in the sense that the eyes' resolution is not isotropic: the eyes are able to distinguish higher frequencies in a horizontal or vertical direction than in a diagonal direction. [0035]
  • The average calculation AVC calculates an average of the motion vector lengths MVL contained in the motion vector length map MVLM. As a result, an average motion vector length AMVL is obtained. The average motion vector length AMVL is compared with a threshold value, for example 2. The result of this comparison has an impact on the data processing functions comprised in part II as will be discussed hereinafter. [0036]
  • The motion detection MDET provides a binary motion indication BMI for each motion vector length MVL contained in the motion vector length map MVLM. It does so by comparing the motion vector length MVL with a threshold value. If the motion vector length MVL is above the threshold value, the binary motion indication BMI has the [0037] value 1. If not, the binary motion indication BMI has the value 0. Accordingly, a binary motion indication map BMIM is obtained for the group of macro-blocks concerned, in which each binary motion indication BMI is associated with a different macro-block and indicates whether the macro-block represents a moving object, or rather a portion thereof, or not.
  • The filtering FIL compares each binary motion indication BMI with one or more of its neighboring binary motion indications and provides a filtered motion indication FMI in dependence on the outcome of this comparison. Accordingly, a filtered motion indication map FMIM is obtained. In any of the following [0038] cases 1, 2, 3, and 4, the filtered motion indication FMI will have a value which is different from the corresponding binary motion indication BMI. In any other case, the filtered motion indication FMI will have a value which is identical to the value of the corresponding binary motion indication BMI. FIG. 8a illustrates case 1 in which the binary motion indication BMI belonging to a certain macro-block is 0, but the binary motion indication belonging to the preceding macro-block and that belonging to the following macro-block are both 1. In this case, the filtered motion indication FMI for the macro-block in question is given the value 2. The value 2 indicates that, although the motion vector length is below the threshold value, the macro-block probably represents a portion of a moving object. The moving object should, in its entirety, be coded with greater precision than stationary objects. As will be shown hereinafter, this is achieved by giving the value 2 to the filtered motion indication FMI for the macro-block concerned.
  • An example of a macro-block for which the filtered motion indication FMI may be 2, is a macro-block representing a cheek of a person who is talking in a videophone picture sequence. The cheek is a moving object and, consequently, most of the macro-blocks representing it will have a binary motion indication BMI which is 1. However, the middle of the cheek will generally move to a relatively little extent. Accordingly, the binary motion indication BMI for a macro-block in the middle of the cheek may be 0, whereas the binary motion indications for neighboring macro-blocks are 1. In that case, the filtered motion indication FMI for the macro-block in the middle of the cheek will be 2. It will hereinafter be shown that, as a result, the entire cheek is coded with sufficient precision which contributes to the perceived overall picture quality. [0039]
  • FIG. 8[0040] b illustrates case 2 in which the binary motion indication BMI belonging to a certain macro-block is 1, but the binary motion indication belonging to the preceding macro-block and that belonging to the next macro-block are both 0. In this case, the filtered motion indication FMI for the macro-block concerned is given the value 0. The value 0 is given because the macro-block concerned does probably not represent a portion of a moving object. Otherwise at least one of the neighboring binary motion indications would have had the value 1. The fact that the binary motion indication BMI for the macro-block concerned is 1 instead of 0, which is the value of its neighbors, is probably due to random effects. As will be shown hereinafter, by giving the value 0 to the filtered motion indication FMI for the macro-block concerned, it is prevented that the quantization parameter is adjusted as a result of random effects which is undesirable.
  • FIG. 8[0041] c illustrates case 3 in which the binary motion indication BMI belonging to a certain macro-block is 0, but the binary motion indications belonging to the two following macro-blocks are 1. In this case, the filtered motion indication FMI for the macro-block concerned is given the value 1. A succession of two or more macro-blocks having binary motion indications which are 1, generally represents a moving object. The moving object may need to be coded with much greater precision than the background surrounding it. However, in MPEG-coding, the coding precision cannot be increased to any given extent from one macro-block to a next macro-block. A solution is to already increase the coding precision for the macro-block concerned which precedes the succession of two or more macro-blocks representing the moving object. As will be shown hereinafter, this is achieved by giving the value 1 to the filtered motion indication FMI for the macro-block concerned.
  • FIG. 8[0042] d illustrates case 4 in which the binary motion indication BMI belonging to a certain macro-block is 0, but the binary motion indications belonging to the two preceding macro-blocks are 1. In this case, the filtered motion indication FMI for the macro-block concerned is given the value 1. The macro-block concerned is adjacent to a moving object. The macro-block concerned in case 3 illustrated by FIG. 8c, is also adjacent to a moving object. Any of these macro-blocks may represent a part of a background which, in previously coded pictures, has been shielded off by the moving object. Consequently, any of these macro-blocks may constitute new information. If this is so, there will generally be no macro-block in the previous picture which is substantially similar to the macro-block constituting the new information. For that reason, it is preferable that the macro-block constituting the new information is coded with relatively great precision in order to avoid so-called block effects. As will be shown hereinafter, this is achieved by giving the value 1 to the filtered motion indication FMI for the macro-blocks concerned in cases 3 and 4 illustrated by FIGS. 8c and 8 d, respectively.
  • The processing functions comprised in the part II will now be discussed in greater detail. As mentioned hereinbefore, these processing functions are carried out for each individual macro-block instead of for a group of macro-blocks as the processing functions comprised in part I. [0043]
  • The picture-detail calculation PDC and the quantization parameter control QPC is not carried out if the average motion vector length AMVL is below the threshold value. If the average motion vector length AMVL is below the threshold value, a global value VAL_glob for the quantization parameter QP is applied for the quantization Q of each macro-block in the group. In any case, the global value VAL_glob is calculated before the coding of the current picture P(n) starts and is valid during the coding of the current picture P(n). The global value VAL_glob for the quantization parameter QP may be calculated on the basis of the amount of data which has been obtained by coding the previous picture P(n−1), or a portion thereof, the value of the quantization parameter QP which was then used, the bit rate of the MPEG data stream DS, and the amount of data contained in memory MEM7 which constitutes an output buffer for providing coded pictures in the form of a regular bit stream. EP-A-0 670 663 (attorney's docket PHF 94,510) describes an example of such a quantization parameter calculation. [0044]
  • The motion compensation MCOMP reads from the motion vector map NIVM, the motion vector MV which belongs to the macro-block to be coded. It uses the motion vector MV to provide the similar block of 16-by-16 pixels contained in the coded-and-decoded version of the previous picture PCD(n−1). The subtraction SUB of the similar block of 16-by-16 pixels from the macro-block to be coded, provides a prediction-error macro-block. The discrete-cosine transformation DCT transforms the prediction-error macro-block into a block of DCT-coefficients. The picture-detail calculation PDC provides a sum of the squared values in the prediction error macro-block which has been provided by the subtraction SUB. This sum is hereinafter referred to as the sum of absolute differences SAD. As mentioned hereinbefore, the picture-detail calculation PDC is carried out only if the average motion vector length AMVL is above the threshold value. [0045]
  • The quantization parameter control QPC, which is carried out only if the average motion vector length AMVL is above the threshold value, first calculates a desired value VAL_des for the quantization parameter QP. The desired value VAL_des is the sum of the global value VAL_glob and a delta value VAL_delta: VAL_des=VAL_glob+VAL_delta. The delta value VAL_delta depends on the sum of absolute differences SAD, the motion vector length MVL, and the filtered motion indication FMI belonging to the macro-block to be coded. The sum of absolute differences SAD is classified as low, average or high depending on its value. Similarly, the motion vector length MVL is classified as null, average or high depending on its value. The table below shows which delta value VAL_delta applies. [0046]
    FMI = 1 & MVL =
    FMI = 0 average; or FMI = 2 FMI = 1 & MVL = high
    Low SAD +range_up 0 −int_down
    Average +int_up −range_down −range_down
    SAD
    High SAD −int_down −range_down −range_down
  • A macro-block will not be coded if the sum of absolute differences SAD is low and the filtered motion indication FMI has the [0047] value 0. If the latter conditions apply, the macro-block represents a rather static scene. Not coding the macro-block means that all DCT coefficients are set to zero which makes that the macro-block can be represented in the MPEG data stream DS with relatively few bits. This leaves more room, in terms of number of bits, for coding macro-blocks representing moving objects which contributes to the picture quality.
  • Once the desired value VAL_des for the quantization parameter QP has been calculated, the quantization parameter control QPC establishes an adapted value VAL_adp for the quantization parameter QP. The adapted value VAL_adp is equal to the desired value VAL_des if the following condition fulfilled. The difference between the desired value VAL_des and the adapted value applied for the previous macro-block, is less than 2. If this condition is not fulfilled, the adapted value VAL_adp is the adapted value applied for the previous macro-block plus 2 or [0048] minus 2, depending on whether the desired value VAL_des is above or below the previously adapted value, respectively.
  • The fact that the adapted value VAL_adp is not always automatically equal to the desired value VAL_des is because, in MPEG-coding, the quantization parameter's value is transmitted in a differential form. That is, a differential value is transmitted which is the difference between current quantization parameter's value and the previous quantization parameter's value. The differential value must be in a range between −2 and 2, boundaries included. Thus, any change in the quantization parameter's value between two adjacent macro-blocks, may not exceed 2. [0049]
  • The quantization Q divides each DCT-coefficient by the quantization parameter QP and by rounds off the result of this division to the nearest integer. The quantization parameter QP is either given the adapted value VAL_adp, when the average motion vector length AMVL is above the threshold value, or the global value VAL_glob, when the latter condition is not fulfilled. Accordingly, the quantization Q provides a block of quantized DCT coefficients. [0050]
  • The variable-length coding VLC transforms the block of quantized DCT-coefficients into variable length codes. The variable length codes are written into the memory MEM7 and are later removed from the memory MEM7 so as to form part of the MPEG data stream DS. [0051]
  • Finally, a decoded version is of the just coded macro-block is established for storage in the memory MEM2. To that end, the de-quantization DQ transforms the quantized block of DCT-coefficients into a de-quantized block of DCT-coefficients. The inverse cosine transformation IDCT transforms the de-quantized block of DCT-coefficients into a-decoded error-prediction macro-block. The addition ADD of this decoded error-prediction macro-block and the similar block of 16-by-16 pixels, which has been provided by the motion compensation MCOMP discussed hereinbefore, results in a decoded version of the macro-block just coded. Thus, when all the macro-blocks of the current picture P(n) have been coded, the memory MEM2 will contain the coded-and-decoded version of the current picture, which version will be used for coding a next picture P(n+1). [0052]
  • As mentioned hereinbefore, the MPEG video coder illustrated in FIG. 7 recaptures the features illustrated in FIGS. [0053] 1 to 6. With regard to the features illustrated in FIG. 1, the following is noted. The prediction-error macro-block provided by the subtractor SUB in the MPEG video coder illustrated in FIG. 7, constitutes data D which relates to the macro-block to be coded as indicated in FIG. 1. The discrete cosine transformation DCT, the quantization Q in dependence on a quantization parameter QP, and the variable length coding VLC, constitute the compressor CMP as illustrated in FIG. 1.
  • The drawings and their description hereinbefore illustrate rather than limit the invention. It will be evident that there are numerous alternatives which fall within the scope of the appended claims. In this respect, the following closing remarks are made. [0054]
  • There are numerous ways of physically spreading functions or functional elements over various units. In this respect, the drawings are very diagrammatic, each representing only one possible embodiment of the invention. Thus, although a drawing shows different functional elements as different blocks, this no by means excludes that some functional elements, or all functional elements, may be implemented as a single physical unit. [0055]

Claims (7)

1. A method of coding a sequence of pictures (P), the method comprising the steps of:
establishing a motion vector (MV) indicating a block of pixels (B(*,*;n−1)) in a previous picture (P(n−1)) which is similar to a block of pixels (B(*,*;n) in a current picture P(n) to be coded; and
compressing data (D) relating to the block of pixels B(*,*;n) in the current picture P(n), the extent to which the data (D) is compressed depending on a compression parameter (CP);
characterized in that the method comprises the step of:
controlling the compression parameter (CP) on the basis of the motion vector (MV).
2. A method as claimed in claim 1, characterized in that the method comprises the steps of:
establishing a motion indication (MI) for the block of pixels (B(*,*;n)) in the current picture P(n) on the basis of the motion vector (MV) belonging to the block of pixels (B(*,*;n) and motion vectors (MV+,MV−) belonging to neighboring blocks of pixels; and
controlling the compression parameter (CP) in dependence on the motion indication (MI).
3. A method as claimed in claim 1, characterized in that it comprises the steps of:
calculating an average length (AL) of motion vectors (MV) belonging to a group (GRP) of blocks of pixels; and
maintaining the compression parameter (CP) at a global value (VAL_glob) for compressing data (D) relating to the group (GRP) of blocks of pixels, if the average length (AL) is below a threshold (TH).
4. A method as claimed in claim 1, characterized in that it comprises the step of:
controlling the compression parameter (CP) on the basis of the length (L) and the orientation (O) of the motion vector (MV).
5. A method as claimed in claim 1, characterized in that it comprises the steps of:
establishing a picture-detail indication (PDI) for the block of pixels B(*,*;n) to be coded; and
controlling the compression parameter (CP) on the basis of the motion vector (MV) and the picture-detail indication (PDI) of the data (D) to be compressed.
6. A method as claimed in claim 1, characterized in that it comprises the steps of:
calculating a global value (VAL_glob) for the compression parameter (CP) which is valid for all blocks of pixels in the current picture P(n);
calculating a delta value (VAL_delta) for the compression parameter (CP) on the basis of the motion vector (MV); and
controlling the compression parameter (CP) in dependence on a sum of the global value (VAL_glob) and the delta value (VAL_delta).
7. A video coder for coding a sequence of pictures (P), the video coder comprising:
a motion estimator for establishing a motion vector (MV) indicating a block of pixels (B(*,*;n−1)) in a previous picture (P(n−1)) which is similar to a block of pixels (B(*,*;n) in a current picture P(n) to be coded; and
a data compressor for compressing data (D) relating to the block of pixels B(*,*;n) in the current picture P(n), the extent to which the data (D) is compressed depending on a compression parameter (CP);
characterized in that the video coder comprises:
a controller for controlling the compression parameter (CP) on the basis of the motion vector (MV).
US09/976,226 1997-11-07 2001-10-12 Coding a sequence of pictures Abandoned US20020031181A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US09/976,226 US20020031181A1 (en) 1997-11-07 2001-10-12 Coding a sequence of pictures

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
EP97402681.7 1997-11-07
EP97402681 1997-11-07
US09/186,601 US6363115B1 (en) 1997-11-07 1998-11-05 Coding a sequence of pictures
US09/976,226 US20020031181A1 (en) 1997-11-07 2001-10-12 Coding a sequence of pictures

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
US09/186,601 Continuation US6363115B1 (en) 1997-11-07 1998-11-05 Coding a sequence of pictures

Publications (1)

Publication Number Publication Date
US20020031181A1 true US20020031181A1 (en) 2002-03-14

Family

ID=8229897

Family Applications (2)

Application Number Title Priority Date Filing Date
US09/186,601 Expired - Fee Related US6363115B1 (en) 1997-11-07 1998-11-05 Coding a sequence of pictures
US09/976,226 Abandoned US20020031181A1 (en) 1997-11-07 2001-10-12 Coding a sequence of pictures

Family Applications Before (1)

Application Number Title Priority Date Filing Date
US09/186,601 Expired - Fee Related US6363115B1 (en) 1997-11-07 1998-11-05 Coding a sequence of pictures

Country Status (7)

Country Link
US (2) US6363115B1 (en)
EP (1) EP0951780B1 (en)
JP (1) JP2001507908A (en)
KR (1) KR100590436B1 (en)
CN (1) CN1164122C (en)
DE (1) DE69828950T2 (en)
WO (1) WO1999025122A2 (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040228404A1 (en) * 2003-05-12 2004-11-18 Lg Electronics Inc. Moving picture coding method
US20040228403A1 (en) * 2003-05-12 2004-11-18 Lg Electronics Inc. Moving picture coding method
US20090190038A1 (en) * 2008-01-25 2009-07-30 Mediatek Inc. Method, video encoder, and integrated circuit for detecting non-rigid body motion
US20100086022A1 (en) * 2004-06-03 2010-04-08 Hillcrest Laboratories, Inc. Client-Server Architectures and Methods for Zoomable User Interfaces
US9236774B2 (en) 2011-03-09 2016-01-12 Yamaha Motor Electronics Co., Ltd. Stator-fixed structure and driving unit
US9380259B2 (en) 2011-12-22 2016-06-28 Fujitsu Limited Information processing device and information processing method
US11620269B2 (en) * 2020-05-29 2023-04-04 EMC IP Holding Company LLC Method, electronic device, and computer program product for data indexing

Families Citing this family (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6731818B1 (en) * 1999-06-30 2004-05-04 Realnetworks, Inc. System and method for generating video frames
US6735338B1 (en) 1999-06-30 2004-05-11 Realnetworks, Inc. System and method for generating video frames and detecting text
US6760378B1 (en) 1999-06-30 2004-07-06 Realnetworks, Inc. System and method for generating video frames and correcting motion
US6753865B1 (en) 1999-06-30 2004-06-22 Realnetworks, Inc. System and method for generating video frames and post filtering
FI116819B (en) 2000-01-21 2006-02-28 Nokia Corp Procedure for transferring images and an image encoder
GB2362533A (en) * 2000-05-15 2001-11-21 Nokia Mobile Phones Ltd Encoding a video signal with an indicator of the type of error concealment used
US7113545B2 (en) * 2000-08-28 2006-09-26 Thomson Licensing Method and apparatus for motion compensated temporal interpolation of video sequences
US6765964B1 (en) 2000-12-06 2004-07-20 Realnetworks, Inc. System and method for intracoding video data
US6891889B2 (en) * 2001-09-05 2005-05-10 Intel Corporation Signal to noise ratio optimization for video compression bit-rate control
CN101626480B (en) * 2004-08-17 2011-04-13 松下电器产业株式会社 Image encoding device, and image decoding device
US7587091B2 (en) * 2004-10-29 2009-09-08 Intel Corporation De-interlacing using decoder parameters
FR2947134A1 (en) * 2009-06-23 2010-12-24 France Telecom METHODS OF ENCODING AND DECODING IMAGES, CODING AND DECODING DEVICES, DATA STREAMS AND CORRESPONDING COMPUTER PROGRAM.
US9654792B2 (en) 2009-07-03 2017-05-16 Intel Corporation Methods and systems for motion vector derivation at a video decoder
JP5272940B2 (en) * 2009-07-17 2013-08-28 富士通株式会社 Image encoding device

Family Cites Families (22)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH04137983A (en) * 1990-09-28 1992-05-12 Nippon Steel Corp Method and device for decoding moving picture
JP2587534B2 (en) * 1990-11-09 1997-03-05 シャープ株式会社 Image coding device
SE469866B (en) * 1991-04-12 1993-09-27 Dv Sweden Ab Method for estimating motion content in video signals
JPH05183892A (en) * 1991-12-27 1993-07-23 Sony Corp Image processor
JPH05111015A (en) * 1991-10-17 1993-04-30 Sony Corp Movement adaptive image encoder
JP3219205B2 (en) * 1992-02-17 2001-10-15 ソニー株式会社 Image data processing device
JPH0630402A (en) * 1992-07-13 1994-02-04 Canon Inc Image coding device
JPH0646401A (en) * 1992-07-24 1994-02-18 Casio Comput Co Ltd Image/animation image compressing device
DE69431226T2 (en) * 1993-09-28 2003-04-17 Canon Kk Image display device
US5473379A (en) * 1993-11-04 1995-12-05 At&T Corp. Method and apparatus for improving motion compensation in digital video coding
US6052414A (en) * 1994-03-30 2000-04-18 Samsung Electronics, Co. Ltd. Moving picture coding method and apparatus for low bit rate systems using dynamic motion estimation
JP3622226B2 (en) * 1994-04-13 2005-02-23 ソニー株式会社 Encoder
JP3265818B2 (en) * 1994-04-14 2002-03-18 松下電器産業株式会社 Video encoding method
JPH07322267A (en) * 1994-05-25 1995-12-08 Nippon Telegr & Teleph Corp <Ntt> Image signal coder
KR0148154B1 (en) * 1994-06-15 1998-09-15 김광호 Coding method and apparatus with motion dimensions
FR2725577B1 (en) * 1994-10-10 1996-11-29 Thomson Consumer Electronics CODING OR DECODING METHOD OF MOTION VECTORS AND CODING OR DECODING DEVICE USING THE SAME
US5627601A (en) * 1994-11-30 1997-05-06 National Semiconductor Corporation Motion estimation with bit rate criterion
JP3707118B2 (en) * 1995-04-28 2005-10-19 ソニー株式会社 Image coding method and apparatus
JP4223571B2 (en) * 1995-05-02 2009-02-12 ソニー株式会社 Image coding method and apparatus
US5745183A (en) * 1995-08-25 1998-04-28 Thomson Consumer Electronics, Inc. Image motion estimation system which derives candidate block from interpolated motion vectors
JP3183155B2 (en) * 1996-03-18 2001-07-03 株式会社日立製作所 Image decoding apparatus and image decoding method
US6272178B1 (en) * 1996-04-18 2001-08-07 Nokia Mobile Phones Ltd. Video data encoder and decoder

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040228404A1 (en) * 2003-05-12 2004-11-18 Lg Electronics Inc. Moving picture coding method
US20040228403A1 (en) * 2003-05-12 2004-11-18 Lg Electronics Inc. Moving picture coding method
US20100086022A1 (en) * 2004-06-03 2010-04-08 Hillcrest Laboratories, Inc. Client-Server Architectures and Methods for Zoomable User Interfaces
US20090190038A1 (en) * 2008-01-25 2009-07-30 Mediatek Inc. Method, video encoder, and integrated circuit for detecting non-rigid body motion
US8208552B2 (en) * 2008-01-25 2012-06-26 Mediatek Inc. Method, video encoder, and integrated circuit for detecting non-rigid body motion
US9236774B2 (en) 2011-03-09 2016-01-12 Yamaha Motor Electronics Co., Ltd. Stator-fixed structure and driving unit
US9380259B2 (en) 2011-12-22 2016-06-28 Fujitsu Limited Information processing device and information processing method
US11620269B2 (en) * 2020-05-29 2023-04-04 EMC IP Holding Company LLC Method, electronic device, and computer program product for data indexing

Also Published As

Publication number Publication date
CN1164122C (en) 2004-08-25
WO1999025122A2 (en) 1999-05-20
JP2001507908A (en) 2001-06-12
DE69828950T2 (en) 2006-01-12
KR20000069913A (en) 2000-11-25
EP0951780B1 (en) 2005-02-09
KR100590436B1 (en) 2006-06-19
WO1999025122A3 (en) 1999-07-15
DE69828950D1 (en) 2005-03-17
US6363115B1 (en) 2002-03-26
CN1249881A (en) 2000-04-05
EP0951780A2 (en) 1999-10-27

Similar Documents

Publication Publication Date Title
US6363115B1 (en) Coding a sequence of pictures
US5808700A (en) Motion video coding systems with motion vector detection
US5963673A (en) Method and apparatus for adaptively selecting a coding mode for video encoding
EP2259596B1 (en) Adaptive filtering based upon boundary strength
US7233706B1 (en) Method of reducing a blocking artifact when coding moving picture
EP1246131B1 (en) Method and apparatus for the reduction of artifact in decompressed images using post-filtering
EP0560577B1 (en) Video compression coding and decoding with automatic sub-pixel frame/field motion compensation
CA2218865A1 (en) Hybrid hierarchical/full-search mpeg encoder motion estimation
KR100415494B1 (en) Image encoding method and apparatus, recording apparatus, video signal encoding apparatus, processing apparatus and method, video data processing apparatus and method
US20060140277A1 (en) Method of decoding digital video and digital video decoder system thereof
EP1819173B1 (en) Motion vector predictive encoding apparatus and decoding apparatus
US6754270B1 (en) Encoding high-definition video using overlapping panels
JPH0984024A (en) Coder for moving image signal
JP3382292B2 (en) Image encoding apparatus and method
JP4359273B2 (en) Coding mode selection method
Grotz et al. A 64 kbit/s videophone codec with forward analysis and control
KR0122744B1 (en) Apparatus for cording picture by adaptive processing
Choi et al. Adaptive image quantization using total variation classification
JP4269405B2 (en) Encoding apparatus and method
JP2004328799A (en) Motion picture coding apparatus and motion picture coding method
JP4353928B2 (en) Data compression method, recording method, and transmission method
JP4359274B2 (en) Video compression encoding device
Chu et al. Reduction of blocking artifacts using motion-compensated spatial-temporal filtering
Hong et al. An efficient real time algorithm to simultaneously reduce blocking and ringing artifacts of compressed video
WO1999059342A1 (en) Method and system for mpeg-2 encoding with frame partitioning

Legal Events

Date Code Title Description
STCB Information on status: application discontinuation

Free format text: ABANDONED -- AFTER EXAMINER'S ANSWER OR BOARD OF APPEALS DECISION