WO2000022833A1 - Motion vector detection with local motion estimator - Google Patents

Motion vector detection with local motion estimator Download PDF

Info

Publication number
WO2000022833A1
WO2000022833A1 PCT/SG1998/000085 SG9800085W WO0022833A1 WO 2000022833 A1 WO2000022833 A1 WO 2000022833A1 SG 9800085 W SG9800085 W SG 9800085W WO 0022833 A1 WO0022833 A1 WO 0022833A1
Authority
WO
WIPO (PCT)
Prior art keywords
picture
motion vectors
motion vector
data blocks
local motion
Prior art date
Application number
PCT/SG1998/000085
Other languages
French (fr)
Inventor
Yau Wai Lucas Hui
Hong Lye Oh
Jean-Michel Bard
Original Assignee
Stmicroelectronics Asia Pacific Pte Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Stmicroelectronics Asia Pacific Pte Ltd filed Critical Stmicroelectronics Asia Pacific Pte Ltd
Priority to PCT/SG1998/000085 priority Critical patent/WO2000022833A1/en
Priority to EP98951906A priority patent/EP1119975B1/en
Priority to US09/807,497 priority patent/US6876702B1/en
Priority to DE69813911T priority patent/DE69813911T2/en
Publication of WO2000022833A1 publication Critical patent/WO2000022833A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/503Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
    • H04N19/51Motion estimation or motion compensation

Definitions

  • the present invention relates to a method and apparatus for encoding moving pictures sequences.
  • the present invention relates to a method and apparatus for motion estimation and motion compensation in a video signal compression system.
  • Motion estimation/compensation, transform coding, and statistical coding are utilized to efficiently compress the input video sequence.
  • each picture from the input video sequence is partitioned into rows of smaller and non-overlapping macroblocks of picture elements (pixels). Macroblocks in each row may be grouped into one or more slices.
  • the compression is performed on each macroblock on a row-by-row basis starting from the leftmost macroblock to the rightmost macroblock, and the top row to the bottom row.
  • motion vectors are detected for each macroblock in a picture.
  • the coding mode for a macroblock e.g. intra-coded, forward- predicted, backward-predicted, or interpolated
  • the utilized motion vectors are differentially coded with variable length codes before outputting.
  • a typical motion vector detection process comprises determining, for each macroblock to be coded, a search window consisting of pixels from a reference picture and matching pixel vales of the macroblocks to blocks of pixel values obtained from the search window. This process is known to be computationally intensive. Particularly, the size of the search window has a direct impact to the computation load.
  • a search method may be selected based on its performance in terms of accuracy and computation complexity.
  • the telescopic search method comprises the steps of obtaining a motion vector from a co-sited macroblock from a closest coded picture; optional scaling of the obtained motion vector according to the picture distances between the reference picture, the closest coded picture, and the current picture; and defining the search window based on the centre position of the current macroblock plus an offset defined by the scaled motion vector.
  • the methods disclosed therein comprise the steps of calculating a global motion vector based on the motion vectors of a previous picture, and offsetting search windows of all macroblocks by the calculated global motion vector.
  • the global motion vector may be determined by the mean or the median function, or by the most common motion vector of the previous picture; it can be further normalized according to the picture distances.
  • the calculated global motion vector may then represent a global translational motion of objects from one picture to the other.
  • Apparatus implementing the motion estimator may include a search engine for performing the comparison of the current macroblock pixels with candidate pixel blocks from the search window.
  • An example implementation of a suitable search engine is a systolic array processor which calculates and compares sum of absolute difference of the current macroblock" with all candidate blocks.
  • a search window cache is coupled with the search engine to sustain the large input data bandwidth requirement of the search engine.
  • the search window cache is updated via a DMA with the new search window for each macroblock from a slower but larger frame memory where the reference picture is stored.
  • a programmable or fixed function controller with necessary RAM or ROM is used to determine the search windows, control DMA update of the cache with the search window, as well as monitor the search engine for resulting motion vectors.
  • the search window cache is designed to maximize the overlapping area of one search window and the next. As a row of macroblocks is processed from the left to right, only the rightmost part of the search window for each macroblock is loaded into the search window cache.
  • the amount of picture motion from one frame to another can be very large. particularly when the reference picture is two or more frames separated in sequence from the object picture.
  • increasing the size of the search window to better match the large amount of motion would escalate the implementation complexity and power consumption.
  • the aforementioned telescopic search method expands the possible search range by redefining the search window location for each macroblock. This method faces problems in picture regions with un-correlated motion wherein the search window has to be enlarged to account for the incorrectly defined search window location.
  • the expanded possible search range increases the search window cache size requirement and also the bandwidth requirement between the search window cache and the frame memory. This is so because the search window cache has to store data of all possible locations of search window for the next macroblock.
  • VLC variable length codes
  • a method for determining motion vectors for encoding picture data in a sequence of pictures wherein data representing a picture in the sequence is arranged in a plurality of adjacent data blocks comprising the steps of:
  • the present invention also provides a moving pictures encoder for encoding a sequence of pictures, including: a motion vector detector coupled to receive picture data from the sequence of pictures for determining respective motion vectors for data blocks in a picture of the picture sequence, wherein each motion vector is determined using a comparison of the data block with search window data from a reference picture; a local motion estimator coupled to the motion vector detector for determining and storing a local motion vector based on motion vectors from a group of adjacent data blocks from a picture; and wherein the motion vector detector utilises stored local motion vectors from a previously coded picture to determine the search window data.
  • the present invention further provides a coding apparatus for encoding picture data in a sequence of pictures wherein data representing a picture in the sequence is arranged in a plurality of adjacent data blocks and motion vectors are generated for the data blocks of a picture in the sequence, the coding apparatus including: a local motion estimator for generating respective local motion vectors for groups of adjacent data blocks in the object picture, wherein a local motion vector for each group of data blocks is generated according to individual motion vectors of the data blocks in the group; a local motion vector storage memory for storing the local motion vectors for the object picture; and a motion vector detector for generating motion vectors for data blocks in an object picture, the motion vector detector including a search window cache for caching a selected portion of a reference picture and a search engine for comparing data blocks of the object picture with the search window cache contents, wherein the contents of the search window cache are selected according to a local motion vector retrieved from the local motion vector storage memory from a previously coded picture in the picture sequence.
  • a local motion estimator for
  • An embodiment of the present invention utilizes a local motion estimator which determines a local motion vector for each row of macroblocks in a picture to be coded.
  • the local motion estimator may determine the local motion vectors based on detected motion vectors from the corresponding macroblock rows of a previously processed or coded picture. Each determined local motion vector provides an offset for search windows of all macroblocks in a given row. The local motion vectors better adapt to motion variations within a picture, and enable expansion of the effective search range of the motion estimator without increasing the search window size.
  • each local motion vector defines a fixed offset to all search windows within a macroblock row, and therefore the amounts of caching and cache updating remain constant from one macroblock to the next within the row given that the search window size is also fixed.
  • This is particularly advantageous because the cache size as well as the cache update bandwidth requirements are the highest at this point. Due to reduced search window sizes for macroblocks near the leftmost and rightmost sides of the row, the caching and cache update bandwidth requirements are reduced on the average when the motion estimation process is moving from one macroblock row to the next. Therefore, the cache size and the cache update bandwidth requirements can be optimized based on only the search window size itself.
  • a maximum offset vector can be found with the determined set of local motion vectors for a picture to be coded.
  • the maximum offset vector may be the maximum of absolute values of the horizontal and vertical components from the set of local motion vectors.
  • the maximum possible motion vector size is determined and therefore a corresponding VLC table can be selected for coding of motion vectors for the picture.
  • the selection of VLC tables which are optimized for different maximum possible motion vectors can be made adaptive according to the amount of motion from picture to picture.
  • a picture sequence encoder encodes each input picture by determining its picture coding type (I-, P-, or B-picture), obtaining macroblocks from the picture, subjecting macroblocks of P-picture or B-picture to a motion vector detection process (frame and/or field, forward and/or backward) using a local motion estimator, performing necessary motion compensation (predicted or interpolated) using the detected motion vectors, subjecting all macroblocks to a transform coder followed by a statistical coder.
  • the local motion estimator is updated with motion vectors from past processed pictures by the motion vector detector.
  • the motion vectors are used to generate a local motion vector for each row of macroblocks in a new picture to be coded. Generated local motion vectors are used to offset search windows of all macroblocks in the corresponding macroblock row.
  • Embodiments of the present invention are able to provide an improved picture sequence encoding method or apparatus containing a motion estimator with an expanded search range and a minimized search window. Further, embodiments of the invention can provide an apparatus for encoding a picture sequence which includes a motion estimator with optimized search window cache size and frame buffer memory bandwidth requirements. The embodiments of the invention described herein therefore enable an efficient method for motion vectors coding.
  • FIG. 1 is a functional block diagram of a video encoder with a local motion estimator according to one embodiment of the present invention
  • FIG. 2 illustrates forward and backward motion vector detection for B-pictures and P-picture according to the present invention
  • FIG. 3 illustrates how a search window is defined by a local motion vector within a reference picture according to the present invention
  • FIG. 4 shows a block diagram of a circuit for implementing the motion vector detector and local motion estimator as part of a moving pictures encoding system apparatus according to an embodiment of the present invention.
  • FIG. 1 A block diagram of a picture sequence encoder according to a preferred embodiment of the present invention is illustrated in FIG. 1.
  • a moving pictures sequence which may be in the form of video signals, is input to a frame buffer (A) 101 for re-ordering and processing requirements (eg. raster to macroblock conversion).
  • the moving pictures sequence may be pre-processed before inputting.
  • Useful pre-processing techniques may include noise reduction, inverse telecine, and chroma subsampling.
  • a picture coding type (I, P, or B-picture) is determined using known methods according to application needs for each picture in the input picture sequence.
  • An I-picture is an intra- coded picture used mainly for random access or scene update
  • a P-picture uses forward motion predictive coding with reference to a previously coded I or P-picture (anchor picture)
  • a B-picture uses forward and backward motion predictive/interpolative coding with reference to previously coded I and/or P-pictures.
  • An input pictures sequence may be either a field or frame structured sequence coming from an interlaced or progressive source.
  • Macroblocks containing blocks of pixel values are derived from a picture to be coded from picture data stored in the frame buffer (A) 101.
  • the macroblocks are derived on a row by row basis starting from the leftmost macroblock to the rightmost macroblock, and the top row to the bottom row.
  • Macroblocks belonging to an I-picture are passed directly to a transform coder 105, whilst macroblocks belonging to a P-picture or B-picture are passed to a motion vector detector 102. It is also possible to subject macroblocks belonging to an I-picture to the motion vector detector 102 for generation of error concealment motion vectors such as defined in the MPEG 2 standard.
  • All necessary motion vectors, for example the frame and/or field, forward and/or backward, and 16x16/16x8/8x8 motion vectors, of each macroblock are detected by the motion vector detector 102 by matching the macroblock to candidate blocks obtained from a search window obtained from a reference picture stored in a frame buffer (B) 103.
  • Different matching methods such as the exhaustive search method, the logarithmic search method, multi-steps or hierarchical search method, and search window sizes may be utilized in the motion vector detector 102 depending on application as well as implementation needs.
  • Matching methods may also be implemented in various pixel resolutions for example integer, half pel or quarter pel resolution.
  • the matching criterion may be based on minimum of absolute errors, square errors, or other suitable distortion functions.
  • methods of checking neighbouring macroblock motion vector correlation may be applied. Details of various described matching methods and matching criterion which can be utilised in conjunction with embodiments of the present invention are well known or will be readily apparent to one skilled in the art.
  • the motion vector detector 102 is coupled to a motion compensation processor 104.
  • a macroblock coding mode which can be intra-coded, frame/field forward/backward predicted, or frame/field inte ⁇ olated is first decided for each macroblock based on the detected motion vectors from the motion vector detector 102. Any necessary prediction errors are then generated based on the decided macroblock coding mode.
  • An example macroblock coding mode decision method may be found in the MPEG2 Test Model 5 specification.
  • Resultant macroblocks from the motion compensation process 104 are subjected to a transform coder 105 which exploits correlation within each macroblock and also its psycho- visual effects. Examples of popular transform coders may be found in the MPEG1 , MPEG2, H.261 , and H.263 standards.
  • One embodiment of a transform coder 105 according to the MPEG2 Test Model 5 includes discrete cosine transformation (DCT), a quantizer, rate controller with adaptive quantization, inverse quantizer, and inverse DCT.
  • DCT discrete cosine transformation
  • the transform coder 105 produces transformed and quantized coefficients for each macroblock which are then subjected to a statistical coder 106.
  • the transformed and quantized coefficients are inverse quantized and inverse transformed by the transform coder 105 to produce reconstructed macroblocks.
  • any necessary inverse of motion compensation is provided to each reconstructed macroblock by the motion compensation process 104.
  • Reference pictures are formed by the reconstructed macroblocks for use in the motion vector detector 102 and motion compensation process 104. In some applications or coding instances, input pictures can be taken directly as reference pictures for motion vector detector' 102.
  • the statistical coder 106 exploits all statistical redundancies in the received data, and multiplexes the results to produce the final compressed output bitstream.
  • the statistical coder 106 provides the zig-zag scanning and run- length encoding of the transformed and quantized coefficients, differential coding of the utilized motion vectors, and multiplexing of all results and necessary side information (eg. sequence/GOP/picture/slice/macroblock layer header information, picture coding types, macroblock coding modes, etc).
  • the statistical coder 106 utilizes variable length codes (VLC) from VLC tables 110.
  • VLC variable length codes
  • a local motion estimator 107 determines a local motion vector for each row of macroblocks from an input picture.
  • motion vectors from a same macroblock row of a picture as detected by the motion vector detector 102 are averaged to generate the local motion vector for the corresponding macroblock row of a new input picture.
  • the median motion vector or the most common vector may be selected, instead of the average, to be output local motion vector.
  • Further methods of generating the local motion vector of each macroblock row according to the present invention are possible with motion vectors from neighbouring (top or bottom or additionally neighbour picture) macroblock rows, or with independent row motion detectors; however, there may be little or no implementation advantage for doing so.
  • Each generated local motion vector may be normalized according to the picture distances involved.
  • Figure 2 illustrates as an example how local motion vectors (MV) can be generated using results of previous forward and/or backward motion vector detection according to the present invention.
  • two B-pictures 202 and 203 (Bl and B2 respectively) are in between two anchor I or P-pictures 201 and 204.
  • the pictures are shown in display order representing a small part of an input picture sequence.
  • Forward MV detection is performed for picture 202 (Bl), 203 (B2), and 204 (P) represented by Bl-Forward 206, B2-Forward 207, and P-Forward 208 respectively.
  • Macroblock rows 211, 212, 213, and 214 represent macroblocks rows of the same vertical position or row number in pictures 201, 202, 203, 204 respectively.
  • a local MV to be used for future MV detection is generated by motion vectors from a very nearby picture.
  • a local MV for use in B2-Forward 207 of macroblock row 213 may be generated by motion vectors resulting from Bl-Forward 206 for row 212.
  • the following table summarizes the selection of motion vectors for local MV generation as one out of many possible embodiments of the local motion estimator according to the present invention.
  • the resulting local motion vectors are stored in a local MV storage 108 illustrated in Figure 1.
  • the local motion vectors are retrieved by the motion vector detector 102 to offset search windows of the corresponding macroblock rows.
  • An optional maximum search range detector 109 may be used according to the present invention to determine the maximum horizontal and vertical search ranges of all motion vectors of a picture such that optimal VLCs can be used from the VLC tables 110 for coding of motion vectors in the statistical coder 106.
  • the maximum search range independently defined for horizontal and vertical motion, is the maximum range of all local motion vectors for a picture plus the maximum range of the search window for the picture.
  • VLCs and VLC tables are adaptively extended when motion within the sequence or scenes become larger.
  • the maximum search range information may be coded or transmitted at the picture layer. Methods of VLC coding for motion vectors may be found in the MPEG2 standard.
  • FIG. 3 the procedure for determination of a search window using a local MV is illustrated.
  • a search window for a current macroblock (MB) to be coded is determined from a reference picture 301 by first determining the position of a co-sited MB 302 on the reference picture 301. The position is then offset by a local motion vector 303 generated for the macroblock row where the current MV belongs to a new offset MB position 304 as illustrated. Finally, the horizontal and vertical search ranges are included centred around the offset MB position 304 to form the search window 305 for the current MB. The search window 305 is then used for motion vector detection for the current MB.
  • MB macroblock
  • Figure 4 shows a block diagram of a circuit which can be employed for implementing the motion vector detector and local motion estimator as part of a moving pictures encoding system apparatus according to an embodiment of the present invention.
  • the input picture and reference picture are stored in a large frame buffer 405 which may be implemented using DRAM (dynamic random access memory).
  • a controller 401 which can be a microprocessor, a digital signal processor (DSP), or a fixed function sequencer, is used to control a DMA (direct memory access controller) 404 for accessing the frame buffer 405.
  • DMA direct memory access controller
  • Instruction codes needed for the controller 401 can be stored in an instruction ROM (read only memory) 402.
  • a current MB to be coded is determined and loaded into a search engine 407 by the controller 401 and DMA 404.
  • the controller then obtains a previously determined local motion vector 5 for the corresponding macroblock row from data in a RAM 403.
  • the controller defines a search window and transfers the corresponding data from a reference picture stored in the frame buffer 405 to a search window cache 406 using the DMA 404.
  • the search window size is fixed according to application or implementation needs, for 10 subsequent MBs only a fraction of the defined search window must be loaded into the search window cache 406.
  • the rest of the defined search window overlaps the search window of the previous MB and therefore is already stored in the search window cache 406.
  • the search engine 407 performs motion vector detection based on the
  • the search engine 407 may be implemented to perform specific search methods such as the exhaustive search method or the multi-steps/multi-resolution hierarchical search methods, or specific search criterion such as using minimum sum of square errors or absolute errors.
  • a systolic array processor architecture may be used in the search engine 407 to the meet the computation power
  • the search engine 407 provides as output all detected motion vectors to the controller 401.
  • the results of search criterion computation may be provided for macroblock coding mode decision processes.
  • the controller 401 determines a local motion vector using the output motion vectors for each macroblock row and stores the local motion vector into data RAM
  • embodiments of the present invention provide an improved method and apparatus for motion vector detection.
  • a local motion estimator is used to better predict picture to picture motion which consists of not just
  • the present invention optionally allows maximum horizontal and vertical motion search ranges to be adaptively determined using the local motion vectors.
  • This method provides an efficient way of coding motion vectors using variable length codes. Sequences or scenes with little motion are better coded with motion vector bitrate reduction, whilst on the other hand, motion vector bitrate is increased to handle larger motion within sequence or scenes.

Abstract

A method and apparatus of encoding digital video, for example according to the ISO/IEC MPEG standards (ISO/IEC 11172-2 MPEG-1 and ISO/IEC 13818-2 MPEG-2), which employs a local motion estimator for determining respective local motion vectors for groups of adjacent macroblocks in a picture. A local motion vector is determined for each row of macroblocks in a picture from the motion vectors of the macroblocks in that row. Then, for macroblocks in the corresponding row of a subsequently coded picture, the local motion vector can be used to select the search window used for determining the motion vectors. The local motion estimator improves search range and accuracy of macroblock motion vector detection without increasing the search window accessing bandwidth and caching requirement, and computation lode of searching.

Description

MOTION VECTOR DETECTION WITH LOCAL MOTION ESTIMATOR
Field of the Invention
The present invention relates to a method and apparatus for encoding moving pictures sequences. In particular, the present invention relates to a method and apparatus for motion estimation and motion compensation in a video signal compression system.
Background of the Invention Methods for encoding moving pictures or video such as the MPEG1, MPEG2, H.261, and H.263 standards had been developed for efficient transmission and storage. A detailed description of one such encoding method is found in MPEG2 Test Model 5, ISO/IEC JTC1/SC29/WG11/N0400, April 1993, and the disclosure of that document is hereby expressly incorporated herein by reference. In the described encoding method, an input video sequence is organized into a sequence layer, group-of-pictures (GOP), pictures, slices, macroblocks, and finally block layer. Each picture is coded according to its determined picture coding type. The picture coding types used include intra-coded picture (I-picture), predictive-coded picture (P-picture), and bi-directionally predictive-coded picture (B-picture).
Motion estimation/compensation, transform coding, and statistical coding are utilized to efficiently compress the input video sequence. For example in MPEG2 Test Model 5, each picture from the input video sequence is partitioned into rows of smaller and non-overlapping macroblocks of picture elements (pixels). Macroblocks in each row may be grouped into one or more slices. The compression is performed on each macroblock on a row-by-row basis starting from the leftmost macroblock to the rightmost macroblock, and the top row to the bottom row.
In the motion estimation/compensation method, motion vectors are detected for each macroblock in a picture. The coding mode for a macroblock (e.g. intra-coded, forward- predicted, backward-predicted, or interpolated) is decided based on the detected motion vectors and the determined picture coding type. The utilized motion vectors are differentially coded with variable length codes before outputting.
A typical motion vector detection process comprises determining, for each macroblock to be coded, a search window consisting of pixels from a reference picture and matching pixel vales of the macroblocks to blocks of pixel values obtained from the search window. This process is known to be computationally intensive. Particularly, the size of the search window has a direct impact to the computation load.
Many methods of matching the pixel blocks are available, such as an exhaustive search method which compares every definable block within the search window, a logarithmic search method, a hierarchial search, and various other possible derivations. Depending on application requirements, a search method may be selected based on its performance in terms of accuracy and computation complexity.
To cater for sequences with large object movements between pictures, methods exist to increase the search range without enlarging the search window. These methods provide greater accuracy motion vectors for picture sequences with large movements without a large increase in computation load. One such method is the telescopic search method in which the motion vectors of macroblocks from a previously coded or matched picture are used to generate a new search window for each current macroblock. The telescopic search method comprises the steps of obtaining a motion vector from a co-sited macroblock from a closest coded picture; optional scaling of the obtained motion vector according to the picture distances between the reference picture, the closest coded picture, and the current picture; and defining the search window based on the centre position of the current macroblock plus an offset defined by the scaled motion vector.
Alternate methods of determining search windows are disclosed in United States Patents
5.473,379 and 5,657,087, for example. The methods disclosed therein comprise the steps of calculating a global motion vector based on the motion vectors of a previous picture, and offsetting search windows of all macroblocks by the calculated global motion vector. The global motion vector may be determined by the mean or the median function, or by the most common motion vector of the previous picture; it can be further normalized according to the picture distances. The calculated global motion vector may then represent a global translational motion of objects from one picture to the other.
Apparatus implementing the motion estimator may include a search engine for performing the comparison of the current macroblock pixels with candidate pixel blocks from the search window. An example implementation of a suitable search engine is a systolic array processor which calculates and compares sum of absolute difference of the current macroblock" with all candidate blocks. A search window cache is coupled with the search engine to sustain the large input data bandwidth requirement of the search engine. The search window cache is updated via a DMA with the new search window for each macroblock from a slower but larger frame memory where the reference picture is stored. A programmable or fixed function controller with necessary RAM or ROM is used to determine the search windows, control DMA update of the cache with the search window, as well as monitor the search engine for resulting motion vectors.
To minimize the bandwidth between the cache and frame memory, the search window cache is designed to maximize the overlapping area of one search window and the next. As a row of macroblocks is processed from the left to right, only the rightmost part of the search window for each macroblock is loaded into the search window cache.
In some instances the amount of picture motion from one frame to another can be very large. particularly when the reference picture is two or more frames separated in sequence from the object picture. However, increasing the size of the search window to better match the large amount of motion would escalate the implementation complexity and power consumption.
The aforementioned telescopic search method expands the possible search range by redefining the search window location for each macroblock. This method faces problems in picture regions with un-correlated motion wherein the search window has to be enlarged to account for the incorrectly defined search window location. In terms of implementation, the expanded possible search range increases the search window cache size requirement and also the bandwidth requirement between the search window cache and the frame memory. This is so because the search window cache has to store data of all possible locations of search window for the next macroblock.
Methods utilizing the global motion vector such as disclosed in US patents 5,473,379 and 5,657,087 may be used to minimize the search window cache size as well as the bandwidth requirement from the frame memory while expanding the actual search range. These methods fix the offset of the search window for all macroblocks in a picture. However, given that the only a single global motion vector is used to offset all of the macroblock search windows, the search range expansion works well only with pictures containing uniform translational motion. Pictures with zooming, rotational motion, and shearing effects, for example, are not well dealt with using this technique.
Finally, all of the detected and utilized motion vectors are differentially coded with variable length codes (VLC) to reduce the coding bit rate. Expanding the search range may produce larger motion vectors which require bigger VLC tables to be selected at the picture level to code the motion vectors. In turn, the bit rate for motion vector coding is increased.
Summary of the Invention
In accordance with the present invention, there is provided a method for determining motion vectors for encoding picture data in a sequence of pictures wherein data representing a picture in the sequence is arranged in a plurality of adjacent data blocks, the method comprising the steps of:
(a) encoding a first picture in the sequence with motion vectors for the data blocks in the first picture;
(b) generating respective local motion vectors for groups of adjacent data blocks in the first picture, wherein a local motion vector for each group of data blocks is generated according to the individual motion vectors of the data blocks in the group;
(c) selecting a data block of an object picture to be coded;
(d) determining a window of search data from a reference picture on the basis of the corresponding position of the selected data block and the local motion vector which corresponds to the selected data block; and
(e) comparing the selected data block with the search data from the reference picture in order to determine a motion vector for the selected data block.
The present invention also provides a moving pictures encoder for encoding a sequence of pictures, including: a motion vector detector coupled to receive picture data from the sequence of pictures for determining respective motion vectors for data blocks in a picture of the picture sequence, wherein each motion vector is determined using a comparison of the data block with search window data from a reference picture; a local motion estimator coupled to the motion vector detector for determining and storing a local motion vector based on motion vectors from a group of adjacent data blocks from a picture; and wherein the motion vector detector utilises stored local motion vectors from a previously coded picture to determine the search window data.
The present invention further provides a coding apparatus for encoding picture data in a sequence of pictures wherein data representing a picture in the sequence is arranged in a plurality of adjacent data blocks and motion vectors are generated for the data blocks of a picture in the sequence, the coding apparatus including: a local motion estimator for generating respective local motion vectors for groups of adjacent data blocks in the object picture, wherein a local motion vector for each group of data blocks is generated according to individual motion vectors of the data blocks in the group; a local motion vector storage memory for storing the local motion vectors for the object picture; and a motion vector detector for generating motion vectors for data blocks in an object picture, the motion vector detector including a search window cache for caching a selected portion of a reference picture and a search engine for comparing data blocks of the object picture with the search window cache contents, wherein the contents of the search window cache are selected according to a local motion vector retrieved from the local motion vector storage memory from a previously coded picture in the picture sequence.
An embodiment of the present invention utilizes a local motion estimator which determines a local motion vector for each row of macroblocks in a picture to be coded. The local motion estimator may determine the local motion vectors based on detected motion vectors from the corresponding macroblock rows of a previously processed or coded picture. Each determined local motion vector provides an offset for search windows of all macroblocks in a given row. The local motion vectors better adapt to motion variations within a picture, and enable expansion of the effective search range of the motion estimator without increasing the search window size.
In the preferred embodiment, each local motion vector defines a fixed offset to all search windows within a macroblock row, and therefore the amounts of caching and cache updating remain constant from one macroblock to the next within the row given that the search window size is also fixed. This is particularly advantageous because the cache size as well as the cache update bandwidth requirements are the highest at this point. Due to reduced search window sizes for macroblocks near the leftmost and rightmost sides of the row, the caching and cache update bandwidth requirements are reduced on the average when the motion estimation process is moving from one macroblock row to the next. Therefore, the cache size and the cache update bandwidth requirements can be optimized based on only the search window size itself.
In addition, a maximum offset vector can be found with the determined set of local motion vectors for a picture to be coded. The maximum offset vector may be the maximum of absolute values of the horizontal and vertical components from the set of local motion vectors. Combining maximum offset vector and the search window size, the maximum possible motion vector size is determined and therefore a corresponding VLC table can be selected for coding of motion vectors for the picture. In this case, the selection of VLC tables which are optimized for different maximum possible motion vectors can be made adaptive according to the amount of motion from picture to picture.
A picture sequence encoder according to an embodiment of the present invention encodes each input picture by determining its picture coding type (I-, P-, or B-picture), obtaining macroblocks from the picture, subjecting macroblocks of P-picture or B-picture to a motion vector detection process (frame and/or field, forward and/or backward) using a local motion estimator, performing necessary motion compensation (predicted or interpolated) using the detected motion vectors, subjecting all macroblocks to a transform coder followed by a statistical coder. The local motion estimator is updated with motion vectors from past processed pictures by the motion vector detector. The motion vectors are used to generate a local motion vector for each row of macroblocks in a new picture to be coded. Generated local motion vectors are used to offset search windows of all macroblocks in the corresponding macroblock row.
Embodiments of the present invention are able to provide an improved picture sequence encoding method or apparatus containing a motion estimator with an expanded search range and a minimized search window. Further, embodiments of the invention can provide an apparatus for encoding a picture sequence which includes a motion estimator with optimized search window cache size and frame buffer memory bandwidth requirements. The embodiments of the invention described herein therefore enable an efficient method for motion vectors coding.
Brief description of the drawings
The invention is described in greater detail hereinafter, by way of example only, through the following description of preferred embodiments thereof and with reference to the accompanying drawings, wherein:
FIG. 1 is a functional block diagram of a video encoder with a local motion estimator according to one embodiment of the present invention; FIG. 2 illustrates forward and backward motion vector detection for B-pictures and P-picture according to the present invention;
FIG. 3 illustrates how a search window is defined by a local motion vector within a reference picture according to the present invention; and FIG. 4 shows a block diagram of a circuit for implementing the motion vector detector and local motion estimator as part of a moving pictures encoding system apparatus according to an embodiment of the present invention.
Detailed Description of the Preferred Embodiments A block diagram of a picture sequence encoder according to a preferred embodiment of the present invention is illustrated in FIG. 1. A moving pictures sequence, which may be in the form of video signals, is input to a frame buffer (A) 101 for re-ordering and processing requirements (eg. raster to macroblock conversion). The moving pictures sequence may be pre-processed before inputting. Useful pre-processing techniques may include noise reduction, inverse telecine, and chroma subsampling.
A picture coding type (I, P, or B-picture) is determined using known methods according to application needs for each picture in the input picture sequence. An I-picture is an intra- coded picture used mainly for random access or scene update, a P-picture uses forward motion predictive coding with reference to a previously coded I or P-picture (anchor picture), and a B-picture uses forward and backward motion predictive/interpolative coding with reference to previously coded I and/or P-pictures. An input pictures sequence may be either a field or frame structured sequence coming from an interlaced or progressive source.
Macroblocks containing blocks of pixel values are derived from a picture to be coded from picture data stored in the frame buffer (A) 101. The macroblocks are derived on a row by row basis starting from the leftmost macroblock to the rightmost macroblock, and the top row to the bottom row. Macroblocks belonging to an I-picture are passed directly to a transform coder 105, whilst macroblocks belonging to a P-picture or B-picture are passed to a motion vector detector 102. It is also possible to subject macroblocks belonging to an I-picture to the motion vector detector 102 for generation of error concealment motion vectors such as defined in the MPEG 2 standard.
All necessary motion vectors, for example the frame and/or field, forward and/or backward, and 16x16/16x8/8x8 motion vectors, of each macroblock are detected by the motion vector detector 102 by matching the macroblock to candidate blocks obtained from a search window obtained from a reference picture stored in a frame buffer (B) 103. Different matching methods such as the exhaustive search method, the logarithmic search method, multi-steps or hierarchical search method, and search window sizes may be utilized in the motion vector detector 102 depending on application as well as implementation needs. Matching methods may also be implemented in various pixel resolutions for example integer, half pel or quarter pel resolution. The matching criterion may be based on minimum of absolute errors, square errors, or other suitable distortion functions. In addition, methods of checking neighbouring macroblock motion vector correlation may be applied. Details of various described matching methods and matching criterion which can be utilised in conjunction with embodiments of the present invention are well known or will be readily apparent to one skilled in the art.
The motion vector detector 102 is coupled to a motion compensation processor 104. In the motion compensation process 104, a macroblock coding mode which can be intra-coded, frame/field forward/backward predicted, or frame/field inteφolated is first decided for each macroblock based on the detected motion vectors from the motion vector detector 102. Any necessary prediction errors are then generated based on the decided macroblock coding mode. An example macroblock coding mode decision method may be found in the MPEG2 Test Model 5 specification.
Resultant macroblocks from the motion compensation process 104 are subjected to a transform coder 105 which exploits correlation within each macroblock and also its psycho- visual effects. Examples of popular transform coders may be found in the MPEG1 , MPEG2, H.261 , and H.263 standards. One embodiment of a transform coder 105 according to the MPEG2 Test Model 5 includes discrete cosine transformation (DCT), a quantizer, rate controller with adaptive quantization, inverse quantizer, and inverse DCT. The transform coder 105 produces transformed and quantized coefficients for each macroblock which are then subjected to a statistical coder 106. In addition, the transformed and quantized coefficients are inverse quantized and inverse transformed by the transform coder 105 to produce reconstructed macroblocks. Before storing the reconstructed macroblocks to the frame buffer (B) 103 for future reference, any necessary inverse of motion compensation is provided to each reconstructed macroblock by the motion compensation process 104. Reference pictures are formed by the reconstructed macroblocks for use in the motion vector detector 102 and motion compensation process 104. In some applications or coding instances, input pictures can be taken directly as reference pictures for motion vector detector' 102.
The statistical coder 106 exploits all statistical redundancies in the received data, and multiplexes the results to produce the final compressed output bitstream. As an example in the MPEG2 Test Model 5, the statistical coder 106 provides the zig-zag scanning and run- length encoding of the transformed and quantized coefficients, differential coding of the utilized motion vectors, and multiplexing of all results and necessary side information (eg. sequence/GOP/picture/slice/macroblock layer header information, picture coding types, macroblock coding modes, etc). The statistical coder 106 utilizes variable length codes (VLC) from VLC tables 110.
A local motion estimator 107 determines a local motion vector for each row of macroblocks from an input picture. In an embodiment of the local motion estimator 107, motion vectors from a same macroblock row of a picture as detected by the motion vector detector 102 are averaged to generate the local motion vector for the corresponding macroblock row of a new input picture. Alternatively, the median motion vector or the most common vector may be selected, instead of the average, to be output local motion vector. Further methods of generating the local motion vector of each macroblock row according to the present invention are possible with motion vectors from neighbouring (top or bottom or additionally neighbour picture) macroblock rows, or with independent row motion detectors; however, there may be little or no implementation advantage for doing so. Each generated local motion vector may be normalized according to the picture distances involved.
Figure 2 illustrates as an example how local motion vectors (MV) can be generated using results of previous forward and/or backward motion vector detection according to the present invention. In this example, two B-pictures 202 and 203 (Bl and B2 respectively) are in between two anchor I or P-pictures 201 and 204. The pictures are shown in display order representing a small part of an input picture sequence. Forward MV detection is performed for picture 202 (Bl), 203 (B2), and 204 (P) represented by Bl-Forward 206, B2-Forward 207, and P-Forward 208 respectively. Similarly, backward MV detection is performed for picture 202 (Bl) and 203 (B2), and forward MV detection for picture 201 (I/P) with' respect to a previous reference picture which is represented by Previous /p-Forward 216. Macroblock rows 211, 212, 213, and 214 represent macroblocks rows of the same vertical position or row number in pictures 201, 202, 203, 204 respectively.
It is preferable that a local MV to be used for future MV detection is generated by motion vectors from a very nearby picture. As an example, a local MV for use in B2-Forward 207 of macroblock row 213 may be generated by motion vectors resulting from Bl-Forward 206 for row 212. With reference to Figure 2, the following table summarizes the selection of motion vectors for local MV generation as one out of many possible embodiments of the local motion estimator according to the present invention.
Figure imgf000013_0001
The resulting local motion vectors are stored in a local MV storage 108 illustrated in Figure 1. When they are available, the local motion vectors are retrieved by the motion vector detector 102 to offset search windows of the corresponding macroblock rows. An optional maximum search range detector 109 may be used according to the present invention to determine the maximum horizontal and vertical search ranges of all motion vectors of a picture such that optimal VLCs can be used from the VLC tables 110 for coding of motion vectors in the statistical coder 106. The maximum search range, independently defined for horizontal and vertical motion, is the maximum range of all local motion vectors for a picture plus the maximum range of the search window for the picture. With the maximum search range detector 109, shorter VLCs or lower bitrate can be used for coding of motion vectors for a sequence of scenes with little motion, and the VLCs and VLC tables are adaptively extended when motion within the sequence or scenes become larger. The maximum search range information may be coded or transmitted at the picture layer. Methods of VLC coding for motion vectors may be found in the MPEG2 standard.
In Figure 3, the procedure for determination of a search window using a local MV is illustrated. A search window for a current macroblock (MB) to be coded is determined from a reference picture 301 by first determining the position of a co-sited MB 302 on the reference picture 301. The position is then offset by a local motion vector 303 generated for the macroblock row where the current MV belongs to a new offset MB position 304 as illustrated. Finally, the horizontal and vertical search ranges are included centred around the offset MB position 304 to form the search window 305 for the current MB. The search window 305 is then used for motion vector detection for the current MB.
Figure 4 shows a block diagram of a circuit which can be employed for implementing the motion vector detector and local motion estimator as part of a moving pictures encoding system apparatus according to an embodiment of the present invention. Typically the input picture and reference picture are stored in a large frame buffer 405 which may be implemented using DRAM (dynamic random access memory). A controller 401 , which can be a microprocessor, a digital signal processor (DSP), or a fixed function sequencer, is used to control a DMA (direct memory access controller) 404 for accessing the frame buffer 405. Instruction codes needed for the controller 401 can be stored in an instruction ROM (read only memory) 402.
A current MB to be coded is determined and loaded into a search engine 407 by the controller 401 and DMA 404. The controller then obtains a previously determined local motion vector 5 for the corresponding macroblock row from data in a RAM 403. Using the determined local motion vector, the controller defines a search window and transfers the corresponding data from a reference picture stored in the frame buffer 405 to a search window cache 406 using the DMA 404. Given that one local motion vector is used for the whole macroblock row and that the search window size is fixed according to application or implementation needs, for 10 subsequent MBs only a fraction of the defined search window must be loaded into the search window cache 406. The rest of the defined search window overlaps the search window of the previous MB and therefore is already stored in the search window cache 406.
With the current MB, the search engine 407 performs motion vector detection based on the
15 search window data in the search window cache 406. The search engine 407 may be implemented to perform specific search methods such as the exhaustive search method or the multi-steps/multi-resolution hierarchical search methods, or specific search criterion such as using minimum sum of square errors or absolute errors. A systolic array processor architecture may be used in the search engine 407 to the meet the computation power
20 requirements. The search engine 407 provides as output all detected motion vectors to the controller 401. In addition, the results of search criterion computation (minimum square errors, absolute errors, etc) may be provided for macroblock coding mode decision processes.
At the end of a row, the controller 401 determines a local motion vector using the output motion vectors for each macroblock row and stores the local motion vector into data RAM
25 403 for future use.
It will be appreciated from the foregoing description that embodiments of the present invention provide an improved method and apparatus for motion vector detection. A local motion estimator is used to better predict picture to picture motion which consists of not just
30 translation motion, but also zoom, sheer, or split screen motion. The better prediction of picture to picture motion allows smaller search windows to be used to calculate individual macroblock motion vectors, and therefore enables a reduction of computation load and implementation complexity. With a fixed local motion vector (fixed offset) for each macroblock row, the search window cache size can be minimized according to only the search window size. Furthermore, the updating of the search window cache is also minimized and hence a reduction of memory access bandwidth. These results provide additional reduction in implementation complexity.
The present invention optionally allows maximum horizontal and vertical motion search ranges to be adaptively determined using the local motion vectors. This method provides an efficient way of coding motion vectors using variable length codes. Sequences or scenes with little motion are better coded with motion vector bitrate reduction, whilst on the other hand, motion vector bitrate is increased to handle larger motion within sequence or scenes.
The foregoing detailed description of the embodiments of the present invention has been presented by way of example only, and is not intended to be considered limiting to the invention as defined in the claims appended hereto.

Claims

Claims
1. A method for determining motion vectors for encoding picture data in a sequence of pictures wherein data representing a picture in the sequence is arranged in a plurality of adjacent data blocks, the method comprising the steps of:
(a) encoding a first picture in the sequence with motion vectors for the data blocks in the first picture;
(b) generating respective local motion vectors for groups of adjacent data blocks in the first picture, wherein a local motion vector for each group of data blocks is generated according to the individual motion vectors of the data blocks in the group;
(c) selecting a data block of an object picture to be coded;
(d) determining a window of search data from a reference picture on the basis of the corresponding position of the selected data block and the local motion vector which corresponds to the selected data block; and (e) comparing the selected data block with the search data from the reference picture in order to determine a motion vector for the selected data block.
2. A method as claimed in claim 1, including repeating the steps (c), (d) and (e) for each data block in a group of adjacent data blocks in the object picture.
3. A method as claimed in claim 2, including a step (f) of generating a local motion vector from the motion vectors of the data blocks in said group of data blocks in the object picture.
4. A method as claimed in claim 1, 2 or 3, wherein each said group of adjacent data blocks comprises a row of macroblocks from the respective picture.
5. A method as claimed in claim 2, wherein the window of search data is of fixed size for a group of data blocks.
6. A method as claimed in claim 5, wherein the step (e) involves storing the window of search data in a cache memory, and wherein each successive iteration of the step (e) after an initial operation requires only a fraction of the size of the search data window to be cached.
7. A method as claimed in claim 3, including repeating the steps (c) through (f) for each data block and group of data blocks in the object picture.
8. A method as claimed in claim 7, including a step (g) of determining a maximum offset vector based on the maximum of absolute values of horizontal and vertical components of the local motion vectors for the object picture, and selecting a variable length coding table for coding the data block motion vectors for the object picture based on the maximum offset vector.
9. A method as claimed in claim 7, wherein the local motion vectors generated from the object picture are subsequently used for performing step (d) during coding of a successive picture in the sequence of pictures.
10. A method as claimed in claim 1, 2, 3 or 4, wherein the local motion vector for a group of data blocks comprises an average of the motion vectors for the data blocks in the group.
11. A method as claimed in claim 1, 2, 3 or 4, wherein the local motion vector for a group of data blocks comprises a median of the motion vectors for the data blocks in the group.
12. A method as claimed in claim 1, 2, 3 or 4, wherein the local motion vector for a group of data blocks comprises a most common value of the motion vectors for the data blocks in the group.
13. A moving pictures encoder for encoding a sequence of pictures, including: a motion vector detector coupled to receive picture data from the sequence cf pictures for determining respective motion vectors for data blocks in a picture of the picture sequence, wherein each motion vector is determined using a comparison of the data block with search window data 5 from a reference picture; a local motion estimator coupled to the motion vector detector for determining and storing a local motion vector based on motion vectors from a group of adjacent data blocks from a picture; and wherein the motion vector detector utilises stored local motion vectors from a previously coded picture to determine the search window data.
10 14. A moving pictures encoder as claimed in claim 13, wherein the reference picture is stored in a frame buffer memory the search window data in a search window cache memory, and wherein the motion vector detector is coupled to control transfer of data from the frame buffer memory to the search window cache memory on the basis of stored local motion vectors.
15
15. A moving pictures encoder as claimed in claim 14, wherein a group of adjacent data blocks utilised by the local motion estimator to determine a local motion vector comprises a row of macroblocks from a picture.
20 16. A moving pictures encoder as claimed in claim 13, further including a transform coder and a statistical coder which uses variable length code tables for encoding the motion vectors, and a maximum vector analyser for determining a maximum vector magnitude from the local motion vectors for a picture and selecting a variable length coding table for use by the statistical coder on the basis of the maximum vector magnitude.
25
17. A coding apparatus for encoding picture data in a sequence of pictures wherein data representing a picture in the sequence is arranged in a plurality of adjacent data blocks and motion vectors are generated for the data blocks of a picture in the sequence, the coding apparatus including:
30 a local motion estimator for generating respective local motion vectors for groups of adjacent data blocks in the object picture, wherein a local motion vector for each group of data blocks is generated according to individual motion vectors of the data blocks in the group; a local motion vector storage memory for storing the local motion vectors for the object picture; and a motion vector detector for generating motion vectors for data blocks in an object picture, the motion vector detector including a search window cache for caching a selected poπion of a reference picture and a search engine for comparing data blocks of the object picture with the search window cache contents, wherein the contents of the search window cache are selected according to a local motion vector retrieved from the local motion vector storage memory from a previously coded picture in the picture sequence.
18. A coding apparatus as claimed in claim 17, wherein the local motion estimator generates a local motion vector for each row of macroblocks in a picture.
19. A coding apparatus as claimed in claim 17, further including a transform coder and a statistical coder which uses variable length code tables for encoding the motion vectors, and a maximum vector analyser for determining a maximum vector magnitude from the local motion vectors for a picture and selecting a variable length coding table for use by the statistical coder on the basis of the maximum vector magnitude.
PCT/SG1998/000085 1998-10-13 1998-10-13 Motion vector detection with local motion estimator WO2000022833A1 (en)

Priority Applications (4)

Application Number Priority Date Filing Date Title
PCT/SG1998/000085 WO2000022833A1 (en) 1998-10-13 1998-10-13 Motion vector detection with local motion estimator
EP98951906A EP1119975B1 (en) 1998-10-13 1998-10-13 Motion vector detection with local motion estimator
US09/807,497 US6876702B1 (en) 1998-10-13 1998-10-13 Motion vector detection with local motion estimator
DE69813911T DE69813911T2 (en) 1998-10-13 1998-10-13 METHOD FOR DETERMINING MOTION VECTOR FIELDS WITH LOCAL MOTION ESTIMATION

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/SG1998/000085 WO2000022833A1 (en) 1998-10-13 1998-10-13 Motion vector detection with local motion estimator

Publications (1)

Publication Number Publication Date
WO2000022833A1 true WO2000022833A1 (en) 2000-04-20

Family

ID=20429884

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/SG1998/000085 WO2000022833A1 (en) 1998-10-13 1998-10-13 Motion vector detection with local motion estimator

Country Status (4)

Country Link
US (1) US6876702B1 (en)
EP (1) EP1119975B1 (en)
DE (1) DE69813911T2 (en)
WO (1) WO2000022833A1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2386019A (en) * 2002-01-26 2003-09-03 Lg Electronics Inc Adaptively defining a motion vector search range using vector values of adjacent blocks
WO2006103984A1 (en) * 2005-03-25 2006-10-05 Matsushita Electric Industrial Co., Ltd. Motion detection device

Families Citing this family (42)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1430724B1 (en) * 2001-09-12 2008-08-06 Nxp B.V. Motion estimation and/or compensation
SG111093A1 (en) * 2002-11-18 2005-05-30 St Microelectronics Asia Motion vector selection based on a preferred point
US7571287B2 (en) 2003-03-13 2009-08-04 Marvell World Trade Ltd. Multiport memory architecture, devices and systems including the same, and methods of using the same
US20040223084A1 (en) * 2003-05-08 2004-11-11 Stanger Leon J. Methods and apparatus for border processing in video distribution systems
DE602004002455T2 (en) * 2004-04-30 2007-01-11 Matsushita Electric Industrial Co., Ltd., Kadoma Motion vector estimation through adaptive temporal prediction
US20050259734A1 (en) * 2004-05-21 2005-11-24 Timothy Hellman Motion vector generator for macroblock adaptive field/frame coded video data
KR100605105B1 (en) * 2004-05-28 2006-07-26 삼성전자주식회사 Apparatus of Image Interpolation
TWI370691B (en) * 2005-01-19 2012-08-11 Via Tech Inc A method and system for hierarchical search with cache
JP2007088922A (en) * 2005-09-22 2007-04-05 Sanyo Electric Co Ltd Encoding apparatus and encoding method
KR100775104B1 (en) * 2006-02-27 2007-11-08 삼성전자주식회사 Image stabilizer and system having the same and method thereof
US20070242749A1 (en) * 2006-04-14 2007-10-18 Sung Chih-Ta S Image frame compression of video stream with fast random accessing and decompressing
KR100763917B1 (en) * 2006-06-21 2007-10-05 삼성전자주식회사 The method and apparatus for fast motion estimation
JP4763549B2 (en) * 2006-08-18 2011-08-31 富士通セミコンダクター株式会社 Inter-frame prediction processing apparatus, image encoding apparatus, and image decoding apparatus
US20080126278A1 (en) * 2006-11-29 2008-05-29 Alexander Bronstein Parallel processing motion estimation for H.264 video codec
WO2008139489A1 (en) * 2007-05-10 2008-11-20 Allgo Embedded Systems Private Limited Dynamic motion vector analysis method
US8127104B1 (en) 2007-08-06 2012-02-28 Marvell International Ltd. Alignment matrix memory copy
US8688947B1 (en) 2007-11-21 2014-04-01 Marvell International Ltd. Aligned data access
US20090207915A1 (en) * 2008-02-15 2009-08-20 Freescale Semiconductor, Inc. Scalable motion search ranges in multiple resolution motion estimation for video compression
US20090238268A1 (en) * 2008-03-20 2009-09-24 Mediatek Inc. Method for video coding
US8131915B1 (en) 2008-04-11 2012-03-06 Marvell Intentional Ltd. Modifying or overwriting data stored in flash memory
US8683085B1 (en) 2008-05-06 2014-03-25 Marvell International Ltd. USB interface configurable for host or device mode
US8055025B2 (en) * 2008-06-14 2011-11-08 City University Of Hong Kong Motion estimation method
US8423710B1 (en) 2009-03-23 2013-04-16 Marvell International Ltd. Sequential writes to flash memory
US8295551B2 (en) * 2009-04-08 2012-10-23 Samsung Electronics Co., Ltd. System and method of adaptive vertical search range tracking for motion estimation in digital video
US8213236B1 (en) 2009-04-21 2012-07-03 Marvell International Ltd. Flash memory
RU2486692C1 (en) * 2009-06-18 2013-06-27 Кабусики Кайся Тосиба Device of video coding and device of video decoding
AU2014201817B2 (en) * 2009-06-18 2015-12-10 Kabushiki Kaisha Toshiba Dynamic Image Decoding Device
US9628794B2 (en) 2009-06-18 2017-04-18 Kabushiki Kaisha Toshiba Video encoding apparatus and a video decoding apparatus
US20110109794A1 (en) * 2009-11-06 2011-05-12 Paul Wiercienski Caching structure and apparatus for use in block based video
TW201117135A (en) * 2009-11-12 2011-05-16 Nat Univ Chung Cheng Motion prediction method for multimedia image
US8756394B1 (en) 2010-07-07 2014-06-17 Marvell International Ltd. Multi-dimension memory timing tuner
JP5841540B2 (en) * 2010-11-26 2016-01-13 パナソニック インテレクチュアル プロパティ コーポレーション オブアメリカPanasonic Intellectual Property Corporation of America Image encoding method, image decoding method, image encoding device, image decoding device, program, and integrated circuit
BR112014010539A2 (en) 2011-11-04 2017-04-18 Motorola Mobility Llc motion vector scaling for non-uniform motion vector network
US11317101B2 (en) 2012-06-12 2022-04-26 Google Inc. Inter frame candidate selection for a video encoder
US9485515B2 (en) 2013-08-23 2016-11-01 Google Inc. Video coding using reference motion vectors
US9503746B2 (en) 2012-10-08 2016-11-22 Google Inc. Determine reference motion vectors
JP6502485B2 (en) * 2015-05-21 2019-04-17 オリンパス株式会社 Imaging device
US11461919B2 (en) * 2016-04-21 2022-10-04 Ramot At Tel Aviv University Ltd. Cascaded neural network
US10715818B2 (en) * 2016-08-04 2020-07-14 Intel Corporation Techniques for hardware video encoding
US10602174B2 (en) 2016-08-04 2020-03-24 Intel Corporation Lossless pixel compression for random video memory access
US11025913B2 (en) 2019-03-01 2021-06-01 Intel Corporation Encoding video using palette prediction and intra-block copy
US10855983B2 (en) 2019-06-13 2020-12-01 Intel Corporation Encoding video using two-stage intra search

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0652678A2 (en) * 1993-11-04 1995-05-10 AT&T Corp. Method and apparatus for improving motion compensation in digital video coding
JPH0955941A (en) * 1995-08-16 1997-02-25 Sony Corp Picture encoding method/device and recording device for picture encoding data
JPH09163380A (en) * 1995-12-05 1997-06-20 Sony Corp Image coding device and motion vector detection method
US5717441A (en) * 1995-05-02 1998-02-10 Matsushita Electric Ind. Picture data memory with high access efficiency in detecting motion vectors, a motion vector detection circuit provided with the picture data memory, and an address conversion circuit
EP0831642A2 (en) * 1996-09-20 1998-03-25 Nec Corporation Apparatus and method for motion vector estimation with high speed
US5786860A (en) * 1994-07-30 1998-07-28 Korean Advanced Institute Of Science And Technology High speed block matching for bi-directional motion vector estimation
EP0866619A2 (en) * 1997-03-19 1998-09-23 Sony Corporation Devices and methods for image coding

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2283385B (en) * 1993-10-26 1998-04-01 Sony Uk Ltd Motion compensated video signal processing
KR0148154B1 (en) 1994-06-15 1998-09-15 김광호 Coding method and apparatus with motion dimensions
US5801778A (en) * 1996-05-23 1998-09-01 C-Cube Microsystems, Inc. Video encoding with multi-stage projection motion estimation
JP3774954B2 (en) * 1996-10-30 2006-05-17 株式会社日立製作所 Video encoding method
JP3052893B2 (en) * 1997-05-16 2000-06-19 日本電気株式会社 Video encoding device
US6014181A (en) * 1997-10-13 2000-01-11 Sharp Laboratories Of America, Inc. Adaptive step-size motion estimation based on statistical sum of absolute differences
US6295377B1 (en) * 1998-07-13 2001-09-25 Compaq Computer Corporation Combined spline and block based motion estimation for coding a sequence of video images

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0652678A2 (en) * 1993-11-04 1995-05-10 AT&T Corp. Method and apparatus for improving motion compensation in digital video coding
US5786860A (en) * 1994-07-30 1998-07-28 Korean Advanced Institute Of Science And Technology High speed block matching for bi-directional motion vector estimation
US5717441A (en) * 1995-05-02 1998-02-10 Matsushita Electric Ind. Picture data memory with high access efficiency in detecting motion vectors, a motion vector detection circuit provided with the picture data memory, and an address conversion circuit
JPH0955941A (en) * 1995-08-16 1997-02-25 Sony Corp Picture encoding method/device and recording device for picture encoding data
US5883674A (en) * 1995-08-16 1999-03-16 Sony Corporation Method and apparatus for setting a search range for detecting motion vectors utilized for encoding picture data
JPH09163380A (en) * 1995-12-05 1997-06-20 Sony Corp Image coding device and motion vector detection method
US5872604A (en) * 1995-12-05 1999-02-16 Sony Corporation Methods and apparatus for detection of motion vectors
EP0831642A2 (en) * 1996-09-20 1998-03-25 Nec Corporation Apparatus and method for motion vector estimation with high speed
EP0866619A2 (en) * 1997-03-19 1998-09-23 Sony Corporation Devices and methods for image coding

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2386019A (en) * 2002-01-26 2003-09-03 Lg Electronics Inc Adaptively defining a motion vector search range using vector values of adjacent blocks
GB2386019B (en) * 2002-01-26 2004-04-07 Lg Electronics Inc Adaptive decision method
WO2006103984A1 (en) * 2005-03-25 2006-10-05 Matsushita Electric Industrial Co., Ltd. Motion detection device

Also Published As

Publication number Publication date
US6876702B1 (en) 2005-04-05
EP1119975A1 (en) 2001-08-01
DE69813911T2 (en) 2004-02-05
EP1119975B1 (en) 2003-04-23
DE69813911D1 (en) 2003-05-28

Similar Documents

Publication Publication Date Title
US6876702B1 (en) Motion vector detection with local motion estimator
US7551673B1 (en) Adaptive motion estimator
KR100294999B1 (en) Efficient, flexible motion estimation architecture for real time mpeg2 compliant encoding
US5731850A (en) Hybrid hierarchial/full-search MPEG encoder motion estimation
US6519005B2 (en) Method of concurrent multiple-mode motion estimation for digital video
US6438168B2 (en) Bandwidth scaling of a compressed video stream
US7362808B2 (en) Device for and method of estimating motion in video encoder
EP0560577B1 (en) Video compression coding and decoding with automatic sub-pixel frame/field motion compensation
EP1262073B1 (en) Methods and apparatus for motion estimation using neighboring macroblocks
US7602849B2 (en) Adaptive reference picture selection based on inter-picture motion measurement
US5650829A (en) Motion video coding systems with motion vector detection
WO2001049039A1 (en) Methods and apparatus for reduction of prediction modes in motion estimation
US20070092007A1 (en) Methods and systems for video data processing employing frame/field region predictions in motion estimation
WO2001049029A1 (en) Methods and apparatus for motion estimation in compressed domain
KR20040089163A (en) Method and Apparatus for Determining Search Range for Adaptive Motion Vector for Use in Video Encoder
US20070076964A1 (en) Method of and an apparatus for predicting DC coefficient in transform domain
US6480629B1 (en) Motion estimation method using orthogonal-sum block matching
KR100229803B1 (en) Method and apparatus for detecting motion vectors
US7113543B2 (en) Video encoding
US20020159526A1 (en) Video encoder and video recording apparatus provided with such a video encoder
JP4126044B2 (en) Video encoding apparatus and method
KR100240620B1 (en) Method and apparatus to form symmetric search windows for bidirectional half pel motion estimation
KR19990058255A (en) Video compression method using 1/4 pixel motion vector.
Teixeira et al. Block matching algorithms in MPEG video coding
JP2002539685A (en) Block matching motion estimation

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): JP SG US

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE

121 Ep: the epo has been informed by wipo that ep was designated in this application
DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
WWE Wipo information: entry into national phase

Ref document number: 1998951906

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 09807497

Country of ref document: US

WWP Wipo information: published in national office

Ref document number: 1998951906

Country of ref document: EP

WWG Wipo information: grant in national office

Ref document number: 1998951906

Country of ref document: EP