US20040240560A1 - Video decoder architecture and method for using same - Google Patents

Video decoder architecture and method for using same Download PDF

Info

Publication number
US20040240560A1
US20040240560A1 US10/869,092 US86909204A US2004240560A1 US 20040240560 A1 US20040240560 A1 US 20040240560A1 US 86909204 A US86909204 A US 86909204A US 2004240560 A1 US2004240560 A1 US 2004240560A1
Authority
US
United States
Prior art keywords
sub
frame
picture
coefficients
frames
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US10/869,092
Inventor
Marta Karczewicz
Ragip Kurceren
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to US10/869,092 priority Critical patent/US20040240560A1/en
Publication of US20040240560A1 publication Critical patent/US20040240560A1/en
Abandoned legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/438Interfacing the downstream path of the transmission network originating from a server, e.g. retrieving MPEG packets from an IP network
    • H04N21/4383Accessing a communication channel
    • H04N21/4384Accessing a communication channel involving operations to reduce the access time, e.g. fast-tuning for reducing channel switching latency
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/103Selection of coding mode or of prediction mode
    • H04N19/105Selection of the reference unit for prediction within a chosen coding or prediction mode, e.g. adaptive choice of position and number of pixels used for prediction
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/134Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
    • H04N19/164Feedback from the receiver or from the transmission channel
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/17Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
    • H04N19/172Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a picture, frame or field
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/17Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
    • H04N19/176Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/30Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using hierarchical techniques, e.g. scalability
    • H04N19/39Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using hierarchical techniques, e.g. scalability involving multiple description coding [MDC], i.e. with separate layers being structured as independently decodable descriptions of input picture data
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/503Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
    • H04N19/51Motion estimation or motion compensation
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/503Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
    • H04N19/51Motion estimation or motion compensation
    • H04N19/58Motion compensation with long-term prediction, i.e. the reference frame for a current frame not being the temporally closest one
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/60Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
    • H04N19/61Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding in combination with predictive coding
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/65Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using error resilience
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/85Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression
    • H04N19/89Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression involving methods or arrangements for detection of transmission errors at the decoder
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/234Processing of video elementary streams, e.g. splicing of video streams, manipulating MPEG-4 scene graphs
    • H04N21/23424Processing of video elementary streams, e.g. splicing of video streams, manipulating MPEG-4 scene graphs involving splicing one content stream with another content stream, e.g. for inserting or substituting an advertisement
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/44Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream, rendering scenes according to MPEG-4 scene graphs
    • H04N21/44016Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream, rendering scenes according to MPEG-4 scene graphs involving splicing one content stream with another content stream, e.g. for substituting a video clip
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/60Network structure or processes for video distribution between server and client or between remote clients; Control signalling between clients, server and network components; Transmission of management data between server and client, e.g. sending from server to client commands for recording incoming content stream; Communication details between server and client 
    • H04N21/63Control signaling related to video distribution between client, server and network components; Network processes for video distribution between server and clients or between remote clients, e.g. transmitting basic layer and enhancement layers over different transmission paths, setting up a peer-to-peer communication via Internet between remote STB's; Communication protocols; Addressing
    • H04N21/647Control signaling between network components and server or clients; Network processes for video distribution between server and clients, e.g. controlling the quality of the video stream, by dropping packets, protecting content from unauthorised alteration within the network, monitoring of network load, bridging between two different networks, e.g. between IP and wireless
    • H04N21/64784Data processing by the network
    • H04N21/64792Controlling the complexity of the content stream, e.g. by dropping packets

Definitions

  • This invention relates generally to the field of the multimedia applications. More particularly, this invention relates to a new frame type, apparatus and method for using same to provide for access of a video stream.
  • Multimedia applications that include audio and streaming video information have come into greater use.
  • multimedia groups have established and proposed standards for compressing/encoding and decompressing/decoding the audio and video information.
  • the examples are MPEG standards, established by the Motion Picture Expert Group and standards developed by ITU-Telecommunications Standardization.
  • Video sequences like ordinary motion pictures recorded on film, comprise a sequence of still images, and the illusion of motion is created by displaying consecutive images at a relatively fast rate.
  • the display rate are between five and thirty frames per second.
  • the images in consecutive frames tend to be similar.
  • a typical scene recorded by a camera comprises some stationary elements, such as, for example, background scenery and some moving parts.
  • the moving parts may take many different forms, for example, the face of a news reader, moving traffic, and so on.
  • the camera recording the scene may itself be moving, in which case all elements of the image have the same kind of motion. In many cases, this means that the overall change between one video frame and the next is rather small. Of course, this depends on the nature of the movement, the rate of the movement, i.e., the amount of change from one frame to the next.
  • the purpose of the video coding is to remove the redundancy in the image sequence so that the encoded data rate is commensurate with the available bandwidth to transport the video sequence while keeping the distortion between the original and reconstructed images as small as possible.
  • the redundancy in video sequences can be categorized into spatial and temporal redundancy. Spatial redundancy refers to the correlation between neighboring pixels in a frame while temporal redundancy refers to correlation between neighboring frames.
  • FIG. 1A depicts an Intra-frame or I-type frame 200 .
  • the I-type frame or picture is a frame of video data that is coded exploiting only the spatial correlation of the pixels within the frame without using information from the past or the future and is utilized as the basis for decoding/decompression of other type frames.
  • FIG. 1B is a representation of a Predictive-frame or P-type frame 210 .
  • the P-type frame or picture is a frame that is encoded/compressed using motion compensated prediction from I-type or P-type frames of its past, in this case, I.sub. 1 200 .
  • previous frames are used to encode/compress a present given frame of video data.
  • 205 a represents the motion compensated prediction information to create a P-type frame 210 . Since in a typical video sequence the adjacent frames in a sequence are highly correlated, higher compression efficiencies are achieved when using P-frames.
  • FIG. 1C depicts a Bi-directional-frame or B-type frame 220 .
  • the B-type frame or picture is a frame that is encoded/compressed using a motion compensated prediction derived from the I-type reference frame ( 200 in this example) or P-type reference frame in its past and the I-type reference frame or P-type reference frame ( 210 in this example) in its future or a combination of both.
  • B-type frames are usually inserted between I-type frames or P-type frames.
  • FIG. 2 represents a group of pictures in what is called display order I.sub. 1 B.sub. 2 B.sub. 3 P.sub. 4 B.sub. 5 P.sub. 6 .
  • FIG. 2 illustrates the B-type frames inserted between I-type and P-type frames and the direction which motion compensation information flows.
  • FIGS. 3 and 4 A system for P-frame encoding and decoding is provided and is shown in FIGS. 3 and 4.
  • a communication system comprising an encoder 300 of FIG. 3 and a decoder 400 of FIG. 4 is operable to communicate a multimedia sequence between a sequence generator and a sequence receiver.
  • Other elements of the video sequence generator and receiver are not shown for the purposes of simplicity.
  • the communication path between sequence generator and receiver may take various forms, including but not limited to a radio-link.
  • Encoder 300 is shown in FIG. 3 coupled to receive video input on line 301 in the form of a frame to be encoded I(x, y), called the current frame.
  • I(x, y) we denote location of the pixel within the frame.
  • the current frame I(x,y) is partitioned into rectangular regions of M ⁇ N pixels. These blocks are encoded using either only spatial correlation (intra coded blocks) or both spatial and temporal correlation (inter coded blocks). In what follows we concentrate on inter blocks.
  • Each of inter coded blocks is predicted using motion information from the previously coded and transmitted frame, called reference frame and denoted as R(x,y), which at given instant is available in the frame memory 350 of the encoder 300 .
  • the motion information of the block may be represented by two dimensional motion vector ( ⁇ x(x,y), ⁇ y(x,y)) where ⁇ x(x,y) is the horizontal and ⁇ y(x,y) is the vertical displacement, respectively of the pixel in location (x,y) between the current frame and the reference frame.
  • the motion vectors ( ⁇ x( ), ⁇ y( )) are calculated by the motion estimation and coding block 370 .
  • the input to the motion estimation and coding block 370 are current frame and reference frame.
  • the motion information is provided to a Motion Compensated (MC) prediction block 360 .
  • the MC prediction block is also coupled to a frame memory 350 to receive the reference frame.
  • the motion vectors for each inter block together with the reference frame are used to construct prediction frame P(x, y):
  • the prediction error E(x, y) i.e., the difference between the current frame and the prediction frame P(x, y) is calculated by:
  • weights c.sub.err(i,j), corresponding to the basis functions are called prediction error coefficients.
  • Coefficients c.sub.err(i,j) can be calculated by performing so called forward transform. These coefficients are quantized in quantization block 320 :
  • I.sub.err ( i, j ) Q ( c.sub.err ( i,j ), QP )
  • I.sub.err(i, j) are the quantized coefficients.
  • the operation of quantization introduces loss of information—the quantized coefficient can be represented with smaller number of bits.
  • the level of compression (loss of information) is controlled by adjusting the value of the quantization parameter (QP).
  • the quantization block 320 is coupled to both a multiplexer 380 and an inverse quantization block 330 and in turn an inverse transform block 340 .
  • Blocks 330 and 340 provide decoded prediction error E.sub.c(x, y) which is added to the MC predicted frame P(x, y) by adder 345 . These values can be further normalized and filtered and the result stored in frame memory 350 .
  • Video vectors and quantized coefficients are encoded using Variable Length Codes (VLC) which further reduce the number of bits needed for their representation. Encoded motion vectors and quantized coefficients as well as other additional information needed to represent each coded frame of the image sequence constitute a bitstream 415 which is transmitted to the decoder 400 of FIG. 4. Bitstream may be multiplexed 380 before transmission.
  • VLC Variable Length Codes
  • the special type of the inter coded blocks are copy coded blocks.
  • values of both motion vectors and quantized prediction error coefficients I.sub.err are equal to 0.
  • FIG. 4 shows the decoder 400 of the communication system.
  • Bitstream 415 is received from encoder 300 of FIG. 3.
  • Bitstream 415 is demultiplexed via demultiplexer 410 .
  • Dequantized coefficients d.sub.err(i,j) are calculated in the inverse quantization block 420 :
  • d.sub.err ( i, j ) Q ⁇ 1 ( I.sub.err ( i, j ), QP ).
  • the pixels of the current coded frame are reconstructed by finding the prediction pixels in the reference frame R(x,y) using the received motion vectors and then adding to the compressed prediction error in adder 435 :
  • I.sub.c ( x, y ) R ( x+ ⁇ x, y+ ⁇ ,y )+ E.sub.c ( x, y ).
  • the transform/inverse transform is performed for 4 ⁇ 4 blocks by performing defined above one dimensional transform/inverse transform both vertically and horizontally.
  • DC 0 ( DCC (0,0)+ DCC (1,0)+ DCC (0,1)+ DCC (1,1))/2
  • DC 1 ( DCC (0,0) ⁇ DCC (1,0)+ DCC (0,1) ⁇ DCC (1,1))/2
  • DC 2 ( DCC (0,0)+ DCC (1,0) ⁇ DCC (0,1) ⁇ DCC (1,1))/2
  • B(QP ⁇ 0, . . . ,31) [3881, 4351, 4890, 5481, 6154, 6914, 7761, 8718, 9781, 10987, 12339, 13828, 15523, 17435, 19561, 21873, 24552, 27656, 30847, 34870, 38807, 43747, 49103, 54683, 61694, 68745, 77615, 89113, 100253, 109366, 126635, 141533];
  • Video streaming has emerged as one of the essential applications over the fixed internet and—in the near future over 3G multimedia networks.
  • the server starts streaming the pre-encoded video bitstream to the receiver upon a request from the receiver which plays the stream as it receives with a small delay.
  • the problem with video streaming is that the best-effort nature of today's networks causes variations of the effective bandwidth available to a user due to the changing network conditions.
  • the server should then scale the bitrate of the compressed video to accommodate these variations. In case of conversational services that are characterized by real-time encoding and point-to-point delivery, this is achieved by adjusting, on the fly, the source encoding parameters, such as quantization parameter or frame rate, based on the network feedback. In typical streaming scenarios when already encoded video bitstream is to be streamed to the client, the above solution can not be applied.
  • the simplest way of achieving bandwidth scalability in case of pre-encoded sequences is by producing multiple and independent streams of different bandwidth and quality.
  • the server dynamically switches between the streams to accommodate variations of the bandwidth available to the client.
  • SP-picture uses motion compensated predictive coding to exploit temporal redundancy in the sequence.
  • the difference between SP and P-pictures is that using SP-pictures identical frames may be obtained even when different reference frames are used for prediction.
  • This property allows SP-pictures to replace I-pictures in numerous applications such as switching from one bitstream to another, random access, fast-forward, fast-backward.
  • SP-frames unlike I-frames are utilizing motion compensated predictive coding they require smaller number of bits than I-frames.
  • FIG. 1A is a diagram showing the encoding of an I-type frame or I-picture.
  • FIG. 1B is a diagram showing the encoding of a P-type frame or P-picture.
  • FIG. 1C is a diagram showing the encoding of a B-type frame or B-picture.
  • FIG. 2 is a diagram showing B-type frame inserted between I-type and P-type frames and the direction which motion compensation information flows.
  • FIG. 3 is a block diagram of a generic motion-compensated predictive video coding system (encoder).
  • FIG. 4 is a block diagram of a generic motion-compensated predictive video coding system (decoder).
  • FIG. 5 is an illustration showing switching between bitstreams 1 and 2 using SP-pictures.
  • FIG. 6 is a block diagram of a decoder in accordance with an embodiment of the invention.
  • FIG. 7 is an illustration of random access using SP-pictures.
  • FIG. 8 is an illustration of a fast-forward process using SP-pictures.
  • a new decoder architecture which has the property that identical frames may be obtained even when they are predicted using different reference frames.
  • the picture type obtained using this structure will be called SP-frame also may be referred to as picture.
  • This property allows SP-pictures to replace I-pictures in numerous applications such as switching from one bitstream to another, random access, fast-forward, fast-backward. Since unlike unlike l-frames SP-frames are using motion compensated prediction they require a lot less bits than the I-frames.
  • FIG. 5 shows two bitstreams corresponding to the same sequence encoded at different bitrates—bitstream 1 ( 510 ) and bitstream 2 ( 520 ).
  • SP-pictures should be placed at locations at which one wants to allow switching from one bitstream to another (pictures S.sub. 1 ( 513 ), and S.sub. 2 ( 523 ) in FIG. 5).
  • FIG. 5 picture S.sub. 12 ( 550 ) will be transmitted instead of S.sub. 2 ( 523 )).
  • Pictures S.sub. 2 ( 523 ) and S.sub. 12 ( 550 ) in FIG. 5 are represented by different bitstreams. However, their reconstructed values are identical.
  • SP-pictures Application of SP-pictures to enable random access is depicted in FIG. 7.
  • SP-pictures are placed at fixed intervals within bitstream 1 ( 720 ) (e.g. picture S.sub. 1 ( 730 )) which is being streamed to the client.
  • bitstream 1 720
  • bitstream 2 bitstream 2 ( 740 )
  • I-picture I.sub. 2 ( 750 ), at the temporal location preceding SP-picture.
  • SP-picture 710 S.sub. 2 , at the same temporal location as SP-picture.
  • Bitstream 1 ( 720 ) may then be accessed at a location corresponding to an I-picture in bitstream 2 ( 740 ). For example to access bitstream 1 at frame I.sub. 2 , first the pictures I.sub. 2 , S.sub. 2 from bitstream 2 are transmitted and then the following pictures from bitstream 1 are transmitted.
  • bitstream 2 will constitute of only SP-pictures predicted from each other placed in larger temporal intervals (e.g. each 1 sec) the structure presented in this figure can be used to obtain “Fast Forward” functionality. Due to the usage of SP-pictures “Fast Forward” can start at any bitstream location. In similar manner “Fast Backward” functionality can be obtained.
  • VRC Video Redundancy Coding
  • the principle of the VRC method is to divide the sequence of pictures into two or more threads in such a way that all camera pictures are assigned to one of the threads in a round-robin fashion. Each thread is coded independently. In regular intervals, all threads converge into a so-called sync frame. From this sync frame, a new thread series is started. If one of these threads is damaged because of a packet loss, the remaining threads stay intact and can be used to predict the next sync frame.
  • SP-frame comprises blocks encoded using spatial correlation among the pixels (intra blocks) and blocks encoded using both spatial and temporal correlation (inter or copy blocks). Inter and copy coded blocks are reconstructed differently than in P-type frames.
  • each pixel S(x,y) in the inter or copy coded block is decoded as a weighted sum of the basis functions f.sub.ij(x,y) where the weigh values d.sub.rec will be called dequantized reconstruction image coefficients.
  • the values of d.sub.rec are obtained by quantization and dequantization of reconstruction image coefficients c.sub.rec.
  • Reconstruction image coefficients c.sub.rec are formed using
  • Values S(x,y) can be further normalized and filtered.
  • QP 1 and QP 2 Two different values of quantization parameter QP: QP 1 and QP 2 can be used during quantization/dequantization process for these blocks. Values of QP 1 and QP 2 can be different when quantization/dequantization process is performed for luma component than when it is performed for chroma component.
  • alpha(QP) is a parameter dependent on method of quantization and used QP value.
  • Quantize c.sub.rec using quantization parameter QP QP 1 .
  • the quantized values will be referred to as quantized reconstructed image coefficients and denoted as I.sub.rec.
  • I.sub.rec ( c.sub.pred ⁇ A ( QP 1 )+ I.sub.err ⁇ F ( QP 1 , QP 2 )+0.5 ⁇ 2 20 )//2 20
  • I.sub.rec I.sub.pred +(beta( QP 2 ) ⁇ I.sub.err+ 0.5 ⁇ beta( QP 1 ))//beta( QP 1 ).
  • beta(QP) is a parameter dependent on method of quantization and used QP value.
  • TML-6 H.26L Test Model Long Term Number 6
  • VCEG-L45 ITU-T Video Coding Experts Group Meeting, Eibsee, Germany
  • B(QP) constant B(QP) is defined above in the section on quantization.
  • SP-frames there are two types, specifically, the SP-frames; placed within the bitstream, e.g., S.sub. 1 ( 513 ) and S.sub. 2 ( 523 ) in FIG. 5, and the SP-frames (S.sub. 12 in FIG. 5) that will be sent when there is a switch between bitstreams (from bitstream 1 to bitstream 2 ).
  • the encoding of S.sub. 2 ( 523 ) and S.sub. 12 ( 550 ) are such that their reconstructed frames are identical although they use different reference frames as described below.
  • [0096] Calculate motion vectors using same method as for P-pictures. After motion compensation calculate transform coefficients for predicted P(x,y) and current image I(x,y) by performing forward transform.
  • the transform coefficients for the current image are denoted as c.sub.orig and for the predicted image as c.sub.pred.
  • alpha( QP ) (2 20 +0.5 ⁇ A ( QP ))// A ( QP )
  • RDQ rate distortion constrained quantization
  • An example of use rate distortion constrained quantization is described in “H.26L Test Model Long Term Number 6 (TML-6) draft0”, document VCEG-L45, ITU-T Video Coding Experts Group Meeting, Eibsee, Germany, 09-12 Jan. 2001.
  • [0103] Form the predicted frame for S.sub. 12 by performing motion estimation with the reference frames being pictures preceding S.sub. 1 in bitstream 1 .
  • [0104] Calculate transform coefficients for predicted image by performing forward transform.
  • the transform coefficients for the predicted image are denoted as c.sub.pred.
  • dedcoder 600 comprises, inter alia, a demultiplexer 610 , inverse quantization block 620 ,m inverse transform block 630 , frame memory 640 , MC prediction block 650 , tranform block 660 , quantization block 670 .

Abstract

A decoder and method for using a new picture or frame type is provided. This type is referred to a an SP-picture. The temporal redundancies are not exploited in I-frames, compression efficiency of I-frame coding is significantly lower than the predictive coding. A method allows use of motion compensated predictive coding to exploit temporal redundancy in the sequence while still allowing perfect reconstruction of the frame using different reference frames. Methods using this new picture type provide for error resilience/recovery, bandwidth scalability, bitstream switching, processing scalability, random access and other functions.
The SP-type picture provides for, among other functions, switching between different bitstreams, random access, fast forward and fast error-recovery by replacing I-pictures to increase the coding efficiency. As will be demonstrated, SP-pictures have the property that identical SP-frames may be obtained even when they are predicted using different reference frames.

Description

    CROSS REFERENCE TO RELATED APPLICATIONS
  • This application is a continuation of U.S. patent application Ser. No. 09/827,796 filed on Apr. 6, 2001 entitled VIDEO DECODER ARCHITECTURE AND METHOD FOR USING SAME and claims priority from same and Provisional Application No. 60/259,529 filed on Jan. 3, 2001, both incorporated herein by reference.[0001]
  • BACKGROUND
  • This invention relates generally to the field of the multimedia applications. More particularly, this invention relates to a new frame type, apparatus and method for using same to provide for access of a video stream. [0002]
  • Multimedia applications that include audio and streaming video information have come into greater use. Several multimedia groups have established and proposed standards for compressing/encoding and decompressing/decoding the audio and video information. The examples are MPEG standards, established by the Motion Picture Expert Group and standards developed by ITU-Telecommunications Standardization. [0003]
  • The following are incorporated herein by reference: [0004]
  • G. Bjontegaard, “H.26L Test Model Long Term Number 6 (TML-6) draft0”, document VCEG-L45, ITU-T Video Coding Experts. Group Meeting, Eibsee, Germany, 09-12 Jan. 2001. Keiichi Hibi, “Report of the Ad Hoc Committee on H.26L Development”, document Q15-H-07, ITU-T Video Coding Experts Group (Question 15) Meeting, Berlin, 03-06 Aug., 1999. Gary S. Greenbaum, “Remarks on the H.26L Project: Streaming Video Requirements for Next Generation Video Compression Standards”, document Q15-G-11, ITU-T Video Coding Experts Group (Question 15) Meeting, Monterey, 16-19 Feb., 1999. G. Bjontegaard, “Recommended Simulation Conditions for H.26L”, document Q15-I-62, ITU-T Video Coding Experts Group (Question 15) Meeting, Red Bank, N.J., 19-22 Oct., 1999. ATM & MPEG-2 Integrating Digital Video into Broadband Networks by Michael Orzessek and Peter Sommer (Prentice Hall Upper Saddle River N.J.). [0005]
  • Video sequences, like ordinary motion pictures recorded on film, comprise a sequence of still images, and the illusion of motion is created by displaying consecutive images at a relatively fast rate. For example, the display rate are between five and thirty frames per second. Because of the relatively fast frame rate, the images in consecutive frames tend to be similar. A typical scene recorded by a camera comprises some stationary elements, such as, for example, background scenery and some moving parts. The moving parts may take many different forms, for example, the face of a news reader, moving traffic, and so on. Alternatively, the camera recording the scene may itself be moving, in which case all elements of the image have the same kind of motion. In many cases, this means that the overall change between one video frame and the next is rather small. Of course, this depends on the nature of the movement, the rate of the movement, i.e., the amount of change from one frame to the next. [0006]
  • The purpose of the video coding is to remove the redundancy in the image sequence so that the encoded data rate is commensurate with the available bandwidth to transport the video sequence while keeping the distortion between the original and reconstructed images as small as possible. The redundancy in video sequences can be categorized into spatial and temporal redundancy. Spatial redundancy refers to the correlation between neighboring pixels in a frame while temporal redundancy refers to correlation between neighboring frames. [0007]
  • FIGS. 1A-1C illustrate the type of encoded/compressed video frames that are commonly utilized for video standards. FIG. 1A depicts an Intra-frame or I-[0008] type frame 200. The I-type frame or picture is a frame of video data that is coded exploiting only the spatial correlation of the pixels within the frame without using information from the past or the future and is utilized as the basis for decoding/decompression of other type frames. FIG. 1B is a representation of a Predictive-frame or P-type frame 210. The P-type frame or picture is a frame that is encoded/compressed using motion compensated prediction from I-type or P-type frames of its past, in this case, I.sub.1 200. That is, previous frames are used to encode/compress a present given frame of video data. 205 a represents the motion compensated prediction information to create a P-type frame 210. Since in a typical video sequence the adjacent frames in a sequence are highly correlated, higher compression efficiencies are achieved when using P-frames.
  • FIG. 1C depicts a Bi-directional-frame or B-[0009] type frame 220. The B-type frame or picture is a frame that is encoded/compressed using a motion compensated prediction derived from the I-type reference frame (200 in this example) or P-type reference frame in its past and the I-type reference frame or P-type reference frame (210 in this example) in its future or a combination of both. B-type frames are usually inserted between I-type frames or P-type frames. FIG. 2 represents a group of pictures in what is called display order I.sub.1 B.sub.2 B.sub.3 P.sub.4 B.sub.5 P.sub.6. FIG. 2 illustrates the B-type frames inserted between I-type and P-type frames and the direction which motion compensation information flows.
  • A system for P-frame encoding and decoding is provided and is shown in FIGS. 3 and 4. Referring to FIGS. 3 and 4, a communication system comprising an [0010] encoder 300 of FIG. 3 and a decoder 400 of FIG. 4 is operable to communicate a multimedia sequence between a sequence generator and a sequence receiver. Other elements of the video sequence generator and receiver are not shown for the purposes of simplicity. The communication path between sequence generator and receiver may take various forms, including but not limited to a radio-link.
  • [0011] Encoder 300 is shown in FIG. 3 coupled to receive video input on line 301 in the form of a frame to be encoded I(x, y), called the current frame. By (x, y) we denote location of the pixel within the frame. In the encoder the current frame I(x,y) is partitioned into rectangular regions of M×N pixels. These blocks are encoded using either only spatial correlation (intra coded blocks) or both spatial and temporal correlation (inter coded blocks). In what follows we concentrate on inter blocks.
  • Each of inter coded blocks is predicted using motion information from the previously coded and transmitted frame, called reference frame and denoted as R(x,y), which at given instant is available in the [0012] frame memory 350 of the encoder 300. The motion information of the block may be represented by two dimensional motion vector (Δx(x,y), Δy(x,y)) where Δx(x,y) is the horizontal and Δy(x,y) is the vertical displacement, respectively of the pixel in location (x,y) between the current frame and the reference frame. The motion vectors (Δx( ), Δy( )) are calculated by the motion estimation and coding block 370. The input to the motion estimation and coding block 370 are current frame and reference frame. The motion information is provided to a Motion Compensated (MC) prediction block 360. The MC prediction block is also coupled to a frame memory 350 to receive the reference frame. In the MC block 360, the motion vectors for each inter block together with the reference frame are used to construct prediction frame P(x, y):
  • P(x, y)=R(x+Δx(x,y), y+Δy(x,y)).
  • Notice that values of the prediction frame are calculated only for inter blocks. For some pixels (x,y) which belong to intra blocks these values will not be calculated. It is also possible to use more than one reference frame. In such case different blocks may use different reference frames. [0013]
  • Subsequently, the prediction error E(x, y), i.e., the difference between the current frame and the prediction frame P(x, y) is calculated by: [0014]
  • E(x, y)=I(x, y)−P(x, y).
  • In [0015] transform block 310, the prediction error for each K×L block is represented as weighted sum of a transform basis functions f.sub.ij(x, y), E ( x , y ) = i = 1 K j = 1 L c · sub · err ( i , j ) f · sub · ij ( x , y ) .
    Figure US20040240560A1-20041202-M00001
  • The weights c.sub.err(i,j), corresponding to the basis functions are called prediction error coefficients. Coefficients c.sub.err(i,j) can be calculated by performing so called forward transform. These coefficients are quantized in quantization block [0016] 320:
  • I.sub.err(i, j)=Q(c.sub.err(i,j),QP)
  • where I.sub.err(i, j) are the quantized coefficients. The operation of quantization introduces loss of information—the quantized coefficient can be represented with smaller number of bits. The level of compression (loss of information) is controlled by adjusting the value of the quantization parameter (QP). [0017]
  • The [0018] quantization block 320 is coupled to both a multiplexer 380 and an inverse quantization block 330 and in turn an inverse transform block 340. Blocks 330 and 340 provide decoded prediction error E.sub.c(x, y) which is added to the MC predicted frame P(x, y) by adder 345. These values can be further normalized and filtered and the result stored in frame memory 350.
  • Motion vectors and quantized coefficients are encoded using Variable Length Codes (VLC) which further reduce the number of bits needed for their representation. Encoded motion vectors and quantized coefficients as well as other additional information needed to represent each coded frame of the image sequence constitute a [0019] bitstream 415 which is transmitted to the decoder 400 of FIG. 4. Bitstream may be multiplexed 380 before transmission.
  • The special type of the inter coded blocks are copy coded blocks. For copy coded blocks values of both motion vectors and quantized prediction error coefficients I.sub.err are equal to 0. [0020]
  • FIG. 4 shows the [0021] decoder 400 of the communication system. Bitstream 415 is received from encoder 300 of FIG. 3. Bitstream 415 is demultiplexed via demultiplexer 410. Dequantized coefficients d.sub.err(i,j) are calculated in the inverse quantization block 420:
  • d.sub.err(i, j)=Q −1(I.sub.err(i, j), QP).
  • In [0022] inverse transform block 430, the dequantized coefficients are used to obtain compressed prediction error by performing inverse transform: E · sub · c ( x , y ) = i = 1 K j = 1 L d · sub · err ( i . j ) f · sub · ij ( x , y ) .
    Figure US20040240560A1-20041202-M00002
  • The pixels of the current coded frame are reconstructed by finding the prediction pixels in the reference frame R(x,y) using the received motion vectors and then adding to the compressed prediction error in adder [0023] 435:
  • I.sub.c(x, y)=R(x+Δx, y+Δ,y)+E.sub.c(x, y).
  • To obtain reconstructed image these values can be further normalized and filtered. [0024]
  • An example of a forward transform is provided by “H.26L Test Model Long Term Number 6 (TML-6) draft0”, document VCEG-L45, ITU-T Video Coding Experts Group Meeting, Eibsee, Germany, 09-12 January 2001. The forward transformation of some pixels a, b, c, d into 4 transform coefficients A, B, C, D is defined by: [0025]
  • A=13a+13b+13c+13d
  • B=17a+7b−7c−17d
  • C=13a−13b−13c+13d
  • D=7a−17b+17c−7d
  • The inverse transformation of transform coefficients A, B, C, D into 4 pixels a′, b′,c′, d′ is defined by: [0026]
  • a′=13A+17B+13C+7D
  • b′=13A+7B−13C−17D
  • c′=13A−7B−13C+17D
  • d′=13A−17B+13C−7D
  • The transform/inverse transform is performed for 4×4 blocks by performing defined above one dimensional transform/inverse transform both vertically and horizontally. [0027]
  • In “H.26L Test Model Long Term Number 6 (TML-6) draft0”, document VCEG-L45, ITU-T Video Coding Experts Group Meeting, Eibsee, Germany, 09-12 Jan. 2001 for chroma component, an additional 2×2 transform for the DC coefficients is performed. [0028]
  • DCC(0,0)=(DC0+DC1+DC2+DC3)/2
  • DCC(1,0)=(DC0−DC1+DC2−DC3)/2
  • DCC(0,1)=(DC0+DC1−DC2−DC3)/2
  • DCC(1,1)=(DC0−DC1−DC2+DC3)/2
  • Definition of the corresponding inverse transform: [0029]
  • DC0=(DCC(0,0)+DCC(1,0)+DCC(0,1)+DCC(1,1))/2
  • DC1=(DCC(0,0)−DCC(1,0)+DCC(0,1)−DCC(1,1))/2
  • DC2=(DCC(0,0)+DCC(1,0)−DCC(0,1)−DCC(1,1))/2
  • DC3=(DCC(0,0)−DCC(1,0)−DCC(0,1)+DCC(1,1))/2
  • In “H.26L Test Model Long Term Number 6 (TML-6) draft0”, document VCEG-L45, ITU-T Video Coding Experts Group Meeting, Eibsee, Germany, 09-12 Jan. 2001 to obtain values of reconstructed image the results of the inverse transform are normalized by shifting by 20 bits (with rounding). [0030]
  • An example of quantization/dequantization is provided by “H.26L Test Model Long Term Number 6 (TML-6) draft0”, document VCEG-L45, ITU-T Video Coding Experts Group Meeting, Eibsee, Germany, 09-12 Jan. 2001. A coefficient c is quantized in the following way: [0031]
  • I=(c×A(QP)+220)//220
  • where f may be in the range (0-0.5) and f may have the same sign as c. By // division with truncation is denoted. The dequantized coefficient is calculated as follows: [0032]
  • d=I×B(QP)
  • Values of A(QP) and B(QP) are given below: [0033]
  • A(QP=0, . . . ,31)=[620, 553, 492, 439, 391, 348, 310, 276, 246, 219, 195, 174, 155, 138, 123, 110, 98, 87, 78, 69, 62, 55, 49, 44, 39, 35, 31, 27, 24, 22, 19, 17]; [0034]
  • B(QP═0, . . . ,31)=[3881, 4351, 4890, 5481, 6154, 6914, 7761, 8718, 9781, 10987, 12339, 13828, 15523, 17435, 19561, 21873, 24552, 27656, 30847, 34870, 38807, 43747, 49103, 54683, 61694, 68745, 77615, 89113, 100253, 109366, 126635, 141533]; [0035]
  • Video streaming has emerged as one of the essential applications over the fixed internet and—in the near future over 3G multimedia networks. In streaming applications, the server starts streaming the pre-encoded video bitstream to the receiver upon a request from the receiver which plays the stream as it receives with a small delay. The problem with video streaming is that the best-effort nature of today's networks causes variations of the effective bandwidth available to a user due to the changing network conditions. The server should then scale the bitrate of the compressed video to accommodate these variations. In case of conversational services that are characterized by real-time encoding and point-to-point delivery, this is achieved by adjusting, on the fly, the source encoding parameters, such as quantization parameter or frame rate, based on the network feedback. In typical streaming scenarios when already encoded video bitstream is to be streamed to the client, the above solution can not be applied. [0036]
  • The simplest way of achieving bandwidth scalability in case of pre-encoded sequences is by producing multiple and independent streams of different bandwidth and quality. The server dynamically switches between the streams to accommodate variations of the bandwidth available to the client. [0037]
  • Now assume that we have multiple bitstreams generated independently with different encoding parameters, such as quantization parameter, corresponding to the same video sequence. Since encoding parameters are different for each bitstream, the reconstructed frames of different bitstreams at the same time instant will not be the same. Therefore when switching between bitstreams, i.e., starting to decode a bitstream, at arbitrary locations would lead to visual artifacts due to the mismatch between the reference frames used to obtain predicted frame. Furthermore, the visual artifacts will not only be confined to the switched frame but will further propagate in time due to motion compensated coding. [0038]
  • In the current video encoding standards, perfect (mismatch-free) switching between bitstreams is achieved possible only at the positions where the future frames/regions do not use any information previous to the current switching location, i.e., at I-frames. Furthermore, by placing I-frames at fixed (e.g. 1 sec) intervals, VCR functionalities, such as random access or “Fast Forward” and “Fast Backward” (increased playback rate) for streaming video content, are achieved. User may skip a portion of video and restart playing at any I-frame location. Similarly, increased playback rate can be achieved by transmitting only I-pictures. The drawback of using I-frames in these applications is that since I-frames are not allowed to utilize temporal redundancy they require much larger number of bits than P-frames. [0039]
  • The above-mentioned references are exemplary only and are not meant to be limiting in respect to the resources and/or technologies available to those skilled in the art. [0040]
  • SUMMARY
  • A new picture or frame type and method of using same is provided. This type of novel frame type is referred to as a SP-picture. SP-picture uses motion compensated predictive coding to exploit temporal redundancy in the sequence. The difference between SP and P-pictures is that using SP-pictures identical frames may be obtained even when different reference frames are used for prediction. This property allows SP-pictures to replace I-pictures in numerous applications such as switching from one bitstream to another, random access, fast-forward, fast-backward. At the same time since SP-frames unlike I-frames are utilizing motion compensated predictive coding they require smaller number of bits than I-frames. [0041]
  • These and other features, aspects, and advantages of embodiments of the present invention will become apparent with reference to the following description in conjunction with the accompanying drawings. It is to be understood, however, that the drawings are designed solely for the purposes of illustration and not as a definition of the limits of the invention, for which reference should be made to the appended claims. [0042]
  • BRIEF DESCRIPTIONS OF THE DRAWINGS
  • FIG. 1A is a diagram showing the encoding of an I-type frame or I-picture. [0043]
  • FIG. 1B is a diagram showing the encoding of a P-type frame or P-picture. [0044]
  • FIG. 1C is a diagram showing the encoding of a B-type frame or B-picture. [0045]
  • FIG. 2 is a diagram showing B-type frame inserted between I-type and P-type frames and the direction which motion compensation information flows. [0046]
  • FIG. 3 is a block diagram of a generic motion-compensated predictive video coding system (encoder). [0047]
  • FIG. 4 is a block diagram of a generic motion-compensated predictive video coding system (decoder). [0048]
  • FIG. 5 is an illustration showing switching between [0049] bitstreams 1 and 2 using SP-pictures.
  • FIG. 6 is a block diagram of a decoder in accordance with an embodiment of the invention. [0050]
  • FIG. 7 is an illustration of random access using SP-pictures. [0051]
  • FIG. 8 is an illustration of a fast-forward process using SP-pictures. [0052]
  • DETAILED DESCRIPTION
  • A new decoder architecture is provided which has the property that identical frames may be obtained even when they are predicted using different reference frames. The picture type obtained using this structure will be called SP-frame also may be referred to as picture. This property allows SP-pictures to replace I-pictures in numerous applications such as switching from one bitstream to another, random access, fast-forward, fast-backward. Since unlike l-frames SP-frames are using motion compensated prediction they require a lot less bits than the I-frames. [0053]
  • Some of possible applications of SP-frames are described below: [0054]
  • Bitstream Switching: [0055]
  • An example of how to utilize SP-frames to switch between different bitstreams is illustrated in the FIG. 5. FIG. 5 shows two bitstreams corresponding to the same sequence encoded at different bitrates—bitstream [0056] 1 (510) and bitstream 2 (520). Within each encoded bitstream, SP-pictures should be placed at locations at which one wants to allow switching from one bitstream to another (pictures S.sub.1 (513), and S.sub.2 (523) in FIG. 5). When switching from bitstream 1 (510) to bitstream 2 (520), another picture of this type will be transmitted (in FIG. 5 picture S.sub.12 (550) will be transmitted instead of S.sub.2 (523)). Pictures S.sub.2 (523) and S.sub.12 (550) in FIG. 5 are represented by different bitstreams. However, their reconstructed values are identical.
  • Random Access: [0057]
  • Application of SP-pictures to enable random access is depicted in FIG. 7. SP-pictures are placed at fixed intervals within bitstream [0058] 1 (720) (e.g. picture S.sub.1 (730)) which is being streamed to the client. To each one of these SP-pictures there is a corresponding pair of pictures generated and stored as another bitstream (bitstream 2 (740)):
  • I-picture, I.sub.[0059] 2 (750), at the temporal location preceding SP-picture.
  • SP-[0060] picture 710, S.sub.2, at the same temporal location as SP-picture.
  • Pictures stored in bitstream [0061] 2 (740) are only used when random access is requested by a client. Bitstream 1 (720) may then be accessed at a location corresponding to an I-picture in bitstream 2 (740). For example to access bitstream 1 at frame I.sub.2, first the pictures I.sub.2, S.sub.2 from bitstream 2 are transmitted and then the following pictures from bitstream 1 are transmitted.
  • Fast-Forward: [0062]
  • If in FIG. 7 [0063] bitstream 2 will constitute of only SP-pictures predicted from each other placed in larger temporal intervals (e.g. each 1 sec) the structure presented in this figure can be used to obtain “Fast Forward” functionality. Due to the usage of SP-pictures “Fast Forward” can start at any bitstream location. In similar manner “Fast Backward” functionality can be obtained.
  • Video Redundancy Coding: [0064]
  • SP-pictures have other uses in applications in which they do not act as replacements of I-pictures. Video Redundancy Coding can be given as an example (VRC). “The principle of the VRC method is to divide the sequence of pictures into two or more threads in such a way that all camera pictures are assigned to one of the threads in a round-robin fashion. Each thread is coded independently. In regular intervals, all threads converge into a so-called sync frame. From this sync frame, a new thread series is started. If one of these threads is damaged because of a packet loss, the remaining threads stay intact and can be used to predict the next sync frame. It is possible to continue the decoding of the damaged thread, which leads to slight picture degradation, or to stop its decoding which leads to a drop of the frame rate. Sync frames are always predicted out of one of the undamaged threads. This means that the number of transmitted I-pictures can be kept small, because there is no need for complete re-synchronization.” For the sync frame, more than one representation (P-picture) is sent, each one using a reference picture from a different thread. Due to the usage of P-pictures these representations are not identical. Therefore, mismatch is introduced when some of the representations cannot be decoded and their counterparts are used when decoding the following threads. Usage of SP-pictures as sync frames eliminates this problem. [0065]
  • Error Resiliency/Recovery: [0066]
  • Multiple representations of a single frame in the form of SP-frames predicted from different reference pictures, e.g., the immediate previously reconstructed frames and a reconstructed frame further back in time, can be used to increase error resilience. Consider the case when an already encoded bitstream is being streamed and there has been a packet loss leading to a frame loss. The client signals the lost frame(s) to the sender which responds by sending the next SP-frame in the representation that uses frames that have been already received by the client. [0067]
  • In the following, we provide a detailed description of SP-picture encoding/decoding within the context of H.26L. [0068]
  • SP-frame comprises blocks encoded using spatial correlation among the pixels (intra blocks) and blocks encoded using both spatial and temporal correlation (inter or copy blocks). Inter and copy coded blocks are reconstructed differently than in P-type frames. [0069]
  • Value of each pixel S(x,y) in the inter or copy coded block is decoded as a weighted sum of the basis functions f.sub.ij(x,y) where the weigh values d.sub.rec will be called dequantized reconstruction image coefficients. The values of d.sub.rec are obtained by quantization and dequantization of reconstruction image coefficients c.sub.rec. Reconstruction image coefficients c.sub.rec are formed using [0070]
  • Motion compensation prediction of this block constructed using previously transmitted and decoded frames and received motion vectors. [0071]
  • Received quantized prediction error coefficients I.sub.err. [0072]
  • Values S(x,y) can be further normalized and filtered. [0073]
  • The invention is described in view of certain embodiments. Variations and modification are deemed to be within the spirit and scope of the invention. The changes required in H.26L Test Model in order to implement this embodiment of the present invention are also described. [0074]
  • SP-picture Decoding [0075]
  • The decoding of inter and copy coded blocks in SP-picture is described. Two different values of quantization parameter QP: QP[0076] 1 and QP2 can be used during quantization/dequantization process for these blocks. Values of QP1 and QP2 can be different when quantization/dequantization process is performed for luma component than when it is performed for chroma component.
  • The reconstructed values for inter and copy coded blocks are calculated as follows: [0077]
  • 1. Form prediction P(x,y) of current block using received motion vectors and the reference frame. Calculate transform coefficients c.sub.pred for P(x,y). These coefficients can be calculated by performing forward transform on P(x,y). [0078]
  • 2. Calculate reconstruction image coefficients [0079]
  • c.sub.rec=c.sub.pred+alpha(QP 2I.sub.err.
  • where alpha(QP) is a parameter dependent on method of quantization and used QP value. Quantize c.sub.rec using quantization parameter QP=QP[0080] 1. The quantized values will be referred to as quantized reconstructed image coefficients and denoted as I.sub.rec.
  • When implementing this step in “H.26L Test Model Long Term Number 6 (TML-6) draft0”, document VCEG-L45, ITU-T Video Coding Experts Group Meeting, Eibsee, Germany, 09-12 Jan. 2001 to reduce computational complexity calculation of c.sub.rec and their quantization are combined: [0081]
  • I.sub.rec=(c.sub.pred×A(QP 1)+I.sub.err×F( QP 1,QP 2)+0.5×220)//220
  • where F(QP[0082] 1,QP2)═(2 20×A(QP1)+0.5×A(QP2)//A(QP2). Constant A(QP) is defined above in the section on quantization.
  • 3. Dequantize I.sub.rec using QP=QP[0083] 1. The dequantized coefficients are equal to d.sub.rec.
  • 4. Inverse transform is performed for d.sub.rec. These values can be further normalized and filtered. [0084]
  • Another embodiment for SP-Picture decoding [0085]
  • The blocks with type inter and copy are reconstructed as follows: [0086]
  • 1. Form prediction P(x,y) of current block using received motion vectors and the reference frame. [0087]
  • 2. Calculate transform coefficients c.sub.pred for P(x,y). These coefficients can be calculated by performing forward transform for P(x,y). [0088]
  • 3. Quantize coefficients c.sub.pred using quantization parameter QP=QP[0089] 1. The quantized values will be referred to as quantized prediction image coefficients and denoted as I.sub.pred. Obtain quantized reconstruction image coefficients I.sub.rec by adding the received quantized coefficients for the prediction error I.sub.err to I.sub.pred, i.e.,
  • I.sub.rec=I.sub.pred+(beta(QP 2I.sub.err+0.5×beta(QP 1))//beta(QP 1).
  • where beta(QP) is a parameter dependent on method of quantization and used QP value. In case of quantization used in “H.26L Test Model Long Term Number 6 (TML-6) draft0”, document VCEG-L45, ITU-T Video Coding Experts Group Meeting, Eibsee, Germany, 09-12 Jan. 2001, parameter beta(QP)=B(QP) where constant B(QP) is defined above in the section on quantization. [0090]
  • 4. Dequantize I.sub.rec using QP=QP[0091] 1. The dequantized coefficients are equal to d.sub.rec.
  • 5. Inverse transform is performed for d.sub.rec. These values can be further normalized and filtered. [0092]
  • In the following, we describe the encoding of SP-frames for the decoder structure described as the preferred embodiment of the invention. [0093]
  • As can be observed from FIG. 5, there are two types of SP-frames, specifically, the SP-frames; placed within the bitstream, e.g., S.sub.[0094] 1 (513) and S.sub.2 (523) in FIG. 5, and the SP-frames (S.sub.12 in FIG. 5) that will be sent when there is a switch between bitstreams (from bitstream 1 to bitstream 2). The encoding of S.sub.2 (523) and S.sub.12 (550) are such that their reconstructed frames are identical although they use different reference frames as described below.
  • When encoding an SP-picture placed within a bitstream (S.sub.[0095] 1 (513) and S.sub.2 (523) in FIG. 5), the encoding of inter and copy coded blocks is performed as follows:
  • 1. Calculate motion vectors using same method as for P-pictures. After motion compensation calculate transform coefficients for predicted P(x,y) and current image I(x,y) by performing forward transform. The transform coefficients for the current image are denoted as c.sub.orig and for the predicted image as c.sub.pred. [0096]
  • 2. Transform coefficients for the predicted blocks are quantized using QP=QP[0097] 1. Obtained levels are denoted as I.sub.pred.
  • 3. The prediction error coefficients are obtained by c.sub.err=c.sub.orig−I.sub.pred×alpha(QP[0098] 1) where alpha(QP) is a parameter dependent on method of quantization and used QP value.
  • When SP-frames are used in “H.26L Test Model Long Term Number 6 (TML-6) draft0”, document VCEG-L45, ITU-T Video Coding Experts Group Meeting, Eibsee, Germany, 09-12 Jan. 2001 [0099]
  • alpha(QP)=(220+0.5×A(QP))//A(QP)
  • where constant A(QP) is defined above in the section on quantization. [0100]
  • 4. The prediction error coefficients are quantized using QP=QP[0101] 2. It is recommended to use rate distortion constrained quantization (RDQ) in which more than one coefficient are considered together in the quantization process. An example of use rate distortion constrained quantization is described in “H.26L Test Model Long Term Number 6 (TML-6) draft0”, document VCEG-L45, ITU-T Video Coding Experts Group Meeting, Eibsee, Germany, 09-12 Jan. 2001.
  • Let as assume that we want to encode the SP-picture, denoted as S.sub.[0102] 12 in FIG. 5, to switch from bitstream 1 to bitstream 2. The reconstructed values of this picture have to be identical to the reconstructed values of SP-picture in bitstream 2, denoted as S.sub.2 in FIG. 5, to which we are switching. The bitstream of the Intra macroblocks in frame S.sub.2 are copied to S.sub.12. The encoding of inter macroblocks is performed as follows:
  • 1. Form the predicted frame for S.sub.[0103] 12 by performing motion estimation with the reference frames being pictures preceding S.sub.1 in bitstream 1.
  • 2. Calculate transform coefficients for predicted image by performing forward transform. The transform coefficients for the predicted image are denoted as c.sub.pred. [0104]
  • 3. Quantize the obtained coefficients c.sub.pred using QP=QP[0105] 1 and subtract the quantized coefficient levels I.sub.pred from the corresponding I.sub.rec of S.sub.2-picture. The resulting levels are the levels of the prediction error which will be transmitted to the decoder.
  • An embodiment of a [0106] decoder 600 in accordance with an embodiment of the invention is illustrated in FIG. 6. Referring to FIG. 6, dedcoder 600 comprises, inter alia, a demultiplexer 610, inverse quantization block 620,m inverse transform block 630, frame memory 640, MC prediction block 650, tranform block 660, quantization block 670.
  • The invention is described in view of certain embodiments. Variations and modification are deemed to be within the spirit and scope of the invention. For instance, data from the demultiplexer may be normalized before proceeding to adder and inverse quantization. It will be obvious to those skilled in the art after reading the specification including the appended claims that various changes in form and detail may be made therein without departing from the spirit and scope of the invention. [0107]

Claims (5)

1 to 18 cancelled.
19. A method of coding a switching picture S.sub.12 for switching from a first bitstream having a SP-picture S.sub.1 to a second bitstream having a second SP-picture S.sub.2, said method comprising the steps of; copying the bitstream of the intra macroblocks in second SP-picture S.sub.2 to switching picture S.sub.12; and encoding said copied macroblocks comprising the steps of: forming a predicted frame for S.sub.12 by performing motion estimation with a plurality of reference pictures which are preceding S.sub.1 in said first bitstream; calculating a set of transform coefficients for predicted image c.sub.pred by performing a forward transform; Quantizing the obtained transform coefficients to resulting in quantized coefficient levels; and subtracting said quantized coefficient levels from a set of corresponding coefficient levels of said second SP-picture S.sub.2.
20. A method of reconstructing blocks of encoded data comprising the steps of: forming a prediction P(x,y) of a current block using a plurality of received motion vectors and a reference frame; performing a forward transform on P(x,y) to obtain a set of transform coefficients c.sub.pred for P(x,y); quantizing said set of transform coefficients c.sub.pred, resulting in quantized prediction image coefficients I.sub.pred. adding said received quantized coefficients for the prediction error I.sub.err to I.sub.pred resulting in quantized reconstruction image coefficients I.sub.rec using: I.sub.rec=I.sub.pred+(beta(QP2).times.I.sub.err+0.5.time-s.beta(QP1))//beta(OP1). wherein beta(QP) is a parameter dependent on method of quantization and used QP value; dequantizing I.sub.rec resulting in dequantized coefficients d.sub.rec; and performing inverse transform for d.sub.rec.
21. The method of claim 20, further comprising the step of normalizing said plurality of inverse transformed dequantized coefficients.
22. The method of claim 21, further comprising the step of filtering said plurality of normalized inverse transformed dequantized coefficients.
US10/869,092 2001-01-03 2004-06-16 Video decoder architecture and method for using same Abandoned US20040240560A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US10/869,092 US20040240560A1 (en) 2001-01-03 2004-06-16 Video decoder architecture and method for using same

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US25952901P 2001-01-03 2001-01-03
US09/827,796 US20020122491A1 (en) 2001-01-03 2001-04-06 Video decoder architecture and method for using same
US10/869,092 US20040240560A1 (en) 2001-01-03 2004-06-16 Video decoder architecture and method for using same

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
US09/827,796 Continuation US20020122491A1 (en) 2001-01-03 2001-04-06 Video decoder architecture and method for using same

Publications (1)

Publication Number Publication Date
US20040240560A1 true US20040240560A1 (en) 2004-12-02

Family

ID=34078815

Family Applications (3)

Application Number Title Priority Date Filing Date
US09/827,796 Abandoned US20020122491A1 (en) 2001-01-03 2001-04-06 Video decoder architecture and method for using same
US10/250,838 Active 2024-06-09 US7706447B2 (en) 2001-01-03 2002-01-03 Switching between bit-streams in video transmission
US10/869,092 Abandoned US20040240560A1 (en) 2001-01-03 2004-06-16 Video decoder architecture and method for using same

Family Applications Before (2)

Application Number Title Priority Date Filing Date
US09/827,796 Abandoned US20020122491A1 (en) 2001-01-03 2001-04-06 Video decoder architecture and method for using same
US10/250,838 Active 2024-06-09 US7706447B2 (en) 2001-01-03 2002-01-03 Switching between bit-streams in video transmission

Country Status (2)

Country Link
US (3) US20020122491A1 (en)
ZA (1) ZA200304086B (en)

Families Citing this family (69)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2004515163A (en) * 2000-11-29 2004-05-20 ブリティッシュ・テレコミュニケーションズ・パブリック・リミテッド・カンパニー Transmission and reception of real-time data
KR100425676B1 (en) * 2001-03-15 2004-04-03 엘지전자 주식회사 Error recovery method for video transmission system
US8209429B2 (en) * 2001-07-19 2012-06-26 British Telecommunications Public Limited Company Video stream switching
JP2005503722A (en) * 2001-09-21 2005-02-03 ブリティッシュ・テレコミュニケーションズ・パブリック・リミテッド・カンパニー Data communication method and system using reception of buffer size to calculate transmission rate for congestion control
AU2002365821A1 (en) * 2001-11-30 2003-06-17 British Telecommunications Public Limited Company Data transmission
US7020203B1 (en) * 2001-12-21 2006-03-28 Polycom, Inc. Dynamic intra-coded macroblock refresh interval for video error concealment
JP3923898B2 (en) * 2002-01-18 2007-06-06 株式会社東芝 Image coding method and apparatus
US20030151753A1 (en) * 2002-02-08 2003-08-14 Shipeng Li Methods and apparatuses for use in switching between streaming video bitstreams
ATE490649T1 (en) * 2002-03-27 2010-12-15 British Telecomm VIDEO CODING AND TRANSMISSION
EP1359722A1 (en) * 2002-03-27 2003-11-05 BRITISH TELECOMMUNICATIONS public limited company Data streaming system and method
KR100917743B1 (en) * 2002-03-27 2009-09-15 브리티쉬 텔리커뮤니케이션즈 파블릭 리미티드 캄퍼니 Data structure for data streaming system
JP4355156B2 (en) 2002-04-16 2009-10-28 パナソニック株式会社 Image decoding method and image decoding apparatus
WO2004023819A2 (en) * 2002-09-06 2004-03-18 Koninklijke Philips Electronics N.V. Content-adaptive multiple description motion compensation for improved efficiency and error resilience
GB0306296D0 (en) * 2003-03-19 2003-04-23 British Telecomm Data transmission
US7471724B2 (en) * 2003-06-23 2008-12-30 Vichip Corp. Limited Method and apparatus for adaptive multiple-dimensional signal sequences encoding/decoding
US8868772B2 (en) 2004-04-30 2014-10-21 Echostar Technologies L.L.C. Apparatus, system, and method for adaptive-rate shifting of streaming content
US20070230574A1 (en) * 2004-05-25 2007-10-04 Koninklijke Philips Electronics N.C. Method and Device for Encoding Digital Video Data
JP4559811B2 (en) * 2004-09-30 2010-10-13 株式会社東芝 Information processing apparatus and information processing method
US7609765B2 (en) 2004-12-02 2009-10-27 Intel Corporation Fast multi-frame motion estimation with adaptive search strategies
US20060140591A1 (en) * 2004-12-28 2006-06-29 Texas Instruments Incorporated Systems and methods for load balancing audio/video streams
EP1839445A2 (en) * 2005-01-18 2007-10-03 Thomson Licensing Method and apparatus for estimating channel induced distortion
US9661376B2 (en) * 2005-07-13 2017-05-23 Polycom, Inc. Video error concealment method
EP1911285A4 (en) * 2005-07-22 2009-12-02 Empirix Inc Method of transmitting pre-encoded video
DE102005049017B4 (en) * 2005-10-11 2010-09-23 Carl Zeiss Imaging Solutions Gmbh Method for segmentation in an n-dimensional feature space and method for classification based on geometric properties of segmented objects in an n-dimensional data space
US20070098274A1 (en) * 2005-10-28 2007-05-03 Honeywell International Inc. System and method for processing compressed video data
US20070116077A1 (en) * 2005-11-22 2007-05-24 Nlight Photonics Corporation Vertically displaced stack of multi-mode single emitter laser diodes
US20070115617A1 (en) * 2005-11-22 2007-05-24 Nlight Photonics Corporation Modular assembly utilizing laser diode subassemblies with winged mounting blocks
WO2007061509A2 (en) * 2005-11-22 2007-05-31 Nlight Photonics Corporation Modular diode laser assembly
US20070116071A1 (en) * 2005-11-22 2007-05-24 Nlight Photonics Corporation Modular diode laser assembly
WO2007111473A1 (en) * 2006-03-27 2007-10-04 Electronics And Telecommunications Research Institute Scalable video encoding and decoding method using switching pictures and apparatus thereof
US8358693B2 (en) * 2006-07-14 2013-01-22 Microsoft Corporation Encoding visual data with computation scheduling and allocation
US8311102B2 (en) * 2006-07-26 2012-11-13 Microsoft Corporation Bitstream switching in multiple bit-rate video streaming environments
US8340193B2 (en) * 2006-08-04 2012-12-25 Microsoft Corporation Wyner-Ziv and wavelet video coding
WO2008020792A1 (en) * 2006-08-17 2008-02-21 Telefonaktiebolaget Lm Ericsson (Publ) Error recovery for rich media
US9094686B2 (en) * 2006-09-06 2015-07-28 Broadcom Corporation Systems and methods for faster throughput for compressed video data decoding
CN101523908A (en) * 2006-10-02 2009-09-02 艾利森电话股份有限公司 Multimedia management
US7388521B2 (en) * 2006-10-02 2008-06-17 Microsoft Corporation Request bits estimation for a Wyner-Ziv codec
US8375304B2 (en) * 2006-11-01 2013-02-12 Skyfire Labs, Inc. Maintaining state of a web page
US9247260B1 (en) 2006-11-01 2016-01-26 Opera Software Ireland Limited Hybrid bitmap-mode encoding
US8711929B2 (en) * 2006-11-01 2014-04-29 Skyfire Labs, Inc. Network-based dynamic encoding
US8443398B2 (en) * 2006-11-01 2013-05-14 Skyfire Labs, Inc. Architecture for delivery of video content responsive to remote interaction
WO2008056923A1 (en) * 2006-11-07 2008-05-15 Samsung Electronics Co, . Ltd. Method and apparatus for encoding and decoding based on intra prediction
KR100846512B1 (en) * 2006-12-28 2008-07-17 삼성전자주식회사 Method and apparatus for video encoding and decoding
US8630512B2 (en) * 2007-01-25 2014-01-14 Skyfire Labs, Inc. Dynamic client-server video tiling streaming
US8340192B2 (en) * 2007-05-25 2012-12-25 Microsoft Corporation Wyner-Ziv coding with multiple side information
EP2383920B1 (en) 2007-12-20 2014-07-30 Optis Wireless Technology, LLC Control channel signaling using a common signaling field for transport format and redundancy version
US8325800B2 (en) 2008-05-07 2012-12-04 Microsoft Corporation Encoding streaming media as a high bit rate layer, a low bit rate layer, and one or more intermediate bit rate layers
US8379851B2 (en) * 2008-05-12 2013-02-19 Microsoft Corporation Optimized client side rate control and indexed file layout for streaming media
US7949775B2 (en) 2008-05-30 2011-05-24 Microsoft Corporation Stream selection for enhanced media streaming
US8265140B2 (en) 2008-09-30 2012-09-11 Microsoft Corporation Fine-grained client-side control of scalable media delivery
US20110090965A1 (en) * 2009-10-21 2011-04-21 Hong Kong Applied Science and Technology Research Institute Company Limited Generation of Synchronized Bidirectional Frames and Uses Thereof
EP2458861A1 (en) * 2010-11-25 2012-05-30 ST-Ericsson SA Bit rate regulation module and method for regulating bit rate
KR101187530B1 (en) * 2011-03-02 2012-10-02 한국과학기술원 Rendering strategy for monoscopic, stereoscopic and multi-view computer generated imagery, system using the same and recording medium for the same
US9635374B2 (en) 2011-08-01 2017-04-25 Apple Inc. Systems and methods for coding video data using switchable encoders and decoders
US20130083845A1 (en) 2011-09-30 2013-04-04 Research In Motion Limited Methods and devices for data compression using a non-uniform reconstruction space
EP2595382B1 (en) 2011-11-21 2019-01-09 BlackBerry Limited Methods and devices for encoding and decoding transform domain filters
WO2014055826A2 (en) * 2012-10-05 2014-04-10 Huawei Technologies Co., Ltd. Improved architecture for hybrid video codec
KR20150056811A (en) 2012-11-13 2015-05-27 인텔 코포레이션 Content adaptive transform coding for next generation video
US9794569B2 (en) 2013-01-30 2017-10-17 Intel Corporation Content adaptive partitioning for prediction and coding for next generation video
EP2804374A1 (en) 2013-02-22 2014-11-19 Thomson Licensing Coding and decoding methods of a picture block, corresponding devices and data stream
EP2804375A1 (en) 2013-02-22 2014-11-19 Thomson Licensing Coding and decoding methods of a picture block, corresponding devices and data stream
US8881213B2 (en) * 2013-03-13 2014-11-04 Verizon Patent And Licensing Inc. Alignment of video frames
JP6225446B2 (en) * 2013-03-26 2017-11-08 富士通株式会社 Moving image data distribution apparatus, method, program, and system
US9609336B2 (en) * 2013-04-16 2017-03-28 Fastvdo Llc Adaptive coding, transmission and efficient display of multimedia (acted)
US9462306B2 (en) * 2013-07-16 2016-10-04 The Hong Kong University Of Science And Technology Stream-switching in a content distribution system
US10271062B2 (en) * 2016-03-18 2019-04-23 Google Llc Motion vector prediction through scaling
US10484701B1 (en) * 2016-11-08 2019-11-19 Amazon Technologies, Inc. Rendition switch indicator
US11095907B2 (en) 2017-03-27 2021-08-17 Nokia Technologies Oy Apparatus, a method and a computer program for video coding and decoding
CN115460466A (en) * 2022-08-23 2022-12-09 北京泰豪智能工程有限公司 Video picture customization method and system in video communication

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6393055B1 (en) * 1997-05-27 2002-05-21 Koninklijke Philips Electronics N.V. Method of switching video sequences and corresponding switching device and decoding system
US6920175B2 (en) * 2001-01-03 2005-07-19 Nokia Corporation Video coding architecture and methods for using same

Family Cites Families (39)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE69126525T2 (en) 1990-03-15 1997-09-25 Thomson Multimedia Sa DIGITAL IMAGE PROCESSING WITH FILTERING THE BLOCK EDGES
JPH05137131A (en) * 1991-11-13 1993-06-01 Sony Corp Inter-frame motion predicting method
EP0731614B1 (en) 1995-03-10 2002-02-06 Kabushiki Kaisha Toshiba Video coding/decoding apparatus
ES2148768T3 (en) * 1995-07-19 2000-10-16 Koninkl Philips Electronics Nv PROCEDURE AND DEVICE TO DECODE BINARY NUMERICAL TRAINS OF VIDEO AND RECEIVING EQUIPMENT THAT INCLUDES SUCH DEVICE.
EP0856228B1 (en) * 1995-10-20 1999-07-28 Nokia Mobile Phones Ltd. Motion vector field coding
JP2827997B2 (en) * 1995-12-28 1998-11-25 日本電気株式会社 Image signal Hadamard transform encoding device and decoding device
IT1285258B1 (en) 1996-02-26 1998-06-03 Cselt Centro Studi Lab Telecom HANDLING DEVICE FOR COMPRESSED VIDEO SEQUENCES.
US5708732A (en) * 1996-03-06 1998-01-13 Hewlett-Packard Company Fast DCT domain downsampling and inverse motion compensation
US6212235B1 (en) * 1996-04-19 2001-04-03 Nokia Mobile Phones Ltd. Video encoder and decoder using motion-based segmentation and merging
EP0843484B1 (en) * 1996-05-28 2002-01-02 Matsushita Electric Industrial Co., Ltd. Device predicting and decoding image
US6137834A (en) * 1996-05-29 2000-10-24 Sarnoff Corporation Method and apparatus for splicing compressed information streams
GB2318246B (en) 1996-10-09 2000-11-15 Sony Uk Ltd Processing digitally encoded signals
US6480541B1 (en) * 1996-11-27 2002-11-12 Realnetworks, Inc. Method and apparatus for providing scalable pre-compressed digital video with reduced quantization based artifacts
EP0945023B1 (en) 1996-12-10 2002-09-18 BRITISH TELECOMMUNICATIONS public limited company Video coding
EP0862332B1 (en) 1997-02-28 2003-05-07 Matsushita Electric Industrial Co., Ltd. Motion picture signal converting apparatus
US6052384A (en) * 1997-03-21 2000-04-18 Scientific-Atlanta, Inc. Using a receiver model to multiplex variable-rate bit streams having timing constraints
SG65064A1 (en) 1997-04-09 1999-05-25 Matsushita Electric Ind Co Ltd Image predictive decoding method image predictive decoding apparatus image predictive coding method image predictive coding apparatus and data storage media
US6012091A (en) * 1997-06-30 2000-01-04 At&T Corporation Video telecommunications server and method of providing video fast forward and reverse
US6501798B1 (en) 1998-01-22 2002-12-31 International Business Machines Corporation Device for generating multiple quality level bit-rates in a video encoder
US6611624B1 (en) 1998-03-13 2003-08-26 Cisco Systems, Inc. System and method for frame accurate splicing of compressed bitstreams
BR9904912A (en) * 1998-03-31 2000-06-20 Koninkl Philips Electronics Nv Process for modifying data in an encrypted data stream, and video processing device
JPH11341435A (en) * 1998-05-22 1999-12-10 Sony Corp Edit method and edit device
FR2782437B1 (en) 1998-08-14 2000-10-13 Thomson Multimedia Sa MPEG STREAM SWITCHING METHOD
JP2000115783A (en) * 1998-10-06 2000-04-21 Canon Inc Decoder and its method
US6434195B1 (en) 1998-11-20 2002-08-13 General Instrument Corporaiton Splicing of video data in progressively refreshed video streams
US7046910B2 (en) 1998-11-20 2006-05-16 General Instrument Corporation Methods and apparatus for transcoding progressive I-slice refreshed MPEG data streams to enable trick play mode features on a television appliance
JP3855522B2 (en) 1999-02-23 2006-12-13 松下電器産業株式会社 Video converter
US6658056B1 (en) * 1999-03-30 2003-12-02 Sony Corporation Digital video decoding, buffering and frame-rate converting method and apparatus
GB9908809D0 (en) 1999-04-16 1999-06-09 Sony Uk Ltd Signal processor
FR2795272B1 (en) 1999-06-18 2001-07-20 Thomson Multimedia Sa MPEG STREAM SWITCHING METHOD
US6735249B1 (en) 1999-08-11 2004-05-11 Nokia Corporation Apparatus, and associated method, for forming a compressed motion vector field utilizing predictive motion coding
GB2353655B (en) 1999-08-26 2003-07-23 Sony Uk Ltd Signal processor
GB2353653B (en) 1999-08-26 2003-12-31 Sony Uk Ltd Signal processor
US6765963B2 (en) 2001-01-03 2004-07-20 Nokia Corporation Video decoder architecture and method for using same
US6804301B2 (en) 2001-08-15 2004-10-12 General Instrument Corporation First pass encoding of I and P-frame complexity for compressed digital video
US6956600B1 (en) 2001-09-19 2005-10-18 Bellsouth Intellectual Property Corporation Minimal decoding method for spatially multiplexing digital video pictures
US6996173B2 (en) 2002-01-25 2006-02-07 Microsoft Corporation Seamless switching of scalable video bitstreams
CN100380980C (en) 2002-04-23 2008-04-09 诺基亚有限公司 Method and device for indicating quantizer parameters in a video coding system
CN100387043C (en) 2003-01-28 2008-05-07 汤姆森特许公司 Robust mode staggercasting

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6393055B1 (en) * 1997-05-27 2002-05-21 Koninklijke Philips Electronics N.V. Method of switching video sequences and corresponding switching device and decoding system
US6920175B2 (en) * 2001-01-03 2005-07-19 Nokia Corporation Video coding architecture and methods for using same

Also Published As

Publication number Publication date
US7706447B2 (en) 2010-04-27
ZA200304086B (en) 2004-07-15
US20020122491A1 (en) 2002-09-05
US20040114684A1 (en) 2004-06-17

Similar Documents

Publication Publication Date Title
US6765963B2 (en) Video decoder architecture and method for using same
US6920175B2 (en) Video coding architecture and methods for using same
US20040240560A1 (en) Video decoder architecture and method for using same
US7693220B2 (en) Transmission of video information
US9088776B2 (en) Scalability techniques based on content information
RU2322770C2 (en) Method and device for indication of quantizer parameters in video encoding system
US6961377B2 (en) Transcoder system for compressed digital video bitstreams
Sun et al. Adaptive error concealment algorithm for MPEG compressed video
WO2005098755A1 (en) Method and apparatus for encoding a picture sequence using predicted and non-predicted pictures which each include multiple macroblocks
EP1575294A1 (en) Method and apparatus for improving the average image refresh rate in a compressed video bitstream
WO2009103942A1 (en) Video streaming
Kurceren et al. Synchronization-predictive coding for video compression: The SP frames design for JVT/H. 26L
Aramvith et al. MPEG-1 and MPEG-2 video standards
KR100626419B1 (en) Switching between bit-streams in video transmission
EP1719343A1 (en) Transmission of video information
KR101371507B1 (en) System and method for low-delay video telecommunication
JP2001148852A (en) Image information converter and image information conversion method
Xin Improved standard-conforming video transcoding techniques
EP1739970A1 (en) Method for encoding and transmission of real-time video conference data
KR100923961B1 (en) System and method for low-delay video telecommunication
JP3590976B2 (en) Video compression device
Bharamgouda Rate control for region of interest video coding in H. 264
Richardson et al. Temporal filtering of coded video
Mandal et al. Digital video compression techniques
JP2001148855A (en) Image information converter and image information conversion method

Legal Events

Date Code Title Description
STCB Information on status: application discontinuation

Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION