WO2001054392A2 - Method and apparatus for visual lossless image syntactic encoding - Google Patents

Method and apparatus for visual lossless image syntactic encoding Download PDF

Info

Publication number
WO2001054392A2
WO2001054392A2 PCT/IL2001/000065 IL0100065W WO0154392A2 WO 2001054392 A2 WO2001054392 A2 WO 2001054392A2 IL 0100065 W IL0100065 W IL 0100065W WO 0154392 A2 WO0154392 A2 WO 0154392A2
Authority
WO
WIPO (PCT)
Prior art keywords
frame
pixel
threshold
detail
pixels
Prior art date
Application number
PCT/IL2001/000065
Other languages
French (fr)
Other versions
WO2001054392A3 (en
Inventor
Semion Sheraizin
Vitaly Sheraizin
Original Assignee
Vls Com Ltd.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Family has litigation
First worldwide family litigation filed litigation Critical https://patents.darts-ip.com/?family=11073740&utm_source=google_patent&utm_medium=platform_link&utm_campaign=public_patent_search&patent=WO2001054392(A2) "Global patent litigation dataset” by Darts-ip is licensed under a Creative Commons Attribution 4.0 International License.
Application filed by Vls Com Ltd. filed Critical Vls Com Ltd.
Priority to EP01942829A priority Critical patent/EP1260094A4/en
Priority to AU2001228771A priority patent/AU2001228771A1/en
Publication of WO2001054392A2 publication Critical patent/WO2001054392A2/en
Publication of WO2001054392A3 publication Critical patent/WO2001054392A3/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/85Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/134Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
    • H04N19/154Measured or subjectively estimated visual quality after decoding, e.g. measurement of distortion
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/42Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/60Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
    • H04N19/61Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding in combination with predictive coding
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/80Details of filtering operations specially adapted for video compression, e.g. for pixel interpolation

Definitions

  • the present invention relates generally to processing of video images and, in particular, to syntactic encoding of images for later compression by standard compression techniques.
  • TV digital broadcast television
  • video conferencing video conferencing
  • interactive TV etc.
  • All of these signals, in their digital form, are divided into frames, each of which consists of many pixels (image elements), each of which requires 8 - 24 bits to describe them.
  • the result is megabits of data per frame.
  • JPEG JPEG
  • MPEG H-compression
  • These compression standards use video signal transforms and intra- and inter-frame coding which exploit spatial and temporal correlations among pixels of a frame and across frames.
  • An object of the present invention is to provide a method and apparatus for video compression which is generally lossless vis-a-vis what the human eye perceives.
  • a visual lossless encoder for processing a video frame prior to compression by a video encoder.
  • the encoder includes a threshold determination unit, a filter unit, an association unit and an altering unit.
  • the threshold determination unit identifies a plurality of visual perception threshold levels to be associated with the pixels of the video frame, wherein the threshold levels define contrast levels above which a human eye can distinguish a pixel from among its neighboring pixels of the frame.
  • the filter unit divides the video frame into portions having different detail dimensions.
  • the association unit utilizes the threshold levels and the detail dimensions to associate the pixels of the video frame into subclasses.
  • Each subclass includes pixels related to the same detail and which generally cannot be distinguished from each other.
  • the altering unit alters the intensity of each pixel of the video frame according to its subclass.
  • the altering unit includes an inter-frame processor and an intra-frame processor.
  • the intra-frame processor includes a controllable filter bank having a plurality of different filters and a filter selector which selects one of the filters for each pixel according to its subclass.
  • the inter-frame processor includes a low pass filter and a high pass filter operative on a difference frame between a current frame and a previous frame, large and small detail threshold elements for thresholding the filtered difference frame with a large detail threshold level and a small detail threshold level, respectively, and a summer which sums the outputs of the two filters as amended by the threshold elements.
  • the threshold unit includes a unit for generating a plurality of parameters describing at least one of the following parameters: the volume of information in the frame, the per pixel color and the cross-frame change of intensity, and unit for generating the visual perception threshold from at least one of the parameters.
  • a method of visual lossless encoding of frames of a video signal includes steps of spatially and temporally separating and analyzing details of the frames, estimating parameters of the details, defining a visual perception threshold for each of the details in accordance with the estimated detail parameters, classifying the frame picture details into subclasses in accordance with the visual perception thresholds and the detail parameters and transforming each the frame detail in accordance with its associated subclass.
  • the step of separating and analyzing also includes the step of spatial high pass filtering of small dimension details and temporal filtering for detail motion analysis. Moreover, in accordance with a preferred embodiment of the present invention, the step of separating and analyzing also includes the step of spatial high pass filtering of small dimension details and temporal filtering for detail motion analysis. Moreover, in accordance with a preferred embodiment of the present invention
  • the step of estimating comprises at least one of the following steps:
  • NIGOP a normalized volume of inter-frame changes for a group of pictures from the output of the previous step of generating; evaluating a signal-to-noise ratio SNR by high pass filtering a difference
  • the step of defining includes the step of producing the visual perception thresholds, per-pixel, from a minimum threshold value and at least one of the parameters.
  • the step of defining includes the step of producing the visual perception thresholds, per-pixel, according to the following equation,
  • THD ; THD Procedure 1 + NAi + NI ⁇ y + NIr + NI G0P + NY i + Pi + (l-R i (h i ))+
  • the step of classifying includes the steps of comparing multiple spatial high frequency levels of a pixel against its associated visual perception threshold and processing the comparison results to associate the pixel with one of the subclasses.
  • the step of transforming includes the step of filtering each subclass with an associated two-dimensional low pass filter.
  • the step of transforming includes the steps of generating a difference frame between the current frame and a previous transformed frame, low and high pass filtering of the difference frame, comparing the filtered frames with a large detail threshold and a small detail threshold and summing those portions of the filtered frames which are greater than the thresholds.
  • the large detail threshold is 2 to 5 percent.
  • the method includes the step of rounding the output of the step of transforming.
  • the intensity can be a luminance value or a chrominance value.
  • Fig. 1 is an example of a video frame
  • Fig. 2 is a block diagram illustration of a video compression system
  • Fig. 3 is a block diagram illustration of the details of the visual lossless
  • Fig. 4 is a graphical illustration of the transfer functions for a number of
  • Figs. 5A and 5B are block diagram illustrations of alternative
  • Fig. 6 is a graphical illustration of the transfer functions for a number of
  • Fig. 7 is a graphical illustration of the transfer function for a non-linear
  • Figs. 8A, 8B and 8C are block diagram illustrations of alternative
  • Fig. 9 is a block diagram illustration of a spatial-temporal analyzer forming
  • Figs. 10A and 10B are detail illustrations of the analyzer of Fig. 9; and Fig. 11 is a detail illustration of a frame analyzer forming part of the syntactic encoder of Fig. 3.
  • Applicants have realized that there are different levels of image detail in an image and that the human eye perceives these details in different ways.
  • Applicants have realized the following: 1. Picture details whose detection mainly depends on the level of noise in the image occupy approximately 50 - 80% of an image.
  • a visual perception detection threshold for image details does not depend on the shape of the details in the image.
  • a visual perception threshold THD depends on a number of picture parameters, including the general brightness of the image. It does not depend on the noise spectrum.
  • the present invention is a method for describing, and then encoding, images based on which details in the image can be distinguished by the human eye and which ones can only be detected by it.
  • Fig. 1 is a grey-scale image of a plurality of shapes of a bird in flight, ranging from a photograph of one (labeled 10) to a very stylized version of one (labeled 12).
  • the background of the image is very dark at the top of the image and very light at the bottom of the image.
  • the human eye can distinguish most of the birds of the image. However, there is at least one bird, labeled 14, which the eye can detect but cannot determine all of its relative contrast details. Furthermore, there are large swaths of the image (in the background) which have no details in them.
  • the present invention is a method and system for syntactic encoding of video frames before they are sent to a standard video compression unit.
  • the present invention separates the details of a frame into two different types, those that can only be detected (for which only one bit will suffice to describe each of their pixels) and those which can be distinguished (for which at least three bits are needed to describe the intensity of each of their pixels).
  • Fig. 2 shows a visual lossless (VLS) encoder 20 connected to a standard video transmitter 22 which includes a video compression encoder 24, such as a standard MPEG encoder, and a modulator 26.
  • VLS encoder 20 transforms an incoming video signal such that video compression encoder 24 can compress the video signal two to five times more than encoder 24 can do on its own, resulting in a significantly reduced volume bit stream to be transmitted.
  • Modulator 26 modulates the reduced volume bit stream and transmits it to a receiver 30, which, as in the prior art, includes a demodulator 32 and a decoder 34.
  • Demodulator 32 demodulates the transmitted signal and decoder 34 decodes and decompresses the demodulated signal. The result is provided to a monitor 36 for display.
  • encoder 20 attempts to quantify each frame of the video signal according to which sections of the frame are more or less distinguished by the human eye. For the less-distinguished sections, encoder 20 either provides pixels of a minimum bit volume, thus reducing the overall bit volume of the frame or smoothes the data of the sections such that video compression encoder 24 will later significantly compress these sections, thus resulting in a smaller bit volume in the compressed frame. Since the human eye does not distinguish these sections, the reproduced frame is not perceived significantly differently than the original frame, despite its smaller bit volume.
  • Fig. 3 details the elements of VLS encoder 20.
  • Encoder 20 comprises an input frame memory 40, a frame analyzer 42, an intra-frame processor 44, an output frame memory 46 and an inter-frame processor 48.
  • Analyzer 42 analyzes each frame to separate it into subclasses, where subclasses define areas whose pixels cannot be distinguished from each other.
  • Intra-frame processor 44 spatially filters each pixel of the frame according to its subclass and, optionally, also provides each pixel of the frame with the appropriate number of bits.
  • Inter-frame processor 48 provides temporal filtering (i.e. inter-frame filtering) and updates output frame memory 46 with the elements of the current frame which are different than those of the previous frame.
  • frames are composed of pixels, each having luminance Y and two chrominance C r and C b components, each of which is typically defined by eight bits.
  • VLS encoder 20 generally separately processes the three components.
  • the bandwidth of the chrominance signals is half as wide as that of the luminance signal.
  • the filters (in the x direction of the frame) for chrominance have a narrower bandwidth.
  • the following discussion shows the filters for the luminance signal Y.
  • Frame analyzer 42 comprises a spatial-temporal analyzer 50, a parameter estimator 52, a visual perception threshold determiner 54 and a subclass determiner 56. Details of these elements are provided in Figs. 9 - 11 , discussed hereinbelow.
  • spatial-temporal analyzer 50 generates a plurality of filtered frames from the current frame, each filtered through a different high pass filter (HPF), where each high pass filter retains a different range of frequencies therein.
  • HPF high pass filter
  • Fig. 4 is an amplitude vs. frequency graph illustrating the transfer functions of an exemplary set of high pass filters for frames in a non-interlacing scan format.
  • Four graphs are shown. It can be seen that the curve labeled HPF-R3 has a cutoff frequency of 1MHz and thus, retains portions of the frame with information above 1 MHz.
  • curve HPF-R2 has a cutoff frequency of 2 MHz
  • HPF-C2 has a cutoff frequency of 3MHz
  • HPF-R1 and HPF-C1 have a cutoff frequency of 4MHz.
  • the terminology “Rx” refers to operations on a row of pixels while the terminology “Cx” refers to operations on a column of pixels.
  • the filters of Fig. 4 implement the following finite impulse response (FIR) filters on either a row of pixels (the x direction of the frame) or a column of pixels (the y direction of the frame), where the number of pixels used in the filter defines the power of the cosine.
  • FIR finite impulse response
  • a filter implementing cos 10 x takes 10 pixels around the pixel of interest, five to one side and five to the other side of the pixel of interest.
  • HPF-R3 1 - cos 10 x
  • HPF-R2 1-cos 6 x
  • HPF-C1 1-cos 2 y
  • the high pass filters can also be considered as digital equivalents of optical apertures. The higher the cut-off frequency, the smaller the aperture. Thus, filters HPF-R1 and HPF-C1 retain only very small details in the frame (of 1 - 4 pixels in size) while filter HPF-R3 retains much larger details (of up to 11 pixels).
  • the filtered frames will be labeled by the type of filter (HPF-X) used to create them.
  • analyzer 50 also generates difference frames between the current frame and another, earlier frame.
  • the previous frame is typically at most 15 frames earlier.
  • a "group" of pictures or frames (GOP) is a series of frames for which difference frames are generated.
  • Parameter estimator 52 takes the current frame and the filtered and difference frames and generates a set of parameters that describe the information content of the current frame. The parameters are determined on a pixel-by-pixel basis or on a per frame basis, as relevant. It is noted that the parameters do not have to be calculated to great accuracy as they are used in combination to determine a per pixel, visual perception threshold THDj.
  • SNR Signal to noise ratio
  • can be 46dB, equivalent to a
  • Normalized volume of intraframe change Nl ⁇ Y this measures the volume of change in a frame l ⁇ (or how much detail there is in a frame), normalized by the maximum possible amount of information MAXINF O within a frame (i.e. 8 bits per pixel x N pixels per frame). Since the highest frequency range indicates the amount of change in a frame, the volume of change l ⁇ is a sum of the intensities in the filtered frame having the highest frequency range, such as filtered frame HPF-R1.
  • Normalized volume of interframe changes Nip: this measures the volume of changes IF between the current frame and its previous frame, normalized by the maximum possible amount of information MAXINFO within a frame.
  • the volume of interframe changes I F is the sum of the intensities in the difference frame.
  • NIQOP Normalized volume of change within a group of frames NIQOP: this measures the volume of changes I GO P over a group of frames, where the group is from 2 to 15 frames, as selected by the user. It is normalized by the maximum possible amount of information MAXINF O within a frame and by the number of frames in the group.
  • Normalized luminance level NYi is the luminance level of a pixel in the current frame. It is normalized by the maximum intensity IMAX possible for the pixel.
  • Color saturation pi this is the color saturation level of the ith pixel and it is
  • C r and C b , ⁇ are the chrominance levels of the ith pixel.
  • Hue hi this is the general hue of the ith pixel and is determined
  • hue hi can be determined by interpolating
  • Visual perception threshold determiner 54 determines the visual perception threshold THD
  • THD I THD buffer 1 + N ⁇ ,. + NI xr +NI F +NI GOP + NY i +p i + (l-R l (h i ))+ SNR
  • Subclass determiner 56 compares each pixel i of each high pass filtered frame HPF-X to its associated threshold THDj to determine whether or not that pixel is significantly present in each filtered frame, where "significantly present” is defined by the threshold level and by the "detail dimension" (i.e. the size of the object or detail in the image of which the pixel forms a part). Subclass determiner 56 then defines the subclass to which the pixel belongs.
  • the pixel if the pixel is not present in any of the filtered frames, the pixel must belong to an object of large size or the detail is only detected but not distinguished. If the pixel is only found in the filtered frame of HPF-C2 or in both frames HPF-C1 and HPF-C2, it must be a horizontal edge (an edge in the Y direction of the frame). If it is found in filtered frames HPF-R3 and HPF-C2, it is a single small detail. If the pixel is found only in filtered frames HPF-R1 , HPF-R2 and HPF-R3, it is a very small vertical edge. If, in addition, it is also found in filtered frame HPF-C2, then the pixel is a very small, single detail.
  • the output of subclass determiner 56 is an indication of the subclass to which each pixel of the current frame belongs.
  • Intra-frame processor 44 performs spatial filtering of the frame, where the type of filter utilized varies in accordance with the subclass to which the pixel belongs.
  • intra-frame processor 44 filters each subclass of the frame differently and according to the information content of the subclass. The filtering limits the bandwidth of each subclass which is equivalent to sampling the data at different frequencies. Subclasses with a lot of content are sampled at a high frequency while subclasses with little content, such as a plain background area, are sampled at a low frequency.
  • intra-frame processor 44 changes the intensity of the pixel by an amount less than the visual distinguishing threshold for that pixel. Pixels whose contrast is lower than the threshold (i.e. details which were detected only) are transformed with non-linear filters. If desired, the data size of the detected only pixels can be reduced from 8 bits to 1 or 2 bits, depending on the visual threshold level and the detail dimension for the pixel. For the other pixels (i.e. the distinguished ones), 3 or 4 bits is sufficient.
  • Intra-frame processor 44 comprises a controllable filter bank 60 and a filter selector 62.
  • Controllable filter bank 60 comprises a set of low pass and non-linear filters, shown in Figs. 5A and 5B to which reference is now made, which filter selector 62 activates, based on the subclass to which the pixel
  • Selector 62 can activate more than one filter, as necessary.
  • Figs. 5A and 5B are two, alternative embodiments of controllable filter bank 60. Both comprise two sections 64 and 66 which operate on columns (i.e.
  • SW-X where X is one of C1 , C2, R1 , R2, R3 (selecting one of the low pass filters
  • Filter selector 62 switches the relevant switch, thereby activating the relevant filter. It is noted that
  • the non-linear filters NLF-R and NLF-C are activated by switches R3 and C2,
  • Controllable filter bank 60 also includes time aligners (TA) which add any necessary delays to ensure that the pixel currently being processed remains at its
  • the low pass filters are associated with the high pass filters used in
  • the low pass filters thus pass that which their associated high pass filters ignore.
  • Low pass filter LPF-R3 has a
  • LPF-R2 has a cutoff frequency of 1 MHz
  • filter LPF-C2 has a cutoff frequency of 1.25MHz
  • filters LPF-C1 and LPF-R1 have a cutoff frequency of about 2MHz.
  • filters LPF-Cx operate on the columns of the frame and filters LPF-Rx operate on the rows of the frame.
  • Fig. 7 illustrates an exemplary transfer function for the non-linear filters (NLF) which models the response of the eye when detecting a detail.
  • the transfer function defines an output value Vout normalized by the threshold level THDj as a function of an input value Vin also normalized by the threshold level THDj.
  • the input - output relationship is described by a polynomial of high order. A typical order might be six, though lower orders, of power two or three, are also feasible.
  • Table 3 lists the type of filters activated per subclass, where the header for the column indicates both the type of filter and the label of the switch SW-X of Figs. 5A and 5B.
  • Fig. 5B includes rounding elements RND which reduce the number of bits of a pixel from eight to three or four bits, depending on the subclass to which the pixel belongs.
  • Table 4 illustrates the logic for the example presented hereinabove, where the items which are not active for the subclass are indicated by "N/A". Table 4
  • the output of intra-frame processor 44 is a processed version of the current frame which uses fewer bits to describe the frame than the original version.
  • inter-frame processor 48 which provides temporal filtering (i.e. inter-frame filtering) to further process the current frame. Since the present invention provides a full frame as output, inter-frame processor 48 determines which pixels have changed significantly from the previous frame and amends those only, storing the new version in the appropriate location in frame memory 46.
  • Figs. 8A and 8B are open loop versions (i.e. the previous frame is the frame previously input into inter-frame processor 48) while the embodiment of Fig. 8C is a closed loop version (i.e. the previous frame is the frame previously produced by inter-frame processor 48). All of the embodiments comprise a summer 68, a low pass filter (LPF) 70, a high pass filter (HPF) 72, two comparators 74 and 76, two switches 78 and 80, controlled by the results of comparators 74 and 76, respectively, and a summer 82.
  • Figs. 8A and 8B additionally include an intermediate memory 84 for storing the output of intra-frame processor 44.
  • Summer 68 takes the difference of the processed current frame, produced by processor 44, and the previous frame, stored in either in intermediate memory 84 (Figs. 8A and 8B) or in frame memory 46 (Fig. 8C). The difference frame is then processed in two parallel tracks.
  • the low pass filter is used in the first track.
  • Each pixel of the filtered frame is compared to a general, large detail, threshold THD-LF which is typically set to 5% of the maximum expected intensity for the frame.
  • THD-LF typically set to 5% of the maximum expected intensity for the frame.
  • the difference frame is high pass filtered. Since high pass filtering retains the small details, each pixel of the high pass filtered frame is compared to the particular threshold THDj for that pixel, as produced by threshold determiner 54. If the difference pixel has an intensity above the threshold THDj (i.e. the change in the pixel is significant for detailed visual perception), it is allowed through (i.e. switch 80 is set to pass the pixel).
  • Summer 82 adds the filtered difference pixels passed by switches 78 and/or 80 with the pixel of the previous frame to "produce the new pixel". If switches 78 and 80 did not pass anything, the new pixel is the same as the previous pixel. Otherwise, the new pixel is the sum of the previous pixel and the low and high frequency components of the difference pixel.
  • ML indicates a memory line of the current frame
  • MP indicates a memory pixel of the current
  • ⁇ MF indicates a memory frame
  • VD indicates the vertical drive signal
  • ⁇ TA" indicates a time alignment, e.g. a delay, and CNT indicates a counter.
  • Fig. 9 generally illustrates the operation of spatial-temporal analyzer 50
  • Figs. 10A and 10B provide one detailed embodiment for the spatial analysis
  • estimator 52 threshold determiner 54 and subclass determiner 56. As these
  • FPGA programmable gate array

Abstract

A visual lossless encoder (20) for processing a video frame prior to compression by a video encoder includes a threshold unit (54), a filter unit, an association unit and an altering unit. The threshold unit (54) identifies a plurality of visual perception threshold levels to be associated with the pixels of the video frame, wherein the threshold levels define contrast levels above which a human eye can distinguish a pixel from among its neighboring pixels of the frame. The filter unit divides the video frame into portions having different detail dimensions. The association unit utilizes the threshold levels and the detail dimensions to associate the pixels of the video frame into subclasses. Each subclass includes pixels related to the same detail and which generally cannot be distinguished from each other. The altering unit alters the intensity of each pixel of the video frame according to its subclass.

Description

METHOD AND APPARATUS FOR VISUAL LOSSLESS IMAGE
SYNTACTIC ENCODING
FIELD OF THE INVENTION
The present invention relates generally to processing of video images and, in particular, to syntactic encoding of images for later compression by standard compression techniques.
BACKGROUND OF THE INVENTION
There are many types of video signals, such as digital broadcast television (TV), video conferencing, interactive TV, etc. All of these signals, in their digital form, are divided into frames, each of which consists of many pixels (image elements), each of which requires 8 - 24 bits to describe them. The result is megabits of data per frame.
Before storing and / or transmitting these signals, they typically are compressed, using one of many standard video compression techniques, such as
JPEG, MPEG, H-compression, etc. These compression standards use video signal transforms and intra- and inter-frame coding which exploit spatial and temporal correlations among pixels of a frame and across frames.
However, these compression techniques create a number of well-known, undesirable and unacceptable artifacts, such as blockiness, low resolution and wiggles, among others. These are particularly problematic for broadcast TV
(satellite TV, cable TV, etc.) or for systems with very low bit rates (video conferencing, videophone). Much research has been performed to try and improve the standard compression techniques. The following patents and articles discuss various prior art methods to do so:
US Patents 5,870,501 , 5,847,766, 5,845,012, 5,796,864, 5,774,593, 5,586,200, 5,491 ,519, 5,341 ,442;
Raj Malluri et al, "A Robust, Scalable, Object-Based Video Compression Technique for Very Low Bit-Rate Coding," IEEE Transactions of Circuit and Systems for Video Technology, vol. 7, No. 1 , Feb. 1997;
AwadKh. Al-Asmari, "An Adaptive Hybrid Coding Scheme for HDTV and Digital Sequences," IEEE Transactions on Consumer Electronics, vol. 42, No. 3, pp. 926-936, Aug. 1995;
Kwok-tung Lo and Jian Feng, "Predictive Mean Search Algorithms for Fast VQ Encoding of Images," IEEE Transactions On Consumer Electronics, vol. 41, No. 2, pp. 327-331, May 1995; James Goel et al. "Pre-processing for MPEG Compression Using
Adaptive Spatial Filtering", IEEE Transactions On Consumer Electronics, " vol. 41 , No. 3, pp. 687-698, Aug. 1995;
Jian Feng et al. "Motion Adaptive Classified Vector Quantization for ATM Video Coding", IEEE Transactions on Consumer Electronics, vol. 41 , No. 2, p. 322-326, May 1995;
Austin Y. Lan et al., "Scene-Context Dependent Reference - Frame Placement for MPEG Video Coding," IEEE Transactions on Circuits and Systems for Video Technology, vol. 9, No. 3, pp. 478-489, Apr. 1999; Kuo-Chin Fan, Kou-Sou Kan, "An Active Scene Analysis-Based approach for Pseudoconstant Bit-Rate Video Coding", IEEE Transactions on Circuits and Systems for Video Technology, vol. 8 No. 2, pp. 159-170, Apr. 1998;
Takashi Ida and Yoko Sambansugi, "Image Segmentation and Contour Detection Using Fractal Coding", IEEE Transactions on Circuits and Systems for Video Technology, vol. 8, No. 8, pp. 968-975, Dec. 1998;
Liang Shen and Rangaraj M. Rangayyan, "A Segmentation-Based Lossless Image Coding Method for High-Resolution Medical Image Compression," IEEE Transactions on Medical Imaging, vol. 16, No. 3, pp. 301-316, June 1997;
Adrian Munteanu et al., "Wavelet-Based Lossless Compression of Coronary Angiographic Images", IEEE Transactions on Medical Imaging, vol. 18, No. 3, p. 272-281 , March 1999; and
Akira Okumura et al., "Signal Analysis and Compression Performance Evaluation of Pathological Microscopic Images," IEEE Transactions on Medical Imaging, vol. 16, No. 6, pp. 701-710, Dec. 1997.
SUMMARY OF THE INVENTION
An object of the present invention is to provide a method and apparatus for video compression which is generally lossless vis-a-vis what the human eye perceives. There is therefore provided, in accordance with a preferred embodiment of the present invention, a visual lossless encoder for processing a video frame prior to compression by a video encoder. The encoder includes a threshold determination unit, a filter unit, an association unit and an altering unit. The threshold determination unit identifies a plurality of visual perception threshold levels to be associated with the pixels of the video frame, wherein the threshold levels define contrast levels above which a human eye can distinguish a pixel from among its neighboring pixels of the frame. The filter unit divides the video frame into portions having different detail dimensions. The association unit utilizes the threshold levels and the detail dimensions to associate the pixels of the video frame into subclasses. Each subclass includes pixels related to the same detail and which generally cannot be distinguished from each other. The altering unit alters the intensity of each pixel of the video frame according to its subclass.
Moreover, in accordance with a preferred embodiment of the present invention, the altering unit includes an inter-frame processor and an intra-frame processor.
Furthermore, in accordance with a preferred embodiment of the present invention, the intra-frame processor includes a controllable filter bank having a plurality of different filters and a filter selector which selects one of the filters for each pixel according to its subclass. Further, in accordance with a preferred embodiment of the present invention, the inter-frame processor includes a low pass filter and a high pass filter operative on a difference frame between a current frame and a previous frame, large and small detail threshold elements for thresholding the filtered difference frame with a large detail threshold level and a small detail threshold level, respectively, and a summer which sums the outputs of the two filters as amended by the threshold elements.
Still further, in accordance with a preferred embodiment of the present invention, the threshold unit includes a unit for generating a plurality of parameters describing at least one of the following parameters: the volume of information in the frame, the per pixel color and the cross-frame change of intensity, and unit for generating the visual perception threshold from at least one of the parameters.
There is also provided, in accordance with a preferred embodiment of the present invention, a method of visual lossless encoding of frames of a video signal. The method includes steps of spatially and temporally separating and analyzing details of the frames, estimating parameters of the details, defining a visual perception threshold for each of the details in accordance with the estimated detail parameters, classifying the frame picture details into subclasses in accordance with the visual perception thresholds and the detail parameters and transforming each the frame detail in accordance with its associated subclass.
Additionally, in accordance with a preferred embodiment of the present invention, the step of separating and analyzing also includes the step of spatial high pass filtering of small dimension details and temporal filtering for detail motion analysis. Moreover, in accordance with a preferred embodiment of the present
invention, the step of estimating comprises at least one of the following steps:
determining NΔi, a per-pixel signal intensity change between a current
frame and a previous frame, normalized by a maximum intensity;
determining a Nlχγ, a normalized volume of intraframe change by high
frequency filtering of the frame, summing the intensities of the filtered frame and
normalizing the sum by the maximum possible amount of information within a
frame; generating NIF, a volume of inter-frame changes between a current frame
and its previous frame normalized by a maximum possible amount of information
volume within a frame;
generating NIGOP, a normalized volume of inter-frame changes for a group of pictures from the output of the previous step of generating; evaluating a signal-to-noise ratio SNR by high pass filtering a difference
frame between the current frame and the previous frame by selecting those
intensities of the difference frame lower than a threshold defined as three times a
noise level under which noise intensities are not perceptible to the human eye,
summing the intensities of the pixels in the filtered frame and normalizing the sum
by the maximum intensity and the total number of pixels in the frame;
generating NY|, a normalized intensity value per-pixel;
generating a per-pixel color saturation level pi; generating a per-pixel hue value hi; and determining a per-pixel response R|(h|) to the hue value. Further, in accordance with a preferred embodiment of the present invention, the step of defining includes the step of producing the visual perception thresholds, per-pixel, from a minimum threshold value and at least one of the parameters.
Still further, in accordance with a preferred embodiment of the present invention, the step of defining includes the step of producing the visual perception thresholds, per-pixel, according to the following equation,
200 ^
THD; = THD„ 1 + NAi + NIχy + NIr + NIG0P + NYi + Pi + (l-Ri(hi))+
SNR,
wherein THDmjn is a minimum threshold level. Moreover, in accordance with a preferred embodiment of the present invention, the step of classifying includes the steps of comparing multiple spatial high frequency levels of a pixel against its associated visual perception threshold and processing the comparison results to associate the pixel with one of the subclasses. Further, in accordance with a preferred embodiment of the present invention, the step of transforming includes the step of filtering each subclass with an associated two-dimensional low pass filter.
Still further, in accordance with a preferred embodiment of the present invention, the step of transforming includes the steps of generating a difference frame between the current frame and a previous transformed frame, low and high pass filtering of the difference frame, comparing the filtered frames with a large detail threshold and a small detail threshold and summing those portions of the filtered frames which are greater than the thresholds. Additionally, in accordance with a preferred embodiment of the present invention, the large detail threshold is 2 to 5 percent.
Moreover, in accordance with a preferred embodiment of the present invention, the method includes the step of rounding the output of the step of transforming.
Finally, the intensity can be a luminance value or a chrominance value.
BRIEF DESCRIPTION OF THE DRAWINGS
The present invention will be understood and appreciated more fully from
the following detailed description taken in conjunction with the appended drawings
in which:
Fig. 1 is an example of a video frame;
Fig. 2 is a block diagram illustration of a video compression system
having a visual lossless syntactic encoder, constructed and operative in
accordance with a preferred embodiment of the present invention;
Fig. 3 is a block diagram illustration of the details of the visual lossless
syntactic encoder of Fig. 2;
Fig. 4 is a graphical illustration of the transfer functions for a number of
high pass filters useful in the syntactic encoder of Fig. 3;
Figs. 5A and 5B are block diagram illustrations of alternative
embodiments of a controllable filter bank forming part of the syntactic encoder of
Fig. 3;
Fig. 6 is a graphical illustration of the transfer functions for a number of
low pass filters useful in the controllable filter bank of Figs. 5A and 5B;
Fig. 7 is a graphical illustration of the transfer function for a non-linear
filter useful in the controllable filter bank of Figs. 5A and 5B;
Figs. 8A, 8B and 8C are block diagram illustrations of alternative
embodiments of an inter-frame processor forming a controlled filter portion of the
syntactic encoder of Fig. 3;
Fig. 9 is a block diagram illustration of a spatial-temporal analyzer forming
part of the syntactic encoder of Fig. 3; Figs. 10A and 10B are detail illustrations of the analyzer of Fig. 9; and Fig. 11 is a detail illustration of a frame analyzer forming part of the syntactic encoder of Fig. 3.
DETAILED DESCRIPTION OF THE PRESENT INVENTION
Applicants have realized that there are different levels of image detail in an image and that the human eye perceives these details in different ways. In particular, Applicants have realized the following: 1. Picture details whose detection mainly depends on the level of noise in the image occupy approximately 50 - 80% of an image.
2. A visual perception detection threshold for image details does not depend on the shape of the details in the image.
3. A visual perception threshold THD depends on a number of picture parameters, including the general brightness of the image. It does not depend on the noise spectrum. The present invention is a method for describing, and then encoding, images based on which details in the image can be distinguished by the human eye and which ones can only be detected by it. Reference is now made to Fig. 1 , which is a grey-scale image of a plurality of shapes of a bird in flight, ranging from a photograph of one (labeled 10) to a very stylized version of one (labeled 12). The background of the image is very dark at the top of the image and very light at the bottom of the image.
The human eye can distinguish most of the birds of the image. However, there is at least one bird, labeled 14, which the eye can detect but cannot determine all of its relative contrast details. Furthermore, there are large swaths of the image (in the background) which have no details in them.
The present invention is a method and system for syntactic encoding of video frames before they are sent to a standard video compression unit. The present invention separates the details of a frame into two different types, those that can only be detected (for which only one bit will suffice to describe each of their pixels) and those which can be distinguished (for which at least three bits are needed to describe the intensity of each of their pixels). Reference is now made to Fig. 2, which illustrates the present invention within an image transmission system. Thus, Fig. 2 shows a visual lossless (VLS) encoder 20 connected to a standard video transmitter 22 which includes a video compression encoder 24, such as a standard MPEG encoder, and a modulator 26. VLS encoder 20 transforms an incoming video signal such that video compression encoder 24 can compress the video signal two to five times more than encoder 24 can do on its own, resulting in a significantly reduced volume bit stream to be transmitted.
Modulator 26 modulates the reduced volume bit stream and transmits it to a receiver 30, which, as in the prior art, includes a demodulator 32 and a decoder 34. Demodulator 32 demodulates the transmitted signal and decoder 34 decodes and decompresses the demodulated signal. The result is provided to a monitor 36 for display.
It will be appreciated that, although the compression ratios are high in the present invention, the resultant video displayed on monitor 36 is not visually degraded. This is because encoder 20 attempts to quantify each frame of the video signal according to which sections of the frame are more or less distinguished by the human eye. For the less-distinguished sections, encoder 20 either provides pixels of a minimum bit volume, thus reducing the overall bit volume of the frame or smoothes the data of the sections such that video compression encoder 24 will later significantly compress these sections, thus resulting in a smaller bit volume in the compressed frame. Since the human eye does not distinguish these sections, the reproduced frame is not perceived significantly differently than the original frame, despite its smaller bit volume. Reference is now made to Fig. 3, which details the elements of VLS encoder 20. Encoder 20 comprises an input frame memory 40, a frame analyzer 42, an intra-frame processor 44, an output frame memory 46 and an inter-frame processor 48. Analyzer 42 analyzes each frame to separate it into subclasses, where subclasses define areas whose pixels cannot be distinguished from each other. Intra-frame processor 44 spatially filters each pixel of the frame according to its subclass and, optionally, also provides each pixel of the frame with the appropriate number of bits. Inter-frame processor 48 provides temporal filtering (i.e. inter-frame filtering) and updates output frame memory 46 with the elements of the current frame which are different than those of the previous frame. It is noted that frames are composed of pixels, each having luminance Y and two chrominance Cr and Cb components, each of which is typically defined by eight bits. VLS encoder 20 generally separately processes the three components. However, the bandwidth of the chrominance signals is half as wide as that of the luminance signal. Thus, the filters (in the x direction of the frame) for chrominance have a narrower bandwidth. The following discussion shows the filters for the luminance signal Y.
Frame analyzer 42 comprises a spatial-temporal analyzer 50, a parameter estimator 52, a visual perception threshold determiner 54 and a subclass determiner 56. Details of these elements are provided in Figs. 9 - 11 , discussed hereinbelow.
As discussed hereinabove, details which the human eye distinguishes are ones of high contrast and ones whose details have small dimensions. Areas of high contrast are areas with a lot of high frequency content. Thus, spatial-temporal analyzer 50 generates a plurality of filtered frames from the current frame, each filtered through a different high pass filter (HPF), where each high pass filter retains a different range of frequencies therein.
Fig. 4, to which reference is now briefly made, is an amplitude vs. frequency graph illustrating the transfer functions of an exemplary set of high pass filters for frames in a non-interlacing scan format. Four graphs are shown. It can be seen that the curve labeled HPF-R3 has a cutoff frequency of 1MHz and thus, retains portions of the frame with information above 1 MHz. Similarly, curve HPF-R2 has a cutoff frequency of 2 MHz, HPF-C2 has a cutoff frequency of 3MHz and HPF-R1 and HPF-C1 have a cutoff frequency of 4MHz. As will be discussed hereinbelow, the terminology "Rx" refers to operations on a row of pixels while the terminology "Cx" refers to operations on a column of pixels.
In particular, the filters of Fig. 4 implement the following finite impulse response (FIR) filters on either a row of pixels (the x direction of the frame) or a column of pixels (the y direction of the frame), where the number of pixels used in the filter defines the power of the cosine. For example, a filter implementing cos10x takes 10 pixels around the pixel of interest, five to one side and five to the other side of the pixel of interest.
HPF-R3: 1 - cos10x HPF-R2: 1-cos6x
HPF-R1 : 1-cos2x
HPF-C2: 1-cos4y
HPF-C1 : 1-cos2y The high pass filters can also be considered as digital equivalents of optical apertures. The higher the cut-off frequency, the smaller the aperture. Thus, filters HPF-R1 and HPF-C1 retain only very small details in the frame (of 1 - 4 pixels in size) while filter HPF-R3 retains much larger details (of up to 11 pixels).
In the following, the filtered frames will be labeled by the type of filter (HPF-X) used to create them.
Returning to Fig. 3, analyzer 50 also generates difference frames between the current frame and another, earlier frame. The previous frame is typically at most 15 frames earlier. A "group" of pictures or frames (GOP) is a series of frames for which difference frames are generated. Parameter estimator 52 takes the current frame and the filtered and difference frames and generates a set of parameters that describe the information content of the current frame. The parameters are determined on a pixel-by-pixel basis or on a per frame basis, as relevant. It is noted that the parameters do not have to be calculated to great accuracy as they are used in combination to determine a per pixel, visual perception threshold THDj.
At least some of the following parameters are determined:
Signal to noise ratio (SNR): this parameter can be determined by generating a difference frame between the current frame and the frame before it, high pass filtering of the difference frame, summing the intensities of the pixels in the filtered frame, normalized by both the number of pixels N in a frame and the maximum intensity IMAX possible for the pixel. If the frame is a television frame, the maximum intensity is 255 quanta (8 bits). The high frequency filter selects only
those intensities lower than 3σ, where σ indicates a level less than which the
human eye cannot perceive noise. For example, σ can be 46dB, equivalent to a
reduction in signal strength of a factor of 200.
Normalized NΔJ: this measures the change Δ|, per pixel i, from the current
frame to its previous frame. This value is then normalized by the maximum intensity IMAX possible for the pixel. Normalized volume of intraframe change NlχY: this measures the volume of change in a frame lχγ (or how much detail there is in a frame), normalized by the maximum possible amount of information MAXINFO within a frame (i.e. 8 bits per pixel x N pixels per frame). Since the highest frequency range indicates the amount of change in a frame, the volume of change lχγ is a sum of the intensities in the filtered frame having the highest frequency range, such as filtered frame HPF-R1.
Normalized volume of interframe changes Nip: this measures the volume of changes IF between the current frame and its previous frame, normalized by the maximum possible amount of information MAXINFO within a frame. The volume of interframe changes IF is the sum of the intensities in the difference frame.
Normalized volume of change within a group of frames NIQOP: this measures the volume of changes IGOP over a group of frames, where the group is from 2 to 15 frames, as selected by the user. It is normalized by the maximum possible amount of information MAXINFO within a frame and by the number of frames in the group.
Normalized luminance level NYi: Y| is the luminance level of a pixel in the current frame. It is normalized by the maximum intensity IMAX possible for the pixel.
Color saturation pi: this is the color saturation level of the ith pixel and it is
determined by: where Cr and Cb,ι are the
Figure imgf000018_0001
chrominance levels of the ith pixel.
Hue hi: this is the general hue of the ith pixel and is determined
by: Alternatively, hue hi can be determined by interpolating
Figure imgf000018_0002
Table 1 , below.
Response to hue Rι(h|): this is the human vision response to a given hue and is given by Table 1 , below. Interpolation is typically used to produce a specific value of the response R(h) for a specific value of hue h.
Table 1
Figure imgf000018_0003
Visual perception threshold determiner 54 determines the visual perception threshold THD| per pixel as follows: 200
THDI = THD„ 1 + NΔ,. + NIxr +NIF +NIGOP + NYi +pi + (l-Rl(hi))+ SNR
Subclass determiner 56 compares each pixel i of each high pass filtered frame HPF-X to its associated threshold THDj to determine whether or not that pixel is significantly present in each filtered frame, where "significantly present" is defined by the threshold level and by the "detail dimension" (i.e. the size of the object or detail in the image of which the pixel forms a part). Subclass determiner 56 then defines the subclass to which the pixel belongs.
For the example provided above, if the pixel is not present in any of the filtered frames, the pixel must belong to an object of large size or the detail is only detected but not distinguished. If the pixel is only found in the filtered frame of HPF-C2 or in both frames HPF-C1 and HPF-C2, it must be a horizontal edge (an edge in the Y direction of the frame). If it is found in filtered frames HPF-R3 and HPF-C2, it is a single small detail. If the pixel is found only in filtered frames HPF-R1 , HPF-R2 and HPF-R3, it is a very small vertical edge. If, in addition, it is also found in filtered frame HPF-C2, then the pixel is a very small, single detail.
The above logic is summarized and expanded in Table 2.
Table 2
Figure imgf000019_0001
The output of subclass determiner 56 is an indication of the subclass to which each pixel of the current frame belongs. Intra-frame processor 44 performs spatial filtering of the frame, where the type of filter utilized varies in accordance with the subclass to which the pixel belongs. In accordance with a preferred embodiment of the present invention, intra-frame processor 44 filters each subclass of the frame differently and according to the information content of the subclass. The filtering limits the bandwidth of each subclass which is equivalent to sampling the data at different frequencies. Subclasses with a lot of content are sampled at a high frequency while subclasses with little content, such as a plain background area, are sampled at a low frequency.
Another way to consider the operation of the filters is that they smooth the data of the subclass, removing "noisiness" in the picture that the human eye does not perceive. Thus, intra-frame processor 44 changes the intensity of the pixel by an amount less than the visual distinguishing threshold for that pixel. Pixels whose contrast is lower than the threshold (i.e. details which were detected only) are transformed with non-linear filters. If desired, the data size of the detected only pixels can be reduced from 8 bits to 1 or 2 bits, depending on the visual threshold level and the detail dimension for the pixel. For the other pixels (i.e. the distinguished ones), 3 or 4 bits is sufficient.
Intra-frame processor 44 comprises a controllable filter bank 60 and a filter selector 62. Controllable filter bank 60 comprises a set of low pass and non-linear filters, shown in Figs. 5A and 5B to which reference is now made, which filter selector 62 activates, based on the subclass to which the pixel
belongs. Selector 62 can activate more than one filter, as necessary.
Figs. 5A and 5B are two, alternative embodiments of controllable filter bank 60. Both comprise two sections 64 and 66 which operate on columns (i.e.
line to line) and on rows (i.e. within a line), respectively. In each section 64 and
66, there is a choice of filters, each controlled by an appropriate switch, labeled
SW-X, where X is one of C1 , C2, R1 , R2, R3 (selecting one of the low pass filters
(LPF)), D-C, D-R (selecting to pass the relevant pixel directly). Filter selector 62 switches the relevant switch, thereby activating the relevant filter. It is noted that
the non-linear filters NLF-R and NLF-C are activated by switches R3 and C2,
respectively. Thus, the outputs of non-linear filters NLF-R and NLF-C are added to
the outputs of low pass filters LPF-R3 and LPF-C2, respectively.
Controllable filter bank 60 also includes time aligners (TA) which add any necessary delays to ensure that the pixel currently being processed remains at its
appropriate location within the frame.
The low pass filters (LPF) are associated with the high pass filters used in
analyzer 50. Thus, the cutoff frequencies of the low pass filters are close to those
of the high pass filters. The low pass filters thus pass that which their associated high pass filters ignore.
Fig. 6, to which reference is now briefly made, illustrates exemplary low
pass filters for the example provided hereinabove. Low pass filter LPF-R3 has a
cutoff frequency of 0.5MHz and thus, generally does not retain anything which its associated high pass filter HPF-R3 (with a cutoff frequency of 1 MHz) retains. Filter
LPF-R2 has a cutoff frequency of 1 MHz, filter LPF-C2 has a cutoff frequency of 1.25MHz and filters LPF-C1 and LPF-R1 have a cutoff frequency of about 2MHz. As for the high frequency filters, filters LPF-Cx operate on the columns of the frame and filters LPF-Rx operate on the rows of the frame.
Fig. 7, to which reference is now briefly made, illustrates an exemplary transfer function for the non-linear filters (NLF) which models the response of the eye when detecting a detail. The transfer function defines an output value Vout normalized by the threshold level THDj as a function of an input value Vin also normalized by the threshold level THDj. As can be seen in the figure, the input - output relationship is described by a polynomial of high order. A typical order might be six, though lower orders, of power two or three, are also feasible.
Table 3 lists the type of filters activated per subclass, where the header for the column indicates both the type of filter and the label of the switch SW-X of Figs. 5A and 5B.
Table 3
Figure imgf000022_0001
Fig. 5B includes rounding elements RND which reduce the number of bits of a pixel from eight to three or four bits, depending on the subclass to which the pixel belongs. Table 4 illustrates the logic for the example presented hereinabove, where the items which are not active for the subclass are indicated by "N/A". Table 4
Figure imgf000023_0001
The output of intra-frame processor 44 is a processed version of the current frame which uses fewer bits to describe the frame than the original version.
Reference is now made to Figs. 8A, 8B and 8C, which illustrate three alternative embodiments for inter-frame processor 48 which provides temporal filtering (i.e. inter-frame filtering) to further process the current frame. Since the present invention provides a full frame as output, inter-frame processor 48 determines which pixels have changed significantly from the previous frame and amends those only, storing the new version in the appropriate location in frame memory 46.
The embodiments of Figs. 8A and 8B are open loop versions (i.e. the previous frame is the frame previously input into inter-frame processor 48) while the embodiment of Fig. 8C is a closed loop version (i.e. the previous frame is the frame previously produced by inter-frame processor 48). All of the embodiments comprise a summer 68, a low pass filter (LPF) 70, a high pass filter (HPF) 72, two comparators 74 and 76, two switches 78 and 80, controlled by the results of comparators 74 and 76, respectively, and a summer 82. Figs. 8A and 8B additionally include an intermediate memory 84 for storing the output of intra-frame processor 44.
Summer 68 takes the difference of the processed current frame, produced by processor 44, and the previous frame, stored in either in intermediate memory 84 (Figs. 8A and 8B) or in frame memory 46 (Fig. 8C). The difference frame is then processed in two parallel tracks.
In the first track, the low pass filter is used . Each pixel of the filtered frame is compared to a general, large detail, threshold THD-LF which is typically set to 5% of the maximum expected intensity for the frame. Thus, the pixels which are kept are only those which changed by more than 5% (i.e. those whose changes can be "seen" by the human eye).
In the second track, the difference frame is high pass filtered. Since high pass filtering retains the small details, each pixel of the high pass filtered frame is compared to the particular threshold THDj for that pixel, as produced by threshold determiner 54. If the difference pixel has an intensity above the threshold THDj (i.e. the change in the pixel is significant for detailed visual perception), it is allowed through (i.e. switch 80 is set to pass the pixel).
Summer 82 adds the filtered difference pixels passed by switches 78 and/or 80 with the pixel of the previous frame to "produce the new pixel". If switches 78 and 80 did not pass anything, the new pixel is the same as the previous pixel. Otherwise, the new pixel is the sum of the previous pixel and the low and high frequency components of the difference pixel.
Reference is now briefly made to Figs. 9, 10A, 10B and 11 which detail elements of frame analyzer 42. In these figures, the term "ML" indicates a memory line of the current frame, "MP" indicates a memory pixel of the current
frame, λMF" indicates a memory frame, "VD" indicates the vertical drive signal,
λλTA" indicates a time alignment, e.g. a delay, and CNT indicates a counter.
Fig. 9 generally illustrates the operation of spatial-temporal analyzer 50
and Figs. 10A and 10B provide one detailed embodiment for the spatial analysis
and temporal analysis portions 51 and 53, respectively. Fig. 11 details parameter
estimator 52, threshold determiner 54 and subclass determiner 56. As these
figures are deemed to be self-explanatory, no further explanation will be included
here.
It is noted that the present invention can be implemented with a field
programmable gate array (FPGA) and the frame memory can be implemented
with SRAM or SDRAM.
The methods and apparatus disclosed herein have been described
without reference to specific hardware or software. Rather, the methods and apparatus have been described in a manner sufficient to enable persons of
ordinary skill in the art to readily adapt commercially available hardware and
software as may be needed to reduce any of the embodiments of the present
invention to practice without undue experimentation and using conventional techniques.
It will be appreciated by persons skilled in the art that the present
invention is not limited by what has been particularly shown and described herein
above. Rather the scope of the invention is defined by the claims that follow:

Claims

1. A visual lossless encoder for processing a video frame prior to compression by a video encoder, the encoder comprising: threshold means for identifying a plurality of visual perception threshold levels to be associated with the pixels of said video frame, wherein said threshold levels define contrast levels above which a human eye can distinguish a pixel from among its neighboring pixels of said frame; filter means for dividing said video frame into portions having different detail dimensions; means, utilizing said threshold levels and said detail dimensions, for associating said pixels of said video frame into subclasses, wherein each subclass includes pixels related to the same detail and which generally cannot be distinguished from each other; and means for altering the intensity of each pixel of said video frame according to its subclass.
2. An encoder according to claim 1 wherein said means for altering comprises an inter-frame processor and an intra-frame processor.
3. An encoder according to claim 2 and wherein said intra-frame processor comprises a controllable filter bank having a plurality of different filters and a filter selector which selects one of said filters for each pixel according to its subclass.
4. An encoder according to claim 2 and wherein said inter-frame processor comprises a low pass filter and a high pass filter operative on a difference frame between a current frame and a previous frame, large and small detail threshold elements for thresholding the filtered difference frame with a large detail threshold level and a small detail threshold level, respectively, and a summer which sums the outputs of said two filters as amended by said threshold elements.
5. An encoder according to claim 1 and wherein said threshold means comprises means for generating a plurality of picture parameters describing at least one of the following parameters: the volume of information in said frame, the per pixel color and the cross-frame change of intensity and means for generating said visual perception threshold from said at least one parameter.
6. A method of visual lossless encoding of frames of a video signal, the
method comprising steps of:
spatially and temporally separating and analyzing details of said frames;
estimating parameters of said details;
defining a visual perception threshold for each of said details
in accordance with said estimated detail parameters; classifying said frame picture details into subclasses in
accordance with said visual perception thresholds and said detail parameters; and
transforming each said frame detail in accordance with its associated subclass.
7. The method according to claim 6 and wherein said step of separating and analyzing also includes the step of spatial high pass filtering of small dimension details and temporal filtering for detail motion analysis.
8. The method according to claim 6 and wherein the step of estimating
comprises at least one of the following steps:
determining NΔ|, a per-pixel signal intensity change between
a current frame and a previous frame, normalized by a maximum
intensity;
determinining a Nl, a normalized volume of intraframe change by high frequency filtering of said frame, summing the
intensities of said filtered frame and normalizing said sum by the
maximum possible amount of information within a frame;
generating Nip, a volume of inter-frame changes between a current frame and its previous frame normalized by a maximum.
possible amount of information volume within a frame;
generating NIGOP, a normalized volume of inter-frame
changes for a group of pictures from the output of said previous step
of generating; evaluating a signal-to-noise ratio SNR by high pass filtering a difference frame between said current frame and said previous
frame by selecting those intensities of said difference frame lower
than a threshold defined as three times a noise level under which
noise intensities are not perceptible to the human eye, summing the intensities of the pixels in the filtered frame and normalizing said sum by said maximum intensity and by the total number of pixels in a frame; generating NYi, a normalized intensity value per-pixel; generating a per-pixel color saturation level pi; generating a per-pixel hue value hi; and determining a per-pixel response R|(h|) to said hue value.
9. The method according to claim 8, wherein said step of defining includes the step of producing said visual perception thresholds, per-pixel, from a minimum threshold value and at least one of said parameters.
10. The method according to claim 8, wherein said step of defining includes the step of producing said visual perception thresholds, per-pixel, according to the following equation,
200 ^
THD: = THD, mm l + NAi + NIxy + NIF +NIG0P + NYi + Pi + (l-Ri(hi))+-
SNR,
wherein THDmin is a minimum threshold level.
11. The method according to claim 6, wherein said step of classifying includes the steps of comparing multiple spatial high frequency levels of a pixel against its associated visual perception threshold and processing said comparison results to associate said pixel with one of said subclasses.
12. The method according to claim 6, wherein said step of transforming includes the step of filtering each subclass with an associated two dimensional low pass filter.
13. The method according to claim 12, wherein said step of transforming includes the steps of generating a difference frame between said current frame and a previous transformed frame, low and high pass filtering of said difference frame, comparing said filtered frames with a large detail threshold and a small detail threshold and summing those portions of said filtered frames which are greater than said thresholds.
14. The method according to claim 13 and wherein said large detail threshold is 2 to 5 percent.
15. The method according to claim 6 and also including the step of rounding the output of said step of transforming.
16. The method according to claim 6, and wherein said intensity is a luminance value.
17. The method according to claim 6, and wherein said intensity is a chrominance value.
PCT/IL2001/000065 2000-01-23 2001-01-23 Method and apparatus for visual lossless image syntactic encoding WO2001054392A2 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
EP01942829A EP1260094A4 (en) 2000-01-23 2001-01-23 Method and apparatus for visual lossless image syntactic encoding
AU2001228771A AU2001228771A1 (en) 2000-01-23 2001-01-23 Method and apparatus for visual lossless image syntactic encoding

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
IL134182 2000-01-23
IL134182A IL134182A (en) 2000-01-23 2000-01-23 Method and apparatus for visual lossless pre-processing
US09/524,618 US6473532B1 (en) 2000-01-23 2000-03-14 Method and apparatus for visual lossless image syntactic encoding
US09/524,618 2000-03-14

Publications (2)

Publication Number Publication Date
WO2001054392A2 true WO2001054392A2 (en) 2001-07-26
WO2001054392A3 WO2001054392A3 (en) 2002-01-24

Family

ID=11073740

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/IL2001/000065 WO2001054392A2 (en) 2000-01-23 2001-01-23 Method and apparatus for visual lossless image syntactic encoding

Country Status (5)

Country Link
US (4) US6473532B1 (en)
EP (1) EP1260094A4 (en)
AU (1) AU2001228771A1 (en)
IL (1) IL134182A (en)
WO (1) WO2001054392A2 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2372918A (en) * 2000-10-27 2002-09-04 Dolby Lab Licensing Corp Encoding of signal components
CN100358366C (en) * 2002-07-11 2007-12-26 松下电器产业株式会社 Filtering intensity decision method, moving picture encoding method, and moving picture decoding method

Families Citing this family (31)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
IL134182A (en) 2000-01-23 2006-08-01 Vls Com Ltd Method and apparatus for visual lossless pre-processing
US6753929B1 (en) 2000-06-28 2004-06-22 Vls Com Ltd. Method and system for real time motion picture segmentation and superposition
US7453468B2 (en) * 2000-11-29 2008-11-18 Xerox Corporation Intelligent color to texture converter
US6744818B2 (en) * 2000-12-27 2004-06-01 Vls Com Ltd. Method and apparatus for visual perception encoding
US7277587B2 (en) * 2002-04-26 2007-10-02 Sharp Laboratories Of America, Inc. System and method for lossless video coding
US20040131117A1 (en) * 2003-01-07 2004-07-08 Sheraizin Vitaly S. Method and apparatus for improving MPEG picture compression
US7786988B2 (en) 2003-07-16 2010-08-31 Honeywood Technologies, Llc Window information preservation for spatially varying power conservation
US7602388B2 (en) * 2003-07-16 2009-10-13 Honeywood Technologies, Llc Edge preservation for spatially varying power conservation
US7663597B2 (en) * 2003-07-16 2010-02-16 Honeywood Technologies, Llc LCD plateau power conservation
US7714831B2 (en) 2003-07-16 2010-05-11 Honeywood Technologies, Llc Background plateau manipulation for display device power conservation
US7580033B2 (en) 2003-07-16 2009-08-25 Honeywood Technologies, Llc Spatial-based power savings
US7583260B2 (en) 2003-07-16 2009-09-01 Honeywood Technologies, Llc Color preservation for spatially varying power conservation
US20060020906A1 (en) * 2003-07-16 2006-01-26 Plut William J Graphics preservation for spatially varying display device power conversation
US7636488B2 (en) * 2003-12-18 2009-12-22 Itt Manufacturing Enterprises, Inc. User adjustable image enhancement filtering
US7903902B2 (en) 2004-07-26 2011-03-08 Sheraizin Semion M Adaptive image improvement
US7639892B2 (en) * 2004-07-26 2009-12-29 Sheraizin Semion M Adaptive image improvement
KR100697516B1 (en) * 2004-10-27 2007-03-20 엘지전자 주식회사 Moving picture coding method based on 3D wavelet transformation
US8780957B2 (en) 2005-01-14 2014-07-15 Qualcomm Incorporated Optimal weights for MMSE space-time equalizer of multicode CDMA system
US7526142B2 (en) * 2005-02-22 2009-04-28 Sheraizin Vitaly S Enhancement of decompressed video
US9197912B2 (en) * 2005-03-10 2015-11-24 Qualcomm Incorporated Content classification for multimedia processing
US7169920B2 (en) * 2005-04-22 2007-01-30 Xerox Corporation Photoreceptors
US7760210B2 (en) 2005-05-04 2010-07-20 Honeywood Technologies, Llc White-based power savings
US8755446B2 (en) * 2005-05-04 2014-06-17 Intel Corporation Varying sharpness based on motion in video sequences
US7602408B2 (en) 2005-05-04 2009-10-13 Honeywood Technologies, Llc Luminance suppression power conservation
US8879635B2 (en) 2005-09-27 2014-11-04 Qualcomm Incorporated Methods and device for data alignment with time domain boundary
US8948260B2 (en) 2005-10-17 2015-02-03 Qualcomm Incorporated Adaptive GOP structure in video streaming
US9131164B2 (en) 2006-04-04 2015-09-08 Qualcomm Incorporated Preprocessor method and apparatus
EP1924097A1 (en) * 2006-11-14 2008-05-21 Sony Deutschland Gmbh Motion and scene change detection using color components
TW201001334A (en) * 2008-06-20 2010-01-01 Altek Corp Adjustment method of color tone for digital image and electronic apparatus thereof
US8818744B2 (en) * 2008-10-16 2014-08-26 Tektronix, Inc. Test and measurement instrument and method of switching waveform display styles
US9842410B2 (en) 2015-06-18 2017-12-12 Samsung Electronics Co., Ltd. Image compression and decompression with noise separation

Family Cites Families (128)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US2697758A (en) 1950-08-01 1954-12-21 Rca Corp Gamma correcting circuit
CA1032264A (en) 1974-02-19 1978-05-30 James A. Mendrala Luminance key amplifier
US3961133A (en) 1974-05-24 1976-06-01 The Singer Company Method and apparatus for combining video images with proper occlusion
JPS5571363A (en) 1978-11-24 1980-05-29 Sony Corp Video synthesizing unit
FI842333A (en) 1984-06-08 1985-12-09 Valtion Teknillinen Tutkimuskeskus FOERFARANDE FOER IDENTIFIERING AV DE MEST FOERAENDRADE BILDOMRAODENA I LEVANDE VIDEOSIGNAL.
JP2605780B2 (en) 1988-02-13 1997-04-30 ソニー株式会社 Gamma correction circuit for luminance signal
US6147198A (en) 1988-09-15 2000-11-14 New York University Methods and compositions for the manipulation and characterization of individual nucleic acid molecules
US4947255A (en) 1988-09-19 1990-08-07 The Grass Valley Group, Inc. Video luminance self keyer
US5691777A (en) 1988-10-17 1997-11-25 Kassatly; Lord Samuel Anthony Method and apparatus for simultaneous compression of video, audio and data signals
US5012333A (en) 1989-01-05 1991-04-30 Eastman Kodak Company Interactive dynamic range adjustment system for printing digital images
US6610256B2 (en) 1989-04-05 2003-08-26 Wisconsin Alumni Research Foundation Image processing and analysis of individual nucleic acid molecules
JP2827328B2 (en) 1989-09-28 1998-11-25 ソニー株式会社 Video signal processing device
US5542008A (en) * 1990-02-28 1996-07-30 Victor Company Of Japan, Ltd. Method of and apparatus for compressing image representing signals
US5555557A (en) 1990-04-23 1996-09-10 Xerox Corporation Bit-map image resolution converter with controlled compensation for write-white xerographic laser printing
US5339171A (en) 1990-04-24 1994-08-16 Ricoh Company, Ltd. Image processing apparatus especially suitable for producing smooth-edged output multi-level tone data having fewer levels than input multi-level tone data
JPH0483480A (en) 1990-07-26 1992-03-17 Nippon Hoso Kyokai <Nhk> Polarizing key type image synthesizer
JPH04172066A (en) 1990-11-06 1992-06-19 Hitachi Ltd Video camera
GB2250886B (en) 1990-12-13 1995-06-14 Rank Cintel Ltd Noise reduction in video signals
JP2934036B2 (en) * 1991-03-07 1999-08-16 松下電器産業株式会社 Motion detection method and noise reduction device
JPH04294466A (en) * 1991-03-22 1992-10-19 Ricoh Co Ltd Image processor
US5799111A (en) 1991-06-14 1998-08-25 D.V.P. Technologies, Ltd. Apparatus and methods for smoothing images
DE69214229T2 (en) 1991-08-14 1997-04-30 Agfa Gevaert Nv Method and device for improving the contrast of images
GB9119964D0 (en) 1991-09-18 1991-10-30 Sarnoff David Res Center Pattern-key video insertion
DE4142650B4 (en) 1991-12-21 2006-03-16 Bts Holding International Bv Method and arrangement for deriving a control signal for the insertion of a background signal into parts of a foreground signal
WO1993014600A1 (en) 1992-01-21 1993-07-22 Supermac Technology Method and apparatus for compression and decompression of color image data
US5428398A (en) 1992-04-10 1995-06-27 Faroudja; Yves C. Method and apparatus for producing from a standard-bandwidth television signal a signal which when reproduced provides a high-definition-like video image relatively free of artifacts
US5408542A (en) 1992-05-12 1995-04-18 Apple Computer, Inc. Method and apparatus for real-time lossless compression and decompression of image data
JP2611607B2 (en) 1992-06-29 1997-05-21 日本ビクター株式会社 Scene change detection device
JPH0678320A (en) 1992-08-25 1994-03-18 Matsushita Electric Ind Co Ltd Color adjustment device
JPH06133221A (en) 1992-10-14 1994-05-13 Sony Corp Image pickup device
US5481275A (en) 1992-11-02 1996-01-02 The 3Do Company Resolution enhancement for video display using multi-line interpolation
US5491514A (en) 1993-01-28 1996-02-13 Matsushita Electric Industrial Co., Ltd. Coding apparatus, decoding apparatus, coding-decoding apparatus for video signals, and optical disks conforming thereto
US5565921A (en) 1993-03-16 1996-10-15 Olympus Optical Co., Ltd. Motion-adaptive image signal processing system
US5510824A (en) 1993-07-26 1996-04-23 Texas Instruments, Inc. Spatial light modulator array
GB2282293B (en) 1993-09-10 1997-08-27 Sony Uk Ltd A method of and apparatus for deriving a key signal from a digital video signal
KR0134325B1 (en) 1993-12-16 1998-04-23 배순훈 Preprocessing filter for image data
JPH07203428A (en) * 1993-12-28 1995-08-04 Canon Inc Image processing method and its device
US5586200A (en) 1994-01-07 1996-12-17 Panasonic Technologies, Inc. Segmentation based image compression system
KR960012475B1 (en) 1994-01-18 1996-09-20 대우전자 주식회사 Digital audio coder of channel bit
US5592226A (en) 1994-01-26 1997-01-07 Btg Usa Inc. Method and apparatus for video data compression using temporally adaptive motion interpolation
IL108957A (en) 1994-03-14 1998-09-24 Scidel Technologies Ltd System for implanting an image into a video stream
US5488675A (en) 1994-03-31 1996-01-30 David Sarnoff Research Center, Inc. Stabilizing estimate of location of target region inferred from tracked multiple landmark regions of a video image
KR970010087B1 (en) 1994-04-30 1997-06-21 Daewoo Electronics Co Ltd Postprocessing method for digital image
WO1995030208A2 (en) 1994-05-03 1995-11-09 Philips Electronics N.V. Better contrast/noise by residue image
KR100307618B1 (en) 1994-05-31 2001-11-30 윤종용 Device and method for encoding image
AU3544995A (en) 1994-09-20 1996-04-09 Neopath, Inc. Apparatus for identification and integration of multiple cell patterns
US5648801A (en) 1994-12-16 1997-07-15 International Business Machines Corporation Grayscale printing system
EP0720316B1 (en) 1994-12-30 1999-12-08 Daewoo Electronics Co., Ltd Adaptive digital audio encoding apparatus and a bit allocation method thereof
EP0721286A3 (en) 1995-01-09 2000-07-26 Matsushita Electric Industrial Co., Ltd. Video signal decoding apparatus with artifact reduction
EP0721257B1 (en) 1995-01-09 2005-03-30 Daewoo Electronics Corporation Bit allocation for multichannel audio coder based on perceptual entropy
JPH08191440A (en) 1995-01-10 1996-07-23 Fukuda Denshi Co Ltd Method and device for correcting endoscope image
US5982926A (en) 1995-01-17 1999-11-09 At & T Ipm Corp. Real-time image enhancement techniques
JP3823333B2 (en) 1995-02-21 2006-09-20 株式会社日立製作所 Moving image change point detection method, moving image change point detection apparatus, moving image change point detection system
KR0159370B1 (en) 1995-03-20 1999-01-15 배순훈 Method and apparatus for encoding a video signals using a boundary of an object
US5852475A (en) 1995-06-06 1998-12-22 Compression Labs, Inc. Transform artifact reduction process
US5774593A (en) 1995-07-24 1998-06-30 University Of Washington Automatic scene decomposition and optimization of MPEG compressed video
US5717463A (en) 1995-07-24 1998-02-10 Motorola, Inc. Method and system for estimating motion within a video sequence
US5653234A (en) 1995-09-29 1997-08-05 Siemens Medical Systems, Inc. Method and apparatus for adaptive spatial image filtering
US6463173B1 (en) 1995-10-30 2002-10-08 Hewlett-Packard Company System and method for histogram-based image contrast enhancement
US5850294A (en) 1995-12-18 1998-12-15 Lucent Technologies Inc. Method and apparatus for post-processing images
US5787203A (en) * 1996-01-19 1998-07-28 Microsoft Corporation Method and system for filtering compressed video images
US6957350B1 (en) 1996-01-30 2005-10-18 Dolby Laboratories Licensing Corporation Encrypted and watermarked temporal and resolution layering in advanced television
US5901178A (en) * 1996-02-26 1999-05-04 Solana Technology Development Corporation Post-compression hidden data transport for video
KR100242636B1 (en) 1996-03-23 2000-02-01 윤종용 Signal adaptive post processing system for reducing blocking effect and ringing noise
US5974159A (en) * 1996-03-29 1999-10-26 Sarnoff Corporation Method and apparatus for assessing the visibility of differences between two image sequences
US5844607A (en) 1996-04-03 1998-12-01 International Business Machines Corporation Method and apparatus for scene change detection in digital video compression
GB9607668D0 (en) 1996-04-12 1996-06-12 Snell & Wilcox Ltd Video noise reducer
KR0176601B1 (en) 1996-05-21 1999-05-01 김광호 Picture quality improving method & circuit using low-filtering and histogram equalization
KR100209132B1 (en) 1996-07-11 1999-07-15 전주범 Method for coding contour in block based object coding system
US6037986A (en) 1996-07-16 2000-03-14 Divicom Inc. Video preprocessing method and apparatus with selective filtering based on motion detection
AU727503B2 (en) 1996-07-31 2000-12-14 Canon Kabushiki Kaisha Image filtering method and apparatus
US5914748A (en) 1996-08-30 1999-06-22 Eastman Kodak Company Method and apparatus for generating a composite image using the difference of two images
US6282299B1 (en) * 1996-08-30 2001-08-28 Regents Of The University Of Minnesota Method and apparatus for video watermarking using perceptual masks
US5847772A (en) 1996-09-11 1998-12-08 Wells; Aaron Adaptive filter for video processing applications
JP3806211B2 (en) 1997-01-08 2006-08-09 株式会社リコー Imaging signal processing method and imaging signal processing apparatus
US6005626A (en) 1997-01-09 1999-12-21 Sun Microsystems, Inc. Digital video signal encoder and encoding method
US6522425B2 (en) 1997-02-04 2003-02-18 Fuji Photo Film Co., Ltd. Method of predicting and processing image fine structures
KR100239308B1 (en) 1997-02-18 2000-01-15 전주범 Method and apparatus for adaptively coding contour of image signals
US6055340A (en) 1997-02-28 2000-04-25 Fuji Photo Film Co., Ltd. Method and apparatus for processing digital images to suppress their noise and enhancing their sharpness
US6014172A (en) 1997-03-21 2000-01-11 Trw Inc. Optimized video compression from a single process step
FR2764156B1 (en) * 1997-05-27 1999-11-05 Thomson Broadcast Systems PRETREATMENT DEVICE FOR MPEG II CODING
US6385647B1 (en) 1997-08-18 2002-05-07 Mci Communications Corporations System for selectively routing data via either a network that supports Internet protocol or via satellite transmission network based on size of the data
US6466912B1 (en) 1997-09-25 2002-10-15 At&T Corp. Perceptual coding of audio signals employing envelope uncertainty
US6097848A (en) 1997-11-03 2000-08-01 Welch Allyn, Inc. Noise reduction apparatus for electronic edge enhancement
JP3082724B2 (en) 1997-11-10 2000-08-28 日本電気株式会社 Piezoelectric transformer and method of manufacturing the same
US6130723A (en) 1998-01-15 2000-10-10 Innovision Corporation Method and system for improving image quality on an interlaced video display
DE19805030C2 (en) 1998-02-09 2003-03-27 Sig Combibloc Gmbh Resealable pouring element and flat gable composite package provided with it
US5991464A (en) 1998-04-03 1999-11-23 Odyssey Technologies Method and system for adaptive video image resolution enhancement
US6643398B2 (en) 1998-08-05 2003-11-04 Minolta Co., Ltd. Image correction device, image correction method and computer program product in memory for image correction
US6236751B1 (en) 1998-09-23 2001-05-22 Xerox Corporation Automatic method for determining piecewise linear transformation from an image histogram
EP1119979B1 (en) 1998-09-29 2013-01-23 General Instrument Corporation Method and apparatus for detecting scene changes and adjusting picture coding type in a high definition television encoder
US6559826B1 (en) 1998-11-06 2003-05-06 Silicon Graphics, Inc. Method for modeling and updating a colorimetric reference profile for a flat panel display
US6707487B1 (en) 1998-11-20 2004-03-16 In The Play, Inc. Method for representing real-time motion
US6567116B1 (en) 1998-11-20 2003-05-20 James A. Aman Multiple object tracking system
US6366705B1 (en) 1999-01-28 2002-04-02 Lucent Technologies Inc. Perceptual preprocessing techniques to reduce complexity of video coders
US6404460B1 (en) 1999-02-19 2002-06-11 Omnivision Technologies, Inc. Edge enhancement with background noise reduction in video image processing
US6393148B1 (en) 1999-05-13 2002-05-21 Hewlett-Packard Company Contrast enhancement of an image using luminance and RGB statistical metrics
US6775408B1 (en) 1999-06-25 2004-08-10 Minolta Co., Ltd. Image processor
US6757449B1 (en) 1999-11-17 2004-06-29 Xerox Corporation Methods and systems for processing anti-aliased images
KR100335055B1 (en) 1999-12-08 2002-05-02 구자홍 Method of removal block effect and ringing effect of compressed video signal
IL134182A (en) 2000-01-23 2006-08-01 Vls Com Ltd Method and apparatus for visual lossless pre-processing
US6940545B1 (en) 2000-02-28 2005-09-06 Eastman Kodak Company Face detecting camera and method
US6633654B2 (en) 2000-06-19 2003-10-14 Digimarc Corporation Perceptual modeling of media signals based on local contrast and directional edges
US6782287B2 (en) 2000-06-27 2004-08-24 The Board Of Trustees Of The Leland Stanford Junior University Method and apparatus for tracking a medical instrument based on image registration
US6753929B1 (en) 2000-06-28 2004-06-22 Vls Com Ltd. Method and system for real time motion picture segmentation and superposition
JP2002152772A (en) 2000-08-28 2002-05-24 Fuji Photo Film Co Ltd White balance correcting device, white balance correcting method, density correcting method and recording medium with program for executing the method recorded thereon
US6873442B1 (en) 2000-11-07 2005-03-29 Eastman Kodak Company Method and system for generating a low resolution image from a sparsely sampled extended dynamic range image sensing device
US6744818B2 (en) 2000-12-27 2004-06-01 Vls Com Ltd. Method and apparatus for visual perception encoding
ATE275091T1 (en) 2001-01-24 2004-09-15 Lindberg & Jensen Aps DOSING DEVICE FOR A CONTAINER
US7087021B2 (en) 2001-02-20 2006-08-08 Giovanni Paternostro Methods of screening for genes and agents affecting cardiac function
US7133451B2 (en) 2001-03-05 2006-11-07 Intervideo, Inc. Systems and methods for refreshing macroblocks
US6671324B2 (en) 2001-04-16 2003-12-30 Mitsubishi Electric Research Laboratories, Inc. Estimating total average distortion in a video with variable frameskip
US7075993B2 (en) 2001-06-12 2006-07-11 Digital Interactive Streams, Inc. Correction system and method for enhancing digital video
US7003174B2 (en) 2001-07-02 2006-02-21 Corel Corporation Removal of block encoding artifacts
US6845181B2 (en) 2001-07-12 2005-01-18 Eastman Kodak Company Method for processing a digital image to adjust brightness
JP3867774B2 (en) 2001-10-25 2007-01-10 独立行政法人 宇宙航空研究開発機構 Method for detecting line image in planar image
JP4051196B2 (en) 2001-11-08 2008-02-20 オリンパス株式会社 Noise reduction system, noise reduction method, noise reduction program, and electronic camera
US6894666B2 (en) 2001-12-12 2005-05-17 Samsung Sdi Co., Ltd. Contrast correcting circuit
US7221805B1 (en) 2001-12-21 2007-05-22 Cognex Technology And Investment Corporation Method for generating a focused image of an object
JP2003304549A (en) 2002-04-11 2003-10-24 Olympus Optical Co Ltd Camera and image signal processing system
US7184071B2 (en) 2002-08-23 2007-02-27 University Of Maryland Method of three-dimensional object reconstruction from a video sequence using a generic model
US6835693B2 (en) 2002-11-12 2004-12-28 Eastman Kodak Company Composite positioning imaging element
JP4167097B2 (en) 2003-03-17 2008-10-15 株式会社沖データ Image processing method and image processing apparatus
US7359572B2 (en) 2003-03-26 2008-04-15 Microsoft Corporation Automatic analysis and adjustment of digital images with exposure problems
KR100579883B1 (en) 2004-05-21 2006-05-15 삼성전자주식회사 Gamma Correction apparatus and method capable of preventing noise boost-up
US20060013503A1 (en) 2004-07-16 2006-01-19 Samsung Electronics Co., Ltd. Methods of preventing noise boost in image contrast enhancement
US7639892B2 (en) 2004-07-26 2009-12-29 Sheraizin Semion M Adaptive image improvement
US7526142B2 (en) 2005-02-22 2009-04-28 Sheraizin Vitaly S Enhancement of decompressed video

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
JAMES GOEL ET AL.: 'Pre-processing for MPEG compression using adaptive spatial filtering' IEEE TRANSACTIONS ON CONSUMER ELECTRONICS vol. 41, no. 3, August 1995, pages 687 - 689, XP002943693 *
LAN ET AL.: 'Scene-context-dependent reference-frame placement for MPEG video coding' IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY vol. 9, no. 3, April 1999, pages 478 - 489, XP002943694 *
See also references of EP1260094A2 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2372918A (en) * 2000-10-27 2002-09-04 Dolby Lab Licensing Corp Encoding of signal components
CN100358366C (en) * 2002-07-11 2007-12-26 松下电器产业株式会社 Filtering intensity decision method, moving picture encoding method, and moving picture decoding method

Also Published As

Publication number Publication date
US6473532B1 (en) 2002-10-29
US20050123208A1 (en) 2005-06-09
EP1260094A2 (en) 2002-11-27
WO2001054392A3 (en) 2002-01-24
IL134182A (en) 2006-08-01
AU2001228771A1 (en) 2001-07-31
IL134182A0 (en) 2001-04-30
US7095903B2 (en) 2006-08-22
US6952500B2 (en) 2005-10-04
USRE42148E1 (en) 2011-02-15
US20030067982A1 (en) 2003-04-10
EP1260094A4 (en) 2004-04-07

Similar Documents

Publication Publication Date Title
US6473532B1 (en) Method and apparatus for visual lossless image syntactic encoding
US7920628B2 (en) Noise filter for video compression
US8139883B2 (en) System and method for image and video encoding artifacts reduction and quality improvement
US7957467B2 (en) Content-adaptive block artifact removal in spatial domain
US6862372B2 (en) System for and method of sharpness enhancement using coding information and local spatial features
EP2308232B1 (en) Encoder optimization of stereoscopic video delivery systems
US5150432A (en) Apparatus for encoding/decoding video signals to improve quality of a specific region
US6281942B1 (en) Spatial and temporal filtering mechanism for digital motion video signals
US6466624B1 (en) Video decoder with bit stream based enhancements
US6058210A (en) Using encoding cost data for segmentation of compressed image sequences
US6983078B2 (en) System and method for improving image quality in processed images
EP1506525B1 (en) System for and method of sharpness enhancement for coded digital video
EP1277344A2 (en) Method and apparatus for transcoding an object-based coded picture signal into a block-based coded picture signal
US6873657B2 (en) Method of and system for improving temporal consistency in sharpness enhancement for a video signal
JPH09187035A (en) Preprocessor for digital video data stream
US7574060B2 (en) Deblocker for postprocess deblocking
JPH11331867A (en) Method and device for extracting color difference signal shape information for video of interlaced scanning system
Ishida et al. Motion-JPEG2000 codec compensated for interlaced scanning videos
WO1999059342A1 (en) Method and system for mpeg-2 encoding with frame partitioning
Patel et al. IMAGE AND VIDEO DENOISING USING ADAPTIVE FILTER

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CR CU CZ DE DK DM DZ EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG US UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE TR BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
AK Designated states

Kind code of ref document: A3

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CR CU CZ DE DK DM DZ EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG US UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A3

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE TR BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

WWE Wipo information: entry into national phase

Ref document number: 2001942829

Country of ref document: EP

WWP Wipo information: published in national office

Ref document number: 2001942829

Country of ref document: EP

REG Reference to national code

Ref country code: DE

Ref legal event code: 8642

NENP Non-entry into the national phase

Ref country code: JP

WWR Wipo information: refused in national office

Ref document number: 2001942829

Country of ref document: EP

WWW Wipo information: withdrawn in national office

Ref document number: 2001942829

Country of ref document: EP