throbber
12/26/23, 8:20 AM
`
`How video compression works - EE Times
`
`Advertisement
`
` 
`
`< https://www.eetimes.com/designline/signal-
`processing-designline/>
`
`DESIGNLINES | SIGNAL PROCESSING
`DESIGNLINE
`How video compression works
`
`By BDTI 08.06.2007
`
` 0
`
`
`< http://www.bdti.com>
`
`[For a closer look at H.264, see H.264: the codec to watch <
`http://www.dspdesignline.com/howto/showArticle.jhtml?
`articleId=187002904> ]
`
`Digital video compression and decompression algorithms (codecs) are at the heart of many
`modern video products, from DVD players to multimedia <
`http://www.dspdesignline.com/encyclopedia/defineterm.jhtml?term=multimedia&x=&y=>
`jukeboxes to video-capable cell phones. Understanding the operation of video compression <
`http://www.dspdesignline.com/encyclopedia/defineterm.jhtml?term=compression&x=&y=>
`algorithms is essential for developers of the systems, processors, and tools that target video
`applications. In this article, we explain the operation and characteristics of video <
`http://www.dspdesignline.com/encyclopedia/defineterm.jhtml?term=video&x=&y=> codecs
`and the demands codecs make on processors. We also explain how codecs differ from one
`another and the significance of these differences.
`
`Starting with stills
`Because video clips are made up of sequences of individual images, or “frames,” video
`compression algorithms share many concepts and techniques with still-image compression
`algorithms. Therefore, we begin our exploration of video compression by discussing the inner
`workings of transform-based still image compression algorithms such as JPEG, which are
`illustrated in Figure 1.
`
`<H
`
`TTPS://WWW.EETIMES.COM/CATEGORY/SPONSORED-
`CONTENT/>
`
`PARTNER
`CONTENT
`
`VIEW
`ALL
`
`
`
`https://www.eetimes.com/how-video-compression-works/
`
`1/13
`
`1
`
`SAMSUNG-1022
`
`

`

`How video compression works - EE Times
`High-Performance Motor Control Chip with Multi-Core Architecture <
`https://www.eetimes.com/high-performance-motor-control-chip-with-multi-core-
`architecture/>
`ttps://ww
`By Bi Chao, Fortior Technology (Shenzhen) Co. Ltd 12.19.2023
`w.eetimes
`.com/high
`
`< h
`
`12/26/23, 8:20 AM
`
`Solomon Systech Driving the Next-Generation Display Technologies <
`https://www.eetimes.com/solomon-systech-driving-the-next-generation-
`erforma
`display-technologies/>
`nce-
`ttps://ww
`By Stephen Las Marias 12.13.2023
`motor-
`w.eetimes
`control-
`.com/solo
`chip-with-
`Embarking on a New Overseas Journey: G Camp Startup Team Showcases
`mon-
`multi-
`Taiwan’s Technological Innovation Strength at Web Summit <
`systech-
`core-
`https://www.eetimes.com/embarking-on-a-new-overseas-journey-g-camp-
`driving-
`ttps://ww
`architectu
`By EE Times Taiwan 12.13.2023
`the-next-
`w.eetimes
`re/>
`generatio
`.com/emb
`n-display-
`arking-on-
`technolog
`a-new-
`ies/>
`overseas-
`journey-g-
`camp-
`startup-
`team-
`showcase
`s-taiwans-
`technolog
`ical-
`innovatio
`n-
`strength-
`at-web-
`summit/>
`
`Click to enlarge.
`
`The image compression techniques used in JPEG <
`http://www.dspdesignline.com/encyclopedia/defineterm.jhtml?term=JPEG&x=&y=> and in
`most video compression algorithms are “lossy.” That is, the original uncompressed image can't be
`perfectly reconstructed from the compressed data, so some information from the original image is
`lost. The goal of using lossy compression is to minimize the number of bits that are consumed by
`the image while making sure that the differences between the original (uncompressed) image and
`the reconstructed image are not perceptible—or at least not objectionable—to the human eye.
`
`Switching to frequency
`The first step in JPEG and similar image compression algorithms is to divide the image into small
`blocks and transform each block into a frequency-domain representation. Typically, this step uses a
`discrete cosine transform (DCT) on blocks that are eight pixels wide by eight pixels high. Thus, the
`
`https://www.eetimes.com/how-video-compression-works/
`
`2/13
`
`- p
`
`< h
`
`< h
`
`2
`
`

`

`How video compression works - EE Times
`12/26/23, 8:20 AM
`DCT operates on 64 input pixels and yields 64 frequency-domain coefficients, as shown in Figure
`2. (Transforms other than DCT and block sizes other than eight by eight pixels are used in some
`algorithms. For simplicity, we discuss only the 8×8 DCT in this article.)
`
`Click to enlarge.
`
`The DCT itself is not lossy; that is, an inverse DCT (IDCT) could be used to perfectly reconstruct
`the original 64 pixels from the DCT coefficients. The transform is used to facilitate frequency-based
`compression techniques. The human eye is more sensitive to the information contained in low
`frequencies (corresponding to large features in the image) than to the information contained in high
`frequencies (corresponding to small features). Therefore, the DCT helps separate the more
`perceptually significant information from less perceptually significant information. After the DCT, the
`compression algorithm encodes the low-frequency DCT coefficients with high precision, but uses
`fewer bits to encode < http://www.dspdesignline.com/encyclopedia/defineterm.jhtml?
`term=encode&x=&y=> the high-frequency coefficients. In the decoding algorithm, an IDCT
`transforms the imperfectly coded coefficients back into an 8×8 block of pixels.
`
`A single two-dimensional eight-by-eight DCT or IDCT requires a few hundred instruction cycles on
`a typical DSP such as the Texas Instruments TMS320C55x. Video compression algorithms often
`perform a vast number of DCTs and/or IDCTs per second. For example, an MPEG-4 video decoder
`operating at VGA < http://www.dspdesignline.com/encyclopedia/defineterm.jhtml?
`term=VGA&x=&y=> (640×480) resolution <
`http://www.dspdesignline.com/encyclopedia/defineterm.jhtml?term=resolution&x=&y=> and
`a frame < http://www.dspdesignline.com/encyclopedia/defineterm.jhtml?
`term=frame&x=&y=> rate of 30 frames per second (fps) would require roughly 216,000 8×8 IDCTs
`per second, depending on the video content. In older video codecs these IDCT computations could
`consume as many as 30% of the processor cycles. In newer, more demanding codec <
`http://www.dspdesignline.com/encyclopedia/defineterm.jhtml?term=codec&x=&y=>
`algorithms such as H.264, however, the inverse transform (which is often a different transform than
`the IDCT) takes only a few percent of the decoder cycles.
`
`Because the DCT and other transforms operate on small image blocks, the memory requirements
`of these functions are typically negligible compared to the size of frame buffers and other data in
`image and video compression applications.
`Choosing the bits: quantization and coding
`After the block transform is performed, the transform coefficients for each block are compressed
`
`https://www.eetimes.com/how-video-compression-works/
`
`3/13
`
`3
`
`

`

`How video compression works - EE Times
`12/26/23, 8:20 AM
`using quantization and coding. Quantization reduces the precision of the transform coefficients in a
`biased manner: more bits are used for low-frequency coefficients and fewer bits for high-frequency
`coefficients. This takes advantage of the fact, as noted above, that human vision is more sensitive
`to low-frequency information, so the high-frequency information can be more approximate. Many
`bits are discarded in this step. For example, a 12-bit coefficient may be rounded to the nearest of
`32 predetermined values. Each of these 32 values can be represented with a five-bit symbol. In the
`decompression algorithm, the coefficients are “dequantized”; i.e., the five-bit symbol is converted
`back to the 12-bit predetermined value used in the encoder. As illustrated in Figure 3, the
`dequantized coefficients are not equal to the original coefficients, but are close enough so that after
`the inverse transform is applied, the resulting image contains few or no visible artifacts.
`
`Click to enlarge.
`
`In older video algorithms, such as MPEG-2, dequantization can require anywhere from about 3%
`up to about 15% of the processor cycles spent in a video decoding application. Cycles spent on
`dequantization in modern video algorithms (such as H.264) are negligible, as are the memory
`requirements.
`
`Statistically Speaking
`Next, the number of bits used to represent the quantized DCT coefficients is reduced by “coding,”
`which takes advantage of some of the statistical properties of the coefficients. After quantization,
`many of the DCT coefficients—often, the vast majority of the high-frequency coefficients—are zero.
`A technique called “run-length coding” takes advantage of this fact by grouping consecutive zero-
`valued coefficients (a “run”) and encoding the number of coefficients (the “length”) instead of
`encoding the individual zero-valued coefficients.
`
`Run-length coding is typically followed by variable-length coding (VLC). In variable-length coding,
`commonly occurring symbols (representing quantized DCT coefficients or runs of zero-valued
`quantized coefficients) are represented using code words that contain only a few bits, while less
`common symbols are represented with longer code words. By using fewer bits for the most
`common symbols, VLC reduces the average number of bits required to encode a symbol thereby
`reducing the number of bits required to encode the entire image.
`
`On the decompression side, variable-length decoding (VLD) reverses the steps performed by the
`VLC block in the compression algorithm. Variable-length decoding is much more computationally
`demanding than variable-length coding. VLC performs one table lookup per symbol (where a
`symbol is encoded using multiple bits); in contrast, the most straightforward implementation of VLD
`requires a table lookup and some simple decision making to be applied for each bit. VLD requires
`an average of about 11 operations per input bit. Thus, the processing requirements of VLD are
`proportional to the video codec's selected bit <
`
`https://www.eetimes.com/how-video-compression-works/
`
`4/13
`
`4
`
`

`

`How video compression works - EE Times
`12/26/23, 8:20 AM
`http://www.dspdesignline.com/encyclopedia/defineterm.jhtml?term=bit&x=&y=> rate. VLD
`can consume as much as 25% of the cycles spent in a video decoder implementation.
`
`In a typical video decompression algorithm, the straightforward VLD implementation described
`above (which operates on one bit at a time) requires several kilobytes of lookup table memory. It is
`possible to improve the performance of the VLD by operating on multiple bits at a time, but this
`optimization requires the use of much larger lookup tables.
`
`Some of the newer standards (such as H.264) replace or augment the run-length coding and VLC
`techniques described above to achieve greater compression. For example, H.264 supports both
`CAVLC (context-adaptive VLC) and CABAC (context-adaptive arithmetic coding). CAVLC
`augments VLC by adapting the coding scheme based on previously-coded coefficients. CABAC
`replaces VLC entirely, using instead a more efficient—but also more computationally demanding—
`scheme of arithmetic coding. CABAC can consume as many as 50% of the cycles in an H.264
`decoder.
`
`Looking at a bigger picture
`All of the techniques described so far operate on each 8×8 block independently from any other
`block. Since images typically contain features that are much larger than an 8×8 block, more
`efficient compression can be achieved by taking into account the similarities between adjacent
`blocks in the image.
`
`To take advantage of inter-block similarities, a prediction step is often added prior to quantization of
`the transform coefficients. In this step, codecs attempt to predict the image information within a
`block using the information from the surrounding blocks. Some codecs (such as MPEG-4) perform
`this step in the frequency domain, by predicting DCT coefficients. Other codecs (such as H.264) do
`this step in the spatial domain, and predict pixels directly. The latter approach is called “intra
`prediction.”
`
`In this step, the encoder attempts to predict the values of some of the DCT coefficients (if done in
`the frequency domain) or pixel <
`http://www.dspdesignline.com/encyclopedia/defineterm.jhtml?term=pixel&x=&y=> values (if
`done in the spatial domain) in each block based on the coefficients or pixels in the surrounding
`blocks. The encoder then computes the difference between the actual value and the predicted
`value and encodes the difference rather than the actual value. At the decoder, the coefficients are
`reconstructed by performing the same prediction and then adding the difference transmitted by the
`encoder. Because the difference tends to be small compared to the actual coefficient values, this
`technique reduces the number of bits required to represent the DCT coefficients.
`
`In predicting the DCT coefficient or pixel values of a particular block, the decoder has access only
`to the values of surrounding blocks that have already been decoded. Therefore, the encoder must
`predict the DCT coefficients or pixel values of each block based only on the values from previously
`encoded surrounding blocks.
`
`JPEG uses a very rudimentary DCT coefficient prediction scheme, in which only the lowest-
`frequency coefficient (the “DC coefficient”) is predicted using simple differential coding. MPEG-4
`video uses a more sophisticated scheme that attempts to predict the first DCT coefficient in each
`row and each column < http://www.dspdesignline.com/encyclopedia/defineterm.jhtml?
`term=column&x=&y=> of the 8×8 block. This scheme is referred to as “AC-DC prediction.”
`
`AC-DC prediction can require a substantial amount of processing power, and some
`implementations require large data arrays. However, AC-DC prediction is typically only used a
`
`https://www.eetimes.com/how-video-compression-works/
`
`5/13
`
`5
`
`

`

`How video compression works - EE Times
`12/26/23, 8:20 AM
`small fraction of the time—when motion compensation (discussed later) is not used—and usually
`has a negligible impact on average processor load.
`
`As mentioned earlier, in H.264 the prediction is done on pixels directly, and the DCT-like integer
`transform always processes a residual—either from motion estimation or from intra-prediction. In
`H.264, the pixel values are never transformed directly as they are in JPEG or MPEG-4 I-frames. As
`a result, the decoder has to decode <
`http://www.dspdesignline.com/encyclopedia/defineterm.jhtml?term=decode&x=&y=> the
`transform coefficients and perform the inverse transform in order to obtain the residual, which is
`added to the predicted pixels.
`Working with color
`Color images are typically represented using several “color planes.” For example, an RGB color
`image contains a red color plane, a green color plane, and a blue color plane. When overlaid and
`mixed, the three planes make up the full color image. To compress a color image, the still-image
`compression techniques described earlier can be applied to each color plane in turn.
`
`Imaging and video applications often use a color scheme in which the color planes do not
`correspond to specific colors. Instead, one color plane contains luminance information (the overall
`brightness of each pixel in the color image) and two more color planes contain color (chrominance)
`information that when combined with luminance can be used to derive the specific levels of the red,
`green, and blue components of each image pixel.
`
`Such a color scheme is convenient because the human eye is more sensitive to luminance than to
`color, so the chrominance planes can often be stored and/or encoded at a lower image resolution
`than the luminance information. Specifically, video compression algorithms typically encode the
`chrominance planes with half the horizontal resolution and half the vertical resolution of the
`luminance plane. Thus, for every 16-pixel by 16-pixel region in the luminance plane, each
`chrominance plane contains one 8-pixel by 8-pixel block. In typical video compression algorithms,
`a “macro block” is a 16×16 region in the video frame that contains four 8×8 luminance blocks and
`the two corresponding 8×8 chrominance blocks.
`
`Adding Motion to the Mix
`Video compression algorithms share many of the compression techniques used in still-
`image compression. A key difference, however, is that video compression can take
`advantage of the similarities between successive video frames to achieve even better
`compression ratios. Using the techniques described earlier, still-image compression
`algorithms such as JPEG can achieve good image quality at a compression ratio of about
`10:1. The most advanced still-image codecs may achieve good image quality at
`compression ratios as high as 30:1. In contrast, video compression algorithms can provide
`good video quality at ratios of up to 200:1. This increased efficiency is accomplished with
`the addition of video-specific compression techniques such as motion estimation and
`motion compensation.
`
`For each macro block in the current frame, motion estimation attempts to find a region in a
`previously encoded frame (called a “reference frame”) that is a close match. The spatial
`offset between the current block and selected block from the reference frame is called a
`“motion vector,” as shown in Figure 4. The encoder computes the pixel-by-pixel difference
`between the selected block from the reference frame and the current block and transmits
`this “prediction error” along with the motion vector. Most video compression standards
`allow motion-based prediction to be bypassed if the encoder fails to find a good match for
`the macro block. In this case, the macro block itself is encoded instead of the prediction
`error.
`https://www.eetimes.com/how-video-compression-works/
`
`6/13
`
`6
`
`

`

`12/26/23, 8:20 AM
`
`How video compression works - EE Times
`
`Click to enlarge.
`
`Note that the reference frame isn't always the previously displayed frame in the sequence of video
`frames. Video compression algorithms commonly encode frames in a different order from the order
`in which they are displayed. The encoder may skip several frames ahead and encode a future
`video frame, then skip backward and encode the next frame in the display sequence. This is done
`so that motion estimation can be performed backward in time, using the encoded future frame as a
`reference frame. Video compression algorithms also commonly allow the use of two reference
`frames—one previously displayed frame and one previously encoded future frame.
`
`Video compression algorithms periodically encode one video frame using still-image coding
`techniques only, without relying on previously encoded frames. These frames are called “intra
`frames” or “I-frames.” If a frame in the compressed bit stream is corrupted by errors, the video
`decoder can “restart” at the next I-frame, which doesn't require a reference frame for
`reconstruction.
`
`As shown in Figure 5, frames that are encoded using only a previously displayed reference frame
`are called “P-frames,” and frames that are encoded using both future and previously displayed
`reference frames are called “B-frames.” A typical sequence of frames is illustrated in Figure 5[d].
`
`Click to enlarge.
`
`One factor that complicates motion estimation is that the displacement of an object from the
`reference frame to the current frame may be a non-integer number of pixels. For example,
`suppose that an object has moved 22.5 pixels to the right and 17.25 pixels upward. To handle such
`situations, modern video compression standards allow motion vectors to have non-integer values;
`motion vector resolutions of one-half or one-quarter of a pixel are common. To support searching
`for block matches at partial-pixel displacements, the encoder must use interpolation to estimate the
`reference frame's pixel values at non-integer locations.
`
`https://www.eetimes.com/how-video-compression-works/
`
`7/13
`
`7
`
`

`

`How video compression works - EE Times
`12/26/23, 8:20 AM
`The simplest and most thorough way to perform motion estimation is to evaluate every possible
`16×16 region in the search area, and select the best match. Typically, a “sum of absolute
`differences” (SAD) or “sum of squared differences” (SSD) computation is used to determine how
`closely a candidate 16×16 region matches a macro block. The SAD or SSD is often computed for
`the luminance plane only, but can also include the chrominance planes. But this approach can be
`overly demanding on processors: exhaustively searching an area of 48×24 pixels requires over 8
`billion arithmetic operations per second at QVGA (640×480) video resolution and a frame rate of 30
`frames per second.
`
`Because of this high computational load, practical implementations of motion estimation do not use
`an exhaustive search. Instead, motion estimation algorithms use various methods to select a
`limited number of promising candidate motion vectors (roughly 10 to 100 vectors in most cases)
`and evaluate only the 16×16 regions corresponding to these candidate vectors. One approach is to
`select the candidate motion vectors in several stages. For example, five initial candidate vectors
`may be selected and evaluated. The results are used to eliminate unlikely portions of the search
`area and zero in on the most promising portion of the search area. Five new vectors are selected
`and the process is repeated. After a few such stages, the best motion vector found so far is
`selected.
`
`Another approach analyzes the motion vectors previously selected for surrounding macro blocks in
`the current and previous frames in an effort to predict the motion in the current macro block. A
`handful of candidate motion vectors are selected based on this analysis, and only these vectors
`are evaluated.
`
`By selecting a small number of candidate vectors instead of scanning the search area exhaustively,
`the computational demand of motion estimation can be reduced considerably—sometimes by over
`two orders of magnitude. But there is a tradeoff between processing load and image quality or
`compression efficiency: in general, searching a larger number of candidate motion vectors allows
`the encoder to find a block in the reference frame that better matches each block in the current
`frame, thus reducing the prediction error. The lower the predication error, the fewer bits that are
`needed to encode the image. So increasing the number of candidate vectors allows a reduction in
`compressed bit rate, at the cost of performing more SAD (or SSD) computations. Or, alternatively,
`increasing the number of candidate vectors while holding the compressed bit rate constant allows
`the prediction error to be encoded with higher precision, improving image quality.
`
`Some codecs (including H.264) allow a 16×16 macroblock to be subdivided into smaller blocks
`(e.g., various combinations of 8×8, 4×8, 8×4, and 4×4 blocks) to lower the prediction error. Each of
`these smaller blocks can have its own motion vector. The motion estimation search for such a
`scheme begins by finding a good position for the entire 16×16 block. If the match is close enough,
`there's no need to subdivide further. But if the match is poor, then the algorithm starts at the best
`position found so far, and further subdivides the original block into 8×8 blocks. For each 8×8 block,
`the algorithm searches for the best position near the position selected by the 16×16 search.
`Depending on how quickly a good match is found, the algorithm can continue the process using
`smaller blocks of 8×4, 4×8, etc.
`
`Even with drastic reductions in the number of candidate motion vectors, motion estimation is the
`most computationally demanding task in video compression applications. The inclusion of motion
`estimation makes video encoding much more computationally demanding than decoding. Motion
`estimation can require as much as 80% of the processor cycles spent in the video encoder.
`Therefore, many processors targeting multimedia applications provide a specialized instruction to
`accelerate SAD computations or a dedicated SAD coprocessor to offload this task from the CPU.
`
`https://www.eetimes.com/how-video-compression-works/
`
`8/13
`
`8
`
`

`

`How video compression works - EE Times
`12/26/23, 8:20 AM
`For example, ARM's ARM11 core provides an instruction to accelerate SAD computation, and
`some members of Texas Instruments' TMS320C55x family of DSPs provide an SAD coprocessor.
`
`Note that in order to perform motion estimation, the encoder must keep one or two reference
`frames in memory in addition to the current frame. The required frame buffers are very often larger
`than the available on-chip memory, requiring additional memory chips in many applications.
`Keeping reference frames in off-chip memory results in very high external memory bandwidth in
`the encoder, although large on-chip caches can help reduce the required bandwidth <
`http://www.dspdesignline.com/encyclopedia/defineterm.jhtml?term=bandwidth&x=&y=>
`considerably.
`
`Secret Encoders
`In addition to the two approaches described above, many other methods for selecting appropriate
`candidate motion vectors exist, including a wide variety of proprietary solutions. Most video
`compression standards specify only the format of the compressed video bit stream and the
`decoding steps and leave the encoding process undefined so that encoders can employ a variety
`of approaches to motion estimation. The approach to motion estimation is the largest differentiator
`among video encoder implementations that comply with a common standard. The choice of motion
`estimation technique significantly affects computational requirements and video quality; therefore,
`details of the approach to motion estimation in commercially available encoders are often closely
`guarded trade secrets.
`
`Motion Compensation
`In the video decoder, motion compensation uses the motion vectors encoded in the compressed bit
`stream to predict the pixels in each macro block. If the horizontal and vertical components of the
`motion vector are both integer values, then the predicted macro block is simply a copy of the 16-
`pixel by 16-pixel region of the reference frame. If either component of the motion vector has a non-
`integer value, interpolation is used to estimate the image at non-integer pixel locations. Next, the
`prediction error is decoded and added to the predicted macro block in order to reconstruct the
`actual macro block pixels. As mentioned earlier, the 16×16 macroblock may be subdivided into
`smaller sections with independent motion vectors.
`
`Compared to motion estimation, motion compensation is much less computationally demanding.
`While motion estimation must perform SAD or SSD computation on a number of 16-pixel by 16-
`pixel regions in an attempt to find the best match for each macro block, motion compensation
`simply copies or interpolates one such region for each macro block. Still, motion compensation can
`consume as much as 40% of the processor cycles in a video decoder, though this number varies
`greatly depending on the content of a video sequence, the video compression standard, and the
`decoder implementation. For example, the motion compensation workload can comprise as little as
`5% of the processor cycles spent in the decoder for a frame that makes little use of interpolation.
`
`Like motion estimation, motion compensation requires the video decoder to keep one or two
`reference frames in memory, often requiring external memory chips for this purpose. However,
`motion compensation makes fewer accesses to reference frame buffers than does motion
`estimation. Therefore, memory bandwidth requirements are less stringent for motion compensation
`compared to motion estimation, although high memory bandwidth is still desirable for best
`processor performance.
`Polishing the image: deblocking and deringing
`Ideally, lossy image and video compression algorithms discard only perceptually insignificant
`information, so that to the human eye the reconstructed image or video sequence appears identical
`to the original uncompressed image or video. In practice, however, some artifacts may be visible.
`This can happen due to a poor encoder implementation, video content that is particularly
`https://www.eetimes.com/how-video-compression-works/
`
`9/13
`
`9
`
`

`

`How video compression works - EE Times
`12/26/23, 8:20 AM
`challenging to encode, or a selected bit rate that is too low for the video sequence, resolution, and
`frame rate. The latter case is particularly common, since many applications trade off video quality
`for a reduction in storage and/or bandwidth requirements.
`
`Two types of artifacts, “blocking” and “ringing,” are common in video compression applications.
`Blocking artifacts are due to the fact that compression algorithms divide each frame into 8×8
`blocks. Each block is reconstructed with some small errors, and the errors at the edges of a block
`often contrast with the errors at the edges of neighboring blocks, making block boundaries visible.
`In contrast, ringing artifacts appear as distortions around the edges of image features. Ringing
`artifacts are due to the encoder discarding too much information in quantizing the high-frequency
`DCT coefficients.
`
`Video compression applications often employ filters following decompression to reduce blocking
`and ringing artifacts. These filtering steps are known as “deblocking” and “deringing,” respectively.
`Both deblocking and deringing use low-pass FIR (finite impulse response) filters to hide these
`visible artifacts.
`
`Deblocking and deringing filters are fairly computationally demanding. Combined, these filters can
`easily consume more processor cycles than the video decoder itself. For example, an MPEG-4
`simple-profile, level 1 (176×144 pixel, 15 fps) decoder optimized for the ARM9E general-purpose
`processor core requires that the processor be run at an instruction cycle rate of about 14 MHz <
`http://www.dspdesignline.com/encyclopedia/defineterm.jhtml?term=MHz&x=&y=> when
`decoding a moderately complex video stream. If deblocking is added, the processor must be run at
`33 MHz. If deringing and deblocking are both added, the processor must be run at about 39 MHz—
`nearly three times the clock < http://www.dspdesignline.com/encyclopedia/defineterm.jhtml?
`term=clock&x=&y=> rate required for the video decompression algorithm alone.
`
`Post-processing or in-line?
`Deblocking and deringing filters can be applied to video frames as a separate post-processing step
`that is independent of video decompression. This approach provides system designers the
`flexibility to select the best deblocking and/or deringing filters for their application or to forego these
`filters entirely in order to reduce computational demands. With this approach, the video decoder
`uses each unfiltered reconstructed frame as a reference frame for decoding future video frames,
`and an additional frame buffer <
`http://www.dspdesignline.com/encyclopedia/defineterm.jhtml?term=buffer&x=&y=> is
`required for the final filtered video output.
`
`Alternatively, deblocking and/or deringing can be integrated into the video decompression
`algorithm. This approach, sometimes referred to as “loop filtering,” uses the filtered reconstructed
`frame as the reference frame for decoding future video frames. This approach requires the video
`encoder to perform the same deblocking and/or deringing filtering steps as the decoder in order to
`keep each reference frame used in encoding identical to that used in decoding. The need to
`perform filtering in the encoder increases processor performance requirements for encoding, but
`can improve image quality, especially for very low bit rates. In addition, the extra frame buffer that
`is required when deblocking and/or deringing are implemented as a separate post-processing step
`is not needed when deblocking and deringing are integrated into the decompression algorithm.
`Figure 6 illustrates deblocking and deringing applied “in-loop” or as a post-processing step. H.264,
`for example, includes an “in-loop” deblocking filter, sometimes referred to as the “loop filter.”
`
`https://www.eetimes.com/how-video-compression-works/
`
`10/13
`
`10
`
`

`

`12/26/23, 8:20 AM
`
`How video compression works - EE Times
`
`Color Space Conversion
`One complication of compressing video streams is that the way color is represented in codecs is
`different from the way it is represented by video cameras and displays. As noted above, video
`compression algorithms typically represent color images using luminance and chrominance planes.
`In contrast, video cameras and displays typically mix red, green, and blue light to represent
`different colors. Therefore, the red, green, and blue pixels captured by a camera must be
`converted into luminance and chrominance values for video encoding, and the luminance and
`chrominance pixels output by the video decoder must be converted to specific levels of red, green,
`and blue for display. The algorithm for this conversion requires about 12 arithmetic operations per
`image pixel, not including the interpolation needed to compensate for the fact that the chrominance
`planes have a lower resolution than the luminance pla

This document is available on Docket Alarm but you must sign up to view it.


Or .

Accessing this document will incur an additional charge of $.

After purchase, you can access this document again without charge.

Accept $ Charge
throbber

Still Working On It

This document is taking longer than usual to download. This can happen if we need to contact the court directly to obtain the document and their servers are running slowly.

Give it another minute or two to complete, and then try the refresh button.

throbber

A few More Minutes ... Still Working

It can take up to 5 minutes for us to download a document if the court servers are running slowly.

Thank you for your continued patience.

This document could not be displayed.

We could not find this document within its docket. Please go back to the docket page and check the link. If that does not work, go back to the docket and refresh it to pull the newest information.

Your account does not support viewing this document.

You need a Paid Account to view this document. Click here to change your account type.

Your account does not support viewing this document.

Set your membership status to view this document.

With a Docket Alarm membership, you'll get a whole lot more, including:

  • Up-to-date information for this case.
  • Email alerts whenever there is an update.
  • Full text search for other cases.
  • Get email alerts whenever a new case matches your search.

Become a Member

One Moment Please

The filing “” is large (MB) and is being downloaded.

Please refresh this page in a few minutes to see if the filing has been downloaded. The filing will also be emailed to you when the download completes.

Your document is on its way!

If you do not receive the document in five minutes, contact support at support@docketalarm.com.

Sealed Document

We are unable to display this document, it may be under a court ordered seal.

If you have proper credentials to access the file, you may proceed directly to the court's system using your government issued username and password.


Access Government Site

We are redirecting you
to a mobile optimized page.





Document Unreadable or Corrupt

Refresh this Document
Go to the Docket

We are unable to display this document.

Refresh this Document
Go to the Docket