`
`STANDARD
`
`ISO/IEC
`
`1 1172-2
`
`First edition
`1993-08-01
`
`Information technology — Coding of
`moving pictures and associated audio for
`digital storage media at up to about
`1,5 Mbit/s —
`
`Part 2:
`
`Video
`
`Technologies de I information —~ Codage de /‘image animée et du son
`associé pour /es supports de stockage numérique jusqu’a environ
`1,5 Mbit/s -
`
`Partie 2: Video
`
`
`
`Reference number
`ISO/IEC 11172-2:i993(E)
`
`PAGE 1 OF 124
`
`T:|.|\/Al.L. SLU'S EXHIBIT 1006’
`
`PAGE 1 OF 124
`
`I.M.L. SLU'S EXHIBIT 1006
`
`
`
`ISO/IEC 11172-2: 1993 (E)
`
`Contents
`
`Page
`
`Foreword ................................................................................................. ..i ii
`
`Introduction ............................................................................................. ..iv
`
`Section 1: General ...................................................................................... ..l
`
`1.1
`
`Scope ............................................................................................. ..l
`
`l .2
`
`Normative references .......................................................................... .. l
`
`Section 2: Technical elements ........................................................................ ..3
`
`2.1
`
`Definitions ...................................................................................... ..3
`
`2.2
`
`Symbols and abbreviations ................................................................ .. 1 1
`
`2.3 Method of describing bitstream syntax .................................................. .. 13
`
`2.4
`
`Requirements .................................................................................. .. l5
`
`An nexes
`
`A
`
`B
`
`C
`
`8 by 8 Inverse discrete cosine transform ................................................ .. 39
`
`Variable length code tables ................................................................ ..4O
`
`Video buffering verifier ..................................................................... .. 49
`
`Dr
`
`Guide to encoding video .................................................................... .. 51
`
`E
`
`F
`
`Bibliography ................................................................................ ..108
`
`List of patent holders ....................................................................... ..l09
`
`© ISO/IEC 1993
`All rights reserved. No part of this publication may be reproduced or utilized in any form or by
`any means, electronic or mechanical, including photocopying and microfilm, without
`permission in writing from the publisher.
`
`ISO/IEC Copyright Office - Case Postale 56 - CHl2ll Geneve 20 ~ Switzerland
`
`Printed in Switzerland.
`
`PAGE2 OF 124
`
`“
`
`V
`
`2‘
`
`H|.M.L. SLU'S EXHIBIT 1006
`
`PAGE 2 OF 124
`
`I.M.L. SLU'S EXHIBIT 1006
`
`
`
`© ISO/IEC
`
`ISO/IEC 11172-2: 1993 (E)
`
`Foreword
`
`ISO (the international Organization for Standardization) and lEC (the inter-
`national Electrotechnical Commission) form the specialized system for
`worldwide standardization. National bodies that are members of ISO or
`IEC participate in the development of
`International Standards through
`technical committees established by the respective organization to deal
`with particular fields of technical activity.
`ISO and lEC technical com-
`mittees collaborate in fields of mutual interest. Other international organ~
`izations, governmental and non—governmental, in liaison with ISO and IEC,
`also take part in the work.
`
`In the field of information technology, ISO and lEC have established a joint
`technical committee, ISO/IEC JTC 1. Draft International Standards adopted
`by the joint technical committee are circulated to national bodies for vot-
`ing. Publication as an International Standard requires approval by at least
`75 % of the national bodies casting a vote.
`
`international Standard ISO/lEC 11172-2 was prepared by Joint Technical
`Committee ISO/IEC JTC 1, Information technology, Sub—Committee SC 29,
`Coded representation of audio, picture, multimedia and hypermedia infor-
`mation.
`
`ISO/lEC 11172 consists of the following parts, under the general title In-
`formation technology — Coding of moving pictures and associated audio
`for digital storage media at up to about 1,5 Mbit/s:
`
`— Part 1: Systems
`
`— Part 2: Video
`
`— Part 3: Audio
`
`— Part 4: Compliance testing
`
`Annexes A, B and C form an integral part of this part of ISO/lEC 11172.
`Annexes D, E and F are for information only.
`
`PAGE3OF124
`
`1
`
`' SLU'S ExHiBi“f1ooa
`
`PAGE 3 OF 124
`
`I.M.L. SLU'S EXHIBIT 1006
`
`
`
`ISO/IEC 11172-2: 1993 (E)
`
`© ISO/IEC
`
`Introduction
`
`Note -— Readers interested in an overview of the MPEG Video layer should read this Introduction and then
`
`proceed to annex D, before returning to clauses 1 and 2.
`
`0.1
`
`Purpose
`
`This part of ISO/IEC 11172 was developed in response to the growing need for a common format for
`representing compressed video on various digital storage media such as CDs, DATs, Winchester disks and
`optical drives. This part of ISO/IEC 11172 specifies a coded representation that can be used for
`compressing video sequences to bitrates around 1,5 Mbit/s. The use of this part of ISO/IEC 11172 means
`that motion video can be manipulated as a fonn of computer data and can be transmitted and received over
`existing and future networks. The coded representation can be used with both 625-line and 525-line
`television and provides flexibility for use with workstation and personal computer displays.
`
`This part of ISO/IEC 11172 was developed to operate principally from storage media offering a continuous
`transfer rate of about 1,5 Mbit/s. Nevertheless it can be used more widely than this because the approach
`taken is generic.
`
`0.1.1 Coding parameters
`
`The intention in developing this part of ISO/IEC 11172 has been to define a source coding algorithm with a
`large degree of flexibility that can be used in many different applications. To achieve this goal, a number of
`the parameters defining the characteristics of coded bitstreams and decoders are contained in the bitstream
`itself. This allows for example, the algorithm to be used for pictures with a variety of sizes and aspect
`ratios and on channels or devices operating at a wide rzmge of bitrates.
`
`Because of the large range of the characteristics of bitstreams that can be represented by this part of ISO/IEC
`11172, a sub—set of these coding parameters known as the "Constrained Parameters" has been defined. The
`aim in defining the constrained parameters is to offer guidance about a widely useful range of parameters.
`Conforming to this set of constraints is not a requirement of this part of ISO/IEC 11172. A flag in the
`bitstream indicates whether or not it is a Constrained Parameters bitstream.
`
`Summary of the Constrained Parameters:
`
`
`
`icture size
`Horizontal
`Vertical
`icture size
`
`
`Picture area
`Pelrate
`
`Piciureraie
`
`Motion vector range
`Less than -64 to +63,5 pels (using half-pel vectors)
`[backward_f_code and forward_f_code <= 4 see table D.7
`
`Less thzui or e ual to 327 680 bits
`In ut buffer size in VBV model
`Less than or e ual to 1 856 000 bits/s (constant bitrate)
`Bitrate
`
`
`
`
`
`
`
`
`
`
`
`
`0.2 Overview of the algorithm
`
`The coded representation defined in this part of ISO/IEC 11172 achieves a high compression ratio while
`preserving good picture quality. The algorithm is not lossless as the exact pel values are not preserved
`during coding. The choice of the techniques is based on the need to balance a high picture quality and
`compression ratio with the requirement to make random access to the coded bitstream. Obtaining good
`picture quality at the bitrates of interest demands a very high compression ratio, which is not achievable
`with intraframe Coding alone. The need for rzurdom access, however, is best satisfied with pure intrafrarne
`coding. This requires a careful balance between intra- and interframe coding and between recursive and non-
`recursive temporal redundancy reduction.
`
`PAGE4OF 124
`
`M
`
`i
`
`7
`
`n'l;M.L. SLU'S EXHIBIT 1006
`
`PAGE 4 OF 124
`
`I.M.L. SLU'S EXHIBIT 1006
`
`
`
`© ISO/IEC
`
`ISO/IEC 11172-2: 1993 (E)
`
`A number of techniques are used to achieve a high compression ratio. The first, which is almost
`independent from this part of ISO/IEC 11172, is to select an appropriate spatial resolution for the signal.
`The algorithm then uses block-based motion compensation to reduce the temporal redundancy. Motion
`compensation is used for causal prediction of the current picture from a previous picture, for non-causal
`prediction of the current picture from a future picture, or for interpolative prediction from past and future
`pictures. Motion vectors are defined for each 16-pel by 16-line region of the picture. The difference signal,
`the prediction error, is further compressed using the discrete cosine transform (DCT) to remove spatial
`correlation before it is quantized in an irreversible process that discards the less important information.
`Finally, the motion vectors are combined with the DCT information, and coded using variable length codes.
`
`0.2.1 Temporal processing
`
`Because of the conflicting requirements of random access and highly efficient compression, three main
`picture types are defined. Intra-coded pictures (I-Pictures) are coded without reference to other pictures.
`They provide access points to the coded sequence where decoding can begin, but are coded with only a
`moderate compression ratio. Predictive coded pictures (P-Pictures) are coded more efficiently using motion
`compensated prediction from a past intra or predictive coded picture and are generally used as a reference for
`further prediction. Bidirectionally-predictive coded pictures (B—Pictures) provide the highest degree of
`compression but require both past and future reference pictures for motion compensation. Bidirectionally-
`predictive coded pictures are never used as references for prediction. The organisation of the three picture
`types in a sequence is very flexible. The choice is left to the encoder and will depend on the requirements of
`the application. Figure 1 illustrates the relationship between the three different picture types.
`
`iii?
`tiff!!!
`
`i3;S§§:§2%““‘
`
`Figure 1
`
`-- Example of temporal picture structure
`
`Prediction
`
`The fourth picture type defined in this part of ISO/IEC 11172, the D—picture, is provided to allow a simple,
`but limited quality, fast—forward playback mode.
`
`0.2.2 Motion representation - macroblocks
`
`The choice of 16 by 16 macroblocks for the motion-compensation unit is a result of the trade-off between
`increasing the coding efficiency provided by using motion infonnation and the overhead needed to store it.
`Each macroblock can be one of a number of different types. For example, intra-coded, forward-predictive-
`coded, backward-predictive coded, and bidirectionally-predictive—coded macroblocks are pennitted in
`bidirectionally—predictive coded pictures. Depending on the type of the macroblock, motion vector
`information and other side infonnation are stored with the compressed prediction error signal in each
`macroblock. The motion vectors are encoded differentially with respect to the last coded motion vector,
`using variable—length codes. The maximum length of the vectors that may be represented can be
`programmed, on a picture—by—picture basis, so that the most demanding applications can be met without
`compromising the perfonnance of the system in more nonnal situations.
`
`It is the responsibility of the encoder to calculate appropriate motion vectors. This part of ISO/IEC 11172
`does not specify how this should be done.
`
`PAGE5OF 124
`
`»
`
`SLU'S ExHIEv3IT1ooe
`
`PAGE 5 OF 124
`
`I.M.L. SLU'S EXHIBIT 1006
`
`
`
`ISO/IEC 11172-2: 1993 (E)
`
`© ISO/IEC
`
`0.2.3 Spatial
`
`redundancy reduction
`
`Both original pictures and prediction error signals have high spatial redundancy. This part of ISO/IEC
`11172 uses a block—based DCT method with visually weighted quantization and run—length coding. Each 8
`by 8 block of the original picture for intra-coded macroblocks or of the prediction error for predictive-coded
`macroblocks is transformed into the DCT domain where it is scaled before being quantized. After
`quantization many of the coefficients are zero in value and so two-dimensional run—length and variable
`length coding is used to encode the remaining coefficients efficiently.
`
`0.3 Encoding
`
`It specifies the syntax and semantics of
`This part of ISO/IEC 11172 does not specify an encoding process.
`the bitstream and the signal processing in the decoder. As a result, many options are left open to encoders
`to trade-off cost and speed against picture quality and coding efficiency. This clause is a brief description of
`the functions that need to be performed by an encoder. Figure 2 shows the main functional blocks.
`
`Motion
`
`Source input pictures
`
`Estimator
`
`
`sioioartuonow
`
`
`where
`
`DCT is discrete cosine transform
`
`DCT‘1 is inverse discrete cosine transform
`Q is quantization
`Q'1 is dequantization
`VLC is variable length coding
`
`Figure 2 -- Simplified video encoder block diagram
`
`The input video signal ‘must be digitized and represented as a luminance and two colour difference signals
`(Y, Cb, Cr). This may be followed by preprocessing and format conversion to select an appropriate
`window, resolution and input format. This part of ISO/IEC 11172 requires that the colour difference
`signals (Cb and Cr) are subsampled with respect to the luminance by 2:1 in both vertical and horizontal
`directions and are reformatted, if necessary, as a non—interlaoed signal.
`
`The encoder must choose which picture type to use for each picture. Having defined the picture types, the
`encoder estimates motion vectors for each 16 by 16 macroblock in the picture.
`In P—Pictures one vector is
`needed for each non—intia macroblock and in B-Pictures one or two vectors are needed.
`
`If B-Pictures are used, some reordering of the picture sequence is necessary before encoding. Because B-
`Pictures are coded using bidirectional motion compensated prediction, they can only be decoded after the
`subsequent reference picture (an I or P—Picture) has been decoded. Therefore the pictures are reordered by the
`
`PAGE6OF124
`
`V
`
`T
`
`A
`
`|.N|.L.SLU'S EXH|B|T1006
`
`PAGE 6 OF 124
`
`I.M.L. SLU'S EXHIBIT 1006
`
`
`
`© ISO/IEC
`
`ISO/IEC 11172—2: 1993 (E)
`
`encoder so that the pictures arrive at the decoder in the order for decoding. The correct display order is
`recovered by the decoder.
`
`The basic unit of coding within a picture is the macroblock. Within each picture, macroblocks are encoded
`in sequence, left to right, top to bottom. Each macroblock consists of six 8 by 8 blocks: four blocks of
`luminance, one block of Cb chrominance, and one block of Cr chrominance. See figure 3. Note that the
`picture area covered by the four blocks of luminance is the same as the area covered by each of the
`chrominance blocks. This is due to subsampling of the chrominance infonnation to match the sensitivity of
`the human visual system.
`
`Cb
`
`Cr
`
`n Y
`
`Figure 3 —- Macroblock structure
`
`Firstly, for a given macroblock, the coding mode is chosen. It depends on the picture type, the
`effectiveness of motion compensated prediction in that local region, and the nature of the signal within the
`block. Secondly, depending on the coding mode, a motion compensated prediction of the contents of the
`block based on past and/or future reference pictures is formed. This prediction is subtracted from the actual
`data in the current macroblock to form an error signal. Thirdly, this error signal is separated into 8 by 8
`blocks (4 luminance and 2 chrominance blocks in each macroblock) and a discrete cosine transfonn is
`performed on each block. Each resulting 8 by 8 block of DCT coefficients is quantized and the two-
`dimensional block is scanned in a zig~zag order to convert it into a one-dimensional string of quantized DCT
`coefficients. Fourthly, the side-information for the macroblock (mode, motion vectors etc) and the
`quantized coefficient data are encoded. For maximum efficiency, a number of variable length code tables are
`defined for the different data elements. Ruii-length coding is used for the quantized coefficient data.
`
`A consequence of using different picture types and variable length coding is that the overall data rate is
`variable. In applications that involve a fixed—rate channel, a FIFO buffer may be used to match the encoder
`output to the channel. The status of this buffer may be monitored to control the number of bits generated
`by the encoder. Controlling the quantization process is the most direct way of controllingithe bitrate. This
`part of ISO/IEC 11172 specifies an abstract model of the buffering system (the Video Buffering Verifier) in
`order to constrain the maximum variability in the number of bits that are used for a given picture. This
`ensures that a bitstreain can be decoded with a buffer of known size.
`
`At this stage, the coded representation of the picture has been generated. The final step in the encoder is to
`regenerate I—Pictures and P-Pictures by decoding the data so that they can be used as reference pictures for
`subsequent encoding. The quantized coefficients are deq uaiitized and an inverse 8 by 8 DCT is performed on
`each block. The prediction error signal produced is then added back to the prediction signal and limited to
`the required range to give a decoded reference picture.
`
`0.4 Decoding
`
`Decoding is the inverse of the encoding operation. It is considerably simpler than encoding as there is no
`need to perform motion estimation and there are many fewer options. The decoding process is defined by
`this part of ISO/[EC 11172. The description that follows is a very brief overview of one possible way of
`decoding a bitstream. Other decoders with different architectures are possible. Figure 4 shows the main
`functional blocks.
`
`PAGE7OF124
`
`*
`
`F
`
`T
`
`if
`
`if
`
`t
`
`ii|.M.L.SLU'S EXH|Bv|ii|'1006
`
`PAGE 7 OF 124
`
`I.M.L. SLU'S EXHIBIT 1006
`
`
`
`ISO/IEC 11172-2: 1993 (E)
`
`© ISO/IEC
`
`Quantizer ste » size
`
`
`
`Reconstructed
`
`output pictures
`
`Picture store
`
`and
`Predictor
`
`
`busmam
`
`Where
`
`DCT'1 is inverse discrete cosine transform
`Q‘1
`is dequantization
`MUX-1 is demultiplexing
`VLD
`is variable length decoding
`
`Figure 4 -- Basic video decoder block diagram
`
`For frxed—rate applications, the channel fills a FIFO buffer at a constant rate with the coded bitstream. The
`decoder reads this buffer and decodes the data elements in the bitstream according to the defined syntax.
`
`As the decoder reads the bitstream, it identifies the start of a coded picture and then the type of the picture.
`It decodes each macroblock in the picture in turn. The macroblock type and the motion vectors, if present,
`are used to construct a prediction of the current macroblock based on past and future reference pictures that
`have been stored in the decoder. The coefficient data are decoded and dequantized. Each 8 by 8 block of
`coefficient data is transfonned by an inverse DCT (specified in annex A), and the result is added to the
`prediction signal and limited to the defined range.
`
`After all the macroblocks in the picture have been processed, the picture has been reconstructed. If it is an 1-
`picture or a P—picture it is a reference picture for subsequent pictures and is stored, replacing the oldest stored
`reference picture. Before the pictures are displayed they may need to be re-ordered from the coded order to
`their natural display order. After reordering, the pictures are available, in digital form, for post-processing
`and display in any manner that the application chooses.
`
`0.5
`
`Structure of the coded video bitstream
`
`This part of ISO/IEC 11172 specifies a syntax for a coded video bitstream. This syntax contains six layers,
`each of which either supports a signal processing or a system function:
`
`La ers of the s max
`
`
`
`
`
`
`
`
`
`0.6
`
`Features supported by the algorithm
`
`Applications using compressed video on digital storage media need to be able to perform a number of
`operations in addition to normal forward playback of the sequence. The coded bitstream has been designed
`to support a number of these operations.
`
`viii
`PAGE8OF 124
`
`'
`
`A
`
`1’
`
`A
`
`Irillili/|.L. SLU'S EXH|B|T1006
`
`
`
`Random access unit: context
`
`
`
`Sequence layer
`Group of pictures layer
`Random access unit: video
`Picture layer
`Primary coding unit
`Slice layer
`Resynchronization unit
`Macroblock layer
`Motion compensation unit
` Block la er
`DCT unit
`
`PAGE 8 OF 124
`
`I.M.L. SLU'S EXHIBIT 1006
`
`
`
`g© ISO/IEC
`
`ISO/IEC 11172-2: 1993 (E)
`
`0.6.1 Random access
`
`Random access is an essential feature for video on a storage medium. Random access requires that any
`picture can be decoded in a limited amount of time. It implies the existence of access points in the
`bitstream - that is segments of information that are identifiable and can be decoded without reference to other
`segments of data A spacing of two random access points (Intra—Pictures) per second can be achieved
`without significant loss of picture quality.
`
`0.6.2 Fast search
`
`Depending on the storage medium, it is possible to scan the access points in a coded bitstream (with the
`help of an application—specific directory or other knowledge beyond the scope of this part of ISO/IEC
`11172) to obtain a fast—forward and fast-reverse playback effect.
`
`0.6.3 Reverse playback
`
`Some applications may require the video signal to be played in reverse order. This can be achieved in a
`decoder by using memory to store entire groups of pictures after they have been decoded before being
`displayed in reverse order. An encoder can make this feature easier by reducing the length of groups of
`pictures.
`
`0.6.4 Error robustness
`
`Most digital storage media and communication channels are not error-free. Appropriate channel coding
`schemes should be used and are beyond the scope of this part of ISO/IEC 11172. Nevertheless the
`compression scheme defined in this part of ISO/IEC 11172 is robust to residual errors. The slice structure
`allows a decoder to recover after a data error and to resynchronize its decoding. Therefore, bit errors in the
`compressed data will cause errors in the decoded pictures to be limited in area. Decoders may be able to use
`concealment strategies to disguise these errors.
`
`0.6.5 Editing
`
`"lhere is a conflict between the requirement for high coding efficiency and easy editing. The coding structure
`and syntax have not been designed with the primary aim of simplifying editing at any picture. Nevertheless
`a number of features have been included that enable editing of coded data.
`
`PAGE9OF124
`
`1
`
`SLU'S ExHIE§i“r1ooe
`
`PAGE 9 OF 124
`
`I.M.L. SLU'S EXHIBIT 1006
`
`
`
`This page intentionally left blank
`
`PAGE 10 OF 124
`
`’
`
`'4 V|.M L. SLU'S EXHIBIT 1006
`
`PAGE 10 OF 124
`
`I.M.L. SLU'S EXHIBIT 1006
`
`
`
` ———jj-—a-
`
`
`
`INTERNATIONAL STANDARD © '30/‘EC ISO/IEC 11172-2: 1993 (E)
`
`Information technology — Coding of moving
`pictures and associated audio for digital storage
`media at up to about 1,5 Mbitls —
`
`Part 2:
`
`Video
`
`Section 1: General
`
`1.1
`
`Scope
`
`This part of ISO/IEC 11172 specifies the coded representation of video for digital storage media and
`specifies the decoding process. The representation supports normal speed forward playback, as well as
`special functions such as random access, fast forward playback, fast reverse playback, normal speed reverse
`playback, pause and still pictures. This part of ISO/IEC 11172 is compatible with standard 525- and 625-
`line television formats, and it provides flexibility for use with personal computer and workstation displays.
`
`ISO/IEC 11172 is primarily applicable to digital storage media supporting a continuous transfer rate up to
`about 1,5 Mbit/s, such as Compact Disc, Digital Audio Tape, and magnetic hard disks. Nevertheless it can
`be used more widely than this because of the generic approach taken. The storage media may be directly
`connected to the decoder, or via cormnunications means such as busses, LANS, or telecommunications
`links. This part of ISO/IEC 11172 is intended for non-interlaced video fonnats having approximately 288
`lines of 352 pels and picture rates around 24 Hz to 30 Hz.
`
`1.2 Normative references
`
`The following International Standards contain provisions which, through reference in this text, constitute
`provisions of this part of ISO/IEC 11172. At the time of publication, the editions indicated were valid.
`All standards are subject to revision, and parties to agreements based on this part of ISO/IEC 11172 are
`encouraged to investigate the possibility of applying the most recent editions of the standards indicated
`below. Members of lEC and ISO maintain registers of currently valid Intemational Standards.
`
`ISO/IEC 11172-1:1993 Information technology — Coding of moving pictures and associated audio for digital
`storage media at up to about 1,5 Mbit/s - Part 1: Systems.
`
`ISO/IEC 11172-3:1993 Information technology — Coding of moving pictures and associated audio for digital
`storage media at up to about 1,5 Mbit/s - Part 3 Audio.
`
`CCIR Recommendation 601-2 Encoding parameters of digital television for studios.
`
`CCIR Report 624-4 Characteristics of systems for monochrome and colour television.
`
`CCIR Recommendation 648 Recording of audio signals.
`
`CCIR Report 955-2 Sound broadcasting by satellite for portable and mobile receivers, including Annex IV
`Summary description ofAdvanced Digital System 11.
`
`CCITT Recommendation J .17 Pre-emphasis used on Sound-Programme Circuits.
`
`PAGE 11 OF 124
`
`it
`
`5
`
`5
`
`"
`
`5
`
`T
`
`1
`
`'|.M.L. SLU'S EXHIBIT 1006
`
`PAGE 11 OF 124
`
`I.M.L. SLU'S EXHIBIT 1006
`
`
`
`ISO/lEC 11172-2: 1993 (E)
`
`© ISO/IEC
`
`IEEE Dmft Standard P1180/D2 1990 Specification for the implementation of 8x 8 inverse discrete cosine
`transform".
`
`IEC publication 90811987 CD Digital Audio System.
`
`PAGE 12OF 124
`
`‘
`
`"
`
`‘V
`
`’
`
`L
`
`'
`
`’
`
`|.M.L. SLU'S EXHIBIT 1006
`
`PAGE 12 OF 124
`
`I.M.L. SLU'S EXHIBIT 1006
`
`
`
`© ISO/IEC
`
`lSO/IEC 11172-2; 1993 (E)
`
`Section 2: Technical elements
`
`2.1 Definitions
`
`For the purposes of ISO/IEC 11172, the following definitions apply. If specific to a part, this is noted in
`square brackets.
`
`2.1.1 ac coefficient [video]: Any DCT coefficient for which the frequency in one or both dimensions
`is non—zero.
`
`In the case of compressed audio an access unit is an audio access unit.
`2.1.2 access unit [system]:
`the case of compressed video an access unit is the coded representation of a picture.
`
`In
`
`2.1.3 adaptive segmentation [audio]: A subdivision of the digital representation of an audio signal
`in variable segments of time.
`
`2.1.4 adaptive bit allocation [audio]: The assignment of bits to subbands in a time and frequency
`varying fashion according to a psychoacoustic model.
`
`2.1.5 adaptive noise allocation [audio]: The assignment of coding noise to frequency bands in a
`time and frequency varying fashion according to a psychoacoustic model.
`
`2.1.6 alias [audio]: Mirrored signal component resulting from sub-Nyquist sampling.
`
`2.1.7 analysis filterbank [audio]: Filterbank in the encoder that transforms a broadband PCM audio
`signal into a set of subsampled subband samples.
`
`2.1.8 audio access unit [audio]: For Layers I and Il an audio access unit is defined as the smallest
`part of the encoded bitstream which can be decoded by itself, where decoded means "fully reconstmcted
`sound". For Layer III an audio access unit is part of the bitstream that is decodable with the use of
`previously acquired main information.
`
`2.1.9 audio buffer [audio]: A buffer in the system target decoder for storage of compressed audio data.
`
`2.1.10 audio sequence [audio]: A non-interrupted series of audio frames in which the following
`parameters are not changed:
`- ID
`- Layer
`- Sampling Frequency
`— For Layer 1 and II: Bittate index
`
`2.1.11 backward motion vector [video]: A motion vector that is used for motion compensation
`from a reference picture at a later time in display order.
`
`2.1.12 Bark [audio]: Unit of critical band rate. The Bark scale is a non—linear mapping of the frequency
`scale over the audio range closely corresponding with the frequency selectivity of the human ear across the
`band.
`
`2.1.13 bidirectionally predictive-coded picture; B-picture [video]: A picture that is coded
`using motion compensated prediction from a past and/or future reference picture.
`
`2.1.14 bitrate: The rate at which the compressed bitstream is delivered from the storage medium to the
`input of a decoder.
`
`2.1.15 block companding [audio]: Normalizing of the digital representation of an audio signal
`within a certain time period.
`
`2.1.16 block [video]: An 8-row by 8—column orthogonal block of pels.
`
`2.1.17 bound [audio]: The lowest subband in which intensity stereo coding is used.
`
`PAGE13OF124
`
`it
`
`if
`
`'
`
`T
`
`T
`
`|.M.L.SLU'S EXH|B3|T1OO6
`
`PAGE 13 OF 124
`
`I.M.L. SLU'S EXHIBIT 1006
`
`
`
`ISO/IEC 11172-2: 1993 (E)
`
`© ISO/IEC
`
`2.1.18 byte aligned: A bit in a coded bitstream is byte-aligned if its position is a multiple of 8-bits
`from the first bit in the stream.
`
`2.1.19 byte: Sequence of 8—bits.
`
`2.1.20 channel: A digital medium that stores or transports an ISO/IEC 11172 stream.
`
`2.1.21 channel [audio]: The left and right channels of a stereo signal
`
`2.1.22 chrominance (component) [video]: A matrix, block or single pel representing one of the
`two colour difference signals related to the primary colours in the manner defined in CCIR Rec 601. The
`symbols used for the colour difference signals are Cr and Cb.
`
`2.1.23 coded audio bitstream [audio]: A coded representation of an audio signal as specified in
`ISO/IEC 11172-3.
`
`2.1.24 coded video bitstream [video]: A coded representation of a series of one or more pictures as
`specified in this part of ISO/IEC 11172.
`
`2.1.25 coded order [video]: The order in which the pictures are stored and decoded. This order is not
`necessarily the same as the display order.
`
`2.1.26 coded representation: A data element as represented in its encoded form.
`
`2.1.27 coding parameters [video]: The set of user-definable parameters that characterize a coded video
`bitstream. Bitstreams are characterised by coding parameters. Decoders are characterised by the bitstreams
`that they are capable of decoding.
`
`2.1.28 component [video]: A matrix, block or single pel from one of the three matrices (luminance
`and two chrominance) that make up a picture.
`
`2.1.29 compression: Reduction in the number of bits used to represent an item of data.
`
`2.1.30 constant bitrate coded video [video]: A compressed video bitstream with a constant
`average bitrate.
`
`2.1.31 constant bitrate: Operation where the bitrate is constant from start to finish of the compressed
`bitstream.
`
`2.1.32 constrained parameters [video]: The values of the set of coding parameters defined in
`2.4.3.2.
`
`2.1.33 constrained system parameter stream (CSPS) [system]: An ISO/IEC 11172
`multiplexed stream for which the constraints defined in 2.4.6 of ISO/IEC 11172-1 apply.
`
`2.1.34 CRC: Cyclic redundancy code.
`
`2.1.35 critical band rate [audio]: Psychoacoustic function of frequency. At a given audible
`frequency it is proportional to the number of critical bands below that frequency. The units of the critical
`band rate scale are Barks.
`
`2.1.36 critical band [audio]: Psychoacoustic measure in the spectral domain which corresponds to the
`frequency selectivity of the human ear. This selectivity is expressed in Bark.
`
`2.1.37 data element: An item of data as represented before encoding and after decoding.
`
`2.1.38 dc-coefficient [video]: The DCT coefficient for which the frequency is zero in both
`dimensions.
`
`4
`PAGE 14 OF 124
`
`1
`
`1
`
`._ g
`1
`A
`A
`it "
`
`,.
`
`1
`
`I.M.L. SLU'S EXH|B|T1006
`
`PAGE 14 OF 124
`
`I.M.L. SLU'S EXHIBIT 1006
`
`
`
`© ISO/IEC
`
`ISO/IEC 11172-2: 1993 (E)
`
`2.1.39 dc-coded picture; D-picture [video]: A picture that is coded using only information from
`itself. Of the DCT coefficients in the coded representation, only the dc-coefficients are present.
`
`2.1.40 DCT coefficient: The amplitude of a specific cosine basis function.
`
`2.1.41 decoded stream: The decoded reconstruction of a compressed bitstream.
`
`2.1.42 decoder input buffer [video]: The first-in first-out
`buffering verifier.
`
`(FIFO) buffer specified in the video
`
`2.1.43 decoder input rate [video]: The data rate specified in the video buffering verifier and encoded
`in the coded video bitstream.
`
`2.1.44 decoder: An embodiment of a decoding process.
`
`2.1.45 decoding (process): The process defined in ISO/IEC 11172 that reads an input coded bitstream
`and produces decoded pictures or audio samples.
`
`2.1.46 decoding time-stamp; DTS [system]: A field that may be present in a packet header that
`indicates the time that an access unit is decoded in the system target decoder.
`
`2.1.47 de-emphasis [audio]: Filtering applied to an audio signal after storage or transmission to undo
`a linear distortion due to emphasis.
`
`2.1.48 dequantization [video]: The process of rescaling the quantized DCT coefficients after their
`representation in the bitstrearn has been decoded and before they are present