`STANDARD
`
`ISO/IEC
`11172-2
`
`First edition
`1993-08-01
`
`
`
`Information technology — Coding of
`moving pictures and associated audio for
`digital storage media at up to about
`1,5 Mbit/s —
`
`Part 2:
`Video
`
`Technologies de |'information — Codage de image animée et du son
`associé pour les supports de stockage numénque jusqu'a environ
`1,5 Mbit —
`Partie 2: Vidéo
`
` Tz
`
`erence number
`ISGnEe1 172-2: 1993(E)
`
`PAGE 1 of 124
`
`PETITIONERS' EXHIBIT 1019
`
`PAGE 1 of 124
`
`PETITIONERS' EXHIBIT 1019
`
`
`
`ISOHEC 17172-2: 1993 (E)
`
`Contents
`
`Page
`
`Tvteedhocttion 2s poesia PL eeua eee oesA
`
`Section >pememrad 3.2 coc ave deat eerie de ee pe ere eed
`
`L.t
`
`SIGOPS eo tic cuties cces en cee waive eas aaa wet b
`
`a anew ewe on een ddaicwedinewereien verde see Revered aoe
`
`L.29Mroramative reference ic .c cess cece cee ee asses eeeeeee casi caasnennerterssneerercereneeisseesead
`
`Section 2:
`
`‘Techical cherietiti. oc ccenneecenacagreccenceesawnencansasesdenennmerecees coeescenpenteind 3
`
`2.1
`
`DeEB esses eee peeecta gaat see ceed hte heel eens ener
`
`2.2
`
`Symbols and abbreviations... 00.2202... ccc cece cn ener eeeennaeeeeeeeeeeeeeeee LT
`
`2.3 Method of describing bitstream synlan..ol cuscenensseseeneeseneeeeeees 13
`
`2.4
`
`Peecquiremiieinbeyoe ae eeEe ieee
`
`Annexes
`
`A
`
`B
`
`c
`
`EB
`
`F
`
`& by @ Inverse discrete cosine transform .....0...0..2222-2. ccs aseeee scsi sseeeeeeee eee oD
`
`Variable length code tables... .....0cc00cis cscs ee cccevacasaaasecensinensaaeeerseeretereeee ss 40
`
`Video buffering: Verifier ; ....0.00.c00ceredceeeavenscseeriereaddedeuteebee beets deeeeetetereen oh
`
`BDOgrapyry csisciwce ce ewe pe pieagawadas os ec ee ee ecasaveseee ges sine eeeeeny ceenrEegertac LOB
`
`List) of patent bolder... .ccc.cecesseenirnenns ce ey eee ces eg poner teed seens saeeeraerasenaens 109
`
`®ISOWIEC 1999
`All rights reserved. No part of this publication may be reproduced or utilized im any fonm or by
`any means, electronic or mechanical, including photocopying and microfilm, without
`permission in writing from the publisher.
`
`ISQVIEC Copyright Office = Case Postale 56 *CH1211 Genéve 20 + Switnerland
`
`Printed in Switzerland.
`
`PAGE 2 of 124
`
`PETITIONERS' EXHIBIT 1019
`
`PAGE 2 of 124
`
`PETITIONERS' EXHIBIT 1019
`
`
`
`® ISONEG
`
`ISO/NEC 11172-2: 1993 (E)
`
`Foreword
`
`ISO (the International Organization for Standardization) and IEC (the Inter-
`national Electrotechnical Commission) form the specialized system for
`worldwide standardization. National bodies that are members of ISO or
`IEC participate in the development of
`International Standards through
`technical committees established by the respective organization to deal
`with particular fields of technical activity,
`ISQ and IEC technical cam-
`mittees collaborate in fields of mutual interest. Other international organ-
`izations, governmental and non-goverme:ntal, in liaison with ISO and IEC,
`also take part in the work.
`
`In the field of information technology, [SO and IEC have established a joint
`technical committee, ISO/IEC JTC 1. Draft International Standards adopted
`by the joint technical committee are circulated to national bodies for wot-
`ing. Publication as an International Standard requires approval by at least
`75% of the national bodies casting a vote.
`
`International Standard ISO/IEC 11172-2 was prepared by Joint Technical
`Committee ISOJEC JTC 1, Information technology, Sub-Committee SC 29,
`Coded representation of audio, picture, multimedia and hypermedia infor-
`mation,
`
`ISOVIEC 11172 consists of the following parts, under the general title in-
`formation technology — Coding of moving pictures and associated audio
`for digital storage media at up to about 1,5 Mbit/s:
`
`— Part 1: Systems
`
`— Pat 2: Video
`
`— Part 3: Audio
`
`— Pant 4: Compliance testing
`
`Annexes 4, B and C form an integral part of this part of ISOWIEC 11172.
`Annexes D, E and F are for information only,
`
`PAGE 3 of 124
`
`PETITIONERS' EXHIBIT 1019
`
`PAGE 3 of 124
`
`PETITIONERS' EXHIBIT 1019
`
`
`
`ISOVIEC 11172-2: 1993 (E)
`
`© ISO/IEC
`
`Introduction
`
`Note -- Readers interested in an overview of the MPEG Video layer should read this Introduction and then
`
`proceed to annex D, before returning to clauses 1 and 2.
`
`0.1
`
`Purpose
`
`This pant of ISQVIEC 11172 was developed im response to the growing need for a Common format for
`representing Compressed video on various digital storage media such as CDs, DATs, Winchester disks and
`optical drives. This part of [SO/DEC 11172 specifies a coded representation that can be used for
`compressing video sequences to bitrates around 1.5 Mbit's. The use of this part of ISO/IEC 11172 means
`that motion video can be manipulated as a fonn of computer data and can be transmitted and received over
`existing and future networks. The coded represcntation can be used with both 625-line and 525-line
`television and provides flexibility for use with workstation and personal computer displays.
`
`This pant of ISO/TEC 11172 was developed to operate principally from storage media offering a continuous
`transfer rate of about 1,5 Mbit/s. Nevertheless it can be used morc widely than this because the approach
`taken is. generic.
`
`0.1.1 Coding parameters
`
`The intention in developing this part of ISO/IEC 11172 has been to define a source coding algorithm with a
`large degree of flexibility that can be used in many different applications. To achieve this goal, a numberof
`the parameters defining the characteristics of coded bitstreams and decoders are contained in the bitstream.
`itself, This allows for example, the algorithm to be used for pictures with a variety of sizes and aspect
`ratios and on channels or devices operating at a wide mnge of bitrates.
`
`Because of the large range of the characteristics of bitstreams that can be represented by this part of ISO/IEC
`11172, a sub-set of these coding parameters known as the "Constrained Parameters” has been defined. The
`aim in defining the constrained parameters is to offer guidance about a widely useful range of parameters.
`Conforming to this set of constraints is not a requirement of this part of ISOVIEC 11172. A flag in the
`bitstream indicates whether or not it is a Constrained Parameters bitstream.
`
`Summary of the Constrained Parameters:
`
`Less than or
`
`equal to 768 pels
`
`
`
`f_code<=4 (s
`
`Mation vector range
`
`Less than -64 to +63,5 pels (using half-pel vectors)
`
`0.2 Overview of the algorithm
`
`The coded representation defined in this part of ISO/IEC 11172 achieves a high compression ratio while
`preserving good picture quality. The algorithm is not lossless as the cxact pel values are not preserved
`during coding. The choice of the techniques is based on the need to balance a high picture quality and
`compression ratio with the requirement to make random access to the coded bitstream. Obtaining good
`picture quality at the bitrates of interest demands a very high compression ratio, which is not achievable
`with inttaframe coding alone. “The need for random access, however, is best satisfied with pure intraframe
`coding. This requires a carcful balance between imtra- and interframe coding and between recursive and non-
`recursive temporal redundancy reduction,
`
`PAGE 4 of 124
`
`PETITIONERS' EXHIBIT 1019
`
`PAGE 4 of 124
`
`PETITIONERS' EXHIBIT 1019
`
`
`
`@ ISONEC
`
`ISONEG 11172-2: 1993 (E)
`
`A number of techniques ane used to achieve a high compression ratio, The first, which is almost
`independent from this part of ISQOVIEC 11172,is to select an appropriate spatial resolution for the signal.
`The algorithm then uses block-based motion compensation to reduce the temporal redundancy. Motion
`compensation is used for causal prediction ofthe current picture froma previous picture, for non-causal
`prediction of the current picture from a future picture, or for interpolative prediction from past and future
`pictures. Motion vectors are defined for each 16-pel by 16-line region of the picture. The difference signal,
`the prediction error, is further compressed using the discrete cosine transform (DCT) to remove spatial
`correlation before it is quantized in an irreversible process that discards the less important information.
`Finally, the motion vectors are combined with the DCT information, and coded using variable length codes.
`
`0.2.1 Temporal processing
`
`Because of the conflicting requirements of random access and highly efficient compression, three main
`picture types are defined. Intra-coded pictures (1-Pictures) are coded without reference to other pictures:
`They provide access points to the coded sequence where decoding can begin, but are coded with only a
`moderate compression ratio. Predictive coded pictures (P-Pictures) are coded more efficiently using motion
`COmpensaled prediction from a past intra or predictive coded picture and are penerally used as a reference for
`further prediction. Bidirectionally-predictive coded pictures (B-Pictures) provide the highest degree of
`compression but require both past and future reference pictures for motion compensation. Bidirectionally-
`predictive coded pictures are never used as references for prediction. The organisation of the three picture
`types in a sequence is very flexible. The choiceis left to the encoder and will depend on the requirements of
`the application. Figure 1 illustrates the relationship between the three different picture types.
`
`Bi-directional
`
`Prediction Prediction
`
`Figure 1
`
`-- Example of temporal picture structure
`
`The fourth picture type defined in this part of ISO/IEC 11172, the D-picture, is provided to allow a simple,
`but limited quality, fast-forward playback mode,
`
`0.2.2 Motion representation - macroblocks
`
`The choice of 16 by 16 macroblocks for the motion-compensation unit is a result of the tmde-off between
`increasing the coding efficiency provided by using motion information and the overhead needed to store it,
`Each macroblock can be one of a numberof different types. For example, intra-coded, forward-predictive-
`coded, backward-predictive coded, and bidirectionally-predictive-coded macroblocks are permitted in
`bidircctionally-predictive coded pictures. Depending on the type of the macroblock, motion vector
`information and other side information are stored with the compressed prediction error signal in each
`macroblock, The motion vectors are encoded differentially with respect to the last coded motion vector,
`using variable-length codes, The maximum length of the vectors that may be represented can be
`programmed, on a picture-by-picture basis, so that the most demanding applications can be met without
`compromising the performance of the system in more nonmal situations,
`
`It is the responsibility of the encoder to calculate appropriate motion vectors, This part of ISO/IEC 11172
`dots not specify bow this should be done.
`
`PAGE 5 of 124
`
`PETITIONERS' EXHIBIT 1019
`
`PAGE 5 of 124
`
`PETITIONERS' EXHIBIT 1019
`
`
`
`ISOVIEC 11172-2: 1993 (E)
`
`© ISOAEC
`
`0.2.3 Spatial redundancy reduction
`
`Both original pictures and prediction error signals have high spatial redundancy. This pant of ISO/TEC
`11172 uses a block-based DCT method with visually weighted quantization and nm-length coding. Each &
`by 8 block of the original picture for intra-coded macroblocks or of the prediction error for predictive-coded
`macroblocks is transformed into the DCT domain where it is scaled before being quantized. After
`quantization many of the coefficients are zero in value and so two-dimensional run-length and variable
`length coding is used to encode the remaining coefficients efficiently.
`
`0.3 Encoding
`
`It specifies the syntax and semantics of
`This pant af ISO/TEC 11172 does not specify an encoding process.
`the bitstream and the signal processing in the decoder. As aresult, many options are left open to encoders
`to trade-off cost and speed against picture quality and coding efficiency. This clause is a bnef description of
`
`Source input pictures
`
`the functions that need to be perfonmed by an encoder. Figure 2 shows the main functional blocks.
`
`
`STO}S3AWOT)OPY
`
`
`where
`
`DXxCT is discrete cosine transform
`DCT-! is inverse discrete cosine wansform
`Q is quantization
`Q°! is dequantization
`VLC is variable length coding
`
`Figure 2 -- Simplified video encoder block diagram
`
`The input video signal must be digilized and represented as a luminance and two colour difference signals
`CY, Cp, Cy}. This may be followed by preprocessing and format conversionto select an appropriate
`window, resolution and input format. This part of ISOVIEC 11172 requires that the colour difference
`signals (Cp and C,) are subsampled with respect to the luminance by 2:1 in both vertical and horizontal
`directions and are reformatied, if necessary, as 4 non-interlaced signal.
`
`The encoder must choose which picture type to use for each picture. Having defined the picture types, the
`encoder estimates motion vectors for each 16 by 16 macroblock in the picture.
`In P-Pictures one vector is
`needed for cach non-intra macroblock and in B-Pictures one or two vectors are needed.
`
`If B-Pictures are used, some reordering of the picture sequence is necessary before encoding, Because B-
`Pictures are coded using bidirectional motion compensated prediction, they can only be decoded after the
`subsequent reference picture (an Tor P-Picture) has been decoded, Therefore the pictures are reordered by the
`
`wi
`
`PAGE 6 of 124
`
`PETITIONERS' EXHIBIT 1019
`
`PAGE 6 of 124
`
`PETITIONERS' EXHIBIT 1019
`
`
`
`© ISONEG
`
`ISOMEG 11172-2: 1993 (E)
`
`encoder so that the pictures arrive at the decoder in the order for decoding. The correct display order is
`
`The basic unit of coding within a picture is the macroblock, Within each picture, macroblocks are encoded
`in sequence, left to right, top to bottom. Each macroblock consists of six & by & blocks: four blocks of
`luminanec, one block of Ch chrominance, and one block of Cr chrominance. See figure 3. Note that the
`Picture area covered by the four blocks of luminance is the same as the area covered by cach of the
`chrominance blocks. This is due to subsampling of the chrominance information to match the sensitivity of
`the human visual system,
`
`fo fiyjf]
`
`¥
`
`Cb
`
`Cr
`
`Figure 3 -- Macroblock structure
`
`[t depends on the picture type, the
`Firstly, fora given macroblock, the coding mode is chosen.
`effectivencss of motion compensated prediction in that local region, and the nature of the signal within the
`block. Secondly, depending on the coding mode, a motion compensated prediction of the contents of the
`block based on past and/or future reference pictures is fonmed. This prediction is subtracted from the actual
`data in the current macroblock to fonm an error signal. Thirdly, this error signal is separated imto & by 8
`blocks (4 luminance and 2 chrominance blocks in each macroblock) and a discrete cosine wansform is
`performed on each block. Each resulting § by 8 block of DCT coefficients is quantized and the two-
`dimensional block is scanned in a zig-zag order to convert it into a one-dimensional string of quantized DCT
`coefficients, Fourthly, the side-infonmation for the macroblock (mode, motion vectors elc) and the
`quantized coefficient data are encoded. Por maximum efficiency, a number of variable length code tables are
`defined) for the different daw elements. Rum-length coding is used for tre quantized coefficient data.
`
`A consequence of using different picture types and variable length coding is that the overall data rate is
`variable, In applications that involve a fixed-rate channel, a FIFO butter may be used to match the encoder
`output to the channel. The staws of this buffer may be monitored to control the number of bits generated
`by the encoder. Controlling the quantization process is the most direct way of controlling the bitrate. This
`part of ISOVIEC 11172 specifies an abstract model of the buffering system (the Video Buffering Verifier) in
`order to constrain the maximum variability in the number of bits that are used for a given picture, This
`ensures that a bitstream can be decoded with a buffer of known size.
`
`At this stage, the coded representation of the picture has been generated. The final step in the encoder is to
`regenerate [-Pictures and P-Pictures by decoding the data so that they can be used as reference pictures for
`subsequent encoding. The quantized coefficients are dequantized and an inverse 8 by 8 DCT is performed on
`each block. The prediction error signal produced is then added back to the prediction signal and limited to
`the required range to give a decoded reference picture.
`
`0.4 Decoding
`
`It is considerably simpler than encoding as there is no
`Decoding ts the inverse of the encoding operation.
`need to perfonn motion estimation and there are many fewer options. The decoding process is defined by
`this part of ISO/IEC 11172. The deseription that follows is a very brief overview of one possible way of
`decoding a bitstream. Other decoders with different architectures are possible. Figure 4 shows the main
`functional blocks.
`
`PAGE 7 of 124
`
`PETITIONERS' EXHIBIT 1019
`
`wii
`
`PAGE 7 of 124
`
`PETITIONERS' EXHIBIT 1019
`
`
`
`ISOMEG 11172-2: 1993 (E)
`
`© ISO/IEC
`
`e
`
`Coded wicleo:
`bitstream
`
`
`
`ete 1
`pox}
`e[vis {ot
`
`Motion Vectors
`
`ee
`
`Picture
`
`@
`
`Picture store:
`
`anid
`Predictor
`
`
`‘
`
`
`Reconstructed
`cunt put pictures
`
`Where
`
`DCT-|
`ql
`MUX-!
`VLD
`
`is inverse discrete cosine wansform
`is dequantization
`is demultiplexing
`is variable length decoding
`
`Figure 4 -- Basic video decoder block diagram
`
`For fixed-rate applications, the channel fills a FIFO buffer at a constant rate with the coded bitstream.
`decoder reads this butter and decodes the data elements in the bitstream acconling to the defined syntax.
`
`‘The
`
`As the decoder reads the bitstream, it identifies the start of a coded picture and then the type of the picture,
`It decodes each macroblock in the picture in tum. The macroblock type and the motion vectors, if present,
`are used to construct a prediction of the current macroblock based on past and future reference pictures that
`have been stored in the decoder. The coefficient data are decoded and dequantized. Each § by 8 block of
`coefficient data is transformed by an inverse DCT (specified in annex A), and the result is added to the
`prediction signal and limited to the defined range.
`
`After all the macroblocks in the picture have been processed. the picture has been reconstructed. If it is an I-
`picture or a P-picture il is a reference picture for subsequent pictures and is stored, replacing the oldest stored
`reference picture. Before the pictures are displayed they may need to be re-ordered from the coded order to
`their natural display order. After reordering, the pictures are available, in digital form, for post-processing
`and display in any manner that the application chooses.
`
`0.5
`
`Structure of the coded video bitstream
`
`This part of ISQVIEC 11172 specifies a syntax fora coded video bitsream. This syntax contains six layers,
`each of which either supports a signal processing ora system function:
`
`
`
`
`Layers of the syntax [|Function___
`SEESS———————
`Sequence layer
`Random access unit! COmlEXL
`
`
`
`Group of pictures layer
`Random access unit: video
`
`
`
`Picture layer
`Primary coding unit
`
`
`Kesynchronization unit
`Slice layer
`
`
`Macroblock layer
`Motion compensation unit Block layer
`
`
`CxlST unit
`
`
`
`
`0.6
`
`Features supported by the algorithm
`
`Applications using compressed video on digital storage media need vo be able to perform a number of
`operations in addition to normal forward playback of the sequence, The coded bitstream has been designed
`to SUppOrt a number of these operations.
`
`vill
`
`PAGE 8 of 124
`
`PETITIONERS' EXHIBIT 1019
`
`PAGE 8 of 124
`
`PETITIONERS' EXHIBIT 1019
`
`
`
`© ISONEC
`
`ISO/IEC 11172-2: 1993 (E)
`
`0.6.1 Random access
`
`Random access is an essential feature for video on a storage medium. Random access requires that any
`picture can be decoded in a limited amount of time.
`[t implies the existence of access points in the
`biistream - that is segments of information that are identifiable and can be decoded without reference to other
`segments of data. A spacing of bwo random access points (Intra-Pictures) per second can be achieved
`without significant loss of picture quality.
`
`0.6.2 Fast search
`
`Depending on the storage medium, it is possible to scan the access points in a ceded bitstream (with the
`help of an application-specific directory or other knowledge beyond the scope of this part of ISO/IEC
`11172) to obtain a fast-forward and fast-reverse playback effect.
`
`0.6.3 Reverse playback
`
`Some applications may require the video signal to be played in reverse onder. This can be achieved in a
`decoder by using Memory to store entire groups of pictures atter they have heen decoded before being
`displayed in reverse onder, An encoder can make this feature easier by reducing the length of groups of
`Pictures.
`
`0.6.4 Error robustness
`
`Mostdigital storage media and communication channels are not error-free. Appropriate channel coding
`schemes should be used and are beyond the scope of this part of ISOVIEC 11172. Nevertheless the
`compression scheme defined in this part of ISQO/TEC 11172 is robust to residual errors. The slice structure
`allows a decoder to recover after a data error and to resynchronize its decoding. Therefore, bit errors in the
`compressed data will cause errors in the decoded pictures to be limited in area. Decoders may be able to use
`concealment strategies to disguise these errors.
`
`0.6.5 Editing
`
`There is a conflict between the requirement for high coding efficiency and easy editing. The coxling structure
`and syntax have not been designed with the primary aim of simplifying editing at any picture, Nevertheless
`a number of features have been included that enable editing of coded data.
`
`PAGE 9 of 124
`
`PETITIONERS' EXHIBIT 1019
`
`PAGE 9 of 124
`
`PETITIONERS' EXHIBIT 1019
`
`
`
`This page intentionally left blank
`
`PAGE 10 of 124
`
`PETITIONERS' EXHIBIT 1019
`
`PAGE 10 of 124
`
`PETITIONERS' EXHIBIT 1019
`
`
`
`
`
`ISOVIEG 11172-2: 1993 (E}
`INTERNATIONAL STANDARD © ISONEC
`
`
`Information technology — Coding of moving
`pictures and associated audio for digital storage
`media at up to about 1,5 Mbit/s —
`
`Part 2:
`Video
`
`Section 1: General
`
`1.1
`
`Scope
`
`This part of ISO/IEC 11172 specifies the coded representation of video for digital storage media and
`specifies the decoding process. The representation supports wormal speed forward playback, as well as
`special functions such as random access, fast forward playback, fast reverse playback, nonnal speed reverse
`playback, pause and still pictures. This part of ISO/IEC 11172 is compatible with standard 525- and 625-
`line television formats, and it provides flextbility for use with personal computer and workstation displays.
`
`ISQ/TEC 11172 is primarily applicable to digital storage media supporting a continuous transfer rate up to
`about 1,5 Mbit/s, such as Compact Disc, Digital Audio Tape, and magnetic hard disks. Nevertheless it can
`be used more widely than this because of the generic approach taken. The storage media may be directly
`conmected to the decoder, or via communications means such as busses, LANs, or telecommunications
`links. This part of ISO/IEC 11172 is intended for non-interlaced video formats having approximately 288
`lines of 352 pels and picture mies around 24 Hz to 30 Hz.
`
`1.2 Normative references
`
`The following Imemational Simdarnds contain provisions which, through reference in this text, constitute
`provisions of this part of ISQVIEC 11172. At the time of publication, the editions indicated were valid.
`All standards are subject to revision, and parties to agreements based on this part of ISO/IEC 11172 are
`encouraged to investigate the possibility of applying the most recent editions of the standards indicated
`below. Members of IEC and [SO maintain registers of currently valid International Standards.
`
`ISOVIEC 11172-1:1993 Information technology - Coding ofmoving pictures and associated audiofor digital
`Storage media at up fo about /5 Mibip's - Part J: Systems.
`
`ISOMEC 11172-3:1993 Information technology - Coding ofmoving pictures and associated audiofor digital
`storage media at up to about I,5 Mbit/s - Part 3 Audio.
`
`CCIR Recommendation 601-2 Encoding parameters afdigital televisionfor studios,
`
`CCIR Report 624-4 Characteristics of systemsfor monochrome and colour television,
`
`CCIR Recommendation 645 Recording of audia signals,
`
`CCIR. Report 955-2 Sound broadcasting by satellitefor portable and mobile receivers, including Annex IV
`Sununary description of Advanced Digital System If,
`
`CCITT Recommendation J.17 Pre-emphasis used on Sound-Programme Circuits,
`
`PAGE 11 of 124
`
`PETITIONERS' EXHIBIT 1019
`
`PAGE 11 of 124
`
`PETITIONERS' EXHIBIT 1019
`
`
`
`ISQVIEG 11172-2: 1993 (E)
`
`© ISO/MEC
`
`TREE Draft Standard P1IS0/D? 1990 Specification for the implementation of 8x & inverse discrete cosine
`transform”,
`
`IEC publication 908:1987 CD Disital Auwaio System.
`
`PAGE 12 of 124
`
`PETITIONERS' EXHIBIT 1019
`
`PAGE 12 of 124
`
`PETITIONERS' EXHIBIT 1019
`
`
`
`© ISO/IEC
`
`ISONEC 11172-2: 1993 (E)
`
`Section 2: Technical elements
`
`2.1 Definitions
`
`For the purposes of ISO/IEC 11172, the following definitions apply. If specific to a part, this is noted in
`square brackets.
`
`2.1.1 ac coefficient [video]: Any DCT coefficient for which the frequency in one or both dimensions
`is Non-7ero.
`
`In the case of compressed audio an access unit is an audio access unit.
`2.1.2 access unit [system]:
`the case of compressed video an access unit is the coded representation of a picture,
`
`In
`
`2.1.3 adaptive segmentation [audio]: A subdivision of the digital representation of an audio signal
`in variable segments of time,
`
`2.1.4 adaptive bit allocation [audio]; The assignment of bits to subbands in a time and frequency
`varying fashion according to a psychoacoustic model.
`
`2.1.5 adaptive noise allocation [audio]: The assignment of coding noise to frequency bands in a
`ime and frequency varying fashion according to a psychoacoustic model.
`
`2.1.6 alias [audio]: Mirrored signal component resulting from sub-NWyquist sampling.
`
`2.1.7 analysis filterbank [audio]: Filterbank in the encoder that transforms a broadband PCM audio
`signal into a set of subsampled subband samples.
`
`2.1.8 audio access unit [audio]: For Layers I and Hf an audio access unit is defined as the smallest
`part of the encoded bitstream which can be decoded by itself, where decoded means "fully reconstructed
`sound". For Layer II an audio access unit is part of the bitstream that is decodable with the use of
`previously acquired main infomation.
`
`2.1.9 audio buffer [audio]; A buffer in the system target decoder for storage of compressed audio data.
`
`2.1.10 audio sequence [audio]: A non-interrupted series of audio frames in which the following
`Parameters are not changed:
`- 1D
`< Layer
`- Sampling Frequency
`- For Layer T and Tl: Bitrate index
`
`2.1.11 backward motion vector [video]: A motion vector thal is used for motion compensation
`from a reference picture at a later ime in display order.
`
`2.1.12 Bark [audio]; Unit of critical band rate. The Bark scale is a non-linear mapping of the frequency
`scale over the audio range closely corresponding with the frequency selectivity of the human éar across the
`bared.
`
`2.1.13 bidirectionally predictive-coded picture; B-picture [video]: A picture that is coded
`using Motion compensated prediction from a pase and/or tune reference picture.
`
`‘The rate at which the compressed bitstream is delivered from the storage medium bo the
`2.1.14 bitrate:
`input of a decoder.
`
`2.1.15 block companding [audio]; Nonnalizing of the digital representation of an audio signal
`within a certain ime period.
`
`2.1.16 block [video]: An 8-row by 8-column orthogonal block of pels.
`
`2.1.17 bound [audio]: The lowest subband in which intensity stereo coding is used.
`
`PAGE 13 of 124
`
`PETITIONERS' EXHIBIT 1019
`
`PAGE 13 of 124
`
`PETITIONERS' EXHIBIT 1019
`
`
`
`ISQIEC 11172-2; 1993 (E)
`
`® SGiEc
`
`2.1.18 byte aligned: A bit in a coded bitstream is byte-aligned if its position is a multiple of 8-bits
`from the first bil in the stream.
`
`2.1.19 byte: Sequence of 8-bits.
`
`2.1.20 channel: A digital medium that stores or transports an ISO/IEC 11172 stream,
`
`2.1.21 channel [audio]: The left and right channels of a stereo signal
`
`2.1.22 chrominance (component) [video]; A matrix, block or single pel representing one of the
`two colour difference signals related to the primary colours in the manner defined in CCIR Rec 601, The
`symbols used for the colour difference signals are Cr and Ch.
`
`2.1.23 coded audio bitstream [audio]: A coded representation of an audio signal as specified in
`ISCWIEC 11172-3.
`
`2.1.24 coded video bitstream [video]: A coded representation of a series of one of More pictures as
`specified in this part of ISQVIEC 11172.
`
`2.1.25 coded order [video]: The order in which the pictures are stored and decoded, This order is not
`necessarily the same as the display order.
`
`2.1.26 coded representation: A data element as represented in its encoded form,
`
`2.1.27 coding parameters [video]; The set of user-definable parameters that characterize a coded video
`bitstream, Bitstreams are characterised by coding parameters. Decoders are characterised by the bitstreams
`that they are capable of decoding.
`
`2.1.28 component [video]: A matrix, block or single pel from one of the three matrices (luminance
`and two chrominance) that make up a picuurc.
`
`2.1.29 compression: Reduction in the number of bits used to represent an item of cata.
`
`2.1.30 constant bitrate coded video [video]: A compressed video bitstream with a constant
`average bitrate.
`
`2.1.31 constant bitrate: Operation where the bitrate is constant from start to finish of the compressed
`bitstream.
`
`2.1.32 constrained parameters |video|:
`2.4.3.2.
`
`‘The values of the set of coding parameters defined in
`
`2.1.33 constrained system parameter stream (CSPS) [system]: An [ISO/IEC 11172
`multiplexed stream for which the constraints defined in 2.4.6 of ISO/IEC 11172-1 apply.
`
`2.1.34 CRC: Cyclic redundancy code.
`
`2.1.35 critical band rate [audio]: Psychoacoustic function of frequency. At a given audiblic
`frequencyit is proportional to the number of critical bands below that frequency. The units of the critical
`band rate scale are Barks.
`
`2.1.36 critical band [audio]: Psychoacoustic measure in the spectral domain which corresponds to the
`frequency selectivity of the human ear. This selectivity is expressed in Bark.
`
`2.1.37 data element; An item of data as represented before encoding and after decoding.
`
`2.1.38 de-coefficient [video]: The DCT coefficient for which the frequency is zero in both
`dimensions,
`
`PAGE 14 of 124
`
`PETITIONERS' EXHIBIT 1019
`
`PAGE 14 of 124
`
`PETITIONERS' EXHIBIT 1019
`
`
`
`© ISOMEG
`
`ISOMEG 11172-2: 1993 (E)
`
`2.1.39 de-coded picture; D-picture [video]: A picturc that is coded using only information from
`itself. Of the DCT coefficients in the coded representation, only the dc-coefficients are present.
`
`2.1.40 DCTcoefficient: The amplitude of a specific cosine basis function.
`
`2.1.41 decoded stream: The decoded reconstruction of a compressed bitstream.
`
`2.1.42 decoder input buffer [video]: The first-im first-out
`buffering verifier.
`
`(FIFO) buffer specified in the video
`
`2.1.43 decoder input rate [wideo|: The data rate specified in the video buffering verifier and encoded
`in the coded video bitstream.
`
`2.1.44 decoder: An embodiment of a decoding process.
`
`2.1.45 decoding (process): The process defined in ISO/IEC 11172 that reads an input coded bitstream
`and produces decoded pictures or audio samples.
`
`2.1.46 decoding time-stamp; DTS [system]: A field that may be present in a packet header that
`indicates the time that an access unit is decoded in the system target decoder.
`
`2.1.47 de-emphasis [audio]: Filtering applicd to an audio signal after storage or transmission to und
`a linear distortion due to emphasis.
`
`2.1.45 dequantization [video]: The process of rescaling the quantized DCT coefficients after their
`representation in the bitstream has been decoded and before they are presented to the inverse LaCT,
`
`2.1.49 digital storage media; DSM: A digital storage or transmission device or system.
`
`2.1.50 discrete cosine transform; [CT [video]: Either the forward discrete cosine tansform or the
`inverse discrete cosine transform, The DCTis an invertible, discrete orthogonal transfonnation, The
`inverse DWCT is defined in annex A.
`
`2.1.51 display order [video]: The order in which the decoded pictures should be displayed. Nonmally
`this is Ue same onder in which they were presented al the inpul of the encoder,
`
`2.1.52 dual channel mode [audio]: A mode, where two audio channels with independent programme
`contents (e.g. bilingual) are encoded within one bitstream. The coding process is the same as for the stereo
`mode.
`
`2.1.53 editing: The process by which one or more compressed bitstreams are manipulated to produce a
`new compressed hitstreaam. Contonni