`STANDARD
`
`ISO/IEC
`11172-2
`
`First edition
`1993-08-91
`
`Information technology — Coding of
`moving pictures and associated audio for
`digital storage media at up to about
`1,5 Mbit/s —
`
`Part 2:
`Video
`
`Technologies de liniormation —- Coaage de l'image animée et du son
`associé pour les supports de stcckage numérique jusqu’3 environ
`15 Mbit/s —
`Partie 2° Vidéo
`
`
`
`PUMAExhibit 2005
`Apple v. PUMA, IPR2016-01135
`cterends ofled
`ISO/IEC 11172-2 1£93]E}
`
`Apple Exhibit 1009
`_ Page 1 of 124
`
`Apple Exhibit 1009
`Page 1 of 124
`
`PUMA Exhibit 2005
`Apple v. PUMA, IPR2016-01135
`1 of 124
`
`
`
`
`
`ISO/IEC 11172-2: 1993 (E}
`
`Contents
`
`Page
`
`Poseword «2.0200 .22c eeeee cere cere eree nee See ered re ee ee tee ee enn sane tate Sine nee pen gees nce eeeii
`
`EmtcOGUCHION. ees cccecevee se tent eter ect e eee eee tee eeeesee ce ee sneceeeanapeeccesurestnegureacsemneecneanee]¥
`
`Sectien 1: General 00.0.2... clee ce ce ce ene cee ec te cence eee etna eee econ en eeeee sreetnee wanna 1
`
`Leb
`
`SEOpe nee c eee eece cee ce nc cee cee concer ee cea ceee ca cete ene aeeaceeaerseseeeneeeeecreeeetctenereed 1
`
`1.2
`
`Normative references...0..00. 020.2 eee eee een cere eenee nee etecereareaecsetesereemerr ed
`
`Section 2: Technical elements. 2.20...e cece cence cence eee eee ee tone ener nem ent cavern ens
`
`2.)
`
`Definitions 22...eee ee ee edie ence ce cere nee cere cone seen aee nce t erence eee etteneetetseae need
`
`2.2
`
`Symbols -and ABDTEVIALIODS 0.2 -ce eee esses teececteetcree ence mee iter caeeeene EL
`
`2.3 Method of describing bitstream syntax... eee cece cece renee nee 13
`
`2.4
`
`Require ments... 22...cece ccc cee ce ene nce cern ces c eee esa cance ene ees teres sseeteceteeeeeee 1S
`
`Annexes
`
`A
`
`B
`
`Cc
`
`dD
`E
`
`8 by § Inverse discrete cosine transform ....... 2.0... eee cee te ec cee eee cee cee enee tees
`
`id io)
`
`Variable length code tables ....0.20.2.<cc:c0ccsececsvuecsseeteececcseeqrerieeesereeneee 40
`
`Video buffering VeUiRCLscccesscesesecctvccesesssseucvesuseautiissssatecssns49
`
`Guide to encoding video ............ 20022000 ec ee cee cee ce eee ee cece eee ee teeteeneneree ns DL
`BibWography sossccessccsssoceceecserececesteesesueeaueeinresescal8
`
`F
`
`List cf patent Holders... ccc sense rere rcnerenes se eeareasvenarnegeenaesaina sas 109
`
`© ISOAEC 1993
`All rights reserved. No part of this publication may be reproduced o7 utilized in any ferm or by
`any means, electronic or mechanical, including photocopying and microfilm, without
`pemnission in writing from the publisher.
`
`PUMAExhibit 2005
`Apple v. PUMA, IPR2016-01135
`ISOAEC Copyright Office + Case Postale $6 + CH1211 Gentve 20 « Switzerland
`2 of 124
`Printed in Switzerland.
`
`li
`
`Apple Exhibit 1009
`Page 2 of 124
`
`
`
`
`
`
`
`Apple Exhibit 1009
`Page 2 of 124
`
`PUMA Exhibit 2005
`Apple v. PUMA, IPR2016-01135
`2 of 124
`
`
`
`© ISCIIEC
`
`ISO/IEC 11172-2: 1993 (E)
`
`Foreword
`
`ISO tthe International O-ganization for Standardization) and IEC {the Inter-
`nationa. Electrotechnical Commission) form the specialized system for
`werldwide standardization. National bodies that are members of
`!SO or
`IEC participate in the development of Imematicnal Stencards through
`techhical committees esteblished by the respective orgenization ‘to deal
`with particular fields of techincal activity.
`[SO and IEC technical com-
`mittees collaborate in fields of mutual interest. Other international organ-
`izaticns, governmental and nor-gevernmental,in liaison with ISO ard IEC,
`also take part n the work.
`
`In the field of information technology, ISO and IEC have established a joint
`technical committee, ‘SO/IEC JTC 1. Daft International Standards adopted
`by the joint technical commictee are circulated to national bodies for vot-
`ing. Publication as an Internetional Standard requires app’oval by at least
`75% o* the national bodies casting 4 vote.
`
`Intemational Standarc iSO/IEC 11172-2 was orepared by Joint Technical
`_ Cemmittee 'SO/IEC JTC 1, information technology, Sub-Committee SC 29,
`Coded representationof audio, picture, multimediaand hypermedia infcr-
`mation.
`iISO/EC 11172 consists of the followingparts, under thegeneral title in-
`formation technology —Coding of moving pictures and associated audio
`for cigital storage media af up to about 1,5 Mbivs:
`— Part 7: Systems
`
`— Part 2: Video
`
`— Part3: Audio
`
`
`
`— Part4: Compliance testing
`Annexes A, B and C fom‘an integral part of this part of ISOAEC 11172.
`Annexes D, E andFare fcrinformation only.
`
`PUMAExhibit 2005
`Apple v. PUMA,IPR2016-01135
`3 of 124
`
`iti
`Apple Exhibit 1009
`Page 3 of 124
`
`Apple Exhibit 1009
`Page 3 of 124
`
`PUMA Exhibit 2005
`Apple v. PUMA, IPR2016-01135
`3 of 124
`
`
`
`
`
`ISO/IEC 11172-2: 1993 (5)
`
`© ISOMEC
`
`Introduction
`
`Note -- Readers interested iu an overview of the MPEG Video layer should read this Introduction and then
`
`peoceed to annex D, before returning to clauses 1 and 2.
`
`0.i
`
`Purpose
`
`This part ofISO/IEC 11172 was developed in response to the growing need fora common format for
`representing compressed video on various digital storage media such as CDs, DATs, Winchester disks and
`optical drives. This part of ISO/TEC 11172 specifies a coded representation that can be used for
`compressing video sequences to bitratcs around 1,5 Mbit/s. The use of this part of ISO/IEC 11172 means
`thatmotion video can be manipulated a3 afonm ofcomputer data and can be transmitted and received over
`existing and future networks. The coded representation can be used with both 625-line and 5254ime
`television and provides flexibility for use with workstation and personal computer displays.
`This part ofESO/EC 11172 was developed to operate principally from storage media offering a continuous
`transfer rate of about 1,5 Mbit/s. Nevertheless it can be used more widely than this because the approach
`taken is generic.
`
`G.1.1 Coding parameters
`
`Theintention indeveloping this part of ISO/IEC 11172 has been to define a source coding algoritim witha
`large degree of flexibility that can be used in many different applications. To achieve this goal, a number of
`the parameters defining the characteristics ofcoded bitstreams and decoders are contained inthe bitstream
`itself, This allows for example, the algorithm to be used for pictures with a variety of sizes and aspect
`ratios and on channels or devices operating at a wide range ofbitrates.
`
`Because of the large range of the characteristics of bitstreams that can be represented bythis part of ISO/IEC
`11172, a sub-set of these coding parameters known as the "Constrained Paruneters” has been defined. The
`aim in defining the constrained parametersis to offer guidance about a widely useful range ofparameters.
`Conforming to this set of constraints is not a requirementof this part of ISO/IEC 11172. A flag in the
`bitstream indicates whether or not itis a Constrained Parameters bitstream.
`
`Summary of the Constrained Parameters:
`
`
`
`
`
`Vertical picture size
`
`
`
`
`
`
`
`
`
`
`Less than or equal to 768 pels
`Lass than or equal tc 576 lines
`Less than or equal to 396 macroblocks
`Less than or equal to 396x255 macrohlocks/s
`Less than or equa! to 30 Hz
`Less than -64 to +63,5 pels (using half-pel vectors}
`ackward_£code and forward_fcodes <= 4 (see table D.7
`
`
`
`Less than or equalto 1 856 000 bits/s (constantbitrate
`
`
`
`0.2 Overview of the algorithm
`
`The coded representation defined in this part of ISOAEC 11172 achieves a high compression ratio while
`preserving good picture quality. The algorithm is not lossiess as the exact pel values are not preserved
`during coding. The choice of the techniques is based on the need !o balance a high picture quality and
`compression ratio with the requirement to make random access to the coded bitstream. Obtaining good
`_.
`Picture quality at the bitrates of interest demands a very bigh compression ratio, which is notachievable
`with intraframe coding alone. The need for random access, however, is best satisfied with pure inRAgdids Exhibit 2005
`coding. This requires a careful balance between intra- and interframe coding and betyspsierecuPivhand AR2016-01135
`recursive temporal redundancy reduction.
`4 of 124
`
`iv
`
`Apple Exhibit 1009
`Page 4 of 124
`
`Apple Exhibit 1009
`Page 4 of 124
`
`PUMA Exhibit 2005
`Apple v. PUMA, IPR2016-01135
`4 of 124
`
`
`
`
`
`© ISOAEC
`
`ISO/IEC 11172-2: 1993 (E)
`
`A bumber oftechniques are used to achieve a high compression ratic. The first, which is almost
`independent from this part ofISOMEC 11172,is to select an appropriate spatial resclution for the signal.
`The algorithm then uses block-hased motion compensation to reduce che temporal redundancy, Motion
`compensation is used for causal prediction ofthe current picture from a previouspicture, for non-causal
`prediciioa of the current picture from a future picture, or for iterpolative prediction from past and future
`pictures. Motion vectors are defined for each 16-pel by 16-line region of the picture. ‘The difference signal,
`the prediction exror, is furthercompressed using the discrete cosine transform (DCT) to remove spatial
`conelation before it is quantized in an irreversibie process that ciscards theless important information.
`Finally, the motion vectors are combined with the DCT information, and coded using variable length codes.
`
`0.2.1 Temporal processing
`
`Because ofthe conflicting requirements of randam access and highly efficient compression, three main
`picturetypes are defined. Intra-coded pictures (1-Pictures) are coded without reference to other pictures.
`‘They provide access points to the coded sequence where decoding can begin, but are coded with only a
`moderate compression ratio. Predictive coded pictures (-Pictures) are coded moreefficiently using motion
`compensated prediciioa from a pastintra or predictive coded picture and are generally used as a reference for
`furtherprediction. Bidirectionally-predictive coded pictures (B-Pictures) providethe highest degree of
`compression but require both past and future reference pictures for motion compensation. Bidirectionally-
`predictive coded pictures are never used as references for prediction. The organisation of the three picture
`types inva sequenceis very flexible. The choice ts left to the encoder and will depend on the requirements of
`the application. Figure I illustrates the relationship between the three different picture types,
`
`Prediction
`
`Bi-directional
`
`Prediction
`
`Figure 1 -- Example of temporal picture structure
`
`The fourth picture type defined in this part ofISO/IEC 11172, the D-picture,is provided to allowa simple,
`but limited quality, fast-forward playback mode.
`
`0.2.2 Motion tepresentation - macroblocks
`
`‘The choice of 16 by 16 macroblocks for lhe motion-compensation unitis a result of the trade-off between
`increasing the coding efficiency provided by using motion information and :he overhead neadedto store it.
`Each macroblock can be one of anumberofdifferent types. For example, intra-coded, forward-predictive-
`ceded, backward-predictive coded, and bidirectionally-prediciive-coded macroblocks are permitted in
`bidirectionally-predictive coded pictures. Depending on the type of the macroblock, motion vector
`infomation and other side information are stored with the compressed prediction error signal in each
`macroblock. ‘Themotion vectors are encoded differentially with respectto the last coded motion vector,
`using variable-length codes. The maximum length of the vectors that may be represented can be
`programmed, cn a piciure-by-picture basis, so that the most demanding applications can be met without
`compromising the performance of the system in more normal situations.
`Itis the responsibility of the encoder to calculate appropriate motion vectors. This partofSOgEEiehibit 2005
`does not specify how this should be done.
`Apple v. PUMA, IPR2016-01135
`5 of 124
`
`Vv
`
`Apple Exhibit 1009
`Page 5 of 124
`
`Apple Exhibit 1009
`Page 5 of 124
`
`PUMA Exhibit 2005
`Apple v. PUMA, IPR2016-01135
`5 of 124
`
`
`
`_ SOAEC 11172-2: 1993 (E)
`
`,
`
`© ISOAEC
`
`6.2.3 Spatial redundancy reduction
`
`Both original pictures and prediction error signals have high spatial reduadancy. This part of ISO/IEC
`11172 usesa block-based DCT method with visually weighted quantization and run-length coding. Each 8
`by 8 block of the original picture for intra-coded macroblocks or of the prediction errorfor predictive-coded
`macroblocks is transformed into the DCT domain whereit is scaled before being quantized. After
`quantization manyof the ccefficients are zero in value and so two-dimensional nm-length and variable
`Jength coding is used to encode the remaining coefficients efficiently.
`
`0.3. Encoding
`
`This part of ISO/IEC 11172 does not specify an encoding process. It specifies the syntax and semantics of
`the bitstream and the signal processing in the decoder. Asa result, many optorns are left open to encoders
`to trade-off cost and speed against picture quality and coding efficiency. This clause is a brief description of
`the functions that need to -be perfomed by an encoder. Figure 2 shows the main functional blocks.
`
`Motion
`
`Source input pictues
`
`stimator
`
`where
`
`DCT is discrete cosine transform
`DCT-1 is inverse discrete cosine tansform
`Qis quantization
`Q'! is dequantization
`VLC is variable length coding
`
`Figure 2 -- Simplified video encoder block diagram
`
`The input video signal must be digitized and represented as a luminance and two colour difference signals
`(Y, Ch, Cy). This may be followed by preprocessing and format conversion to select an appropriate
`window, resolution and input format. This part of ISO/IEC 11172 requires that the colour difference
`signals (Cy and Cy) are subsampled with respect to the luminance by 2:1 in both vertical and horizontal
`dhrections and are ieformatied, if necessary, as a non-interlaced signal.
`The encoder must choose which picture type to ase for each picture. Having defined the picturetypes, the
`encoderestimates motion vectors for each 16 by 16 macroblock im the picture.
`In P-Pictures one vector is
`needed for cach noa-intra macroblock and in B-Fictures one or two vectors are needed,
`
`if B-Pictures are used, some reordering of the picture sequence is necessary before encoding, BecalieWA Exhibit 2005
`Pictures are coded using bidirectional motion compensated prediction, they can oalApptiecedslaNdrthdPR2016-01135
`subsequent reference picture (an I or P-Picture) has been decoded. Therefore the pictures are reordered by the
`6 of 124
`
`vi
`
`Apple Exhibit 1009
`Page 6 of 124
`
`Apple Exhibit 1009
`Page 6 of 124
`
`PUMA Exhibit 2005
`Apple v. PUMA, IPR2016-01135
`6 of 124
`
`
`
`© ISOAEG
`
`ISOMEC 11172-2: 1993 (E)
`
`encoder sc that the pictures artive at the decoder in the order fordecoding. The correct display order is
`recovered by the decoder:
`
`The basic unit of coding within a pictureis the macroblock. Within each picture, macroblocks are encoded
`in sequence, left to right, top to bottom. Each macroblock consists of six 8 by 8 blocks: four blocks of
`Tuminance, one block of Cb chreminance, and one block of Cr chrominance. See figure 3. Note that the
`picture area covered by the four blocks of luminance is the same as the area coyered by each of the
`cbrominance blocks. This is due to subsampliag of the chrominance information to match the sensitivity of
`the buman visual system.
`-
`
`co
`
`Cb
`
`Cr
`
`Figure 3 -- Macroblock structure
`
`Fustly, fora given macroblock, the coding mede is chosen. It depends on the picture type, the
`effectiveness of Inction compensated prediction in that local region, and the nature of the signal withn the
`block. Secondly, depending on ihe coding mode, a motion compensated prediction of the contents ofthe
`block based on past and/or future reference pictures is formed. This prediction is subtracted from the actual
`data in the current macroblock to form an error signal. Thirdly, this error signal is separated into 8 by 8
`blocks (4 luminance and 2 chrominance blocks in each macroblock) and a discrete cosine transform is
`performed on each block. Each resulting 8 by 8 block of DCT coefficients is quantized and the iwo-
`dimensional block is scanned in a zig-zag order to convert itinto a one-dimensional string of quantized DCT
`coefficients. Fourthly, the side-informaton for the macrebleck (mode, motion vectors etc) and the
`quantized coefficient data are encoded. For maximum efficiency, a number of variable length code tables are
`defined forthe different data elements. Run-leagth codingis used for the quantized coefficient data.
`
`A consequence of using different picture types and variable Jength coding is that the overall data rate is
`variable. In applications that involve a fixed-rate channel, a FIFO buffer may be used tc match the encoder
`output i the channel. The status of this buffer may be monitored to control the number of bits generated
`by the encoder. Centrolling the quantization process is the most direct way pf controllingthe bitrate, This
`part of ISO/IEC 11172 specifies an abstract model of the buffering system (the Video Buffering Verifier} in
`order to constrain the maximum variability im the number ofbits that are used fora given picture. This
`ensues that a bitstreain can be decoded with a buffer of knownsize.
`
`At this stage, the coded representationofthe picture has been generated. The final step in the encoder is to
`regeverate I-Pictures and P-Pictures by decoding the data so that they can be used as reference pictures for-
`subsequent encoding. The quantized coefficients are dequantized and an inverse 8 by 8 DCT is performed on
`each block, The prediction error signal produced is then added back io the prediction signal and limited to
`the required range to give a decoded reference picture.
`
`0.4 Decoding
`
`It is considerably simpler than encoding as there is no
`Decoding is the inverse of ihe encoding operation.
`need to perform motion estimation and there are many fewer options. The decoding process is defined by
`this part of ISOMEC 11172. The description that follows is a very brief overview of one possible way of
`decoding abilstream. Other decoders with different architectures are possible. Figure 4 shows the main
`functional blocks.
`
`'
`
`PUMAExhibit 2005
`Apple v. PUMA, IPR2016-01135
`7 of 124
`
`vii
`
`Apple Exhibit 1009
`Page 7 of 124
`
`
`
`
`
`
`
`
`
`Apple Exhibit 1009
`Page 7 of 124
`
`PUMA Exhibit 2005
`Apple v. PUMA, IPR2016-01135
`7 of 124
`
`
`
`{SO/MEC 11172-2: 1993 (E)
`
`@ISO}EC
`
`re ded video
`tstream
`
`Where
`
`Motion Vectors
`
`
`
`Reconstructed
`ouiput pictures
`
`is inverse discrete cosine transform
`DCT!
`isdequantization
`Qt
`MUX"! is demultiplexing
`VLD
`is variable length decoding
`
`Figure 4 -- Basic video decoder block diagram
`
`Forfixed-rate applications, the chame! fills a FIFO buffer at a constant rate with the coded bitsircam. The
`decoder reads this buffer and decodes the data elements in the bisteam according to the defined syntax.
`
`As the decoder readsthe bitstream, -it identifies the start of a coded pictere and then the type of the picture.
`It decodes each macrobiock in the picture in tum. The macroblock type and the motion vectors, if present,
`are used {o constucta prediction of the currentmacroblock based on past and future reference pictures that
`-have been stored in the decoder. The coefficient data are decoded and dequantized. Each 8 by 8 block of
`coefficient data is transformed by an inverse DCT(specified in annex A), and the result is added to the
`prediction signal and limited to the defined range.
`
`After all the macroblocks in the picture have been processed. the picture has been reconstructed, Hf itis an {-
`picture or a P-picture it is a reference picture for subsequent pictures and is stored, replacing the oldest stored
`reference picture. Before the pictures are displayed they may need 10 be re-ordered from the coded order to ~
`their natural display order. Afterreordering, the pictures are availabk, in digital form, for post-processing
`and display in any manner thai the application chooses.
`
`0.5
`
`Structure of the coded video bitstream
`
`This part of ISOMEC 11172 specifies a syntax for a coded video bitstream. This syntax comtains six layers,
`each of which either supports a signal processing or a system function:
`
`Layers of the
`
`syntax
`
`DCT unit
`
`Sequence layer
`Groupofpictures layer
`Picture layer
`Slice layer
`Macrobicck layer
`Block layer
`
`Random access unit: coniext
`Random access unit: video
`Primary coding unit
`Resynchrorization unit
`Motion compensation unit
`
`0.6 Features supported by the algorithm
`
`Applications using compressed video on digital storage media need to be able to perform 4 number of
`operations in addition to nonnal forward playbackof the sequence. The coded bitstream has been designed
`fo support a number of these operations.
`Pee
`Pe
`PUMAExhibit 2005
`Apple v. PUMA, IPR2016-01135
`8 of 124
`
`viii
`
`Apple Exhibit 1009
`Page 8 of 124
`
`Apple Exhibit 1009
`Page 8 of 124
`
`PUMA Exhibit 2005
`Apple v. PUMA, IPR2016-01135
`8 of 124
`
`
`
`© ISO/IEC
`
`ISOAEC 11172-2: 1993 (E)
`
`G@.6.1 Random access
`
`Random access is an esseniial feature for video ona storage medium. Random access requires that any
`picture can be decoded in a limited amountof time. It implies Lhe existence of access points in the
`bitstream - that is segments of information that are identifiable and can be decoded without reference to other
`segments of data. A spacing of two random access points (Intra-Pictares) per second can be achieved
`without significant loss of picture quality.
`
`0.6.2 Fast search
`
`Depending on the storage medium, itis possible to scan the access points in a coded bitstream (with the
`help of an application-specific directory or other knowledge beyond the scope ofthis partof ISOMEC
`11172) to obtain a fast-forward and fast-reverse playback effect.
`
`0.6.3 Reverse playback
`
`Some applications may require the video signal to be played in reverse order. This canbe achieved ina
`decoder by using memory 6 store entire groups ofpictures after they have been decoded before being
`displayed in reverse order. An encoder can make this feature easier by reducing the Jength of groups of
`pictures.
`
`0.6.4 Error robustness
`
`Most digital stomge mecia and communication chanvuels are not error-free, Appropriate channel coding
`schemes should be used and are beyond the scope of this part of ISQ/TEC 11172. Nevertheless the
`compression scheme defined in this part of ISO/TEC-11172 is robust to residual errors. The slice structure
`allows a decoder to recover after a data error and to resynchronize its decoding. Therefore, bit errors in the
`compressed data will cause errors in the decoded pictures to be limited m area. Decoders may be able to use
`concealmentstrategies to disguise these errors.
`
`0.6.5 Editing
`
`There is a conflict between the requirement for high coding efficiency and easy editing. The coding structarz
`and syntax have not been designed witb the primary aim of simplifying editing at any picture. Nevertheless
`a number of features have been included that enable editing of coded data.
`
`
`
`
`
`PUMAExhibit 2005
`Apple v. PUMA, IPR2016-01135
`9 of 124
`
`ix
`Apple Exhibit 1009
`Page 9 of 124
`
`Apple Exhibit 1009
`Page 9 of 124
`
`PUMA Exhibit 2005
`Apple v. PUMA, IPR2016-01135
`9 of 124
`
`
`
`This page intentionally ieft blank
`
`PUMAExhibit 2005
`Apple v. PUMA, IPR2016-01135
`10 of 124
`
`Apple Exhibit 1009
`Page 10 of 124
`
`Apple Exhibit 1009
`Page 10 of 124
`
`PUMA Exhibit 2005
`Apple v. PUMA, IPR2016-01135
`10 of 124
`
`
`
`
`
`
`
`INTERNATIONAL STANDARD © ISO/IEC ISOMEC 11172-2: 1993 (E}
`
`Information technology — Coding of moving
`pictures and associated audio for digital storage
`media at up to about 1,5 Mbit/s —
`
`Part 2:
`Video
`
`Section 1: General
`
`1.1
`
`Scope
`
`This part of ISOTEC 11172 specifizs the coded representation of video for digital storage media and
`specifies the decoding process. The representation supports normal speed forward playback, as well as
`special functions such as random access,fast forward playback, fast reverse playback, nonnal speed reverse
`playback, pause and still pictures. This part ofISOMEC 11172 is compatible with standard 525- and 625-
`line television formats, and it provides flexibility for use with personal computer and workstation displays.
`
`ISOMEC 11172 is primarily applicable to digital storage media supporting a continuous transfer rate up to
`about 1,5 Mbit/s, such as Compact Disc, Digital Audio Tape, aad magnetic hard disks. Nevertheiess itcan
`be used more widely than this because of the generic approach taken. The storage media may be directly
`comnected lo the decoder, or yia communications means such as busses, LANs, or telecommunications
`links. This partofISO/IEC 11172 is intended for non-interlaced video formats having approximately 288
`lines of 352 pels and picture rates around 24 Hz to 30 Hz.
`
`1.2 Normative references
`
`The following International Standards contain provisions which, through reference in this test, constitute
`provisions of this part of ISO/TEC 11172. At the time of publication, the editions indicated were valid.
`Ali standards are subject to revision, and parties to agreements based on this part of ISO/IEC 11172 are
`encouraged to investigate the possibility of applying the mest recenteditions of thestandards indicated
`below. Members of IEC and ISO maintain registers of currently valid Internaticnal Standards.
`
`ISOMEC 11172-1:1993 Injormation technclozy- Coding ofmoving pictures and associated audiofor digital
`storaze media at up io about 1,5 Mbit/s - Part 1: Systeits.
`
`. ESO/MEC 11172-3:1993 Injorination technology - Coding ofmoving pictures and associated audio for digital
`storaze media at up to about 1,5 Mbit/s - Part 3 Audio.
`
`CCIR Recommendation 601-2 Encoding parameters ofdigital television for studios.
`
`CCIR Report 624-4 Charecteristics of sysiemsJor monochrome aud colour television.
`
`CCIR Recommendation 648 Recording of audio sigaals.
`
`CCIR Report955-2 Sound broadcasting by satelliteforportable andmobile receivers, includingAnnex iV
`Sunuaary description ofAdvanced Digital System i,
`A Exhibit 2005
`Apple v. PUMA,IPR2016-01135
`CCITT Recommendation J.17 Pre-emphasis used on Sound-Programme Circuits.
`11 of 124
`
`1
`
`Apple Exhibit 1009
`Page 11 of 124
`
`Apple Exhibit 1009
`Page 11 of 124
`
`PUMA Exhibit 2005
`Apple v. PUMA, IPR2016-01135
`11 of 124
`
`
`
`
`
`ISO/IEC 11172-2: 1993 (B)
`
`© ISO/IEC
`
`IEEEDrafi Standard P11 80/D2 199€ Specificauon for ihe implementation of 8x & inverse discrete cosine
`iransform”.
`
`TEC publication 908:1987 CD Digital Audio Sysiem.
`
`
`
`PUMAExhibit 2005
`Apple v. PUMA, IPR2016-01135
`12 of 124
`
`Apple Exhibit 1009
`Page 12 of 124
`
`Apple Exhibit 1009
`Page 12 of 124
`
`PUMA Exhibit 2005
`Apple v. PUMA, IPR2016-01135
`12 of 124
`
`
`
`
`
`
`
`i
`
`
`
`© [SOAEC
`
`ISO/MEG 11172-2: 1993 (E}
`
`Section 2: Technical elaments
`
`2.1 Definitions
`
`Forthe purposes of SOMTEC 11172, the following definitions apply. If specific to a part, this is noted in
`square brackets.
`
`2.1.1 ac coefficient |video]: Any DCT coefficient for which the frequency in one or both dimensions
`is non-zero,
`
`In the case of compressed audio an access unit is an andio access unit.
`21.2 access unit jsystem]:
`the case of compressed video an access unit is the coded representation of a picture,
`
`In
`
`2.1.3 adaptive segmentation [audio]: A subdivision of the digital representation of an audio signal
`in variable segments of time.
`
`2.1.4 adaptive bit allocation [audio]: The assignment of bits to sebbands in a time and frequency
`varying fashion according to a psychoacoustic model.
`:
`
`2.1.5 adaptive noise allocation [audio]: The assigument of coding noise to frequency bands in a
`time and frequency varying fashion according toa psychoacoustic model.
`
`41.6 alias [audiok Mirrored signal component resulting from sub-Nyquist sampling.
`
`2.1.7 analysis filterbank [audio]: Filterbank in the encoder that transforms a broadband PCM aadio
`signal inio a set of subsampled subband samples.
`
`2.4.8 audio access unit [awdio}: For Layers 1 and HUan audio access unit is defined as the smellest
`part of the encoded bitstream which can be decoded byitself, where decoded means "fully reconstructed
`sound*. For Layer MJ an audio access unit is part of the bitstream that is decodable with the use of
`previously acquired main information.
`
`2.1.9 audic buffer [audiol: A buffer in the system target decoder for storage of compressed audio data.
`2110 audio sequence [audio]: A 10n-intermpted series of audic frames in which the following
`parameters are not changed:
`
`- Layer
`- Sampling Frequency
`- For Layer Land II: Bitrate index
`
`2.1.11 backward motion vector [video]: A motion vector that is used for motion compesation
`bom a reference picture at a later time in display oder.
`
`2.1.12 Bark [audio]: Unit of critical band rate. The Bark scale is a non-linear mapping of the frequency
`scale over the audio range closely corresponding with the frequency selectivity of the human ear across the
`band.
`
`2113 bidirectionally predictive-coded picture; B-picture [video]: A picture that is coded
`using moticn compensated prediction from a past and/or future reference picture.
`
`2.1.14 bitrate: The rate at which the compressed bitstream is delivered from the storage medium to the
`input of 2 decoder.
`
`2.1.15 block companding [audio]: Normalizing of the digital representation of an aucio signal
`within a certain time period.
`21.16 block [video]: An 8-row by &column orthogonal block of pels.
`
`PUMAExhibit 2005
`Apple v. PUMA, IPR2016-01135
`2.1.17 bound [audio]: The lowest subband in which intensity stereo coding is used.
`13 of 124
`
`3
`Apple Exhibit 1009
`Page 13 of 124
`
`Apple Exhibit 1009
`Page 13 of 124
`
`PUMA Exhibit 2005
`Apple v. PUMA, IPR2016-01135
`13 of 124
`
`
`
`ISO/IEG 11172-2 1993 (E}
`
`GISOIEC
`
`4.1.18 byte aligned: A bit in a coded bitstream is byte-aligned if its position is a multiple of 8-bits
`from the first bit in the stream.
`
`2.119 byte: Sequence of 8-bits.
`
`2.1.20 channel: A digital medium that stores or transports an ISOEC 11172 stream.
`
`2.1.21 channel [aucioj: The left and right channels of a stereo signal
`
`2.1.22 chrominance (component) [video]: A matrix, block cr single pel representing one of the
`twe colour differerce signals related to the primarycolours in the manner defined im CCIR Rec 501. The
`symbols used for the colour difference signals are Cr and Cb.
`
`2.1.23 coded audio bitstream {audio]: A coded representation of an audio signal as specified im
`[SO/IEC 11172-3.
`
`2.1.24 coded video bitstream [video]: A coded representation of a series of one or more. pictures as
`specified in this part of ISO/IEC 13172..
`
`2.1.25 coded order [video]: The order in which the pictures are stored and decoded. This order is not
`necessasily the same as the display order.
`
`2.1.26 coded representation: A data element as represented in its enccded form.
`
`2.1.27 coding parameters [video]: The set of user-definable parameters that characterize a coded video
`bitstream. Bitstreams are characterised by coding parameters. Decoders are characterised by the bitstreams
`that they are capable of decoding.
`
`2.1.28 component [video]: A matrix, block or single pel. from one of the three matrices (luminance
`and two chrominance) thal make up a picture.
`
`2.1.29 compression Reduction in the number of bits used to represent ar item of data,
`
`2.1.30 constant bitrate coded video [video]: A compressed video bitstream wilh a constant
`average bitrate.
`
`2.1.31 constant bitrate: Operation where the bitrate is constant {rom start to finish of the compressed
`bitstream.
`
`2.1.32 constrained parameters [video The values of the set of coding parameters defined in
`2,4,3,2,
`
`2.1.33 constrained system parameter stream (CSPS) [system]: An ISOAEC 11172
`taultiplexed stream for which the coistraints defined in 2.4.6 of ISGOMEC 11172-1 apply.
`
`2.1.34 CRC; Cyclic redundancy code.
`
`2.1.35 critical banc rate {audio]: Psychoacoustic function of frequency. Ata given audible
`frequency it is proportional to the numberofcritical bands below that frequency. The units of the critical
`band rate scale are Barks.
`
`2.1.36 critical band [audio]: Psychoacoustic measure in the spectral domam which corresponds to the
`frequency selectivity of the human ear. This selectivity is expressed in Bark.
`
`2.1.37 data element: An item of data as represented before encoding and after decoding.
`
`2.1.38 de-coefficient [video]: The DCT coefficient for which the frequency is zero in bowUPUMA Exhibit 2005
`dimensiors.
`Apple v. PUMA,IPR2016-01135
`14 of 124
`
`.
`
`Apple Exhibit 1009
`Page 14 of 124
`
`
`
`Apple Exhibit 1009
`Page 14 of 124
`
`PUMA Exhibit 2005
`Apple v. PUMA, IPR2016-01135
`14 of 124
`
`
`
`©ISONEC
`
`ISO/IEC 11172-2: 1993 (E)
`
`2.1.39 de-coded picture; D-picture [video]: A picture that is coded using only information from
`itself. OF the DCT coefficients in the coded representation, only the de-coefficients are present
`
`2.1.40 DCT coefficient: The amplitude of a specific cosine basis function,
`
`2.1.41 decoded stream: The decoded reconstraction of a compressed bitstreain.
`
`2.1.42 decoder input buffer [video]: