`STANDARD
`
`ISO/IEC
`11172-3
`
`Information technology —— Coding of
`moving pictures and associated audio for
`digital storage media at up to about
`1,5 Mbit/s —
`
`Part 3:
`
`Audio
`
`Technologies do I ‘information — Codage de /’/‘mags animée at du son
`assoc/‘é pour /95 supports de stockage numérique /'usqu'é environ
`7,5 Mbit/s —
`
`Partie 3: Audio
`
`
`
`ISO/IEC 11172-3:1993(E)
` .L. SLU'S EXHIBIT 1007
`
`Reference number
`
`PAGE 1 OF 158
`
`PAGE 1 OF 158
`
`I.M.L. SLU'S EXHIBIT 1007
`
`
`
`ISO/IEC 11172-3: 1993 (E)
`
`Contents
`
`I
`
`Page
`
`Introduction .............................................................................................. ..v
`
`Section 1: General ...................................................................................... ..1
`
`1.1
`
`Scope ....................... 4." ......................................................... .: .......... ..l
`
`1.2
`
`Normative references .........................................................
`
`............... ..l
`
`Section 2: Technical elements ........................................................................ ..2
`
`2. l
`
`Definitions ...............................................................
`
`..................... ..2
`
`2.2
`
`Symbols and abbreviations ................
`
`............................................. .. 10
`
`2.3 Method of describing bitstream syntax ................ ., ................................ ..l2
`
`2.4
`
`Requirements ..............................................
`
`............................... .. 14
`
`Annexes
`
`A
`
`B
`
`C
`
`D
`
`8
`
`Diagrams .....................................................................................
`
`Tables .......................................................................................... ..45
`
`The encoding process ....................................................................... .. 66
`
`Psychoacoustic models .... ..: ............................................................. ..l09
`
`Bit
`
`sensitivity to errors ...................................
`
`......................... ..l40
`
`© ISO/IEC 1993
`All rights reserved. No part of this publication may be reproduced or utilized in any form or by
`any means, electronic or mechanical, including photocopying and microfilm, without
`permission in writing from the publisher.
`
`ISO/IEC Copyright Office - Case Postale 56 - Cl-I 1211 Geneve 20 - Switzerland
`
`Printed in Switzerland.
`
`ii
`
`PAGE 2 OF 158 L. :SLU'S EXHIBIT 1007
`
`PAGE 2 OF 158
`
`I.M.L. SLU'S EXHIBIT 1007
`
`
`
`© ISO/IEC
`
`ISO/IEC 11172-3: 1993 (E)
`
`F
`
`G
`
`H
`
`Error
`
`concealment ........................................................................... ..142
`
`Joint stereo coding ......................................................................... ..l43
`
`List of patent holders ....................................................................... ..l47
`
`PAGE 3 OF 158
`
`,
`
`7
`
`~
`
`_
`
`_
`
`-
`
`r
`
`«V
`
`.
`
`iii
`
`A
`
`»
`
`”|;M.L. SLU'S EXHIBIT 1007
`
`PAGE 3 OF 158
`
`I.M.L. SLU'S EXHIBIT 1007
`
`
`
`ISO/IEC 11172-a: 1993 (E)
`
`© ISO/lEC
`
`Foreword
`
`lSO (the international Organization for Standardization) and IEC (the inter-
`national Eiectrotechnical Commission) form the specialized system for
`‘worldwide standardization. National bodies that are members of lSO or
`IEC participate in the development of International Standards through
`technical committees established by the respective organization to deal
`with particular fields of technical activity.
`ISO and IEC technical com-
`mittees collaborate in fields of mutual interest. Other international organ-
`izations, governmental and non-governmental, in liaison with ISO and IEC,
`also take part in the work.
`
`in the field of information technology, lSO and IEC have established a joint
`technical committee, ISO/iEC JTC 1. Draft international Standards adopted
`by the joint technical committee are circulated to national bodies for vot-
`ing. Publication as an International Standard requires approval by at least
`75 % of the national bodies casting a vote.
`
`international Standard ISO/IEC 11172-3 was prepared by Joint Technical
`Committee lSO/I EC JTC 1, Information technology, Sub-Committee SC 29,
`Coded representation of audio, picture, multimedia and hypermedia infor-
`mation.
`
`ISO/iEC 11172 consists of the following parts, under the general title In-
`formation technology — Coding of moving pictures and associated audio
`for digital storage media at up to about 1,5 Mbit/s:
`
`— Part 1: Systems
`
`—— Part2: Video
`
`-— Part 3: Audio
`
`—— Part 4: Compliance testing
`
`Annexes A and B form an integral part of this part of ISO/IEC 11172. An-
`nexes C, D, E, F, G and H are for information only.
`
`
`
`.M.L. SLU'S EXHIBIT 1007
`
`iv
`
`PAGE 4 OF 158
`
`PAGE 4 OF 158
`
`I.M.L. SLU'S EXHIBIT 1007
`
`
`
`o ISO/IEC
`
`Introduction
`
`ISO/IEC 11172-3: 1993 (E)
`
`Note: Readers interested in an overview of MPEG Audio should read this Introduction and then proceed to
`
`annex A (Diagrams) and annex C (The encoding process) before reading the normative clauses 1 and 2.
`
`To aid in the understanding of the specification of the stored compressed bitstream and its decoding, a
`sequence of encoding, storage and decoding is described.
`
`0.1
`
`Encoding
`
`The encoder processes the digital audio signal and produces the compressed bitstream for storage. The
`encoder algorithm is not standardized, and may use various means for encoding such as estimation of the
`auditory masking threshold, quantization, and scaling. However, the encoder .output must be such that a
`decoder conforming to the specifications of clause 2.4 will produce audio suitable for the intended
`application.
`
`PCM
`audio samples
`32 44,1 48 kHz
`
`mapping
`
`quamizer
`and
`coding
`
`frame
`k.
`pac mg
`
`e_”°°ded
`bitstream
`
`encoder
`
`ISOIIEC 11172-3
`
`Figure 1 -- Sketch of the basic structure of an encoder
`
`ancillary data
`
`Figure 1 illustrates the basic structure of a audio encoder. Input audio samples are fed into the encoder. The
`mapping creates a filtered and subsainpled representation of the input audio stream. The mapped samples
`may be called either subband samples (as in Layer I or II, see below) or transformed subband samples (as in
`Layer III). A psychoacoustic model creates a set of data to control the quantizer and coding. These data are
`different depending on the actual coder implementation. One possibility is to use an estimation of the
`masking threshold to do this quantizer control. The quantizer and coding block creates a set of coding
`symbols from the mapped input samples. Again, this block can depend on the encoding system. The block
`‘frame packing‘ assembles the actual bitstream from the output data of the other blocks, and adds other
`information (e.g. error correction) if necessary.
`
`There are four different modes possible, single chzuinel, dual channel (two independent audio signals coded
`within one bitstreain), stereo (left and right signals of a stereo pair coded within one bitstream), and Joint
`Stereo (left zuid right signals of a stereo pair coded within one bitstreain with the stereo irrelevancy and
`redundancy exploited).
`
`PAGE 5 OF 158
`
`_
`
`_
`
`V
`,.
`.
`|l.M.L. SLU'S EXHIBIT 1007
`
`PAGE 5 OF 158
`
`I.M.L. SLU'S EXHIBIT 1007
`
`
`
`ISO/IEC 11172-3: 1993 (E)
`
`© tso/use
`
`0.2
`
`Layers
`
`Depending on the application, different layers of the coding system with increasing encoder complexity and
`performance can be used. An ISO/IEC 11172-3 Audio Layer N decoder is able to decode bitstream data
`which has been encoded in Layer N and all layers below N.
`
`Layer I
`
`This layer contains the basic mapping of the digital audio input into 32 subbands, fixed segmentation to
`format the data into blocks, a psychoacoustic model to determine the adaptive bit allocation, and
`quantization using block companding and formatting. The theoretical minimum encoding/decoding delay for
`Layer I is about 19 ms.
`
`Layer II
`
`This layer provides additional coding of bit allocation, scalefactors and samples. Different framing is used.
`The theoretical minimum encoding/decoding delay for Layer II is about 35 ms.
`
`Layer III
`
`This layer introduces increased frequency resolution based on a hybrid filterbank. It adds a different
`(nonuniform) quantizer. adrwtptive segmentation and entropy coding of the quantized values. The theoretical
`minimum encoding/decoding delay for Layer III is about 59 ms.
`
`Joint Stereo coding cmt be added as an additional feature to any of the layers.
`
`0 . 3
`
`S to ra 99
`
`Various streams of encoded video, encoded audio, synchronization data, systems data and auxiliary data may
`be stored together on a storage medium. Editing of the audio will be easier if the edit point is constrained to
`coincide with an addressable point.
`
`Access to storage may involve remote access over a communication system. Access is assumed to be
`controlled by a functional unit other than the audio decoder itself. This control unit accepts user commands,
`reads and interprets data base structure infomnation, reads the stored information from the media,
`demultiplexes non-audio information and passes the stored audio bitstream to the audio decoder at the
`requiredrate.
`
`0.4 Decoding
`
`The decoder accepts the compressed audio bitstream in the syntax defined in 2.4.1, decodes the data elements
`according to 2.4.2, and uses the information to produce digital audio output according to 2.4.3.
`
`encoded
`bitstfeam
`
`frame
`
`unpacking
`
`ISO/IEC 11172-3 decoder
`
`reconstruction
`
`inverse
`
`mapping
`
`PCM
`
`audio samples
`
`ancillary data
`
`Figure 2 -- Sketch of the basic structure of a decoder
`
`Figure 2 illustrates the basic structure of a audio decoder. Bitstream data is fed into the decoder. The
`bitstream unpacking and decoding block does error detection if error~checl< is applied in the encoder (see
`2.4.2.4). The bitstream data are unpacked to recover the various pieces of information. The
`reconstruction block reconstructs the quantized version of the set of mapped samples. The inverse
`mapping transforms these mapped samples back into unifonn PCM.
`
` “PAGE 6 OF 158
`
`‘I lVl.L. SLU'S EXHIBIT 1007
`
`PAGE 6 OF 158
`
`I.M.L. SLU'S EXHIBIT 1007
`
`
`
`
`
`
`
`INTERNATIONAL STANDARD © '30/‘EC ISO/IEC 11172—3: 1993 (El
`
`Information technology —- Coding of moving
`pictures and associated audio for digital storage
`media at up to about 1,5 Mbit/s -—
`'
`
`Part 3:
`
`4
`
`Audio
`
`Section 1: General
`
`1.1
`
`Scope
`
`This part of ISO/IEC 1 1172 specifies the coded representation of high quality audio for storage media and
`the method for decoding of high quality audio signals. The input of the encoder and the output of the decoder
`are compatible with existing PCM standards such as standard Compact Disc and Digital Audio Tape.
`
`This part of the ISO/IEC 11172 is intended for application to digital storage media providing a total
`continuous transfer rate of about 1,5 Mbits/sec for both audio and video bitstreams, such as CD, DAT and
`magnetic hard disc. The storage media may either be connected directly to the decoder. or via other means
`such as communication lines and the ISO/IEC 11172 multiplexed stream defined in ISO/IEC 11172-1.
`This part of ISO/IEC 11172 is intended for sampling rates of 32 kHz, 44,1 kHz, and 48 kHz.
`
`1.2 Normative references
`
`The following International Standards contain provisions which, through reference in this text, constitute
`provisions of this part of ISO/IEC 11172. At the time of publication, the editions indicated were valid.
`All standards are subject to revision, and parties to agreements based on this part of ISO/IEC 11172 are
`encouraged to investigate the possibility of applying the most recent editions of the standards indicated
`below. Members of IEC turd ISO maintain registers of currently valid International Standards.
`
`ISO/IEC 11172-1:1993 Information technology - Coding ofmoving pictures and associated audio for digital
`storage media at up to about 1,5 Mbit/s - Part 1 Systems.
`
`ISO/IEC 11172-2:1993 Information technology - Coding ofmoving pictures and associated audio for digital
`storage media at up to about 1.5 Mbit/s - Part 2: Video.
`
`CCIR Recommendation 601-2 Encoding parameters ofdig ital television for studios.
`
`CCIR Report 624-4 Characteristics of systems for monochrome and colour television.
`
`CCIR Recommendation 648 Recording of audio signals.
`
`CCIR Report 955-2 Sound broadcasting by satellite for portable and mobile receivers, including Annex IV
`Summary description of Advanced Digital System 11.
`
`CCITI‘ Recommendation J .17 Pre-emphasis used on Sound-Programme Circuits.
`
`IEEE Draft Standard P1180/D2 1990 Specification for the implementation of 8x 8 inverse discrete cosine
`tranvonn".
`
`IEC publication 908:1987 CD Digital Audio System.
`
`PAGE 7 OF 158
`
`|.M.L. SLU'S EXHIBIT 1007
`
`PAGE 7 OF 158
`
`I.M.L. SLU'S EXHIBIT 1007
`
`
`
`ISO/[EC 11172-8: 1993 (E)
`
`© [SQ/[EC
`
`Section 2: Technical elements
`
`2.1 Definitions
`
`For the purposes of ISO/IEC 11172, the following definitions apply. If specific to a part, this is noted in
`square brackets.
`
`2.1.1 ac coefficient [video]: Any DCT coefficient for which the frequency in one or both dimensions
`is non-zero.
`
`In the case of compressed audio an access unit is an audio access unit.
`2.1.2 access unit [system]:
`the case of compressed video an access unit is the coded representation of a picture.
`
`In
`
`2.1.3 adaptive segmentation [audio]: A subdivision of the digital representation of an audio signal
`in variable segments of time.
`
`2.1.4 adaptive bit allocation [audio]: The assignment of bits to subbands in a time and frequency
`varying fashion according to a psychoacoustic model.
`
`2.1.5 adaptive noise allocation [audio]: The assignment of coding noise to frequency bands in a
`time and frequency varying fashion according to a psychoacoustic model.
`
`2.1.6 alias [audio]: Mirrored signal component resulting from sub-Nyquist sampling.
`
`2.1.7 analysis filterbank [audio]: Filterbank in the encoder that transfonns a broadband PCM audio
`signal into a set of subsampled subband samples.
`
`2.1.8 audio access unit [audio]: For Layers I and II an audio access unit is defined as the smallest
`part of the encoded bitstream which can be decoded by itself, where decoded means "fully reconstructed
`sound". For Layer III an audio access unit is part of the bitstream that is decodable with the use of
`previously acquired main information.
`‘
`
`2.1.9 audio buffer [audio]: A buffer in the system target decoder for storage of compressed audio data.
`
`2.1.10 audio sequence [audio]: A non—inten'upted series of audio frames in which the following
`parameters are not changed:
`- ID
`
`- Layer
`- Sampling Frequency
`- For Layer I and II: Bitrate index
`
`2.1.11 backward motion vector [video]: A motion vector that is used for motion compensation
`from a reference picture at a later time in display order.
`
`2.1.12 Bark [audio]: Unit of critical band rate. The Bark scale is a non-linear mapping of the frequency
`scale over the audio range closely corresponding with the frequency selectivity of the human ear across the
`band.
`
`2.1.13 bidirectionally predictive-coded picture; B-picture [video]: A picture that is coded
`using motion compensated prediction from a past and/or future reference picture.
`
`2.1.14 bitrate: The rate at which the compressed bitstream is delivered from the storage medium to the
`input of a decoder.
`
`2.1.15 block companding [audio]: Normalizing of the digital representation of rum audio signal
`within a certain time period.
`
`2.1.16 block [video]: An 8-row by 8-column orthogonal block of pels.
`
`2.1.17 bound [audio]: The lowest subband in which intensity stereo coding is used.
`
` 2 PAGE 8 OF 158 ‘.”‘M.L. SLU'S EXHIBIT 1007
`
`
`
`PAGE 8 OF 158
`
`I.M.L. SLU'S EXHIBIT 1007
`
`
`
`© ISO/IEC
`
`ISO/IEC 11172-3: 1993 (E)
`
`2.1.18 byte aligned: A bit in a coded bitstream is byte-aligned if its position is a multiple of 8-bits
`from the first bit in the stream.
`
`2.1.19 byte: Sequence of 8-bits.
`
`2.1.20 channel: A digital medium that stores or transports an ISO/DEC 11172 stream.
`
`2.1.21 channel [audio]: The left and right channels of a stereo signal
`
`2.1.22 chrominance (component) [video]: A matrix, block or single pel representing one of the
`two colour difference signals related to the primary colours in the manner defined in CCIR Rec 601. The
`symbols used for the colour difference signals are Cr and Cb.
`
`2.1.23 coded audio bitstream [audio]: A coded representation of an audio signal as specified in this
`part of ISO/IEC 11172.
`2.1.24 coded video bitstream [video]: A coded representation of a series of one or more pictures as
`specified in ISO/IEC 11172-2.
`
`2.1.25 coded order [video]: The order in which the pictures are stored and decoded. This order is not
`necessarily the same as the display order.
`
`2.1.26 coded representation: A data element as represented in its encoded form.
`
`2.1.27 coding parameters [video]: The set of user-definable parameters that characterize a coded video
`bitstream. Bitstreams are characterised by coding parameters. Decoders are characterised by the bitsueams
`that they are capable of decoding.
`
`2.1.28 component [video]: A matrix, block or single pel from one of the three matrices (luminance
`and two chrominance) that make up a picture.
`
`2.1.29 compression: Reduction in the number of bits used to represent an item of data.
`
`2.1.30 constant bitrate coded video [video]: A compressed video bitstream with a constant
`average bitmte.
`
`2.1.31 constant bitr-ate: Operation where the bitrate is constant from start to finish of the compressed
`bitstream.
`
`2.1.32 constrained parameters [video]: The values of the set of coding parameters defined in
`2.4.3.2 of ISO/IEC 11172-2.
`
`2.1.33 constrained system parameter stream (CSPS) [system]: An ISO/IEC 11172
`multiplexed stream for which the constraints defined in 2.4.6 of ISO/IEC 11172-1 apply.
`
`;
`
`2.1.34 CRC: Cyclic redundancy code.
`
`2.1.35 critical band rate [audio]: Psychoacoustic function of frequency. At a given audible
`frequency it is proportional to the number of critical bands below that frequency. The units of the critical
`band rate scale are Barks.
`
`2.1.36 critical band [audio]: Psychoacoustic measure in the spectral domain which corresponds to the
`frequency selectivity of the human ear. This selectivity is expressed in Bark.
`
`2.1.37 data element: An item of data as represented before encoding and after decoding.
`
`2.1.38 dc-coefficient [video]: The DCT coefficient for which the frequency is zero in both
`dimensions.
`
`PAGE9OF’|58
`
`_‘
`
`"
`
`‘
`
`if
`
`I
`
`‘|.M.L.SLU'S EXH|B|T1007
`
`PAGE 9 OF 158
`
`I.M.L. SLU'S EXHIBIT 1007
`
`
`
`ISO/IEC 11172-8: 1993 (E)
`
`© ISO/[EC
`
`2.1.39 dc-coded picture; D-picture [video]: A picture that is coded using only information from
`itself. Of the DCI‘ coefficients in the coded representation, only the dc-coefficients are present.
`
`2.1.40 DCT coefficient: The amplitude of a specific cosine basis function.
`
`2.1.41 decoded stream: The decoded reconstruction of a compressed bitstream.
`
`2.1.42 decoder input buffer [video]: The first-in first-out
`buffering verifier.
`
`(FIFO) buffer specified in the video
`
`2.1.43 decoder input rate [video]: The data rate specified in the video buffering verifier and encoded
`in the coded video bitstream.
`
`2.1.44 decoder: An embodiment of a decoding process.
`
`2.1.45 decoding (process): The process defined in ISO/IEC 11172 that reads an input coded bitstream
`and produces decoded pictures or audio samples.
`
`2.1.46 decoding time-stamp; DTS [system]: A field that may be present in a packet header that
`indicates the time that an access unit is decoded in the system target decoder.
`
`2.1.47 de-emphasis [audio]: Filtering applied to an audio signal after storage or transmission to undo
`a linear distortion due to emphasis.
`
`2.1.48 dequantization [video]: The process of rescaling the quantized DCT coefficients after their
`representation in the bitstream has been decoded and before they are presented to the inverse DCT.
`
`2.1.49 digital storage media; DSM: A digital storage or trzmsmission device or system.
`
`2.1.50 discrete cosine transform; DCT [video]: Either the forward discrete cosine transfonn or the
`inverse discrete cosine transform. The DCT is rm invertible, discrete orthogonal transformation. The
`inverse DCT is defined in annex A of ISO/IEC 11172-2.
`
`2.1.51 display order [video]: The order in which the decoded pictures should be displayed. Normally
`this is the same order in which they were presented at the input of the encoder.
`
`2.1.52 dual channel mode [audio]: A mode, where two audio channels with independent programme
`contents (e.g. bilingual) are encoded within one bitstream. The coding process is the same as for the stereo
`mode.
`
`2.1.53 editing: The process by which one or more compressed bitstreams are manipulated to produce a
`new compressed bitstream. Conforming edited bitstrezuns must meet the requirements defined in this
`ISO/IEC 11172.
`
`2.1.54 elementary stream [system]: A generic term for one of the coded video, coded audio or other
`coded bitstreams.
`
`2.1.55 emphasis [audio]: Filtering applied to an audio signal before storage or transmission to
`improve the signal-to-noise ratio at high frequencies.
`
`2.1.56 encoder: An embodiment of an encoding process.
`
`2.1.57 encoding (process): A process, not specified in ISO/IEC 11172, that reads a strezun of input
`pictures or audio samples and produces a valid coded bitstrezun as defined in ISO/IEC 11172.
`
`2.1.58 entropy coding: Variable length lossless coding of the digital representation of a signal to
`reduce redundancy.
`
`2.1.59 fast forward playback [video]: The process of displaying a sequence. Of Parts Of a sequence.
`of pictures in display-order faster than real-time.
`
`lV|.L. SLU'S EXHIBIT 1007 4 PAGE1OOF158
`
`3<"«E“é‘2}
`
`PAGE 10 OF 158
`
`I.M.L. SLU'S EXHIBIT 1007
`
`
`
`© ISO/IEO
`
`-
`
`ISO/IEC 11172-3: 1993 (E)
`
`2.1.60 FFT: Fast Fourier Transformation. A fast algorithm for perfonning a discrete Fourier transform
`(an orthogonal transform).
`.
`
`2.1.61 filterbank [audio]: A set of band-pass filters covering the entire audio frequency range.
`
`2.1.62 fixed segmentation [audio]: A subdivision of the digital representation of an audio signal
`into fixed segments of time.
`
`2.1.63 forbidden: The tenn " forbidden“ when used in the clauses defining the coded bitstream indicates
`that the value shall never be used. This is usually to avoid emulation of start codes.
`
`2.1.64 forced updating [video]: The process by which rnacroblocks are intra—coded from time-to-time
`to ensure that mismatch errors between the inverse DCT processes in encoders and decoders cannot build up
`excessively.
`
`2.1.65 forward motion vector [video]: A motion vector that is used for motion compensation from
`a reference picture at an earlier time in display order.
`
`2.1.66 frame [audio]: A part of the audio signal that corresponds to audio PCM samples from an
`Audio Access Unit.
`
`2.1.67 free format [audio]: Any bitrate other than the defined bitrates that is less than the maximum
`valid bitrate for each layer.
`
`2.1.68 future reference picture [video]: The future reference picture is the reference picture that
`occurs at a later time than the current picture in display order.
`
`2.1.69 granules [Layer 11] [audio]: The set of 3 consecutive subband samples from all 32 subbands
`that are considered together before quantization. They correspond to 96 PCM samples.
`
`2.1.70 granules [Layer III] [audio]: 576 frequency lines that carry their own side information.
`
`2.1.71 group of pictures [video]: A series of one or more coded pictures intended to assist random
`access. The group of pictures is one of the layers in the coding syntax defined in ISO/IEC 11172-2.
`
`2.1.72 Hann window [audio]: A time function applied sample-by-sample to a block of audio samples
`before Fourier transformation.
`
`2.1.73 Huffman coding: A specific method for entropy coding.
`
`2.1.74 hybrid filterbank [audio]: A serial combination of subband ftlterbank and MDCT.
`
`2.1.75 IMDCT [audio]: Inverse Modified Discrete Cosine Transform.
`
`2.1.76 intensity stereo [audio]: A method of exploiting stereo irrelevance or redundancy in
`stereophonic audio prograrrunes based on retaining at high frequencies only the energy envelope of the right
`and left channels.
`
`2.1.77 interlace [video]: The property of conventional television pictures where alternating lines of
`the picture represent different instances in time.
`
`2.1.78 intra coding [video]: Coding of a macroblock or picture that uses infonnation only from that
`macroblock or picture.
`
`2.1.79 intra-coded picture; I-picture [video]: A picture coded using information only from itself.
`
`2.1.80 ISO/IEC 11172 (multiplexed) stream [system]: A bitstream composed of zero or more
`elementary streams combined in the manner defined in ISO/IEC 11172-1.
`
`PAGE 11 OF 158
`
`A
`
`V‘|V.M.L. SLU'S EXH|B|'|§1007
`
`PAGE 11 OF 158
`
`I.M.L. SLU'S EXHIBIT 1007
`
`
`
`[SO/IEC 11172-3: 1993 (E)
`
`-
`
`‘
`
`@130/[EC
`
`2.1.81 joint stereo coding [audio]: Any method that exploits stereophonic irrelevance or
`stereophonic redundancy.
`
`2.1.82 joint stereo mode [audio]: A mode of the-audio coding algorithm using joint stereo coding.
`
`2.1.83 layer [audio]: One of the levels in the coding hierarchy of the audio system defined in this part
`of ISO/IEC 11172.
`
`2.1.84 layer [video and systems]: One of the levels in the data hierarchy of the video and system
`specifications defined in ISO/IEC 11172-1 and ISO/[EC 11172-2.
`
`2.1.85 luminance (component) [video]: A matrix, block or single pel representing a monochrome
`representation of the signal and related to the primary colours in the manner defined in CCIR Rec 601. The
`symbol used for luminance is Y.
`
`2.1.86 macroblock [video]: The four 8 by 8 blocks of luminance data and the two corresponding 8 by
`8 blocks of chrominance data coming from a 16 by 16 section of the luminance component of the picture.
`Macroblock is sometimes used to refer to the pel data and sometimes to the coded representation of the pel
`values and other data elements defined in the macroblock layer of the syntax defined in ISO/IEC 11172-2.
`The usage is clear from the context.
`
`2.1.87 mapping [audio]: Conversion of an audio signal from time to frequency domain by subband
`filtering and/or by MDCT.
`
`2.1.88 masking [audio]: A property of the human auditory system by which an audio signal cannot be
`perceived in the presence of another audio signal .
`
`2.1.89 masking threshold [audio]: A function in frequency and time below which an audio signal
`cannot be perceived by the human auditory system.
`
`2.1.90 MDCT [audio]: Modified Discrete Cosine Transfonn.
`
`2.1.91 motion compensation [video]: The use of motion vectors to improve the efficiency of the
`prediction of pel values. The prediction uses motion vectors to provide offsets into the past and/or future
`reference pictures containing previously decoded pel values that are used to form the prediction error signal.
`
`2.1.92 motion estimation [video]: The process of estimating motion vectors during the encoding
`process.
`
`2.1.93 motion vector [video]: A two-dimensional vector used for motion compensation that provides
`an offset from the coordinate position in the current picture to the coordinates in a reference picture.
`
`2.1.94 MS stereo [audio]: A method of exploiting stereo irrelevance or redundancy in stereophonic
`audio programmes based on coding the sum and difference signal instead of the left and right channels.
`
`2.1.95 non-intra coding [video]: Coding of a macroblock or picture that uses information both from
`itself and from macroblocks zmd pictures occuning at other times.
`
`2.1.96 non-tonal component [audio]: A noise-like component of an audio signal.
`
`2.1.97 Nyquist sampling: Sampling at or above twice the maximum bandwidth of a signal.
`
`2.1.98 pack [system]: A pack consists of a pack header followed by one or more packets.
`in the system coding syntax described in ISO/IEC 11172-1.
`
`It is a layer
`
`2.1.99 packet data [system]: Contiguous bytes of data from an elementary stream present in a packet.
`
`2.1.100 packet header [system]: The data structure used to convey information about the elementary
`stream data contained in the packet data.
`
` 5 PAGE 12 OF 158
`
`ML. SLU'S EXHIBIT 1007
`
`PAGE 12 OF 158
`
`I.M.L. SLU'S EXHIBIT 1007
`
`
`
`© ISO/IEC -
`
`ISO/[EC 11172-3: 1993 (E)
`
`2.1.101 packet [system]: A packet consists of a header followed by a number of contiguous bytes
`from an elementary data stream. It is a layer in the system coding syntax described in ISO/IEC 11172-1.
`
`2.1.102 padding [audio]:_ A method to adjust the average length in time of an audio frame to the
`duration of the corresponding PCM samples, by conditionally adding a slot to the audio frame.
`
`2.1.103 past reference picture [video]: The past reference picture is the reference picture that occurs
`at an earlier time than the current picture in display order.
`
`2.1.104 pel aspect ratio [video]: The ratio of the nominal vertical height of pel on the display to its
`nominal horizontal width.
`
`2.1.105 pel [video]: Picture element.
`
`2.1.106 picture period [video]: The reciprocal of the picture rate.
`
`2.1.107 picture rate [video]: The nominal rate at which pictures should be output from the decoding
`process.
`
`2.1.108 picture [video]: Source, coded or reconstructed image data. A source or reconstructed picture
`consists of three rectangular matrices of 8-bit numbers representing the luminance and two chroininance
`signals. The Picture layer is one of the layers in the coding syntax defined in ISO/IEC 11172-2. Note that
`the term "picture" is always used in ISO/IEC 11172 in preference to the terms field or frame.
`
`2.1.109 polyphase filterbank [audio]: A set of equal bandwidth filters with special phase
`interrelationships, allowing for an efficient implementation of the filterbank.
`
`2.1.110 prediction [video]: The use of a predictor to provide an estimate of the pel value or data
`element currently being decoded.
`
`2.1.111 predictive-coded picture; P-picture [video]: A picture that is coded using motion
`compensated prediction from the past reference picture.
`
`2.1.112 prediction error [video]: The difference between the actual value of a pet or data element and
`its predictor.
`
`2.1.113 predictor [video]: A linear combination of previously decoded pel values or data elements.
`
`2.1.114 presentation time-stamp; PTS [system]: A field that may be present in a packet header
`that indicates the time that a presentation unit is presented in the system target decoder.
`
`2.1.115 presentation unit; PU [system]: A decoded audio access unit or a decoded picture.
`
`2.1.116 psychoacoustic model [audio]: A mathematical model of the masking behaviour of the
`human auditory system.
`
`2.1.117 quantization matrix [video]: A set of sixty-four 8-bit values used by the dequantizer.
`
`2.1.118 quantized DCT coefficients [video]: DCT coefficients before dequantization. A variable
`length coded representation of quantized DCT coefficients is stored as part of the compressed video
`bitstream.
`
`2.1.119 quantizer scalefactor [video]: A data element represented in the bitstream and used by the
`decoding process to scale the deq uantization.
`
`2.1.120 random access: The process of beginning to read and decode the coded bitstream at an arbitrary
`point.
`
`PAGE13OF158
`
`
`-
`'
`4'
`I.M.L.SLU'S EXH|B|T1007
`
`PAGE 13 OF 158
`
`I.M.L. SLU'S EXHIBIT 1007
`
`
`
`ISO/[EC 11172-3: 1993 (E)
`
`'
`
`© 130,150
`
`2.1.121 reference picture [video]: Reference pictures are the nearest adjacent 1- or P-pictures to the
`current picture in display order.
`
`2.1.122 reorder buffer [video]: A buffer in the system target decoder for storage of a reconstructed 1-
`picture or a reconstructed P-picture.
`
`2.1.123 requantization [audio]: Decoding of coded subband samples in order to recover the original
`quantized values.
`
`2.1.124 reserved: The term "reserved" when used in the clauses defining the coded bitstream indicates
`that the value may be used in the future for ISO/IEC defined extensions.
`
`2.1.125 reverse playback [video]: The process of displaying the picture sequence in the reverse of
`display order.
`
`2.1.126 scalefactor band [audio]: A set of frequency lines in Layer III which are scaled by one
`scalefactor.
`
`2.1.127 scalefactor index [audio]: A numerical code for a scalefactor.
`
`2.1.128 scalefactor [audio]: Factor by which a set of values is scaled before quantization.
`
`2.1.129 sequence header [video]: A block of data in the coded bitstream containing the coded
`representation of a number of data elements.
`
`2.1.130 side information: Information in the bitstream necessary for controlling the decoder.
`
`2.1.131 skipped macroblock [video]: A macroblock for which no data are stored.
`
`2.1.132 slice [video]: A series of macroblocks.
`ISO/[EC 11172-2.
`
`It is one of the layers of the coding syntax defined in
`
`2.1.133 slot [audio]: A slot is an elementary part in the bitstream. In Layer I a slot equals four bytes,
`in Layers I1 and III one byte.
`
`2.1.134 source stream: A single non-multiplexed stream of samples before compression coding.
`
`2.1.135 spreading function [audio]: A fu