`STANDARD
`
`ISO/IEC
`11172-3
`
`First edition
`1993-08-01
`
`
`
`
`Information technology — Coding of
`moving pictures and associated audio for
`digital storage media at up to about
`1,5 Mbit/s —
`
`Part 3:
`Audio
`
`Technologies de Iinformatian — Codage de |'image animée et du son
`associé pour les supports de stockage numérique jusqu'a enviroen
`1,5 Mbps —
`Partia 3? Audio
`
`
`
`
`
`
`I EC
`erect
`
`Reference number
`ISOMEC 11172-3:19931E)
`
`PAGE 1 of 158
`
`PETITIONERS' EXHIBIT 1020
`
`PAGE 1 of 158
`
`PETITIONERS' EXHIBIT 1020
`
`
`
`ISQVIEC 11172-3: 1993 (E)
`
`Contents
`
`Page
`
`TWtrOGuChigthe occ eects teres eneeer erent ect
`
`cese ue ee sa ce tenes ne sas nune sass ta ewes te ssessessateneseeeees ¥
`
`Section [: Gemeral oo... et ceen ere ee eee ee ane ceeded seem de se deueacesaedaesedeuaaeetesueeeataueanes L
`
`1.1
`
`SCOPES eee cess see rea cree en creer es eeu eens vasa tadua sada seuesnusaeadecegesedederearnrssessereeeaaae
`
`Led)9Mortmative references ......0...cceccscceeereeccecceee senna teases eee eenes seen eset eeeeteeepenenee ll
`
`Section 2: Techmical elements. ....... 0c cece eee cee eee ree rene eee ee nae scene eeeeeenmssnt
`
`2.il
`
`Die FiPbOee cece e eee rete dsennnenneaensnsaenaa ana ete eeae ert tateesneeenees 2
`
`2.2
`
`Symbols and abbreviations... 2... . 2.2.0.0... cece eee eee tee eeteeentereeeeeeee LO
`
`2.3 Method of deseribing bitstream SYnlex.........ccccccessscvesesessscsesessevereresrarscans LB
`
`2.4
`
`Ree quirementes........... ce. ccc cerenanceeeseereessmeeweeseaunesssesccceettstsseceesssascassssasces Lb
`
`Alnexes
`
`A
`
`DPA BOIS. cece eee cece ee cee e eee ceed cb e eee bb bee dee ece eens pba baeedteeee eee eeeaaeeeee aR
`
`ec
`
`D
`
`E
`
`The encoding process 22.0.0 .c ccc cee ce cece crete eet ee eeeeeeeeaee en eeenenssneenereneneees 6G
`
`Psychoacoustic models oo... 0... eee cece eee reece ee eeneen peer er weeeeeeweeee erence h OF
`
`Bit
`
`sensitivity CO S6POrs.cl cee eeee eee ereseeeee eee seeeereecseneereeeerees ees b OD
`
`®ISQVIEC 1995
`All rights reserved. No part of this publication may be reproduced or utilized in any form or by
`any means, electronic or mechanical, including photocopying and microfilm, without
`permission in writing from the publisher.
`
`ISQVIEC Copyright Office = (Case Postale 56 ©CH1211 Genéve 20 © Switzerland
`
`Printed in Switzerland,
`
`ii
`
`PAGE 2 of 158
`
`PETITIONERS' EXHIBIT 1020
`
`PAGE 2 of 158
`
`PETITIONERS' EXHIBIT 1020
`
`
`
`© ISOIEC
`
`ISO/IEC 11172-3: 1993 (E)
`
`F
`
`G
`
`H
`
`Error come lent... 0.10... ccceceeccersece eee euvseuessasesuuutssavensenceeennseecesecsece LD
`
`Joint stereo coding ...... 0. cccciseeeeeceeeeseetassesersttenererscsecessatsrerersereessd 43
`
`List of patemt holders. ..........ccccsssscenesesssteceesersseustenssausrasesacsuegusscesnsensus L47
`
`PAGE 3 of 158
`
`PETITIONERS' EXHIBIT 1020
`
`ii
`
`PAGE 3 of 158
`
`PETITIONERS' EXHIBIT 1020
`
`
`
`ISQYIEC 11172-3: 1993 (E)
`
`© |SOJEC
`
`Foreword
`
`ISO (the International Organization for Standardization) and IEC (the Inter-
`national Electrotechnical Cammission) form the specialized system for
`worldwide stancardization. National bodies that are members of ISO or
`IEC participate in the development of International Standards through
`technical committees established by the respective organization to deal
`with particular fields of technical activity.
`ISO and IEC technical com-
`mittees collaborate in fields of mutual interest. Other international organ-
`izations, governmental and non-governmental, in liaison with ISO and /EC,
`also take part in the work.
`
`In the field of information technology, ISO and IEC have established a joint
`technical committee, [SOMEC JTC 1. Draft International Standards adopted
`by the joint technical committee are circulated to national bodies for vot-
`ing. Publication as an International Standard requires approval by at least
`75% of the national bodies casting a vote.
`
`International Standard ISO/IEC 11172-3 was prepared by Joint Technical
`Cammittes ISOVIEC JTC 1, Information technology, Sub-Committee SC 29,
`Coded representation of audio, picture, multimedia and hypermedia infor-
`mation,
`
`ISOMEC 11172 consists of the following parts, under the general title /n-
`formation technology — Cocing of moving pictures and associated audio
`for digital storage media at up to about 1,5 Mbjit’s:
`
`— Part i: Systems
`
`— Part 2: Video
`
`— Part 3: Audio
`
`— Fart 4: Compliance testing
`
`Annexes A and B form an integra! part of this part of ISOIEC 11172. An-
`nexes C, D, E, F, G and H are for information only.
`
`PAGE 4 of 158
`
`PETITIONERS' EXHIBIT 1020
`
`PAGE 4 of 158
`
`PETITIONERS' EXHIBIT 1020
`
`
`
`®ISGiktc
`
`Introduction
`
`ISO/IEC 11172-3: 1993 (E)
`
`Note: Readers interested in an overview of MPEG Audio should read this Introduction and then proceed to
`
`annex A (Diagrams) and annex C (The encoding process) before reading the normative clauses 1 and 2.
`
`To aid in the understanding of the specification of the stored compressed bitstream andits decoding, a
`sequence of encoding, storage and decoding is described.
`
`i |
`
`Encoding
`
`The encoder processes the digital audio signal and produces the compressed bitstream for storage. The
`encoder algorithm is not standardized, and may use various means for encoding such as estimation of the
`auditory masking threshold, quantization, and scaling. However, the encoder output must be such that a
`decoder conforming to the specifications of clause 2.4 will produce audio suitable for the intended
`application.
`
`PCM
`audio samples
`32 44,1
`48 kHz
`
`mapping
`
`quantizer
`and
`coding
`
`encoded
`bitstream
`
`cay
`ackin
`P
`
`a
`
`encoder
`
`ISOMEC 11172-3
`
`Figure 1 -- Sketch of the basic structure of an encoder
`
`ancillary data
`
`Figure 1 illustrates the basic structure of a audio encoder. Input audio samples are fed into the encoder. The
`napping creates a filtered and subsampled representation of the input audio stream. The mapped samples
`may be called either subband samples (as im Layer I or I, see below) or wansformed subband samples (as in
`Layer 01). A psychoacoustic model creates a set of data to control the quantizer and coving. These data are
`different depending on the actual coder implementation. One possibility is to use an estimation of the
`masking threshold to do this quantizer control. The quantizer and coding block creates a set of coding
`symbols from the mapped input samples. Again, this block can depend on the encoding system. The block
`‘frame packing’ assembles the actual bitstream from the output data of the other blocks, and adds other
`information (e.g. error correction) if necessary.
`
`There are four different modes possible, single channel, dual channel (two independent audio signals coded
`within one bitstream), stereo (left and right signals of a stereo pair coded within one bitstream), and Joint
`Stereo (left and right signals of a stereo pair coded within one bitstream with the stereo imelevancy and
`redundancy exploited).
`
`PAGE 5 of 158
`
`PETITIONERS' EXHIBIT 1020
`
`PAGE 5 of 158
`
`PETITIONERS' EXHIBIT 1020
`
`
`
`ISOWIEC 11172-3: 1993 (E)
`
`© ISO/IEG
`
`o.2
`
`Layers
`
`Depending on the application, different layers of the coding system with increasing encoder complexity and
`performance can be used. An ISO/IEC 11172-3 Audio Layer N decoderis able to decode bitstream data
`which has been encoded in Layer N and all layers below N.
`
`Layer I
`This layer contains the basic mapping of the digital audio input into 32 subbands, fixed segmentation to
`format the data into blocks, a psychoacoustic model to determine the adaptive bit allocation, and
`quantization using block companding and formatting. The theoretical minimum encoding/decoding delay for
`Layer I is about 19 ms.
`
`Layer I
`
`This layer provides additional coding of bit allocation, scalefactors and samples. Different framing is used,
`The theoretical minimum encoding/decoding delay for Layer II is about 35 ms.
`
`Layer III
`This layer introduces increased frequency resolution based on a hybrid filterbank. It adds a different
`(nonunifonn) quantizer, adaptive segmentation and entropy coding of the quantized values. The theoretical
`minimum encoding/decoding delay for Layer III is about 59 ms.
`
`Joint Stereo coding can be added as an additional feature to any of the layers.
`
`0.3
`
`Storage
`
`Various streams of encoded video, encoded audio, synchronization data, systems data and auxiliary data may
`be stored together on a storage medium. Editing of the audio will be easier if the edit point is constrained to
`coincide with an addressable point.
`
`Access to storage may involve remote access over a communication system. Access is assumed to be
`controlled by a functional unit other than the audio decoder itself, This control unit accepts user commands,
`reads and interprets data base structure information, reads the stored information from the media,
`demultiplexes non-audio information and passes the stored audio bitstream to the audio decoder at the
`required rate.
`
`0,4 Decoding
`
`The decoder accepts the compressed audio bitstream in the syntax defined im 2.4.1, decodes the data elements
`according to 2.4.2, and uses the infonmation to produce digital audio output according to 2.4.3,
`
`encoded
`bitstream
`
`frame
`unpacking
`
`reconstruction
`
`ISOMEC 11172-3 decoder
`
`PCM
`audio samples
`32 44,1 48kHz
`
`inverse
`mapping
`
`ancillary data
`
`Figure 2 -- Sketch of the basic structure of a decoder
`
`Figure 2 illustrates the basic structure of a audio decoder, Bitstream data is fed into the decoder. The
`bitstream unpacking and decoding block does error detection if error-check is applied in the encoder (see
`7.4.2.4), The bitstream data are unpacked to recover the various pieces of information. The
`reconstruction block reconstructs the quantized version of the set of mapped samples. The inverse
`mapping transforms these mapped samples back into uniform PCM.
`
`vi
`
`PAGE 6 of 158
`
`PETITIONERS' EXHIBIT 1020
`
`PAGE 6 of 158
`
`PETITIONERS' EXHIBIT 1020
`
`
`
`
`
`INTERNATIONAL STANDARD © ISO/IEC ISONEC 11172-3: 1993 (E)
`
`Information technology — Coding of moving
`pictures and associated audio for digital storage
`media at up to about 1,5 Mbit/s —
`
`Part 3:
`Audio
`
`Section 1: General
`
`1.1
`
`Scope
`
`This part of ISOVIEC 11172 specifies the coded representation of high quality audio for storage media and
`the method for decoding of high quality audio signals. The input of the encoder and the output of the decoder
`are compatible with existing PCM standards such as standard Compact Disc and Digital Audio Tape.
`
`This part of the ISOV/TEC 11172 is intended for application to digital storage media providing a total
`continuous transfer rate of about 1,5 Mbits/sec for both audio and video bitstreams, such as CD, DAT and
`magnetic hard disc. The storage media may either be connected directly to the decoder, or via other means
`such as communication lines and the ISO/IEC 11172 multiplexed suream defined in ISOVIEC 11172-1.
`This part of ISO/TEC 11172 is intended for sampling rates of 32 KHz, 44,1 KHz, and 48 kHz.
`
`1.2 Normative references
`
`The following Intemational Standards contain provisions which, through reference in this text, constituee
`provisions of this part of ISO/TEC 11172. At the time of publication, the editions indicated were valid.
`All standards are subject to revision, and parties to agreements based on this part of ISO/IEC 11172 are
`encouraged to investigate the possibility of applying the most recent editions of the standards indicated
`below. Members of IEC and ISO maintain registers of currently valid Intemational Standards.
`
`ISOVIEC 11172-1:1993 Jnformation technology - Coding ofmoving pictures and associated audio for digital
`Storage media afup to about 5 Mbin’s - Part J Systests.
`
`ISQVIEC 11172-2:1993 Jnformation technology - Coding of moving pictures and associated audio for digital
`Storage media at up to about 1,5 Mbit's - Part 2: Video.
`
`CCIF Recommendation 601-2 Encoding parameters of digital television for studios.
`
`CCIR Report 6244 Characteristics of systems for monochrome and colour television,
`
`CCIR Recommendation 648 Recerding ofaudio signals.
`
`CCIR Report 955-2 Sound broadcasting by satellitefor portable and mobile receivers, including Annex IV
`Summary description ofAdvanced Digital System I,
`
`CCITT Recommendation J.17 Pre-emphasis used on Sound-Programme Circuits.
`
`IEEE Draft Standard P1180/D2 1990 Specification for the inplementation of 8x 8 inverse discrete cosine
`transform”.
`
`IEC publication 908:1987 CD Digital Audio System.
`
`PAGE 7 of 158
`
`PETITIONERS' EXHIBIT 1020
`
`PAGE 7 of 158
`
`PETITIONERS' EXHIBIT 1020
`
`
`
`ISO/IEG 11172-3: 1993 (E)
`
`& |ISOVIEC
`
`Section 2: Technical elements
`
`2.1 Definitions
`
`For the purposes of ISO/IEC 11172, the following definitions apply.
`square brackets.
`
`[f specific to a part, this is noted in
`
`2.1.1 ac coefficient [video]: Any DCT coefficient for which the frequency in one or both dimensions
`i$ MOM-Zzerd.
`
`In the case of compressed audio an access unit is an audio access unit.
`2.1.2 access unit [system]:
`the case of compressed video an access unit is the coded representation ofa picture.
`
`In
`
`2.1.3 adaptive segmentation [audio]: A subdivision of the digital representation of an audio signal
`in variable segments of time.
`
`2.1.4 adaptive bit allocation [audio]: The assignment of bits to subbands in a time and frequency
`Varying fashion according to a psychoacoustic model,
`
`2.1.5 adaptive noise allocation [audio]: The assignment of coding noise to frequency bands in a
`time and frequency varying fashion according to a psychoacoustic model.
`
`2.1.6 alias [audio]: Mirrored signal component resulting from sub-Nyquist sampling.
`
`2.1.7 analysis filterbank [audio]: Filterbank in the encoder that wansforms a broadband PCM audio
`signal into a set of subsampled subband samples.
`
`2.1.3 audio access unit [audio]: For Layers I amd IT an audio access unit is defined as the smallest
`part of the encoded bitstream which can be decoded by itself, where decoded means “fully reconstructed
`sound”. For Layer Ill an audio access unit is part of the bitstream that is decodable with the use of
`previously acquired main information.
`
`2.1.9 audio buffer [audio): A buffer in the system target decoder for storage of compressed audio data.
`
`2.1.10 audio sequence [audio]: A non-interrupted series of audio frames in which the following
`parameters are not changed:
`- ID
`= Layer
`- Sampling Frequency
`+ For Layer T and UT: Bitrate index
`
`2.1.11 backward motion vector [video]: A motion vector that is used for motion compensation
`from a reference picture at a later time in display order.
`
`2.1.12 Bark [audio}: Unit of critical band rare. The Bark scale is a non-linear mapping of the frequency
`scale over the audio range closely comesponding with the frequency selectivity of the human ear across the
`band.
`
`2.1.13 bidirectionally predictive-coded picture; B-picture [video]: A picture that is coded
`using motion compensated prediction from a past and/or future reference picture.
`
`2.1.14 bitrate: The rate at which the compressed bitstream is delivered from the storage medium to the
`input of a decoder.
`
`2.1.15 block companding [audio]: Normalizing of the digital representation of an audio signal
`within a certain time period.
`
`2.1.16 block [video]: An 8-row by &-column orthogonal block of pels.
`
`2.1.17 bound [audio]: The lowest subband in which intensity stereo coding is used.
`
`PAGE 8 of 158
`
`PETITIONERS' EXHIBIT 1020
`
`PAGE 8 of 158
`
`PETITIONERS' EXHIBIT 1020
`
`
`
`© ISO/IEC
`
`ISONEC 11172-3; 1993 (E)
`
`2.1.18 byte aligned: A bit in a coded bitstream is byte-aligned if its position is a multiple of 8-bits
`from the first bit in the stream.
`
`2.1.19 byte: Sequence of 8-bits.
`
`2.1.20 channel: A digital medium that stores or transports an ISQVIEC 11172 stream,
`
`2.1.21 channel [audio]: The left and right channels of a stereo signal
`
`2.1.22 chrominance (component) [video]: A matrix, block or single pel representing one of the
`two colour difference signals related to the primary colours in the manner defined in CCIR Rec 601. The
`symbols used for the colour difference signals are Cr and Cb.
`
`2.1.23 coded audio bitstream [audio]: A coded representation of an audio signal as specified in this
`part of ISO/IEC 111772.
`
`2.1.24 coded video bitstream [video]: A coded representation of a series of one or more pictures as
`specified in ISO/IEC 11172-2.
`
`2.1.25 coded order [video]: The order in which the pictures are stored and decoded. This order is not
`necessarily the same as the display order,
`
`2.1.26 coded representation: A data element as represented in its encoded form.
`
`2.1.27 coding parameters [video]: The set of user-definable parameters that characterize a coded video
`bitstream. Bisstreams are characterised by coding parameters. Decoders are characterised by the bitstreams
`that they are capable of decoding.
`
`2.1.28 component [video]: A matrix, block or single pel from one of the three matrices (luminance
`and two chrominance) that make up a picture,
`
`2.1.29 compression: Reduction in the number of bis used to represent an item of data.
`
`2.1.30 constant bitrate coded video [video]: A compressed video bitstream with a constant
`average bitrate.
`
`2.1.31 constant bitrate: Operation where the bitrate is constant from start to finish of the compressed
`bitstream,
`
`2.1.32 constrained parameters [video]: The values of the set of coding parameters defined in
`2.4.3.2 of ISO/VTEC 11172-2.
`
`2.1.33 constrained system parameter stream (CSPS) [systerm]: An ISO/IEC 11172
`multiplexed stream for which the constraints defined in 2.4.6 of ISO/IEC 11172-1 apply.
`
`2.1.34 CRC: Cyclic redundancy code.
`
`2.1.35 critical band rate [audio]: Psychoacoustic function of frequency. At a given audible
`frequency it is proportional to the number of critical bands below that frequency. The units of the critical
`hand rate scale are Barks.
`
`2.1.36 critical band [audio]: Psychoacoustic measure im the spectral domain which corresponds to the
`frequency selectivity of the human ear. This selectivity is expressed in Bark.
`
`2.1.37 data element: Anitem of data as represented before cncoding and after decoding.
`
`2.1.38 de-coefficient [video]: The DCT coefficient for which the frequency is zero in both
`dimensions.
`
`PAGE 9 of 158
`
`PETITIONERS' EXHIBIT 1020
`
`PAGE 9 of 158
`
`PETITIONERS' EXHIBIT 1020
`
`
`
`ISOVIEC 11172-3: 1993 (E)
`
`© SONEG
`
`2.1.39 de-coded picture; D-picture [video]: A picture that is coded using only information from
`itself. Of the DCT coefficients in the coded representation, only the dc-coefficients are present.
`
`2.1.40 DCT coefficient: The amplitude of a specific cosine basis function.
`
`2.1.41 decoded stream: The decoded reconstruction of a compressed bitstream.
`
`2.1.42 decoder input buffer [video]: The first-in first-out
`buffering verifier.
`
`(FIFO) buffer specified in the video
`
`2.1.43 decoder input rate [video]; The data rate specified in the video buffering verifier and encoded
`in the coded video bitstream.
`
`2.1.44 decoder: An embodiment of a decoding process.
`
`2.1.45 decoding (process): The process defined in ISQVIEC 11172 that reads an input coded bitstream
`and produces decoded pictures or audio samples.
`
`2.1.46 decoding time-stamp; DTS [system]: A field that may be present in a packet header that
`indicates the time that an access unit is decoded in the system target decoder.
`
`2.1.47 de-emphasis [audio]: Filtering applied to an audio signal after storage or transmission to undo
`a linear distortion due to emphasis.
`
`2.1.48 dequantization [video]: The process of rescaling the quantized DCT coefficients after their
`representation in the bitstream has been decoded and before they are presented to the inverse DCT.
`
`2.1.49 digital storage media; DSM: A digital storage or transmission device or system.
`
`2.1.50 discrete cosine transform; DCT [video]: Either the forward discrete cosine transform or the
`inverse discrete cosine wansform. The DCT is an invertible, discrete orthogonal transformation. The
`inverse DCT is defined in annex A of ISO/IEC 11172-2.
`
`2.1.51 display order [video]: The order in which the decoded pictures should be displayed. Normally
`this is the same order in which they were presented at the input of the encoder.
`
`2.1.52 dual channel mode [audio): A mode, where two audio channels with independent programme
`contents (e.g. bilingual) are encoded within one bitstream. The coding process is the same as for the stereo
`mode,
`
`2.1.53 editing: The process by which one or more compressed bitstreams ie manipulated to produce a
`new compressed bitstream. Conforming edited bitstreams must meet the requirements defined in this
`ISOVIEC 11172.
`
`2.1.54 elementary stream [system]: A generic term for one of the coded video, coded audio or other
`coded bitstreams.
`
`2.1.55 emphasis [audio]: Filtering applied to an audio signal before storage or wansmission to
`improve the signal-to-noise ratio at high frequencies.
`
`2.1.56 encoder: An embodiment of an encoding process.
`
`2.1.57 encoding (process): A process, not specified in ISO/IEC 11172, that reads a stream of input
`pictures or audio samples and produces a valid coded bitsueam as defined in ISQVIEC 11172.
`
`2.1.58 entropy coding: Variable length lossless coding of the digital representation of a signal to
`reduce redundancy.
`
`2.1.59 fast forward playback [video]: The process of displaying a sequence, or parts of a sequence,
`of pictures in display-order faster than real-time.
`
`PAGE 10 of 158
`
`PETITIONERS' EXHIBIT 1020
`
`PAGE 10 of 158
`
`PETITIONERS' EXHIBIT 1020
`
`
`
`© ISQVEC
`
`ISQWIEC 11172-3: 1993 (E)
`
`2.1.60 FFT: Fast Fourier Transformation, A fast algorithm for performing a discrete Fourier transform
`(an orthogonal transform).
`:
`
`2.1.61 filterbank [audio]: A set of band-pass filters covering the entire audio frequency range.
`
`2.1.62 fixed segmentation [audio]: A subdivision of the digital representation of an audio signal
`into fixed segments of time.
`
`2.1.63 forbidden: The tenn "forbidden" when used in the clauses defining the coded bitstream indicates
`that the value shall mever be used. This is usually to avoid emulationof start codes.
`
`2.1.64 forced updating [video]: The process by which macroblocks are intra-coded from time-to-time
`to ensure that mismatch errors between the inverse DCT processes in encoders and decoders cannot build up
`excessively.
`
`2.1.65 forward motion vector [video]: A motion vector that is used for motion compensation from
`areference picture at an earlier time in display order.
`
`2.1.66 frame [audio]: A part of the audio signal that corresponds to audio PCM samples from an
`Audio Access Unit.
`
`2.1.67 free format [audio]: Any bitrate other than the defined bitrates that is less than the maximum
`valid bitrate for each layer.
`
`2.1.68 future reference picture [video]: The future reference picture is the reference picture that
`occurs at a later time than the current picture in display order.
`
`2.1.69 granules [Layer I] [audio]: The set of 3 consecutive subband samples from all 32 subbands
`that are considered together before quantization. They correspond to 96 PCM samples.
`
`2.1.70 granules [Layer III] [audio]: 576 frequency lines that carry their own side information,
`
`2.1.71 group of pictures [video]: A series of one or more coded pictures intended to assist random
`access. The group of pictures is one of the layers in the coding syntax defined in ISO/TEC 11172-2.
`
`2.1.72 Hann window [audio]: A time function applied sample-by-sample to a block of audio samples
`before Fourier transformation.
`
`2.1.73 Huffman coding: A specific method for entropy coding.
`
`2.1.74 hybrid filterbank [audio]: A serial combination of subband filterbank and MDCT.
`
`2.1.75 IMDCT [audio]: Inverse Modified Discrete Cosine Transtonn.
`
`;
`2.1.76 intensity stereo [audio]: A method of exploiting stereo irrelevance or redundancy in
`stereophonic audio programmes based on retaining at high frequencies only the energy envelope of the right
`and left channels.
`
`2.1.77 interlace [video]: The property of conventional television pictures where alternating lines of
`the picture represent different instances in time.
`
`2.1.78 intra coding [video]; Coding of a macroblock or picture that uses information only from that
`macroblock or picture.
`
`2.1.79 intra-coded picture; I-picture [video]: A picture coded using information only from itself.
`
`2.1.80 ISO/TEC 11172 (multiplexed) stream [system]: A bitstream composed of zero or more
`elementary streams combined in the manner defined in ISO/IEC 11172-1.
`
`PAGE 11 of 158
`
`PETITIONERS' EXHIBIT 1020
`
`PAGE 11 of 158
`
`PETITIONERS' EXHIBIT 1020
`
`
`
`ISOVIEG 11172-3; 1993 (E)
`
`© ISONEC
`
`2.1.81 joint stereo coding [audio]; Any method that exploits stereophonic irrelevance or
`stereophonic redundancy.
`
`2.1.82 joint stereo mode [audio]; A mode of the audio coding algorithm using joint stereo coding.
`
`2.1.83 layer [audio]: One of the levels in the coding hierarchy of the audio system defined in this part
`of ISO/IEC 11172,
`
`2.1.84 layer [video and systems]: One of the levels in the data hierarchy of the video and system
`specifications defined in ISO/IEC 11172-1 and ISQVIEC 11172-2.
`
`2.1.85 luminance (component) [video]; A matrix, block or single pel representing a monochrome
`representation of the signal and related to the primary colours in the manner defined in CCIR Rec 601. The
`symbol used for luminance is Y.
`
`2.1.86 macroblock [video]: The four 8 by 8 blocks of luminance data and the two corresponding 8 by
`& blocks of chrominance data coming from a 16 by 16 section of the luminance componentof the picture.
`Macroblock is sometimes used to refer to the pel data and sometimes to the coded representation of the pel
`values and other data elements defined in the macroblock layer of the syntax defined in ISO/IEC 11172-2.
`The usage is clear from the context.
`
`2.1.47 mapping (audio): Conversion of an audio signal from time to frequency domain by subband
`filtering and/or by MDCT.
`
`2.1.88 masking [audio): A property of the human auditory system by which an audio signal cannot be
`perceived in the presence of another audio signal .
`
`2.1.89 masking threshold (audio): A function in frequency and time below which an audio signal
`cannot be perceived by the human auditory system.
`
`2.1.90 MDCT [audio]: Modified Discrete Cosine Transfonn.
`
`2.1.91 motion compensation [video]: The use of motion vectors to improve the efficiency of the
`prediction of pel values. The prediction uses motion vectors to provide offsets into the past and/or future
`reference pichires containing previously decoded pel values that are used to form the prediction error signal.
`
`2.1.92 motion estimation [video]: The process of estimating motion vectors during the encoding
`process.
`
`2.1.93 motion vector [video]: A two-dimensional vector used for motion compensation that provides
`an offset from the coordinate position in the current picture to the coordinates in a reference picture.
`
`2.1.94 MS stereo (audio): A method of exploiting stereo irrelevance or redundancy in stereopbonic
`audio programmes based on coding the sum and difference signal instead of the left and right channels.
`
`2.1.95 non-intra coding [video): Coding of a macroblock or picture that uses information both from
`itself and from macroblocks and pictures occurring at other times.
`
`2.1.96 non-tonal component (audio): A noise-like component of an audio signal,
`
`2.1.97 Nyquist sampling: Sampling at or above twice the maximum bandwidth of a signal.
`
`2.1.98 pack [system]: A pack consists of a pack header followed by one or more packets,
`in the system coding syntax described in ISGWIEC 11172-1.
`
`It is a layer
`
`2.1.99 packet data [system]: Contiguous bytes of data from an elementary stream present in a packet,
`
`2.1.100 packet header [system]: The data structure used to convey information about the elementary
`stream data contained in the packet data.
`
`PAGE 12 of 158
`
`PETITIONERS' EXHIBIT 1020
`
`PAGE 12 of 158
`
`PETITIONERS' EXHIBIT 1020
`
`
`
`® ISOVIEG
`
`ISOVIEC 11172-3: 1993 (E)
`
`2.1.101 packet [system]: A packet consists of a header followed by a numberof contiguous bytes
`from an elementary data stream. It is a layer in the system coding syntax described in ISO/IEC 11172-1.
`
`2.1.102 padding [audio]: A method to adjust the average length in time of an audio frame to the
`duration of the corresponding PCM samples, by conditionally adding a slot to the audio frame.
`
`2.1.103 past reference picture [video]: The past reference picture is the reference picture that occurs
`al an earlier time than the current picture in display order.
`
`2.1.104 pel aspect ratio [video]: The ratio of the nominal vertical height of pel on the display to its
`nominal horizontal width.
`
`2.1.105 pel [video]: Picture element.
`
`2.1.106 picture period [video]: The reciprocal of the picture rate.
`
`2.1.107 picture rate (video); The nominal rate at which pictures should be output from the decoding
`process.
`
`2.1.108 picture [video): Source, coded or reconstructed image data. A source or reconstructed picture
`consists of three rectangular matrices of 8-bit numbers representing the Juminance and two chrominance
`signals. The Picture layer is one of the layers in the coding syntax defined in ISO/IEC 11172-2. Note thar
`the term "picture" is always used in ISO/TEC 11172 in preference to the terms field or frame.
`
`2.1.109 polyphase filterbank [audio]: A set of equal bandwidth filters with special phase
`interrelationships, allowing for an efficient implementation of the filterbank.
`
`2.1.110 prediction [video]: The use of a predictor to provide an estimate of the pel value or data
`element currently being decoded.
`
`2.1.111 predictive-coded picture; P-picture [video]: A picture that is coded using motion
`compensated prediction from the past reference picture.
`
`2.1.112 prediction error [video]: The difference between the actual value of a pel or data element and
`its predictor.
`
`2.1.113 predictor [video]: A linear combination of previously decoded pel values or data elements.
`
`2.1.114 presentation time-stamp; PTS [system]: A field that may be present in a packet header
`that indicates the time that a presentation unit is presented in the system target decoder.
`
`2.1.115 presentation unit; PU [system]: A decoded audio access unit or a decoded picture.
`
`2.1.116 psychoacoustic model [audio]: A mathematical model of the masking behaviour of the
`human auditory system.
`
`2.1117 quantization matrix [video): A set of sixty-four 8-bit values used by the dequantizer.
`
`2.1.118 quantized DCT coefficients [video]: DCT coefficients before dequantization. A variable
`length coded representation of quantized DCT coefficients is stored as part of the compressed video
`bitstream.
`
`2.1.119 quantizer scalefactor [video]: A data element represented in the bitstream and used by the
`decoding process to scale the dequantization.
`
`2.1.120 random access: The process of beginning to read and decode the coded bitstream at an arbitrary
`point.
`
`PAGE 13 of 158
`
`PETITIONERS' EXHIBIT 1020
`
`PAGE 13 of 158
`
`PETITIONERS' EXHIBIT 1020
`
`
`
`ISOMEC 117172-3: 1993 (E)
`
`.
`
`© ISOAEC
`
`2.1.121 reference picture (video): Reference pictures are the nearest adjacent I- or P-pictures to the
`current picture in display order.
`
`2.1.122 reorder buffer [video]: A buffer in the system target decoder for storage of a reconstructed I-
`picture or a reconstructed P-picture.
`
`2.1.123 requantization [audio]: Decoding of coded subband samples in order to recover the original
`quantized values.
`
`2.1.124 reserved: The tenn "reserved" when used in the clauses defining the coded bitswream indicates
`that the value may be used in the future for ISOVIEC defined extensions.
`
`2.1,125 reverse playback [video]: The process of displaying the picture sequence in the reverse of
`display order.
`
`2.1,126 scalefactor band [audio]: A set of frequency lines in Layer II which are scaled by one
`scalefactor.
`
`21.127 scalefactor index [audio): A numerical code for a scalefactor.
`
`2.1.128 sealefactor [audio}: Factor by which a set of values is scaled before quantization.
`
`2.1.129 sequence header [video]: A block of data in the coded bitstream containing the coded
`representation of a number of data elements.
`
`2.1.130 side information: Information in the bitstream necessary for controlling the decoder,
`
`2.1.131 skipped macroblock [video]: A macroblock for which no data are stored.
`
`2.1.132 slice [video]: A series of macroblocks.
`ISOVIEC 11172-2.
`
`It is one of the layers of the coding syntax defined in
`
`2.1.133 slot [audio]: A slot is an elementary part in the bitstream. In Layer I a slot equals four bytes,
`in Layers II and IL one byte.
`
`2.1.134 source stream: A single non-multiplexed stream of samples before compression coding.
`
`2.1.135 spreading function [audio]: A function that describes the frequency spread of masking.
`
`2.1.136 start codes [system and video): 32-bit codes embedded in that coded bitstream that are
`unique. They are used for several purposes including identifying some of the layers in the coding syntax.
`
`2.1.137 STD inp