`Released by : ETSI/PT 12
`Release date: February 1992
`
`RELEASE NOTE
`
`Recommendation GSM 06.10
`
`GSM Full Rate Speech Transcoding
`
`New Released version February 92 : 3.2.
`
`Previously distributed version
`
`: 3 2.
`
`0 (Release 1/90)
`0
`(Release 92, Phase 1)
`
`1. Reason for changes
`
`No changes since the previously distributed version.
`
`Google Ex. 1217, pg. 1
`
`Google Ex. 1217, pg. 1
`
`
`
`Google Ex. 1217, pg. 2
`
`Google Ex. 1217, pg. 2
`
`
`
`ETSI/GSM
`
`Version 3.2.0
`
`GSM recommendation: 06.10
`
`Title: GSM full rate speech transcoding
`
`
`Date: February 1992
`
`List of contents: 1. General
`
`2. Transmission characteristics
`
`3. Functional description of the RPE-LTP codec
`
`4. Computational details of the RPE-LTP codec
`
`5. Digital test sequences
`
`Annex 1. Codec performance
`
`Annex 2. Subjective relevance of the speech
`coder output bits
`
`Annex 3. Format for test sequence distribution
`
`NOTE: This Recommendation is a reproduction of recommendation
`T/L/O3/1l "13 kbit/s Regular Pulse Excitation - Long Term
`Prediction - Linear Predictive Coder for use in the
`Pan-European Digital Mobile Radio System".
`
`Floppy disks containing the digital test sequences described
`in section 5 can be distributed by ETSI Secretariat on request.
`
`Google Ex. 1217, pg. 3
`
`Google Ex. 1217, pg. 3
`
`
`
`ETSI/GSM
`
`Version 3.2.0
`
`The contact information of the ETSI secretariat is:
`
`ETSI
`
`B.P. 152
`
`F 06561 Valbonne Cedex
`
`France
`
`Tel
`
`+33 92 94 42 00
`
`Fax
`
`+33 93 65 47 16
`
`Language of original: English
`
`Number of pages: 93
`
`Google Ex. 1217, pg. 4
`
`Google Ex. 1217, pg. 4
`
`
`
`ETSI/GSM
`
`GSM 06.10 / page 3
`
`Version 3.2.0
`
`Detailed list of contents
`************************~k
`
`1- GENERAL
`
`SCOPE
`1.1
`1.2 OUTLINE DESCRIPTION
`1.3. FUNCTIONAL DESCRIPTION OF AUDIO PARTS
`1.4. PCM FORMAT CONVERSION
`1.5
`PRINCIPLES OF THE RPE-LTP ENCODER
`1.6
`PRINCIPLES OF THE RPE-LTP DECODER
`1.7
`SEQUENCE AND SUBJECTIVE IMPORTANCE OF ENCODED PARAMETERS
`
`2- IBAEEMISSIQN_QEABA§I§BISIIQS
`
`2.1. PERFORMANCE CHARACTERISTICS OF THE ANALOGUE/DIGITAL
`INTERFACES
`2.2. TRANSCODER DELAY
`
`3. EQEQIIQNAL DESCRIEIIQE QE THE RPE-LIE QQDEQ
`
`3.1. FUNCTIONAL DESCRIPTION OF THE RPE-LTP ENCODER
`
`3.1.1. Offset compensation
`3.1.2. Preemphasis
`3.1.3. Segmentation
`3.1.4. Autocorrelation
`3.1.5. Schur Recursion
`3.1.6. Transformation of reflection coefficients to Log.—Area
`Ratios
`3.1.7. Quantization and coding of Log.-Area Ratios
`3.1.8 Decoding of the quantized Log.-Area Ratios
`3.1.9. Interpolation of Log.—Area Ratios
`3.1.10. Tranformation of Log.—Area Ratios into reflection
`coefficients
`3.1 11. short term Analysis Filtering
`3.1.12. Sub—segmentation
`3.1.13. Calculation of the LTP parameters
`3.1.14. Coding/Decoding of the LTP lags
`3.1.15. Coding/Decoding of the LTP gains
`3.1.16. Long term analysis filtering
`3.1.17. Long term synthesis filtering
`3.1.18. Weighting Filter
`3.1.19. Adaptive sample rate decimation by RPE grid selection
`3.1.20. APCM quantization of the selected RPE sequence
`3.1 21. APCM inverse quantization
`3.1.22. RPE grid positioning
`
`Google Ex. 1217, pg. 5
`
`Google Ex. 1217, pg. 5
`
`
`
`ETSI/GSM
`
`GSM 06.10 / page 4
`
`Version 3.2.0
`
`.2. DECODER
`
`.2.1. RPE decoding section
`.2.2
`Long Term Prediction section
`.2.3. Short
`term synthesis filtering section
`.2.4 Postprocessing
`
`.
`
`TAIL
`
`F
`
`RP -
`
`DE
`
`.1. DATA REPRESENTATION AND ARITHMETIC OPERATIONS
`2. FIXED POINT IMPLEMENTATION OF THE RPE-LTP CODER
`
`.0 Scaling of the input variable
`1. Downscaling of the input signal
`2. Offset compensation
`.3 Preemphasis
`.4 Autocorrelation
`.5. Computation of the reflection coefficients
`.6
`Transformation of reflection coefficients to Log. —Area
`Ratios
`.7
`Quantization and coding of the Log. —Area Ratios
`. Decoding of the coded Log. —Area Ratios
`.
`Computation of the quantized reflection coefficients
`
`8 9
`
`to get the LARp[l. .8]
`.2.9. l. Interpolation of the LARpp[1. .8]
`2.9. 2. Computation of the rp[1. .8]
`from the interpolated
`LARp[1. .8]
`
`term analysis filtering
`.10. Short
`.11. Calculation of the LTP parameters
`.12. Long term analysis filtering
`.13. Weighting filter
`.14. RPE grid selection
`.15. APCM quantization of the selected RPE sequence
`.16. APCM inverse quantization
`.17 RPE grid positioning
`18. Update of the reconstructed short term residual signal
`dp[——120. .—l]
`
`. FIXED POINT IMPLEMENTATION OF THE RPE-LTP DECODER
`
`#ibhpthnbbbnb
`
`NNbMNNNNN
`
`4 3
`
`4 3 1. RPE decoding section
`4 3 2. Long term synthesis filtering
`4.3 3. Computation of the decoded reflection coefficients
`4.3.4. Short
`term synthesis filtering section
`4 3 5. Deemphasis filtering
`4 3 6. Upscaling of the output signal
`4 3 7. Truncation of the output variable
`
`4 .4. TABLES USED IN THE FIXED POINT IMPLEMENTATION OF THE RPE-LTP
`CODER AND DECODER
`
`Google Ex. 1217, pg. 6
`
`Google Ex. 1217, pg. 6
`
`
`
`ETSI/GSM
`
`GSM 06.10 / page 5
`
`Version 3.2.0
`
`- DIQLIAL_IE§E_5§QEEHQ§S
`
`INPUT AND OUTPUT SIGNALS
`.1.
`.2. CONFIGURATION FOR THE APPLICATION OF THE TEST SEQUENCES
`
`
`
`LnU1U'IU1U1U1 .3. TEST SEQUENCES
`
`.2.l. Configuration 1
`.2.2. Configuration 2
`
`(encoder only)
`(Decoder only)
`
`5.3.1. Test sequences for configuration 1
`5.3.2. Test sequences for configuration 2
`
`ANNEX 1- CQDEQ_EEBEQBMAHQE
`
`INTRODUCTION‘
`Al.l.
`Al.2. SPEECH PERFORMANCE
`
`Al.2.l. Single encoding
`Al.2.2. Speech performance when interconnected with coding systems
`on an analogue basis
`
`D’W‘W
`
`1.2.2.1. Performance with 32 kbit/s ADPCM (G.721)
`1.2.2.2. Performance with another RPE-LTP codec
`1.2.2.3. Performance with encoding other than RPE-LTP and 32
`kbit/s ADPCM (G.721)
`
`A1.3. NON-SPEECH PERFORMANCE
`
`A.l.3.1. Performance with single sine waves
`Al.3.2. Performance with DTMF tones
`A1.3.3. Performance with information tones
`Al.3.4. Performance with voice—band data
`
`Al.4. DELAY
`Al.5. REFERENCES
`
`ANNEX 2.
`
`B
`
`TIVE REL V
`
`E
`
`F
`
`HE
`
`EE H
`
`TP
`
`BIT
`
`ANNEX 3. EQRMAT FQR IESI SEQQEEQE DISTRIBQTIQE
`
`A3.1. TYPE OF FILES PROVIDED
`A3.2. FILE FORMAT DESCRIPTION
`
`Google Ex. 1217, pg. 7
`
`Google Ex. 1217, pg. 7
`
`
`
`ETSI/GSM
`
`GSM 06.10 / page 6
`
`Version 3.2.0
`
`1.1. SCOPE
`
`The transcoding procedure specified in this recommendation is
`applicable for the full—rate traffic channel
`(TCH)
`in the Pan—
`European Digital Mobile Radio (DMR) system. The use of this
`transcoding scheme for other applications has not been considered.
`
`In recommendation GSM 06.01, a reference configuration for the
`speech transmission chain of the Pan—European DMR system is shown.
`According to this reference configuration,
`the speech encoder
`takes its input as a 13 bit uniform PCM signal either from the
`audio part of the mobile station or on the network side,
`from the
`PSTN via an 8 bit/A—law to 13 bit uniform PCM conversion. The
`encoded speech at the output of the speech encoder is delivered to
`a channel encoder unit which is specified in Rec.GSM 05.03. In the
`receive direction,
`the inverse operations take place.
`
`This recommendation describes the detailed mapping between input
`blocks of 160 speech samples in 13 bit uniform PCM format to
`encoded blocks of 260 bits and from encoded blocks of 260 bits to
`output blocks of 160 reconstructed speech samples.
`The sampling
`rate is 8000 sample/s leading to an average bit rate for the
`encoded bit stream of 13 kbit/s. The coding scheme is the
`so—called Regular Pulse Excitation - Long Term prediction — Linear
`Predictive Coder, here—after referred to as RPE—LTP.
`
`The recommendation also specifies the conversion between A—law PCM
`and 13 bit uniform PCM. Performance requirements for the audio
`input and output parts are included only to the extent that they
`affect the transcoder performance. The recommendation also
`describes the codec down to the bit level,
`thus enabling the
`verification of compliance to the recommendation to a high degree
`of confidence by use of a set of digital test sequences. These
`test sequences are also described and are available on floppy
`disks.
`
`1.2. OUTLINE DESCRIPTION
`
`The recommendation is structured as follows:
`
`Section 1.3 contains a functional description of the audio parts
`including the A/D and D/A functions. Section 1.4 describes the
`conversion between 13 bit uniform and 8 bit A—law samples.
`Sections 1.5 and 1.6 present a simplified description of the
`principles of the RPE—LTP encoding and decoding process
`respectively. In section 1.7,
`the sequence and subjective
`importance of encoded parameters are given.
`
`Google Ex. 1217, pg. 8
`
`Google Ex. 1217, pg. 8
`
`
`
`'ETSI/GSM
`
`GSM 06.10 / page 7
`
`Version 3.2.0
`
`Section 2 deals with the transmission characteristics of the audio
`parts that are relevant for the performance of the RPE—LTP codec.
`Some transmission characteristics of the RPE—LTP codec are also
`specified in section 2. Section 3 presents the functional descrip-
`tion of the RPE—LTP coding and decoding procedures, whereas
`section 4 describes the computational details of the algorithm.
`Procedures for the verification of the correct functioning of the
`RPE—LTP are described in section 5.
`
`Performance and network aspects of the RPE—LTP codec are contained
`in annex 1.
`
`1.3. FUNCTIONAL DESCRIPTION OF AUDIO PARTS
`
`The analogue—to-digital and digital—to—analogue conversion will in
`principle comprise the following elements:
`
`1) Analogue to uniform digital
`
`- microphone,
`— input level adjustment device,
`- input anti—aliasing filter,
`— sample—hold device sampling at 8 kHz,
`- analogue-to—uniform digital conversion to 13 bits
`representation.
`
`The uniform format shall be represented in two's complement.
`
`2) Uniform digital to analogue
`
`— conversion from 13 bit /8kHz uniform PCM to analogue,
`— a hold device,
`— reconstruction filter including x/sin x correction,
`— output level adjustment device,
`— earphone or loudspeaker.
`
`In the terminal equipment,
`
`the A/D function may be achieved either
`
`— by direct conversion to 13 bit uniform PCM format.
`- or by conversion to 8 bit/A—law companded format, based on a
`standard A-law codec/filter according to CCITT rec.
`G.711/714,
`followed by the 8-bit to 13—bit conversion
`according to the procedure specified in section 1.4.
`
`For the D/A operation,
`
`the inverse operations take place.
`
`In the latter case it should be noted that the specifications in
`CCITT recommendation G.714 are concerned with PCM equipment
`located in the central parts of the network. When used in the
`terminal equipment,
`this specification does not on its own ensure
`sufficient out—of—band attenuation.
`
`Google Ex. 1217, pg. 9
`
`Google Ex. 1217, pg. 9
`
`
`
`ETSI/GSM
`
`GSM 06.10 / page 8
`
`Version 3.2.0
`
`The specification of out—of-band signals is defined in section 2
`between the acoustic signal and the digital interface to take into
`account that the filtering in the terminal can be achieved both by
`electronic and acoustical design.
`
`1.4. PCM FORMAT CONVERSION
`
`The conversion between 8 bit A-law companded format and the 13—bit
`uniform format shall be as defined in CCITT Recommendation G.721,
`section 4.2.1,
`sub—block EXPAND and section 4.2.7,
`sub—block
`COMPRESS. The parameter LAW = 1 should be used.
`
`1.5. PRINCIPLES OF THE RPE—LTP ENCODER
`
`A simplified block diagram of the RPE—LTP encoder is shown in Fig
`1.1.
`In this diagram the coding and quantization functions are not
`shown explicitly.
`
`The input speech frame, consisting of 160 signal samples (uniform
`13 bit PCM samples),
`is first pre—processed to produce an
`offset—free signal, which is then subjected to a first order
`pre—emphasis filter. The 160 samples obtained are then analyzed to
`determine the coefficients for the short term analysis filter (LPC
`analysis). These parameters are then used for the filtering of
`the same 160 samples. The result is 160 samples of the short term
`residual signal. The filter parameters,
`termed reflection
`coefficients, are transformed to log.area ratios, LARs, before
`transmission.
`
`the speech frame is divided into 4
`For the following operations,
`sub—frames with 40 samples of the short term residual signal in
`each. Each sub—frame is processed blockwise by the subsequent
`functional elements.
`
`Before the processing of each sub-block of 40 short term residual
`samples,
`the parame—ters of the long term analysis filter,
`the LTP
`lag and the LTP gain, are estimated and updated in the LTP
`analysis block, on the basis of the current sub—block of the
`present and a stored sequence of the 120 previous reconstructed
`short term residual samples.
`
`A block of 40 long term residual signal samples is obtained by
`subtracting 40 estimates of the short term residual signal from
`the short term residual signal itself. The resulting block of 40
`long term residual samples is fed to the Regular Pulse Excitation
`analysis which performs the basic compression function of the
`algorithm.
`
`Google Ex. 1217, pg. 10
`
`Google Ex. 1217, pg. 10
`
`
`
`ETSI/GSM
`
`GSM 06.10 / page 9
`
`Version 3.2.0
`
`the block of 40 input long term
`As a result of the RPE-analysis,
`residual samples are represented by one of 4 candidate
`sub—sequences of 13 pulses each. The subsequence selected is
`identified by the RPE grid position (M). The 13 RPE pulses are
`encoded using Adaptive Pulse Code Modulation (APCM) with
`estimation of the sub—block amplitude which is transmitted to the
`decoder as side information.
`
`The RPE parameters are also fed to a local RPE decoding and recon—
`struction module which produces a block of 40 samples of the quan-
`tized version of the long term residual signal.
`
`By adding these 40 quantized samples of the long term residual to
`the previous block of short term residual signal estimates, a
`reconstructed version of the current short term residual signal is
`obtained.
`
`The block of reconstructed short term residual signal samples is
`then fed to the long term analysis filter which produces the new
`block of 40 short term residual signal estimates to be used for
`the next sub-block thereby completing the feedback loop.
`
`1.6. PRINCIPLES OF THE RPE-LTP DECODER
`
`The simplified block diagram of the RPE-LTP decoder is shown in
`fig 1.2. The decoder includes the same structure as the feed—back
`loop of the encoder. In error-free transmission,
`the output of
`this stage will be the reconstructed short term residual samples.
`These samples are then applied to the short term synthesis filter
`followed by the de—emphasis filter resulting in the reconstructed
`speech signal samples.
`
`1.7. SEQUENCE AND SUBJECTIVE IMPORTANCE OF ENCODED PARAMETERS
`
`As indicated in fig 1.1 the three different groups of data are
`produced by the enCoder are:
`
`— the short term filter parameters,
`- the Long Term Prediction (LTP) parameters
`- the RPE parameters
`
`The encoder will produce this information in a unique sequence and
`format, and the decoder must receive the same information in the
`same way. In table 1.1,
`the sequence of output bits b1 to b260 and
`the bit allocation for each parameter is shown.
`
`Google Ex. 1217, pg. 11
`
`Google Ex. 1217, pg. 11
`
`
`
`ETSI/GSM
`
`GSM 06.10 / page 10
`
`Version 3.2.0
`
`The different parameters of the encoded speech and their
`individual bits have unequal
`importance with respect to subjective
`quality. Before being submitted to the channel encoder,
`the bits
`have to be rearranged in the sequence importance as given in table
`1.2. The ranking has been determined by subjective testing and the
`procedure used is described in annex 2.
`
`Parameter Parameter
`number
`
`Parameter
`name
`
`Var. Number
`name- of bits
`
`Bit no.
`(LSB—MSB)
`
`FILTER
`
`PARAMETERS
`
`LTP
`PARAMETERS
`
`RPE
`PARAMETERS
`
`LTP
`PARAMETERS
`
`RPE
`PARAMETERS
`
`1
`2
`3
`4
`5
`6
`7
`8
`
`9
`10
`
`ll
`12
`l3
`l4
`
`éé
`
`26
`27
`
`28
`29
`30
`31
`
`ii
`
`Log. Area
`ratios
`l - 8
`
`LTP lag
`LTP gain
`
`LAR 1
`LAR 2
`LAR 3
`LAR 4
`LAR 5
`LAR 6
`LAR 7
`LAR 8
`
`N1
`b1
`
`RPE grid position
`Block amplitude
`RPE—pulse no.1
`RPE-pulse no.2
`
`M1
`Xmaxl
`xl(0)
`xl(l)
`
`RPE-pulse no.13
`
`x1(12)
`
`LTP lag
`LTP gain
`
`N2
`b2
`
`RPE grid position
`Block amplitude
`RPE—pulse no.1
`RPE-pulse no.2
`
`M2
`Xmax2
`x2(0)
`x2(l)
`
`RPE—pulse no.13
`
`x2(12)
`
`6
`6
`5
`5
`4
`4
`3
`3
`
`7
`2
`
`2
`6
`3
`3
`
`3
`
`7
`2
`
`2
`6
`3
`3
`
`3
`
`— b6
`bl
`- b12
`b7
`b13 - bl7
`b18 — b22
`b23 - b26
`b2? - b30
`b31 — b33
`b34 - b36
`
`b3? — b43
`b44 — b45
`
`b46 - b47
`b48 - b53
`b54 — b56
`b57 — b59
`
`b90 l b92
`
`b93 - b99
`leO- blOl
`
`b102— b103
`b104— b109
`bllO- b112
`b113— b115
`
`b146; b148
`
`Table 1.1a. Encoder output parameters in order of occurrence and
`bit allocation within the speech frame of 260 bits/20
`ms
`
`Google Ex. 1217, pg. 12
`
`Google Ex. 1217, pg. 12
`
`
`
`ETSI/GSM
`
`GSM 06.10 / page 11
`
`Version 3.2.0
`
`Sub-frame no.3
`
`LTP
`PARAMETERS
`
`RPE
`PARAMETERS
`
`LTP
`PARAMETERS
`
`RPE
`PARAMETERS
`
`43
`44
`
`45
`46
`47
`48
`
`éé
`
`60
`61
`
`62
`63
`64
`65
`
`ié
`
`LTP lag
`LTP gain
`
`N3
`b3
`
`RPE grid position
`Block amplitude
`RPE-pulse no.1
`RPE—pulse no.2
`
`M3
`Xmax3
`x3(0)
`x3(1)
`
`RPE—pulse no.13
`
`x3(12)
`
`LTP lag
`LTP gain
`
`N4
`b4
`
`RPE grid position
`Block amplitude
`RPE—pulse no.1
`RPE—pulse no.2
`
`M4
`Xmax4
`x4(0)
`x4(1)
`
`fiéE—pulse no.13
`
`x4(12)
`
`7
`2
`
`2
`6
`3
`3
`
`3
`
`7
`2
`
`2
`6
`3
`3
`
`3
`
`b149- b155
`b156- b157
`
`b158- b159
`b160— b165
`b166— b168
`b169— b17l
`
`b202— b204
`
`b205- b21l
`b212- b213
`
`b214- b215
`b216- b221
`b222— b224
`b225— b227
`
`bzsél'bzso
`
`Table 1.1b. Encoder output parameters in order of occurrence and
`bit allocation within the speech frame of 260 bits/20
`ms
`
`Google Ex. 1217, pg. 13
`
`Google Ex. 1217, pg. 13
`
`
`
`ETSI/GSM
`
`GSM 06.10 / page 12
`
`Version 3.2.0
`
`b53,b109,bl65.b221
`b5
`b12
`bl7
`b4
`b11
`bl6
`b22
`b43,b99,b155,b211
`b52,b108,b164,b220
`b10,b26,b30
`b42,b98,b154,b210
`b41,b97,b153,b209
`b40,b96,b152,b208
`b39,b95,b15l,b207
`b51,b107,b163,b219
`b3
`b21
`b33
`
`number
`
`12,29,46,63
`
`thI—‘UJNH
`
`9,26,43,60
`12,29,46,63
`2,5,6
`9,26,43,60
`9,26,43,60
`9,26,43,60
`9,26,43,60
`12,29,46,63
`
`l 4 7
`
`9,26,43,60
`
`b38,b94,b150,b206
`
`5,6
`10,27,44,61
`9,26,43,60
`ll,28,45,62
`1
`
`2,3,8,4
`5,7
`10,27,44,61
`12,29,46,63
`l3..25
`30..42
`47..59
`64..76
`ll,28,45,62
`12,29,46,63
`l3..25
`30..42
`47..59
`64..67
`
`b25,b29
`b45,b101,b157,b213
`b37,b93,bl49,b205
`b47,b103,b159,b215
`b2
`b9,b15,b36,b20
`b24,b32
`b44,b100,b156,b212
`b50,b106,bl62,b218
`b56,b59,..,b92
`b112,b115,..,b148
`bl68,bl71,..,b204
`b224,b227,..,b260
`b46,b102,b158,b214
`b49,b105,b161,b217
`b55,b58,..,b91
`blll,bll4,..,b147
`b167,b170,..,b203
`b223,b226,b229,b232
`
`Log.area ratio 1
`Block amplitude
`area
`ratio
`Log.
`area
`ratio
`Log.
`area
`ratio
`Log.
`area
`ratio
`Log.
`area
`ratio
`Log.
`area
`ratio
`Log.
`area
`ratio
`Log.
`LTP
`
`IbUJNb—‘UJNH
`
`lag
`Block amplitude
`Log.area ratio 2,5,6
`LTP
`lag
`LTP
`lag
`LTP
`lag
`LTP
`lag
`Block amplitude
`Log.area ratio 1
`Log.area ratio
`Log.area ratio
`
`Log.area ratio
`gain
`LTP lag
`Grid position
`Log.area ratio
`Log.area ratio
`Log.area ratio
`LTP gain
`Block amplitude
`RPE pulses
`RPE pulses
`RPE pulses
`RPE pulses
`Grid position
`Block amplitude
`RPE pulses
`RPE pulses
`RPE pulses
`RPE pulses
`
`Order of bit
`importance
`
`Table 1.2a. Subjective importance of encoded bits (the parameter
`and bit numbers refer to table 1.1)
`
`Google Ex. 1217, pg. 14
`
`Google Ex. 1217, pg. 14
`
`
`
`ETSI/GSM
`
`GSM 06.10 / page 13
`
`Version 3.2.0
`
`Class II
`
`183...
`
`259,260
`
`RPE pulses
`Log.area ratio 1
`Log.area ratio 2,3,6
`Log.area ratio 7
`Log.area ratio 8
`Log.area ratio 8,3
`Log.area ratio 4
`Log.area ratio 4,5
`Block amplitude
`RPE pulses
`RPE pulses
`RPE pulses
`RPE pulses
`Log.area ratio 2,6
`
`68..76
`1
`2,3,6
`7
`8
`8,3
`4
`4,5
`12,29,46,63
`13..25
`30..42
`47..59
`64..76
`2,6
`
`b235,b238,..,b259
`bl
`b8,bl4,b28
`b31
`b35
`b34,b13
`b19
`b18,b23
`b48,b104,b160,b216
`b54,b57,..,b90
`b110,b113,..,bl46
`bl66,bl69,..,b202
`b222,b225,..,b258
`b7,b27
`
`Table 1.2b. Subjective importance of encoded bits (the parameter
`and bit numbers refer to table 1.1)
`
`NQIE: The subdivisions in table 1.2 indicate the border between
`protection classes Ia,
`Ib and II as defined in recommendation
`GSM 05.03.
`
`Google Ex. 1217, pg. 15
`
`Google Ex. 1217, pg. 15
`
`
`
`ETSI/GSM
`
`GSM 06.10 / page 14
`
`Version 3.2.0
`
`Short term
`
`LPC
`
`
`.
`analysrs
`
`‘
`
`Short term
`
`
`
`analysis
`filter
`
`
`Reflew‘b"
`ooefticrents coded as
`Log. - Area Ratios
`_
`(36 bits/20 ms)
`
`RPE parameters
`(47 bits/5 ms)
`
`
`
`RPE grid
`selection
`
`and coding
`
`Input
`
`Pre —
`
`signal
`
`processing
`
`
`
`RPE grid
`(5)
`(4)
`° decoding and
`positioning
`
`analysis
`
`(1) Short term residual
`(2) Long term residual (40 samples)
`(3) Short term resrdual estimate (40 samples)
`(4) Reconstructed short term residual (40 samples)
`(5) Quantized long term residual (40 samples)
`
`Fi 1.1. Sim lified block dia ram of the RPE — LTP encoder
`
`LTP parameters
`(9 bits/5 ms)
`
`To
`radio
`subsystem
`
`Google Ex. 1217, pg. 16
`
`Google Ex. 1217, pg. 16
`
`
`
`ETSI/GSM
`
`GSM 06.10 / page 15
`
`Version 3.2.0
`
`I M
`
`ilection coefficients coded
`as Log-Area Ratios
`(35 bits/20 ms)
`
`l iIII i IPI
`
`
`
`
`Post —
`
`
`
`
`Short term
`RPE grid
`decoding and
`0
`synthesis
`processing
`
`
`
`
`
`positioning
`R E
`fitter
`
`Long term
`t
`
`synthesis
`parame 3'5
`(4? DIE/5 ms)
`filter
`
`
`pq'lrameters
`(9: bits/5 ms)
`
`From
`radio
`
`subsystem
`
`Fig 1.2. Simglified block diagram of the RPE — LTP decoder
`
`Google Ex. 1217, pg. 17
`
`Google Ex. 1217, pg. 17
`
`
`
`ETSI/GSM
`
`GSM 06.10 / page 16
`
`Version 3.2.0
`
`2. TRANSMISSION CHARACTERISTICS
`
`This section specifies the necessary performance characteristics
`of the audio parts for proper functioning of the speech trancoder.
`Some transmission performance characteristics of the RPE—LTP
`transcoder are also given to assist the designer of the speech
`transcoder function. The information given here is redundant and
`the detailed specifications are contained in recommendation GSM
`11.10.
`
`The performance characteristics are referred to the 13 bit uniform
`PCM interface.
`
`the
`EQTE: To simplify the verification of the specifications,
`performance limits may be referred to an A—law measurement
`interface according to CCITT Rec—ommendation G.711.
`In this
`way, standard measuring equipments for PCM systems can be
`utilized for measurements.
`The relationship between the 13
`bit format and the A—law companded shall follow the
`procedures defined in section 1.4.
`
`2.1. PERFORMANCE CHARACTERISTICS OF THE ANALOGUE/DIGITAL
`INTERFACES
`
`‘
`
`Concerning 1) discrimination against out—of—band signals (sending)
`and 2) spurious out-of—band signals (receiving),
`the same
`requirements as defined in ETSI standard TE 04—15 (digital
`telephone, candidate NET33) apply.
`
`2.2. TRANSCODER DELAY
`
`Consider a back to back configuration where the parameters
`generated by the encoder are delivered to the speech decoder as
`soon as they are available.
`
`The transcoder delay is defined as the time interval between the
`instant a speech frame of 160 samples has been received at the
`encoder input and the instant the corresponding 160 reconstructed
`speech samples have been out-put by the speech decoder at an 8 kHz
`sample rate.
`
`The theoretical minimum delay which can be achieved is 20 ms.
`requirement is that the transcoder delay should be less than 30
`ms.
`
`The
`
`Google Ex. 1217, pg. 18
`
`Google Ex. 1217, pg. 18
`
`
`
`ETSI/GSM
`
`GSM 06.10 / page 17
`
`Version 3.2.0
`
`3. FUNCTIONAL DESCRIPTION OF THE RPE-LTP CODEC
`
`The block diagram of the RPE-LTP-coder is shown in fig 3.1. The
`individual blocks are described in the following sections.
`
`3.1. FUNCTIONAL DESCRIPTION OF THE RPE-LTP ENCODER
`
`The E;§p;9§g§§i§g_§§§tign of the RPE—LTP encoder comprises the
`following two sub-blocks:
`
`* Offset compensation (3.1.1)
`* Preemphasis (3.1.2)
`
`The L£g_analysi§_s§gtign of the RPE—LTP encoder comprises the
`following five sub—blocks:
`
`*i-X-If
`
`*-
`
`Segmentation (3.1.3)
`Auto—Correlation (3.1.4)
`Schur Recursion (3.1.5)
`Transformation of reflection coefficients to Log.—Area Ratios
`(3.1.6)
`Quantization and coding of Log.—Area Ratios (3.1.7)
`
`TheW of the RPE-LTP comprises
`the following four sub-blocks:
`
`* Decoding of the quantized Log.-Area Ratios (LARs)
`* Interpolation of Log.-Area Ratios (3.1.9)
`* Transformation of Log.—Area Ratios into reflection coefficients
`(3.1.10)
`* Short term analysis filtering (3.1.11)
`
`(3.1.8)
`
`The Long Term Predigtgr (LEE) segtign comprises 4 sub—blocks
`working on subsegments (3.1.12) of the short term residual
`samples.
`
`iii-1-
`
`Calculation of LTP parameters (3.1.13)
`Coding of the LTP lags (3.1.14) and the LTP gains (3.1.15)
`Decoding of the LTP lags (3.1.14) and the LTP gains (3.1.15)
`Long term analysis filtering (3.1.16), and
`Long term synthesis filtering (3.1.17)
`
`Google Ex. 1217, pg. 19
`
`Google Ex. 1217, pg. 19
`
`
`
`ETSI/GSM
`
`GSM 06.10 / page 18
`
`Version 3.2.0
`
`The B£E_§n§gding_§§g;ign comprises five different sub—blocks:
`
`* Weighting filter (3.1.18)
`* Adaptive sample rate decimation by RPE grid selection (3.1.19)
`* APCM quantization of the selected RPE sequence (3.1.20)
`* APCM inverse quantization (3.1.21)
`* RPE grid positioning (3.1.22)
`
`PREPR
`
`E
`
`3.1.1. Offset compensation
`
`Prior to the speech encoder an offset compensation,by a notch
`filter is applied in order to remove the offset of the input
`signal 50 to produce the offset—free signal Sof-
`
`sof(k)
`
`= so(k)
`
`— sO(k—1) + alpha*sof(k—l)
`
`(3.1.1)
`
`alpha = 327392—15
`
`3.1.2. Preemphasis
`
`The signal Sof is applied to a first order FIR preemphasis
`filter leading to the input signal s of the analysis section.
`
`s(k)
`
`= sof(k)
`
`— beta*sof(k—l)
`
`(3.1.2)
`
`beta: 28180*2'15
`
`P
`
`ALY
`
`E TI
`
`3.1.3. Segmentation
`
`is divided into non—overlapping frames
`The speech signal s(k)
`having a length of To = 20 ms
`(160 samples). A new LPG—analysis
`of order p=8 is performed for each frame.
`
`Google Ex. 1217, pg. 20
`
`Google Ex. 1217, pg. 20
`
`
`
`ETSI/GSM
`
`GSM 06.10 / page 19
`
`Version 3.2.0
`
`3.1.4. Autocorrelation
`
`The first p+1 = 9 values of the Auto—Correlation function are
`calculated by
`
`ACF(k)=
`
`15.9.
`>
`’i=k
`
`s(i)s(i-k)
`
`,k = 0,1...,8
`
`(3.2)
`
`3.1.5. Schur Recursion
`
`The reflection coefficients are calculated as shown in Fig 3.2
`using the Schur Recursion algorithm. The term "reflection
`coefficient" comes from the theory of linear prediction of speech
`(LPC), where a vocal tract representation consisting of series of
`uniform cylindrical sections is assumed. Such a representation can
`be described by the reflection coefficents or the area ratios of
`connected sections.
`
`3.1.6. Transformation of reflection coefficients to Log.—Area
`Ratios
`'
`
`The reflection coefficients r(i),
`Schur algorithm, are in the range
`
`(i=1..8), calculated by the
`
`—1 <= r(i) <= + 1
`
`the reflection
`Due to the favourable quantization characteristics,
`coefficients are converted into Log.-Area Ratios which are
`strictly defined as follows:
`
`Logarea(i) = loglo
`
`(_"_"'7_')
`
`(3.3)
`
`Since it is the companding characteristic of this transformation
`that is of importance,
`the following segmented approximation is
`used.
`
`Google Ex. 1217, pg. 21
`
`Google Ex. 1217, pg. 21
`
`
`
`ETSI/GSM
`
`GSM 06.10 / page 20
`
`Version 3.2.0
`
`|r(i)l < o 675
`;
`r(i)
`, 0.675 <= |r(i)| < o 950
`mm = sign[r(i)]*[ |r(i)l-0.67S]
`sign[r(i)]*[8|r(i)I-6.375] , 0.950 <= |r(i)| <= 1.000
`
`(3.4)
`
`with the result that instead of having to divide and obtain the
`logarithm of particular values, it is merely necessary to
`multiply, add and compare these values.
`
`The following equation (3.5) gives the inverse transformation.
`
`LAR'(i)
`r' (i)=sign[LAR' (i)]*[0.500*ILAR' (i) |
`+0.337500]
`sign[LAR'(i)]*[0.125*|LAR'(i)|
`+0.796875]
`
`;
`
`|LAR'(i)l<O.675
`
`; O.675<=|LAR'(i)l<1.225
`
`;
`
`l.225<=lLAR'(i)|<=1_625
`
`(3.5)'
`
`3.1.7. Quantization and coding of Log.—Area Ratios
`
`The Log.—Area Ratios LAR(i) have different dynamic ranges and
`different asymmetric distribution densities. For this reason,
`transformed coefficients LAR(i) are limited and quantized
`differently according to the following equation (3.6), with LARC(i)
`denoting the quantized and integer coded version of LAR(i).
`
`the
`
`LARC(i) = Nint{A(i)*LAR(i)
`
`+ B(i)}
`
`with
`
`Nint{z} := int{z+sign{z}*0.5}
`
`(3.6)
`
`(3.6a)
`
`Function Nint defines the rounding to the nearest integer value,
`with the coefficients A(i), B(i), and different extreme values of
`LARC(i)
`for each coefficient LAR(i) given in table 3.1.
`
`Google Ex. 1217, pg. 22
`
`Google Ex. 1217, pg. 22
`
`
`
`ETSI/GSM
`
`GSM 06.10 / page 21
`
`Version 3.2.0
`
`| Minimum | Maximum |
`I B(1)
`ILAR No 1 l A(i)
`I LARC(i)
`|
`I
`I
`I LARCIi)
`|
`| ----------------- | -------------------------- |
`l
`I 20.000 |
`0.000 |
`-32
`|
`+31
`|
`l
`I
`2
`l 20.000 I
`0.000 I
`-32
`|
`+31
`|
`I
`3
`| 20.000 |
`4 000 I
`—16
`I
`+15
`|
`I
`4
`I 20.000 |
`-5 000 |
`-16
`|
`+15
`|
`I
`5
`| 13.637 I
`0.184 I
`— 8
`I
`+ 7
`|
`|
`6
`I 15.000 |
`-3.500 |
`- 8
`I
`+ 7
`|
`|
`7
`I
`8.334 I
`-0.666 |
`— 4
`|
`+ 3
`|
`I
`8
`|
`8.824 |
`-2.235 I
`- 4
`I
`+ 3
`|
`
`R-RM
`
`LT
`
`The current frame of the speech signal s is retained in memory
`until calculation of the LPC parameters LAR(i)
`is completed. The
`frame is then read out and fed to the short term analysis filter
`of order p=8. However, prior to the analysis filtering operation,
`the filter coefficients are decoded and preprocessed by
`interpolation.
`
`3.1.8. Decoding of the quantized Log.-Area Ratios
`
`. In this block the quantized and coded Log.—Area Ratios (LARCIi)) are
`decoded according to equation (3.7).
`
`LAR"(i) = I LARCIi)
`
`- B(i) )/ A(i)
`
`(3-7)
`
`3.1.9. Interpolation of Log.—Area Ratios
`
`To avoid spurious transients which may occur if the filter
`coefficients are changed abruptly,
`two subsequent sets of
`Log.—Area Ratios are interpolated linearly. Within each frame of
`160 analysed speech samples the short term analysis filter and the
`short term synthesis filter operate with four different sets of
`coefficients derived according to table 3.2.
`
`Google Ex. 1217, pg. 23
`
`Google Ex. 1217, pg. 23
`
`
`
`ETSI/GSM
`
`GSM 06.10 / page 22
`
`Version 3.2.0
`
`I I
`
`+ O.25*LAR"J(i)
`| 0.75*LAR"J_1(i)
`0...
`3...26 I 0.50*LAR"J_l(i) + 0.50*LAR"J(i)
`7
`I O.25*LAR"J_1(i) + O.75*LAR"J(i)
`o
`I
`LAR"J(i)
`
`3.1.10. Transformation of Log.—Area Ratios into reflection
`coefficients
`
`The reflection coefficients are finally determined using the
`inverse transformation according to equation (3.5).
`
`3.1.11. Short Term Analysis Filtering
`
`term analysis filter is implemented according to the
`The Short
`lattice structure depicted in fig 3.3.
`
`d0(k) = s(k)
`u0(k) = s(k)
`+ r'i*u-_1(k-l) with i=1, ...8
`011(k)
`= di_1(k)
`ui(k) = ui_ (k—l) + r'i* i-1(k)
`with i=1,...8
`d(k ) = d8(fi)
`
`(3.8a)
`(3.8b)
`(3.8e)
`(3.8d)
`(3.8e)
`
`-
`
`EDITR LT EI
`
`3.1.12. Sub—segmentation
`
`Each input frame of the short term residual signal contains 160
`samples, corresponding to 20 ms. The long term correlation is
`evaluated four times per frame,
`for each 5 ms subsegment. For
`convenience in the following, we note j=0,...,3 the sub—segment
`number, so that the samples pertaining to the j—th sub—segment of
`the residual signal are now denoted by d(kj+k) with j = 0,...,3; kj =
`k0 + j*40 and k = 0,...,39 where k0 corresponds to the first value of
`the current frame.
`
`Google Ex. 1217, pg. 24
`
`Google Ex. 1217, pg. 24
`
`
`
`ETSI/GSM
`
`GSM 06.10 / page 23
`
`Version 3.2.0
`
`3.1.13. Calculation of the LTP parameters
`
`For each of the four sub-segments a long term correlation lag Nj,
`(j=0,...,3), and an associated gain factor b-,
`(j=0,...,3)
`are determined. For each sub—segment,
`the determination of these
`parameters is implemented in three steps.
`
`1) The first step is the evaluation of the cross—correlation
`Rj(lambda) of the current sub-segment of short term residual
`Signal d(k'+i),(i=3,...,39) and the previous samples of the
`reconstrucged short term residual signal d'(kj+i),
`(i=-120,...,—l):
`
`Rj(lambda)
`
`=
`
`j =O,...3
`3.9.
`z_ d(kj+i)*d'(kj+i-lambda); kj = k0 + j*40
`i=0
`lambda = 40,...,120
`
`The cross—correlation is evaluated for lags lambda greater than
`or equal to 40 and less than or equal to 120,
`ie corresponding
`to samples outside the current sub—segment and not delayed by
`more than two sub—segments.
`
`(3.9)
`
`2) The second step is to find the position N- of the peak of
`the cross—correlation function within this interval:
`
`Rj(Nj)
`
`max { Rj(lambda);
`
`lambda = 40..120 };
`
`3) The third step is the evaluation of the gain factor bj
`according to:
`
`(3.10)
`
`bj =
`with
`
`Rj(Nj)
`
`/ Sj(Nj);
`
`j = 0,...,3
`
`(3.11)
`
`Sj(Nj)
`
`=
`
`2
`
`3.2
`i:0d'
`
`'
`(kj+1-Nj);
`
`.
`J = 0,...,3
`
`(3.12)
`
`Google Ex. 1217, pg. 25
`
`Google Ex. 1217, pg. 25
`
`
`
`ETSI/GSM
`
`GSM 06.10 / page 24
`
`Version 3.2.0
`
`It is clear that the last 120 samples of the reconstructed
`short term residual signal d'(kj+i),(i=—120,...,-1) must be
`retained until the next sub-segment so as to allow the
`evaluation of the relations (3.9),...,(3.12).
`
`3.1.14. Coding/Decoding of the LTP lags
`
`The long term correlation lags-Nj;(j=0,...,3) can have values
`in the range (40,...,120), and so must be coded using 7 bits with:
`
`ch =le.
`
`j =01-o-l3
`
`(3.13)
`
`At the receiving end, assuming an error free transmission,
`decoding of these values will restore the actual lags:
`
`the
`
`j = 0,...,3
`
`(3.14)
`
`The long term prediction gains b ,(j=0,...,3) are encoded with
`2 bits each, according to the foilowing algorithm:
`
`if
`
`bj <= DLB(i)
`
`then bcj
`
`= 0,
`
`i=0
`
`if DLB(i—l) < bj <= DLB(i)
`
`then bcj
`
`= i;
`
`i=l,2
`
`(3.15)
`
`if DLB(i—l) < bj
`
`then bcj
`
`= 3,
`
`i=3
`
`where DLB(i),(i=0,...,2) denotes the decision levels of the
`quantizer, and ij represents the coded gain value. Decision
`levels and quantizing levels are given in table 3.3.
`
`l
`|
`l
`I
`QLB(i)
`I
`DLB(i)
`I
`i
`I
`|---| ---------------- | ------------------ |
`I
`O
`l
`0.2
`I
`0.10
`I
`|
`1
`|
`0.5
`|
`0.35
`|
`|
`2
`|
`0.8
`l
`0.65
`l
`I
`3
`I
`l
`1.00
`I
`
`Table 3.3. Quantization table for the LTP gain
`
`Google Ex. 1217, pg. 26
`
`Google Ex. 1217, pg. 26
`
`
`
`ETSI/GSM
`
`GSM 06.10 /