throbber
Multirate Digital Filters, Filter Banks,
`Polyphase Networks, and Applications:
`A Tutorial
`
`P. P. VAIDYANATHAN, SENIOR MEMBER, IEEE
`
`Mu/tirate digital filters and filter banks find application in com(cid:173)
`munications, speech processing, image compression, antenna sys(cid:173)
`tems, analog voice privacy systems, and in the digital audio indus(cid:173)
`try. During the last several years there has been substantial progress
`in multirate system research. This includes design of decimation
`and interpolation filters, analysis/synthesis filter banks (also called
`quadrature mirror filters, or QMF), and the development of new
`sampling theorems. First, the basic concepts and building blocks
`in mu/tirate digital signal processing (DSP), including the digital
`polyphase representation, are reviewed. Next, recent progress as
`reported by several authors in this area is discussed. Several appli(cid:173)
`cations are described, including the following: subband coding of
`waveforms, voice privacy systems, integral and fractional sampling
`rate conversion (such as in digital audio), digital crossover net(cid:173)
`works, and multi rate coding of narrow-band filter coefficients. The
`M-band QMF bank is discussed in considerable detail, including
`an analysis of various errors and imperfections. Recent techniques
`for perfect signal reconstruction in such systems are reviewed. The
`connection between QMF banks and other related topics, such as
`block digital filtering and periodically time-varying systems, based
`on a pseudo-circulant matrix framework, is covered. Unconven(cid:173)
`tional applications of the polyphase concept are discussed.
`
`I.
`
`INTRODUCTION
`
`In recent years there has been tremendous progress in
`the multirate processing of digital signals. Unlike the sin(cid:173)
`gle-rate system, the sample spacing in a multirate system
`can vary from point to point[1], [2]. This often results in more
`efficient processing of signals because the sampling rates
`at various internal points can be kept as small as possible.
`Unfortunately, this also results in the introduction of a new
`type of error, i.e., aliasing, which should somehow be can(cid:173)
`celed eventually.
`The basic building blocks in a multi rate digital signal pro(cid:173)
`cessing (DSP) system are decimators and interpolators. In
`1981, an excellent tutorial article on decimation and inter(cid:173)
`polation appeared in [3]. Subsequent to this a text on the
`subject of multi rate systems has also been published by the
`same authors [4]. Since then, a number of new develop(cid:173)
`ments have taken place in the area, particularly in multi rate
`
`Manuscript received October 12, 1988; revised june 13, 1989. This
`work was supported in part by the National Science Foundation
`under grants DCI 8552579 and MIP 8604456.
`The author is with the Department of Electrical Engineering, Cal(cid:173)
`ifornia Institute of Technology, Pasadena, CA 91125, USA.
`IEEE Log Number 8933329.
`
`digital filter bank designs. A short summary of some of these
`developments was reported recently by this author at an
`IEEE international conference [5]. The purpose of this article
`is to provide a self-contained and more complete exposure
`to many recent contributions on multi rate systems, includ(cid:173)
`ing filter bank design.
`As mentioned in [3], multirate systems find application
`in communications, speech processing, spectrum analysis
`[6], radar systems, and antenna systems. In this tutorial, two
`sections are devoted to a review of applications. In Section
`Ill, we point out applications in digital audio systems, in
`subband coding techniques (used in speech and image
`compression), and in analog voice privacy systems (for stan(cid:173)
`dard telephone communications). In Section V-E, appli(cid:173)
`cations of special transfer functions (such as complemen(cid:173)
`tary functions) in digital audio is reviewed. In Section IX,
`several unconventional applications of multirate systems
`and polyphase theory are indicated. These include a) deri(cid:173)
`vation of new sampling theorems for efficient compression
`of signals, b) derivation of new techniques for efficient cod(cid:173)
`ing of impulse response sequences of narrow band filters,
`c) design of FIR filters with adjustable multilevel responses,
`and d) adaptive filtering in subbands.
`
`A. Paper Outline
`
`In section II, basic tools, such as decimators, interpola(cid:173)
`tors, decimation and interpolation filters, and digital filter
`banks, are reviewed, along with the interconnection prop(cid:173)
`erties of the building blocks. In section Ill, some applica(cid:173)
`tions of multirate DSP are indicated, in digital audio sys(cid:173)
`tems, in subband coding, and in voice privacy systems.
`Section IV reviews the digital polyphase decomposition due
`to Bellanger, along with applications such as the uniform
`DFT filter bank. The concept of multilevel polyphase
`decomposition is also introduced here as a tool for efficient
`implementation of fractional decimation filters. Several
`special types of filter banks, such as Nyquist filters, power(cid:173)
`complementary systems and Euclidean filter-banks, are
`studied in section V.ln section VI, the two-band QMF bank
`is studied in sufficient detail along with procedures for
`eliminating aliasing in such systems. Procedures for elim(cid:173)
`ination of amplitude and/or phase distortion are discussed.
`
`0018-9219/90/0100-0056$01.00 © 1990 IEEE
`
`56
`
`PROCEEDINGS OF THE IEEE, VOL. 78, NO. 1, JANUARY 1990
`
`Petitioner Apple Inc.
`Ex. 1010, p. 56
`
`

`
`The most crucial property of W that finds repeated use in
`multirate OSP is the following:
`[M
`'
`O,
`
`k = integer multiple of M
`
`otherwise.
`
`(1c)
`
`M-1
`2:; wkn =
`n=O
`
`Perfect-reconstruction two-channel QMF banks are intro(cid:173)
`duced by blending the polyphase concept with the classical
`network-theoretic concept of losslessness.
`The relation between M-band QMF banks and two other
`related topics (block filtering and periodically time-varying
`systems) is reviewed in section VII, based on an algebraic
`structure called the pseudo-circulant matrix. In section VIII,
`M-band QMF banks are discussed in greater detail, and
`techniques for elimination of aliasing, amplitude, and phase
`distortions are reviewed. Section IX discusses unconven(cid:173)
`tional applications, and Section X discusses some exten(cid:173)
`sions of multirate ideas to cases of multidimensional sig(cid:173)
`nals. The paper concludes with a discussion of open
`problems in multirate OSP.
`
`B. Notations and Terminology
`
`The variables nand ware used as frequency variables for
`continuous-time and discrete-time cases, respectively. In
`the discrete-time case the term normalized frequency is
`used to denote f = w/21r. The frequency response of a trans(cid:173)
`fer function H(z) is expressed as H(eiw) = IH(eiw)l e i<P<wl, where
`I H(eiw)l is the magnitude response and cf>(w) the phase
`response. The quantity T(w) = -dcf>(w)/dw is the group delay
`of H(z). If IH(eiw)l is constant for all w, H(z) is all-pass. If cf>(w)
`has the form k 0 - k1 w, then H(z) is said to have linear phase
`and the group delay is a constant k1; physically, if the input
`to such a filter H(z) has energy only in the passband of H(z),
`then the output is a delayed version of the input, by k1 sam(cid:173)
`ples. Unless mentioned otherwise, a low-pass filter has real
`coefficients, so that IH(eiw)l is symmetric and cf>(w) is anti(cid:173)
`symmetric with respect tow = 0. Usually IH(eiw)l is plotted
`for 0 s f s 0.5 (i.e., for 0 s w s 1r). If wp and w5 denote the
`passband and stopband edges of a low-pass filter, the quan(cid:173)
`tity we = (wp + w5)/2 is said to be the cutoff frequency.
`Bold-faced quantities denote matrices and vectors, as in
`A, H(z) etc. The symbollk denotes the k x k identity matrix
`(with subscript often omitted). The quantitiesAr,At and A*
`denote, respectively, the transpose, transpose conjugate,
`and conjugate of A. For functions H(z), the notation H*(z)
`denotes conjugation of the coefficients without conjugat(cid:173)
`, then H*(z) =a* +
`ing z. For example if H(z) = a + bz- 1
`b*z- 1
`• Thus, H*(z) = H*(z*). The notation H(z) stands for
`H~(z- 1). In other words, conjugate the coefficients, take
`transpose (if matrix), and replace z with z- 1• When z = eiw
`(i.e., on the unit circle), we have H(z) = Ht(z). Linear time(cid:173)
`invariant systems [7] are abbreviated as L Tl and linear
`periodically time-varying systems as LPTV. A p x r matrix
`A is said to be unitary (orthogonal if it is real) if AtA = cl,
`c =I= 0. Note that A is not restricted to be square. For exam(cid:173)
`ple, [~~·<~ll is unitary for any real 8. The symbol WM stands
`for e -jzrtM. The subscript M is usually deleted because its
`value is often clear from the context. This quantity appears
`in the definition of the discrete Fourier transform (OFT) [7],
`[8]. Thus an M-point sequence [x0, x1, • • • , xM_ 1] has the
`M-point OFT sequence
`
`M-1
`Xk = 2:; Xn Wkn,
`n=O
`
`0 s k s M- 1.
`
`(1a)
`
`The inverse OFT (lOFT) is given by
`
`(1b)
`
`For any pair of integers k, n, we have w• = wn, if and only
`if k - n is an integer multiple of M. In particular, therefore,
`Wk =1= wn for 0 s k < n s M - 1.
`State Space Descriptions: Consider a discrete time trans(cid:173)
`fer matrix H(z) with input vector u(n) and output vector y(n).
`Suppose we have implemented this transfer matrix using
`a structure, and let N denote the number of delay elements
`used. Label the outputs of the delay elements as the state
`variables xk(n), 0 s k s N - 1, and define the state vector
`x(n) = [x0(n) x1(n) · · · xN_ 1(n)]r. With u(n) and y(n) denoting
`the input and output (vector) sequences to the structure,
`one can always find equations of the form [9]
`
`x(n + 1) = Ax(n) + Bu(n),
`y(n) = Cx(n) + Du(n),
`
`to describe the structure. This is called the state-space
`description of the structure. The matrix A, called the state
`transition matrix, has size N x N, where N is the number
`of delays in the structure. The transfer function H(z) of the
`structure is given by H(z) = D + C(zl- A)- 18. The smallest
`number of delay elements (i.e., z- 1 elements) required to
`implement H(z) is called the McMillan degree (or simply,
`the degree) of H(z). If the number of delays N in the struc(cid:173)
`ture is equal to the degree, then the structure is said to be
`a minimal realization of H(z). This is equivalent to saying
`that A is as small as possible.
`A summary of acronyms and common notations used in
`this paper is found in the Nomenclature, which follows sec(cid:173)
`tion XI.
`
`II. BASIC BUILDING BLOCKS AND TOOLS
`
`In this section we introduce the basic multi rate building
`blocks, along with their frequency-domain characteriza(cid:173)
`tions, and interconnection behaviors.
`
`A. Decimators and Interpolators
`
`Fig. 1 shows block diagrams of these building blocks. The
`decimator is characterized by the input-output relation
`
`y0 (n) = x(Mn)
`
`(2a)
`
`x(n)~Y:r(n)
`
`(b)
`
`Fig. 1. Building blocks. (a) M-fold decimator. (b) L-fold
`interpolator.
`
`which says that the output at time n is equal to the input
`at time Mn. As a consequence, only the input samples with
`sample numbers equal to a multiple of Mare retained. This
`sampling-rate reduction by a factor of M is demonstrated
`in Fig. 2 for the case of M = 2. The L"fold interpolator is char-
`
`VAlDYANATHAN: MULTIRATE DIGITAL FILTERS
`
`57
`
`-----··--------------~
`
`Petitioner Apple Inc.
`Ex. 1010, p. 57
`
`

`
`x(n) L J l l l J I
`
`012345678
`
`'''"' lrlr
`0 1 2 3 4
`Fig. 2. Demonstration of decimation for M = 2.
`
`acterized by the input-output relation
`
`yM ~ [:(z)
`
`if n is a multiple of L
`
`otherwise.
`
`(2b)
`
`That is, the output y1(n) is obtained by inserting L - 1 zero(cid:173)
`valued samples between adjacent samples of x(n), as dem(cid:173)
`onstrated in Fig. 3 for L = 2. The decimator and interpolator
`
`x(n) L lll l J I
`
`012345678
`
`'n
`
`Fig. 3. Demonstration of interpolation for L = 2.
`
`are linear systems even though they are time-varying [4], [5],
`[10].
`The z transform of the interpolator output y1(n) is given
`by [4]:
`
`(3a)
`This means Y1(ei"') = X(ei"'L) i.e., Y1(ei"') is an L-fold com(cid:173)
`pressed version of X(ei"'), as demonstrated in Fig. 4(b). The
`appearance of multiple copies of the basic spectrum in Fig.
`4 is called the imaging effect and the extra copies are the
`images created by the interpolator.
`
`Since decimation corresponds to compression in the time
`domain, one might expect a stretching effect in the fre(cid:173)
`quencydomain. To be more precise, theztransform ofy0 (n)
`is given by
`
`(3b)
`
`1 X(ei(w-hkl/M). The term with
`which means MY0 (ei"') = Ei;l=-0
`k = 0 is indeed theM-fold stretched version of X(ei"'). The
`M - 1 terms with k > 0 are uniformly shifted versions of
`this stretched version. These M terms together make up a
`function with period 21r in w, which is the basic property
`of the Fourier transform of any sequence [8]. Fig. 4(c) dem(cid:173)
`onstrates this effect for M = 2. The terms with k > 0 are
`called the aliasing terms. As long as x(n) is bandlimited to
`lwl < 1riM, there is no overlap of these terms with the k =
`0 term.
`The fundamental difference between aliasing and imag(cid:173)
`ing is important to notice. Aliasing can cause loss of infor(cid:173)
`mation because of the possible overlap of the shifted ver(cid:173)
`sions of the stretched version of X(e 1"'). Imaging, on the other
`hand, does not lead to any loss of information (which is con(cid:173)
`sistent with the fact that no time-domain samples are lost).
`
`B. Interconnections
`
`Fig. 5 shows a cascade connection which is often encoun(cid:173)
`tered in filter-bank systems. The signal v(n) here is equal to
`
`x(n)~v(n)
`
`~ -7t
`
`o
`
`7t
`
`• ro
`
`-1t/M
`1t/M
`Fig. 5. Effect of decimation followed by interpolation.
`
`x(n) whenever n is a multiple of M, and zero otherwise. The
`transform-domain relation is
`1
`V(z) = .:!_ M:i:
`X(zWk)
`M k=O
`
`(4)
`
`~~ro
`
`0
`(a)
`
`1t
`
`27t
`
`-21t
`
`-1t
`
`which means that MV(e1"') is a sum of X(e 1"') with the M -
`1 uniformly shifted versions X(ei(w-hk/Ml). From the figure
`we see that x(n) can be recovered from v(n) by eliminating
`the images by filtering, provided none of the images has an
`overlap with X(ei"'). If such an overlap occurs, it implies
`aliasing and x(n) cannot be recovered. Notice that in order
`for x(n) to be recoverable it is not necessary for X(e 1"') to be
`restricted to lwl < 1riM. It is sufficient for the total band(cid:173)
`width of X(e 1"')to be less than 21r/M. Thus a general bandpass
`signal with energy in the region a :S w :S a + 21r/M can be
`decimated by M without creating overlap of the alias com(cid:173)
`ponents, and the decimated signal in general is a full-band
`signal.
`A different type of cascade is shown in Fig. 6(a). We shall
`have occasion to use this in section IV-B, which concen(cid:173)
`trates on multilevel-polyphase decompositions. It should
`be cautioned that the two building blocks in Fig. 6(a) are not,
`in general, interchangeable, i.e., the systems in Fig. 6(a) and
`6(b) are not equivalent. For example, with M = L, the system
`of Fig. 6(a) is an identity system, whereas the system of Fig.
`&(b) causes a loss of M - 1 out of M samples. It can be shown
`
`~c -21t
`
`-1t
`
`0
`
`1t
`
`27t
`
`(c)
`
`Fig. 4. Transform-domain effects of decimation and inter(cid:173)
`polation. (a) The z transform. (b) l-fold compressed version.
`(c) Demonstration of effect when M = 2.
`
`58
`
`PROCEEDINGS OF THE IEEE, VOL. 78, NO. 1, JANUARY 1990
`
`Petitioner Apple Inc.
`Ex. 1010, p. 58
`
`

`
`x(n)~Y1 (n)
`(a)
`
`x(n)~y2 (n)
`(b)
`Fig. 6. Two ways to cascade decimator and interpolator.
`These are equivalent if and only if M and L are relatively
`prime. (a) Example of identity system. (b) Example of loss of
`M - 1 out of M samples.
`
`(Appendix A) that the systems of Fig. 6(a) and 6(b) are iden(cid:173)
`tical if and only if L and M are relatively prime.
`Decimation Filters and Interpolation Filters: In most
`applications a decimator is preceded by a bandlimiting fil(cid:173)
`ter H(z) whose purpose is to avoid aliasing. For example, a
`low-pass filter with stopband edge w 5 = 1riM can serve as
`such a filter. The cascade shown in Fig. 7(a) is commonly
`called a decimation filter. An interpolation filter, on the
`other hand, is a device which follows an interpolator (Fig.
`7(b)), the purpose being to eliminate the images. The low(cid:173)
`pass filter of Fig. 7(c) again serves as an example (with L =
`M).
`
`The decimation filter
`
`(a)
`
`The interpolation filter
`
`(b)
`
`plished by setting M = 3, L = 2 in Fig. 8(a). The filter H(z)
`is then taken to be low pass, with passband edge at 7r/3 and
`stopband edge at 27f/3. Notice that in this application, the
`transition bandwidth of H(z) need not be unduly narrow.
`The various signals in Fig. 8(a) have transforms as in Fig. 8(c),
`so that Y(e 1w) is a fractionally stretched version of X(e 1w).
`Two Noble Identities: In Fig. 9(a) we have a decimator fol(cid:173)
`lowed by a transfer function G(z). It can be proved, based
`
`~=~
`(b)
`
`(a)
`
`~=~
`~)
`~)
`Fig. 9. Noble identities for multi rate systems. (a} Decimator
`followed by transfer function G(z}. (b) Equivalent cascade.
`(c) Example of transfer function preceding. (d) Equivalent
`cascade.
`
`on (3b), that this cascade is equivalent to the one in Fig. 9(b)
`provided G(z) is a rational transfer function (i.e., a ratio of
`polynomials in z- 1
`). In a similar manner, the two cascades
`in Figs. 9(c) and 9(d) are equivalent (provided G(z) is rational),
`as can be proved from (3a). These identities are very val(cid:173)
`uable in practically all applications for efficient implemen(cid:173)
`tation of filters and filter banks. We shall call these the
`"noble identities."
`
`C. Analysis and Synthesis Banks
`
`These are the two basic types of filter banks. An analysis
`bank is a set of analysis filters Hk(z) which splits a signal into
`M subband signals xk(n) as shown in Fig. 10(a). What we do
`
`(c)
`(a) Decimation filter. (b) Interpolation filter. (c) Low(cid:173)
`Fig. 7.
`pass filter.
`
`x(n)
`
`Fractional Sampling Rate Alterations: Fig. 8(a) shows a
`scheme for reducing the sampling rate by a nonintegral
`(rational) number MIL. Fractional reduction of sampling rate
`often results in data compression without loss of infor(cid:173)
`mation. As an example, if X(eiw) is as in Fig. 8(b), then a frac(cid:173)
`tional reduction by 3/2 is possible. This can be accom-
`
`.
`.
`.
`.
`~xM_,(n) YM_,(n)~x(n)
`
`(a)
`
`(b)
`
`x(n)~y(n)
`
`(a)
`
`~ ,ro
`
`(b)
`
`Fig. 8. Decimation by rational fraction of MIL. (a) General
`structure. (b) Example of band limited signal. (c) Effect of frac(cid:173)
`tional decimation of this signal (L = 2, M = 3).
`
`~ 'OJ
`
`0
`
`1t
`4
`
`21t
`
`(c)
`Fig. 10. Analysis and synthesis filter banks. (a) Analysis
`bank. (b) Synthesis bank. (c) Typical response of uniform
`OFT filter bank; here M = 4.
`
`with the subband signals depends on the application, as we
`shall see in sections Ill, VI, and IX. Next, a synthesis bank
`(Fig. 10(b)) consists of M synthesis filters Fk(z), which com(cid:173)
`bine M signals Yk(n) (possibly from an analysis bank) into
`a reconstructed signal x(n). There are several types of filter
`banks, i.e., the complementary type, the Nyquist type, etc.,
`to be described in Section V along with applications.
`Uniform OFT Filter Banks: An analysis bank with M filters
`(M > 1) is said to be a uniform DFTfilter bank if all the filters
`are derived from H 0(z) according to Hk(z) = H 0(zWk), 0 ~
`k ~ M - 1. Here H 0(z) is called the prototype filter. Note
`
`VAIDYANATHAN: MUlTIRATE DIGITAl FilTERS
`
`59
`
`Petitioner Apple Inc.
`Ex. 1010, p. 59
`
`

`
`that Hk(eiw) = H 0(ei(w-21rk!Ml), which means that the fre(cid:173)
`quency responses of Hk(z) are uniformly shifted versions
`of H 0(eiw). Fig. 10(c) shows a typical set of responses, where
`H0(z) is taken to be low pass. More details can be found in
`section IV-C and in [4] and [11].
`
`Ill. SOME APPLICATIONS OF MULTIRATE SYSTEMS
`
`We shall now review a number of important applications
`of multi rate filters and filter banks, with pointers to the lit(cid:173)
`erature for details, examples, and demonstrations. In sec(cid:173)
`tion IX, several unconventional applications are also out(cid:173)
`lined.
`Applications in the design of transmultiplexers (which
`are devices for conversion between frequency division
`multiplexing (FDM) and time-division multiplexing (TDM))
`are not discussed here in detail, primarily because of the
`excellent treatment already available in [13]. Also see [14]
`for the correspondence between transmultiplexers and
`analysis/synthesis filter banks. The input to a TDM-to-FDM
`converter is a signal y(n), which is the time-multiplexed ver(cid:173)
`sion of M signals yk(n), 0 :s; k :s; M- 1. Given y(n), the com(cid:173)
`ponents yk(n) can easily be separated out by use of a com(cid:173)
`mutator switch [4], [13]. These M signals are then modulated
`using distinct carrier frequencies. The carrier frequencies
`wk, 0 :s; k :s; M- 1 are chosen so that there is sufficient spec(cid:173)
`tral gap between the messages. A sum of these M signals
`(which is the FDM signal) is then transmitted through the
`channel. The total channel bandwidth is therefore required
`to exceed the sum of signal bandwidths because of the safe(cid:173)
`guard gap between adjacent spectra. The gap enables one
`to obtain perfect recovery of the multiplexed signals yk(n)
`at a future point.
`A novel approach to transmultiplexing was suggested in
`[36] and cited in [14], based on synthesis and analysis filter
`banks. This approach permits overlap between the spectra
`of successive messages in the frequency domain. The total
`required channel bandwidth is therefore less than that in
`conventional FDM channels. Conditions are derived under
`which cross-talk can be avoided and the set of M original
`signals can still be recovered from this version. Details can
`be found in reference [36] cited in [14].
`
`A. Digital Audio Systems
`
`In the digital audio industry, it is a common requirement
`to change the sampling rates of band-limited sequences.
`This arises for example when an analog music waveform
`x.(t) is to be digitized. Assuming that the significant infor(cid:173)
`mation is in the band 0 :s; !OI/27r :s; 22kHz [15], a minimum
`sampling rate of 44kHz is suggested (Fig. 11(a)). It is, how(cid:173)
`ever, necessary to perform analog filtering before sampling
`to eliminate aliasing of out-of-band noise. Now the require(cid:173)
`ments on the analog filter H.(j!J) (Fig. 11(b)) are strigent: it
`should have a fairly flat passband (so that x.(j!J) is not dis(cid:173)
`torted) and a narrow transition band (so that only a small
`amount of unwanted energy is let in). Optimal filters forth is
`purpose (such as elliptic filters [9], which are optimal in the
`minimax sense) have a very nonlinear phase response [16,
`page 82] around the bandedge (i.e., around 22kHz). In high(cid:173)
`quality music this is considered to be objectionable [15]. A
`common strategy to solve this problem is to oversample
`x.(t) by a factor of two (and often four). The filter H.(j!J) now
`has a much wider transition band, as in Fig. 11(c), so that
`
`-44
`
`I
`-22
`
`/
`
`-44
`
`-22
`
`!1
`
`0
`
`!1
`
`0
`
`minimum
`sampling rate
`
`over-sampling
`rate
`
`+
`44
`
`/kHz
`88
`
`• kHz
`
`22
`(a)
`
`I
`
`22
`(b)
`
`~ • kHz
`
`44
`
`22
`(c)
`
`Xa(t)
`
`x(n)
`
`(d)
`(a) Spectrum of x.(t). (b) Antialiasing filter response
`Fig. 11.
`for sampling at 44 kHz. (c) Antialiasing filter response for
`sampling at 88kHz. (d) Improved scheme for AID stage of
`digital audio system.
`
`the phase-response nonlinearity is acceptably low. A simple
`analog_Bessel filter (which has linear phase in the passband
`[9]) can be used in practice. The sequence x1(n) so generated
`is then lowpass filtered (Fig. 11(d)) by a digital filter H(z) and
`then decimated by the same factor of two to obtain the final
`digital signal x(n). The crucial point is that since H(z) is dig(cid:173)
`ital, it can be designed to have linear phase [7], [16], [17],
`while at the same time providing the desired degree of
`sharpness.
`A similar problem arises after the D/A conversion stage,
`where the digital music signal y(n) should be converted to
`an analog signal by lowpass filtering. To eliminate the
`images of Y(eiw) in the region outside 22kHz, a sharp cutoff
`(hence nonlinear phase) analog low-pass filter is required.
`This problem is avoided by using an interpolation filter, as
`in Fig. 7(b), which increases the sampling rate digitally. After
`this, D/A conversion is performed followed by analog fil(cid:173)
`tering. The interpolation filter H(z) is once again a linear(cid:173)
`phase FIR low-pass filter and introduces no phase distor(cid:173)
`tion.
`The obvious price paid in these systems is the increased
`internal rate of computation. However, by using the poly(cid:173)
`phase framework (section IV) the efficiency of these mul(cid:173)
`tirate systems can be dramatically improved.
`In digital audio, it is relatively economic (compared to the
`analog case) to produce high-quality copies of material from
`one medium to another [15]. Perhaps to discourage such
`practice, the sampling rates used for various media are often
`made different from each other. It is therefore necessary
`in studios to design efficient non integral sampling rate con(cid:173)
`verters (such as the one in Fig. 8(a)). See section IV-B for
`further details. Further applications of multi rate filter banks
`in digital audio can be found in section V-E.
`
`B. Subband Coding of Speech and Image Signals
`
`In practice, one often encounters signals with energy
`dominantly concentrated in a particular region of fre-
`
`60
`
`PROCEEDINGS OF THE IEEE, VOL. 78, NO. 1, JANUARY 1990
`
`Petitioner Apple Inc.
`Ex. 1010, p. 60
`
`

`
`quency. An extreme example was shown in Fig. 8(b), where
`all the energy is in 0 :S lwl < 27r/3. In this case it is possible
`to compress the signal simply by decimating it by a factor
`of 3/2 (using Fig. 8(a)) or less.
`It is more common, however, to encounter signals that
`are not bandlimited but still have dominant frequency
`bands. An example is shown in Fig. 12(a). The information
`
`~·;~F===
`
`0
`
`I
`
`(b)
`
`(c)
`
`Fig. 12. Splitting a signal into subband signals x0 (n) and
`x, (n). (a) Cas~ of dominant frequency bands. (b) Splitting the
`s1gnal by usmg an analysis bank with M = 2 (Fig. 10(a)). (c)
`Encoding the subband signal.
`
`in lwl > 1r/2 is not small enough to be discarded. And we
`cannot decimate x(n) without causing aliasing either. It does
`seem unfortunate that a small (but not negligible) fraction.
`of energy in the high-frequency region should prevent us
`from obtaining any kind of signal compression at all.
`Butthere is a way to get around this difficulty: we can split
`the signal into two frequency bands by using an analysis
`bank (Fig. 10(a) with M = 2), with responses as in Fig. 12(b).
`The subband signal x1(n) has less energy than x0(n), and so
`can be encoded with fewer bits than x0(n). As an example,
`let x(n) be a 10-kHz signal (10 000 samples/s) normally
`requiring 8 bits per sample so that the data rate is 80 kbits/
`s. Let us assume that the sub band signals x0(n) and x1(n) can
`be represented with 8 bits and 4 bits per sample, respec(cid:173)
`tively. Because these signals are also decimated by two, the
`data rate now works out to be 40 + 20 = 60 kbits/s, which
`is a compression by 4/3. This is the basic principle of sub(cid:173)
`band coding: split the signal into two or more subbands,
`decimate each subband signal, and allocate bits for samples
`in each subband depending on the energy content. This
`strategy is clearly a generalization of the simple decimation
`process (which works only for strictly bandlimited signals).
`In speech coding practice, further use of the perceptive
`properties in each subband is exploited before quantiza(cid:173)
`tion [18]-[21].
`The reconstruction of the full band signal is done using
`the interpolators and synthesis bank filters as in Fig. 13. The
`
`x(n)
`
`Fig. 13. Analysis/synthesis system for subband coding.
`(Also called two-band QMF bank; see text.)
`
`x(n)
`
`interpolators restore the original sampling rate, and the fil(cid:173)
`ters fk(z) eliminate the images. Further generalizations fol(cid:173)
`low immediately: the signal can be split into M subbands
`with each subband signal decimated by M and indepen(cid:173)
`dently quantized.
`Pioneering work on the application of this technique in
`speech coding has been done by Crochiere [18], [19] and
`by Galand and Estaban [21]. The coding in each subband is
`typically more sophisticated than just quantization. For
`example, techniques such as adaptive pulse code modu(cid:173)
`lation (APCM) and adaptive delta pulse code modulation
`(ADPCM) are commonly used [20]. The specific properties
`of speech signals and their relation to human perception
`are carefully exploited in the coding process; the appro(cid:173)
`priate number of subbands and the coding accuracy in each
`subband are judged based on the articulation index. See
`[18]-[20] for complete examples of speech coding, using this
`idea. The quality of subband coders is usually judged by
`what is called the mean opinion score (MOS). This score is
`obtained by performing listening tests with the help of a
`wide variety of unbiased I isteners, and asking them to assign
`a score for the quality of the reproduced signal x(n) (in com(cid:173)
`parison to x(n)). The maximum score is normalized to 5 by
`convention. Subband-coded speech with an average bit rate
`of 16 kbits/s can typically achieve aMOS of 3.1, whereas at
`32 kbits/s a score of 4.3 has been achieved in the past [20,
`chapter 11].
`In video signal processing, subband coding has been
`applied for image compression, and success has been
`reported by several authors [22]-[27]. The results in [22], in
`particular, use vector quantization [28] in each subband to
`obtain a coded image with only 0.48 bits per pixel. (The orig(cid:173)
`inal uncoded picture being an 8 bits per pixel image.)
`Several comments are now in order: first, in orderfor sub(cid:173)
`band coding to work, it is necessary to have some a priori
`knowledge about the energy distribution of X(eiw). In speech
`and image processing, for example, such knowledge is usu(cid:173)
`ally available because of the long history of experience with
`the coding of such signals. Second, the bandsplitting and
`decimation operation inevitably results in aliasing because
`the filters Hk(z) are not ideal. The filters fk(z) should be cho(cid:173)
`sen carefully in such a way that aliasing is actually canceled
`(see section VI, where we include a detailed review of all
`the distortions which arise in the analysis/synthesis sys(cid:173)
`tem).
`
`C. Analog Voice Privacy Systems
`
`These systems [29] are intended to communicate speech
`over standard analog telephone I inks, while at the same time
`ensuring voice privacy. The main idea is to split a signal x(n)
`into M subband signals xk(n) and then divide each subband
`signal into segments in the time domain. These segments
`of sub band signals are then permuted and recombined into
`a single encrypted signal y(n), which can then be trans(cid:173)
`mitted (after D/A conversion). For example, if there are five
`subbands and twenty-five time segments in each subband,
`then there are 125! possible permutations, and unless an
`eavesdropper has the key for decryption, he will be unable
`to perform a pleasant job of eavesdropping. The aims of the
`designer of such a privacy system are: the encrypted mes(cid:173)
`sage should be unintelligible, decryption without a key
`should be very difficult, and the decrypted signal should
`
`VAIDYANATHAN: MULTIRATE DIGITAL FILTERS
`
`61
`
`Petitioner Apple Inc.
`Ex. 1010, p. 61
`
`

`
`be of good quality retaining naturalness and voice char(cid:173)
`acteristics. These features have indeed been achieved by
`Cox eta/. [29].
`At the receiver end, y(n) is again split into subbands, and
`the time-segments of the subbands unshuffled to get xk(n),
`which can then be interpolated and recombined through
`the synthesis filters.
`
`IV. THE POLYPHASE DECOMPOSITION AND ITS APPLICATIONS
`
`The polyphase decomposition, which originated from the
`work by Bellanger eta/. [11], is very fundamental to many
`applications in multi rate DSP. These include efficient real(cid:173)
`time implementation of decimation and interpolation fil(cid:173)
`ters, fractional sampling-rate changing devices, uniform
`DFT filter banks, and perfect-reconstruction analysis/syn(cid:173)
`thesis systems. More recently, the polyphase idea has been
`applied for the derivati

This document is available on Docket Alarm but you must sign up to view it.


Or .

Accessing this document will incur an additional charge of $.

After purchase, you can access this document again without charge.

Accept $ Charge
throbber

Still Working On It

This document is taking longer than usual to download. This can happen if we need to contact the court directly to obtain the document and their servers are running slowly.

Give it another minute or two to complete, and then try the refresh button.

throbber

A few More Minutes ... Still Working

It can take up to 5 minutes for us to download a document if the court servers are running slowly.

Thank you for your continued patience.

This document could not be displayed.

We could not find this document within its docket. Please go back to the docket page and check the link. If that does not work, go back to the docket and refresh it to pull the newest information.

Your account does not support viewing this document.

You need a Paid Account to view this document. Click here to change your account type.

Your account does not support viewing this document.

Set your membership status to view this document.

With a Docket Alarm membership, you'll get a whole lot more, including:

  • Up-to-date information for this case.
  • Email alerts whenever there is an update.
  • Full text search for other cases.
  • Get email alerts whenever a new case matches your search.

Become a Member

One Moment Please

The filing “” is large (MB) and is being downloaded.

Please refresh this page in a few minutes to see if the filing has been downloaded. The filing will also be emailed to you when the download completes.

Your document is on its way!

If you do not receive the document in five minutes, contact support at support@docketalarm.com.

Sealed Document

We are unable to display this document, it may be under a court ordered seal.

If you have proper credentials to access the file, you may proceed directly to the court's system using your government issued username and password.


Access Government Site

We are redirecting you
to a mobile optimized page.





Document Unreadable or Corrupt

Refresh this Document
Go to the Docket

We are unable to display this document.

Refresh this Document
Go to the Docket