Download - Optimization Algorithms for Realizable Signal-Adapted Filter Banks

Optimization Algorithms for RealizableSignal-Adapted Filter Banks

Thesis by

Andre Tkacenko

In Partial Fulfillment of the Requirements

for the Degree of

Doctor of Philosophy

California Institute of Technology

Pasadena, California

2004

(Submitted May 12, 2004)

ii

c© 2004

Andre Tkacenko

All Rights Reserved

iii

Acknowledgements

First of all, I would like to thank my advisor, Prof. P. P. Vaidyanathan, for the guidance he has given

me and for supporting me during my stay at Caltech. I first met P. P. when I was an undergrad at

Caltech, when he was assigned to be my advisor. Even though he was rather busy on account of

his Option Representative duties at the time, he was always there when I needed to see him and

attentative when I spoke with him. My respect for him grew when I took his course on digital signal

processing, EE 112. He was always able to take a daunting and difficult to understand concept and

simplify it and make it unintimidating. I was so grateful the day that P. P., after finishing his EE

112 lecture for the day, told me that I had been admitted to his group. As my graduate advisor,

he was always there for me and had my best interests at heart. He always pointed me in the right

direction when I was lost and supported me when I was on the right path. In addition to being my

mentor, he has also been a good friend to me. He has a good sense of humor and often times we

have exchanged jokes. (In addition to our love for digital signal processing and our sense of humor,

P. P. and I also have something else in common: a love for Three’s Company!) I will always be

grateful for the opportunity that he gave me to teach the multirate part of his signal processing

course, EE 112b, in the spring of 2003. Because of him, I aspire to return to academia some day.

I would like to thank the National Science Foundation (NSF) and the Office of Naval Research

(ONR) for financially supporting as a graduate student at Caltech. Also, I would like to thank

the members of my defense and candidacy examining committees: Prof. Robert J. McEliece, Prof.

Babak Hassibi, Prof. Jehoshua Bruck, Dr. Bojan Vrcelj, Dr. Payman Arabshahi, and Prof. Joel N.

Franklin. In particular, I would like to thank Prof. McEliece (The Chief) for being my teacher,

senior thesis advisor as an undergrad, and good friend. He was always willing to lend an ear to my

social travails and offer me good advise on what to do. Though his advise was always sagacious,

I unfortunately rarely followed it on account of my being young and naive. I would like to thank

him for his advise and friendship by taking him out to a certain restaurant again some time! In

addition to the above people, I would like to thank some of my teachers at Wilcox High School

iv

including Al Holland, my calculus teacher, and my gym teacher Jerry Louderback, whose wife was

able to procure a Caltech undergraduate application form for me at the proverbial eleventh hour.

I would like to thank my lab mates Bojan Vrcelj, Sony Akkarakaran, Murat Mese, Byung-Jun

Yoon, Sriram Murali, Mike Larsen, and Borching Su for making my time spent in the lab cheerful

and fun. My senior lab mates, Bojan, Sony, and Murat, were inspirational to me and helped me

mature from the first year graduate student that I once was to the senior leveled one that I am

now. In particular, I would especially like to thank Bojan for being both my friend and an elderly

brother figure for me. I miss our afternoon trips to Starbucks in Old Town Pasadena where we used

to sit, chat, and ogle at all of the pretty girls as they walked by. With any luck, I hope we will be

able to resume our old tradition once again. In addition, I would also like to thank Prof. Shayan

Mookherjea, my old friend from my undergraduate days at Caltech. I want to thank Shayan for

walking with me to all of those restaurants we used to eat at before any of us had cars and also

for introducing me to LATEX. Also, I want to thank Aristotelis Asimakopoulous, and friend of mine

from my undergraduate days who came back to Caltech during my graduate studies. He was my

partner in crime in various nefarious activities which I elect to omit here.

Also, I would like to thank my parents, Dusanka and Nikola Tkacenko, for their endless love

and support. I am indebted to them for the sacrifices that they made to put me through school

as an undergraduate. It is my hope that some day I will be able to be as good of a parent to my

children as my parents have been to me. I want to thank my brothers Nick, Boris, and Aleks for

being the best siblings that I could hope for. In particular, I want to thank my brother Nick for

being my best friend (and for introducing me to Buffy and Angel). I hope that we will be able

to spend more time with each other after he finishes school at UC Santa Cruz. Furthermore, I

would also like to thank my grandparents, Sofija and Nikola Milunovic, for their love and support.

In particular, I want to thank my grandfather Nikola, who is a master woodcarver, for offering to

carve me a throne for completing my doctorate. As with all of his other works, I am sure that it is

the “only one kind on planet [sic]”.

I would like to give a special thanks to my Princessa Victoria Delgadillo. She came into my life

when I least expected it and most needed it. It is because of her that I know what it is to feel alive.

She has been a light in my darkness and has shown me true happiness. I want to thank her for

loving me and supporting me during this tumultuous time in my life. Though I don’t know what

the future may hold for us, I want her to know that I love her and that she will always be a part

of my heart and my soul, now and forever.

v

Last, but certainly not least, I would like to thank God, the Infinite Being, for creating this

beautiful world in which we all live and beautiful mathematics (some of which I will present here in

my thesis). I also want to thank Him for always watching over me and protecting me (even when

I didn’t deserve it). Whenever I put my faith in His will, He always led me on the right path. I

pray that His love and divine presence will be with us all, now and ever, and unto the ages of ages.

Amen.

vi

Abstract

Multirate filter banks are fundamental systems commonly used in digital signal processing (DSP).

Typically, they are used to decompose a discrete-time signal into a set of frequency selective compo-

nents called subband signals. Filter banks have been found to be useful for lossy data compression

schemes such as MP3 and JPEG 2000, denoising, and signal estimation. In the last decade, trans-

multiplexers, the dual structures of multirate filter banks, have been shown to be useful in digital

communications systems such as discrete multitone (DMT) systems for channel equalization and

inter/intra-symbol interference cancellation in the presence of noise.

Recently, a special type of filter bank adapted to its input known as the principal component

filter bank (PCFB) has been shown to be simultaneously optimal for a wide variety of objectives.

Such filter banks are not only optimal for relevant data compression type objectives such as cod-

ing gain and multiresolution, but also for digital communications type objectives such as power

minimization, when the filter bank is implemented in its transmultiplexer form. The only problem

is that PCFBs, which are defined over classes of paraunitary (PU) filter banks, are only known

to exist for certain classes. In particular, PCFBs are in general known to exist only in the ex-

tremal cases where the analysis/synthesis polyphase matrix has zero memory and doubly infinite

memory, respectively. Furthermore, for many practical cases of inputs, the filters corresponding

to the infinite-order PCFB have ideal bandpass response and are as such unrealizable. When the

polyphase matrix has finite memory or a finite impulse response (FIR), it is believed that PCFBs

do not exist, although this has not yet been formally proven in the literature.

The main contribution of this thesis is to bridge the gap between the zero memory PCFB and

the infinite-order one. To that end, a variety of methods for the design of realizable signal-adapted

FIR filter banks is presented. It is shown that a popular conventional method for designing signal-

adapted FIR PU filter banks, which only requires the design of an optimal FIR compaction filter,

is in fact not well suited for designing good filter banks due to the exponential complexity caused

by the nonuniqueness of the FIR compaction filter. To avoid this dilemma, we propose a method

vii

by which all of the filters are obtained together. In particular, the method consists of finding an

FIR PU least-squares approximant to the infinite-order PCFB polyphase matrix. Using an elegant

complete parameterization of FIR PU systems in terms of canonical building blocks, an iterative

greedy algorithm for solving the least-squares problem is presented. Simulation results provided

here show that as the order or memory of the signal-adapted FIR PU filter bank increases, the filter

bank behaves more and more like the infinite-order PCFB in terms of a variety of objectives included

coding gain, multiresolution, and power minimization. This serves to bridge the gap between the

zero memory and infinite memory PCFBs, which previously has not been done in the literature.

In addition to being useful for the design of PCFB-like FIR filter banks, the proposed iterative

algorithm can also be used for a variety of other design problems including the FIR PU interpolation

problem. Unlike the traditional FIR interpolation problem, whose solution is known in closed form,

the FIR PU interpolation problem is far more difficult and is in fact still open. Despite this, the

proposed algorithm can be used to find an approximant to an interpolant and sometimes even find

an interpolant, as we show here through simulations.

In the second part of the thesis, we focus on the design of realizable signal-adapted quantized

filter banks in which the filters are FIR but otherwise unconstrained. The filters are chosen to

minimize the mean-squared error of the output, which is shown to be equivalent to maximizing the

coding gain of the system. By alternately optimizing the analysis and synthesis filters, an iterative

greedy algorithm, different from that mentioned above, is proposed for the design of such filter

banks. Simulation results provided show that the filter banks designed exhibit performance close

to the information theoretic rate-distortion bound.

Finally, we show how some of the techniques used in the above iterative algorithms can be

used for the design of a channel shortening equalizer. Channel shortening equalizers, which arise

in the context of digital communications, have been found to be necessary for DMT systems such

as the digital subscriber loop (DSL) in which the channel impulse response must be shortened to

the length of the cyclic prefix. In particular, we show how the eigenfilter technique which is used

in the above-mentioned FIR PU iterative greedy algorithm, can be used for the design of a noise

optimized channel shortening equalizer. As opposed to other techniques, which require a Cholesky

decomposition of a certain matrix for every delay parameter considered, the proposed method is

lower in complexity in that it only requires a single such decomposition for all delay values. Despite

this significant decrease in complexity, it is shown through simulations that the equalizers designed

using this technique perform nearly optimally in terms of observed bit rate.

viii

Contents

Acknowledgements iii

Abstract vi

1 Introduction 1

1.1 Review of Multirate Systems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2

1.1.1 Discrete-Time Signal Processing . . . . . . . . . . . . . . . . . . . . . . . . . 2

1.1.2 Fundamental Building Blocks . . . . . . . . . . . . . . . . . . . . . . . . . . . 3

1.1.3 Decimation/Interpolation Filter Systems . . . . . . . . . . . . . . . . . . . . . 7

1.1.3.1 Noble Identities . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8

1.1.3.2 Polyphase Decompositions . . . . . . . . . . . . . . . . . . . . . . . 8

1.1.4 Filter Banks and Transmultiplexers . . . . . . . . . . . . . . . . . . . . . . . . 11

1.1.4.1 Polyphase Representations of Uniform Systems . . . . . . . . . . . . 13

1.1.4.2 Biorthogonality and Orthonormality . . . . . . . . . . . . . . . . . . 14

1.2 Signal-Adapted Filter Banks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15

1.2.1 Principal Component Filter Banks . . . . . . . . . . . . . . . . . . . . . . . . 17

1.2.1.1 Classes for Which PCFBs Are Known to Exist . . . . . . . . . . . . 18

1.2.1.2 Difficulties with the Class of FIR PU Systems . . . . . . . . . . . . 18

1.3 The FIR PU Interpolation Problem . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19

1.4 The Channel Shortening Equalizer Problem . . . . . . . . . . . . . . . . . . . . . . . 20

1.5 Outline of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21

1.5.1 Eigenfilter Design of Overdecimated Compaction Filter Banks (Chapter 2) . 21

1.5.2 Multiresolution Optimal FIR PU Filter Banks (Chapter 3) . . . . . . . . . . 22

1.5.3 Direct FIR PU Approximation of the PCFB (Chapter 4) . . . . . . . . . . . 23

1.5.4 Coding Gain Optimal FIR Filter Banks (Chapter 5) . . . . . . . . . . . . . . 24

ix

1.5.5 Eigenfilter Channel Shortening Equalizer for DMT Systems (Chapter 6) . . . 24

1.6 Notations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25

2 Eigenfilter Design of Overdecimated Compaction Filter Banks 26

2.1 Outline . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 27

2.2 The Compaction Filter Problem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 27

2.2.1 Solution in the Unconstrained Order Case . . . . . . . . . . . . . . . . . . . . 29

2.2.1.1 Special Case of WSS Input . . . . . . . . . . . . . . . . . . . . . . . 29

2.2.2 Overview of Previous Work on FIR Compaction Filters . . . . . . . . . . . . 30

2.3 Energy Compaction Problem for Overdecimated Filter Banks . . . . . . . . . . . . . 32

2.3.1 Derivation of the Energy Compaction Problem . . . . . . . . . . . . . . . . . 33

2.3.2 Imposing the FIR Constraint on the Matrix F(z) . . . . . . . . . . . . . . . . 34

2.4 Iterative Eigenfilter Method for Solving the FIR Compaction Problem . . . . . . . . 36

2.4.1 Solution to the Iterative Linearized Optimization Problem . . . . . . . . . . . 38

2.5 Simulation Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 39

2.6 Concluding Remarks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 42

Appendix: Least-Squares Signal Model Approximation Problem . . . . . . . . . . . . . . . 43

Blocked Form of LTI Systems and Pseudocirculant Matrices . . . . . . . . . . . . . . 44

Least-Squares Approximation Problem - Deterministic Case . . . . . . . . . . . . . . 46

Least-Squares Approximation Problem - Stochastic Case . . . . . . . . . . . . . . . . 50

3 Multiresolution Optimal FIR PU Filter Banks 54

3.1 Outline . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 55

3.2 Complete Parameterization of 1-D Causal FIR PU Systems . . . . . . . . . . . . . . 57

3.3 Least-Squares Design of FIR Magnitude Squared Nyquist Filters . . . . . . . . . . . 58

3.3.1 Solving the Least-Squares Optimization Problem Using the Method of La-

grange Multipliers . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 59

3.3.1.1 Optimal Choice of u0 . . . . . . . . . . . . . . . . . . . . . . . . . . 59

3.3.1.2 Optimal Choice of vk . . . . . . . . . . . . . . . . . . . . . . . . . . 60

3.3.2 Iterative Gradient Optimization Algorithm . . . . . . . . . . . . . . . . . . . 62

3.3.3 Simulation Results for Designing FIR Compaction Filters . . . . . . . . . . . 63

3.4 Phase Feedback Modification for Improving Compaction Gain . . . . . . . . . . . . . 65

x

3.5 Design of Signal-Adapted FIR PU Filter Banks

Using a Multiresolution Criterion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 68

3.5.1 Application of the Factorization of FIR PU Systems to the MOC . . . . . . . 69

3.5.2 Algorithm for the Design of FIR PU Multiresolution Optimal

Filter Banks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 72

3.6 Simulation Results for MOC Designed FIR PU Filter Banks . . . . . . . . . . . . . . 73

3.6.1 Coding Gain Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 73

3.6.2 Multiresolution Optimality Results . . . . . . . . . . . . . . . . . . . . . . . . 76

3.6.3 Noise Reduction Using Zeroth-Order Wiener Filters . . . . . . . . . . . . . . 76

3.6.4 Power Minimization for Nonredundant PU Transmultiplexers . . . . . . . . . 78


4 Direct FIR PU Approximation of the PCFB 80

4.1 Outline . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 81

4.2 The FIR PU Approximation Problem . . . . . . . . . . . . . . . . . . . . . . . . . . 82

4.2.1 Optimal Choice of U . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 83

4.2.2 Optimal Choice of vk . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 85

4.2.3 Iterative Greedy Algorithm for Solving the Approximation Problem . . . . . 87

4.3 Phase Feedback Modification . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 88

4.3.1 Phase-Type Ambiguity . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 88

4.3.2 Derivation of the Phase Feedback Modification . . . . . . . . . . . . . . . . . 89

4.3.3 Greediness of the Phase Feedback Modification . . . . . . . . . . . . . . . . . 91


4.4.1 Design of PCFB-like FIR PU Filter Banks . . . . . . . . . . . . . . . . . . . . 91

4.4.1.1 Multiresolution Optimality Results . . . . . . . . . . . . . . . . . . . 93

4.4.1.2 Coding Gain Results . . . . . . . . . . . . . . . . . . . . . . . . . . . 97

4.4.1.3 Noise Reduction Using Zeroth-Order Wiener Filters . . . . . . . . . 98

4.4.1.4 Power Minimization for DMT-Type Transmultiplexers . . . . . . . . 98

4.4.2 The FIR PU Interpolation Problem . . . . . . . . . . . . . . . . . . . . . . . 100

4.4.2.1 FIR PU Interpolation - Example 1 . . . . . . . . . . . . . . . . . . . 101

4.4.2.2 FIR PU Interpolation - Example 2 . . . . . . . . . . . . . . . . . . . 102


xi

5 Coding Gain Optimal FIR Filter Banks 104

5.1 Outline . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 105

5.2 Uniform Quantized Filter Bank Model . . . . . . . . . . . . . . . . . . . . . . . . . . 106

5.3 Optimizing the Mean-Squared Reconstruction Error . . . . . . . . . . . . . . . . . . 108

5.3.1 Optimal Synthesis Bank F(z) for Fixed Analysis Bank H(z) . . . . . . . . . 110

5.3.2 Optimal Analysis Bank H(z) for Fixed Synthesis Bank F(z) . . . . . . . . . 111

5.3.2.1 Simplifying the Quantization Noise Term . . . . . . . . . . . . . . . 111

5.3.2.2 Imposing the FIR Constraint on H(z) . . . . . . . . . . . . . . . . . 112

5.3.3 Iterative Greedy Analysis/Synthesis Filter Bank Optimization Algorithm . . 115


5.4.1 Overdecimated Filter Bank Design . . . . . . . . . . . . . . . . . . . . . . . . 116

5.4.1.1 Single Subband Design Example . . . . . . . . . . . . . . . . . . . . 116

5.4.1.2 Multiple Subband Design Example . . . . . . . . . . . . . . . . . . . 119

5.4.2 Quantized Filter Bank Design . . . . . . . . . . . . . . . . . . . . . . . . . . . 122

5.4.2.1 Single Channel Example - Pre/Postfiltering in Quantization . . . . 124

5.4.2.2 Maximally Decimated Quantized Filter Bank . . . . . . . . . . . . . 129


6 Eigenfilter Channel Shortening Equalizer for DMT Systems 134

6.1 Outline . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 134

6.2 The Channel Shortening Equalizer Problem . . . . . . . . . . . . . . . . . . . . . . . 135

6.2.1 Overview of Previous Work on Channel Shortening Equalizers . . . . . . . . 138

6.3 Proposed Eigenfilter Design Method . . . . . . . . . . . . . . . . . . . . . . . . . . . 141

6.3.1 Analysis of the Objective Function J . . . . . . . . . . . . . . . . . . . . . . . 143



Appendix: Equivalence between FSEs and the SIMO-MISO Channel/Equalizer Model . . 148

7 Conclusion 151

7.1 Open Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 152

Bibliography 154

xii

List of Figures

1.1 Linear time-invariant (LTI) filtering operation. . . . . . . . . . . . . . . . . . . . . . . 3

1.2 Multiple-input multiple-output (MIMO) LTI filtering operation. . . . . . . . . . . . . 4

1.3 L-fold expander input/output relationship: (a) Block diagram, (b) Example for L = 2. 4

1.4 M -fold decimator input/output relationship: (a) Block diagram, (b) Example for

M = 3. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5

1.5 (a) Given input signal spectrum X(ejω), (b) Expanded signal spectrum YE(ejω) (L =

4), (c) Decimated signal spectrum YD(ejω) (M = 4). . . . . . . . . . . . . . . . . . . . 6

1.6 (a) M -fold blocking system, (b) L-fold unblocking system. . . . . . . . . . . . . . . . . 7

1.7 (a) M -fold decimation filter system, (b) L-fold interpolation filter system. . . . . . . . 7

1.8 (a) Decimator noble identity, (b) Expander noble identity. . . . . . . . . . . . . . . . . 8

1.9 Time domain interpretation of the polyphase representation for K = 3. . . . . . . . . 9

1.10 Polyphase implementations of (a) decimation filter system (using (1.3)), (b) interpo-

lation filter system (using (1.4)). . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10

1.11 (a) General nonuniform multirate filter bank, (b) The dual structured transmultiplexer. 11

1.12 Polyphase representations of (a) uniform filter bank, (b) uniform transmultiplexer. . . 13

1.13 (a) Typical uniform M -channel maximally decimated filter bank system. (b) Polyphase

representation of the filter bank. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16

1.14 Typical discrete multitone (DMT) system. . . . . . . . . . . . . . . . . . . . . . . . . 20

1.15 Channel shortening equalizer model. . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21

2.1 (a) One-channel overdecimated filter bank model. (b) Polyphase implementation. . . . 28

2.2 (a) Uniform overdecimated filter bank (L < M), (b) Polyphase representation. . . . . 32

2.3 Input power spectral density Sxx(ejω). . . . . . . . . . . . . . . . . . . . . . . . . . . . 39

2.4 Orthonormality error ||εk||F vs. the iteration index k (L = 1, M = 7). . . . . . . . 40

xiii

2.5 Magnitude squared responses of the designed FIR synthesis filter F0(z) along with the

first filter of the infinite-order PCFB (L = 1, M = 7). . . . . . . . . . . . . . . . . . . 40

2.6 Compaction gain Gcomp vs. the filter order parameter N . . . . . . . . . . . . . . . . . 41

2.7 Orthonormality error ||εk||F vs. the iteration index k (L = 2, M = 7, N = 10). . . . . 42

2.8 Magnitude squared responses of the designed FIR synthesis filters F0(z) and F1(z)

along with the first two filters of the infinite-order PCFB (L = 2, M = 7, N = 10). . . 42

2.9 Rational nonuniform synthesis bank. . . . . . . . . . . . . . . . . . . . . . . . . . . . . 44

2.10 (a) Decimator/expander cascade identity, (b) Polyphase identity. . . . . . . . . . . . . 45

2.11 M -fold blocked form of an LTI system H(z). . . . . . . . . . . . . . . . . . . . . . . . 45

2.12 (a) The k-th branch of the signal model from Fig. 2.9, (b) With Fk(z) implemented in

an mkN -fold block form, (c) Resulting structure after applying the polyphase identity. 48

2.13 System for obtaining the optimal driving signal C(z). . . . . . . . . . . . . . . . . . . 49

2.14 (a) Equivalent form of Fig. 2.9 with blocking/unblocking elements removed, (b) Method

for obtaining the optimal driving signal. . . . . . . . . . . . . . . . . . . . . . . . . . . 50

3.1 Input psd Sxx(ejω) and ideal compaction filter magnitude squared response∣∣D(ejω)

∣∣2for M = 3. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 63

3.2 Mean-squared error ξm vs. the iteration index m. . . . . . . . . . . . . . . . . . . . . . 64

3.3 Magnitude squared responses of the ideal compaction filter D(ejω) and the designed

FIR compaction filter F (ejω). . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 64

3.4 Compaction gain Gcomp vs. the filter order parameter N . . . . . . . . . . . . . . . . . 65

3.5 Compaction gain Gcomp vs. the filter order parameter N using the phase feedback

modification. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 67

3.6 Compaction filter design using the phase feedback modification. (a) Mean-squared

error vs. iteration. (b) Magnitude squared response. . . . . . . . . . . . . . . . . . . . 67

3.7 Uniform M -channel maximally decimated PU filter bank. . . . . . . . . . . . . . . . . 68

3.8 Implementation of the analysis bank using the factorization of F(z) from (3.25). . . . 70

3.9 Observed coding gain Gcode as a function of the synthesis polyphase order N . . . . . . 73

3.10 Zero locations of the optimal spectral factor for N = 11. . . . . . . . . . . . . . . . . . 74

3.11 Analysis filter magnitude squared responses for the optimal spectral factor for N = 11.

(a) First channel. (b) Second channel. (c) Third channel. . . . . . . . . . . . . . . . . 75

xiv

3.12 Analysis filter magnitude squared responses for the minimum-phase spectral factor

for N = 11. (a) First channel. (b) Second channel. (c) Third channel. . . . . . . . . . 75

3.13 Proportion P (L) of the total variance as a function of the number of subbands kept

L for (a) N = 3 and (b) N = 9. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 76

3.14 Mean-squared error ε from (3.37) as a function of N for (a) η2 = 1 and (b) η2 = 4. . . 77

3.15 Uniform PU nonredundant transmultiplexer. . . . . . . . . . . . . . . . . . . . . . . . 78

3.16 Total required power P from (3.40) as a function of N . . . . . . . . . . . . . . . . . . 79

4.1 Input psd Sxx(ejω) of the AR(4) process x(n). . . . . . . . . . . . . . . . . . . . . . . 92

4.2 Mean-squared error ξm vs. iteration m for both the unmodified and phase feedback

modified iterative algorithms. (a) Plot of KN = 3, 000 iterations, (b) Magnified plot

of the first 20 iterations. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 93

4.3 Magnitude squared responses of the PCFB and FIR PU synthesis filters using the

unmodified iterative algorithm. (a) F0(z), (b) F1(z), (c) F2(z), (d) F3(z). . . . . . . . 94

4.4 Magnitude squared responses of the PCFB and FIR PU synthesis filters using the

phase feedback modified iterative algorithm. (a) F0(z), (b) F1(z), (c) F2(z), (d) F3(z). 95

4.5 Proportion of the total variance P (L) as a function of the number of subbands kept

L for an M = 4 channel system with (a) N = 3 and (b) N = 10. . . . . . . . . . . . . 96

4.6 Proportion of the total variance P (L) as a function of the number of subbands kept

L for an M = 8 channel system with (a) N = 3 and (b) N = 10. . . . . . . . . . . . . 97

4.7 Observed coding gain Gcode as a function of the FIR PU filter order parameter N . . . 98

4.8 Noise reduction performance (ε from (3.37)) with zeroth-order subband Wiener filters

as a function of the FIR PU filter order parameter N for (a) noise variance (η2) of 1

and (b) noise variance of 4. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 99

4.9 Nonredundant DMT-type transmultiplexer total required power P as a function of

the FIR PU filter order parameter N . . . . . . . . . . . . . . . . . . . . . . . . . . . . 100

4.10 FIR PU interpolation problem - Example 1: Mean-squared error ξm vs. iteration m. . 101

4.11 FIR PU interpolation problem - Example 2: Mean-squared error ξm vs. iteration m. . 102

5.1 (a) Uniform filter bank with scalar quantizers in the subbands, (b) Polyphase repre-

sentation of the filter bank. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 106

5.2 High bit rate quantizer model. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 107

5.3 Equivalent quantized filter bank model under the high bit rate assumption. . . . . . . 108

xv

5.4 Input psd Sxx(ejω) to the system of Fig. 5.1. . . . . . . . . . . . . . . . . . . . . . . . 116

5.5 Mean-squared reconstruction error ξk as a function of the iteration index k. (Single

subband example) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 117

5.6 Magnitude squared responses of the analysis and synthesis filters for (a) Nf = Nh = 7

and (b) Nf = Nh = 10. (Single subband example) . . . . . . . . . . . . . . . . . . . . 118

5.7 Deviation from (a) orthonormality δ⊥,k and (b) biorthogonality δBIO,k as a function

of the iteration index k. (Single subband example) . . . . . . . . . . . . . . . . . . . . 119

5.8 Mean-squared reconstruction error ξk as a function of the iteration index k. (Multiple

subband example) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 120

5.9 Deviation from (a) orthonormality δ⊥,k and (b) biorthogonality δBIO,k as a function

of the iteration index k. (Multiple subband example) . . . . . . . . . . . . . . . . . . 120

5.10 Magnitude squared responses of the analysis filters Hk(z) and synthesis filters Fk(z)

for (a) k = 0, and (b) k = 1. (Multiple subband example) . . . . . . . . . . . . . . . . 121

5.11 Equivalent overdecimated filter bank model in the absence of quantizers. . . . . . . . 122

5.12 Pre/postfiltering quantization system. . . . . . . . . . . . . . . . . . . . . . . . . . . . 124

5.13 Mean-squared reconstruction error ξk as a function of the iteration index k for (a)

N = 10 and (b) N = 20. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 125

5.14 Deviation from orthonormality δ⊥,k as a function of the iteration index k for (a)

N = 10 and (b) N = 20. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 126

5.15 Deviation from biorthogonality δBIO,k as a function of the iteration index k for (a)

N = 10 and (b) N = 20. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 126

5.16 Magnitude squared responses of the analysis filter H0(z) = H(z) and the synthesis

filter F0(z) = F (z) for (a) N = 10 and (b) N = 20. . . . . . . . . . . . . . . . . . . . . 127

5.17 Observed distortion DFB as a function of the average bit rate b plotted with the direct

quantization and rate-distortion bounds. . . . . . . . . . . . . . . . . . . . . . . . . . . 128

5.18 Observed coding gain Gcode as a function of the analysis/synthesis filter order N . . . . 128

5.19 Mean-squared reconstruction error ξk as a function of the iteration index k for (a)

N = 3 and (b) N = 6. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 129

5.20 Deviation from orthonormality δ⊥,k as a function of the iteration index k for (a) N = 3

and (b) N = 6. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 130

5.21 Deviation from biorthogonality δBIO,k as a function of the iteration index k for (a)

N = 3 and (b) N = 6. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 130

xvi

5.22 Magnitude squared responses of the analysis and synthesis filters for N = 3. (a)

H0(z), F0(z), (b) H1(z), F1(z), (c) H2(z), F2(z). . . . . . . . . . . . . . . . . . . . . . 131

5.23 Magnitude squared responses of the analysis and synthesis filters for N = 6. (a)

H0(z), F0(z), (b) H1(z), F1(z), (c) H2(z), F2(z). . . . . . . . . . . . . . . . . . . . . . 131

5.24 Observed distortion DFB as a function of the average bit rate b plotted with (a) the

direct quantization and rate-distortion bounds, (b) the pre/postfiltering quantization

results. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 132

5.25 Observed coding gain Gcode as a function of the analysis/synthesis polyphase filter

order N . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 133

6.1 Channel shortening equalizer model. . . . . . . . . . . . . . . . . . . . . . . . . . . . . 135

6.2 Decomposition of the effective channel ceff(n) into a desired channel cdes(n) and resid-

ual channel cres(n). . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 136

6.3 Discrete-time model of a K-fold oversampled FSE. . . . . . . . . . . . . . . . . . . . . 141

6.4 SIMO-MISO channel/equalizer model. . . . . . . . . . . . . . . . . . . . . . . . . . . . 141

6.5 Noise psd Sηη(ejω) corresponding to NEXT noise with eight disturbers [46] plus

AWGN with power density −110 dBm/Hz. . . . . . . . . . . . . . . . . . . . . . . . . 146

6.6 Original and equalized channel impulse responses using the proposed eigenfilter method

for CSA loop #1. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 146

6.7 Observed bit rate (as a percentage of the MFB bit rate of (6.2)) as a function of the

tradeoff parameter α for CSA loop #1. . . . . . . . . . . . . . . . . . . . . . . . . . . 146

6.8 Equivalent form of Fig. 6.3 upon using the noble identities. . . . . . . . . . . . . . . . 149

6.9 Equivalent form of Fig. 6.8 upon moving the noise process η(n) past the blocking system.149

6.10 Equivalent form of Fig. 6.9 upon removing the unblocking/blocking systems. . . . . . 150

xvii

List of Tables

6.1 Observed bit rates for CSA loops #1-8 using various TEQ design methods. (Bit rates

are expressed as a percentage of the MFB maximum achievable bit rate of (6.2) for

each loop.) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 147

1

Chapter 1

Introduction

Filter banks are common systems that arise in the study of multirate digital signal processing [67, 13]

and wavelet theory [32, 47]. Essentially, a multirate filter bank decomposes a given input signal

into a set of lower rate components known as subband signals. Often times, some subband signals

contain more information about the original input than others. In these cases, the subbands can be

easily manipulated to remove some of the redundancy present in the original input signal, resulting

in data compression. Popular lossy data compression schemes such as MP3 (for audio signals)

as well as JPEG and JPEG 2000 (for image signals) use filter banks in this manner to remove

the redundancy present in the original signals [41, 39]. In addition to this, transmultiplexers

[67], which are dual structures of multirate filter banks, have been found to be useful in digital

communications [46, 15, 16]. By adding a minimal amount of redundancy, transmultiplexers have

been found to be able to achieve stable blind channel equalization [14, 42, 43] and the removal of

inter/intra-symbol interference [17, 18] all in the presence of noise. Several well known modern

communications systems such as digital subscriber loop (DSL) and orthogonal frequency division

multiplexing (OFDM), which are examples of discrete multitone (DMT) systems [26], consist of

transmultiplexers with redundancy [46].

Recently, a special filter bank adapted to its input statistics known as the principal components

filter bank (PCFB) [62, 1, 69] has been found to be simultaneously optimal for a wide variety of

objectives. In particular, PCFBs are optimal for data compression type objectives such as coding

gain and multiresolution, as well as for digital communications type objectives such as power

minimization when the filter bank is implemented in its transmultiplexer form [1]. Unfortunately,

PCFBs, which are defined over classes of paraunitary (PU) filter banks, are in general only known to

exist for two special classes [1]. In particular, PCFBs are known to exist only in the extremal cases

where the filter bank analysis/synthesis polyphase matrix has zero memory and doubly infinite

2

memory, respectively. For many practical input signals, this latter class consists of filters that

have ideal bandpass characteristics and are as such unrealizable [68]. Though several methods

for the design of realizable signal-adapted filter banks have been proposed (see [35, 37, 79] for

example), none have established a connection between their filter banks and the unrealizable PCFB.

In particular, though we might expect a realizable filter bank of finite order that is optimized for

a particular objective to behave more and more like the unrealizable PCFB as the order increases,

this has not yet been shown in the literature.

The thesis presents several optimization algorithms for the design of realizable signal-adapted

filter banks in which the filters all have a finite impulse response (FIR). Matlab code for these

algorithms will soon be available at [60]. One of the main contributions is to establish a link between

realizable signal-adapted filter banks and the unrealizable infinite-order PCFB. In particular, it will

be shown that FIR filter banks designed using our methods behave more and more like the infinite-

order PCFB as the filter order increases. This serves to bridge the gap between the zero memory

PCFB and the infinite-order one which previously has not been done in the literature.

The primary purpose of this introductory chapter is to set the stage for the remainder of

the thesis. To that end, every attempt has been made to keep the chapter as self-contained as

possible. In this chapter, a brief overview of multirate systems and identities which will be used

throughout the thesis, as well as commonly used notations and terminology, is given in Sec. 1.1.

We formally introduce PCFBs in this chapter (Sec. 1.2.1) and mention problems related to them.

In addition, we discuss other problems relevant to the thesis including the FIR paraunitary (PU)

interpolation problem in Sec. 1.3 and the channel shortening equalizer problem in Sec. 1.4. The

material presented here is by no means a complete coverage of multirate systems and filter banks.

For a more comprehensive treatment of multirate system theory and its applications, the reader is

referred to [67, 32, 47, 41, 39, 15, 16].

1.1 Review of Multirate Systems

1.1.1 Discrete-Time Signal Processing

All of the signals of interest in DSP are discrete-time sequences of either real or complex numbers.

These sequences are typically obtained by uniformly sampling a continuous-time signal, such as a

voltage signal as a function of time. For example, if xc(t) denotes a continuous-time signal, then the

sequence x(n) � xc(nTs) for n ∈ Z denotes a discrete-time signal obtained by uniformly sampling

3

� H(z) �x(n)

X(z)

y(n) =∞∑

k=−∞h(k)x(n − k)

Y (z) = H(z)X(z)

Figure 1.1: Linear time-invariant (LTI) filtering operation.

xc(t) every Ts units of time.

In order for the discrete-time signal x(n) to be stored digitally on a computer, for example, the

value of x(n) at each n must be approximated by a value obtained from a finite set of possible values.

This process is called quantization or discretization. Typically, we will assume that the number of

discretization levels is large enough so as to ignore the effects of quantization (fine quantization).

However, in many data compression applications, it becomes necessary to use only a small number

of levels (coarse quantization), so as to decrease the overall bit rate and obtain lossy compression

[41, 39]. For the remainder of the thesis, we will ignore the effects of quantization unless stated

otherwise.

Signal processing analysis is often facilitated by considering alternative representations of discrete-

time signals and systems. Perhaps the most commonly used alternative representations of a discrete-

time signal x(n) are its z-transform X(z) and its Fourier transform X(ejω). The z-transform is

defined as

X(z) �∞∑

n=−∞x(n)z−n

for regions of z ∈ C for which the above summation converges. Also, the Fourier transform X(ejω)

is simply X(z) evaluated on the unit circle z = ejω, assuming that the summation converges there.

In cases where the Fourier transform X(ejω) exists but the z-transform X(z) does not [67], we will

write X(z) as a shorthand notation for X(ejω).

1.1.2 Fundamental Building Blocks

The majority of multirate DSP systems consists of three fundamental building blocks. These are

the linear time-invariant (LTI) filter, the expander, and the decimator. An LTI filter, such as the

one shown in Fig. 1.1, is characterized by its impulse response h(n) (i.e., its response to the unit

Kronecker delta impulse function x(n) = δ(n) [67]), or equivalently by its z-transform H(z), which

is called the transfer function. The system of Fig. 1.1 is an example of a single-input single-output

(SISO) LTI system. More generally, a multiple-input multiple-output (MIMO) LTI filter, such

4

H(z)r p

x(n)

X(z)

y(n) =∞∑

k=−∞h(k)x(n − k)

Y(z) = H(z)X(z)

Figure 1.2: Multiple-input multiple-output (MIMO) LTI filtering operation.

� ↑ L �x(n) yE(n) = [x(n)]↑L =

{x(

nL

), n = multiple of L

0 , otherwise

(a)

� � n

−2 −1 0 1 2

x(n)

��

� � �

� � n

−4 −3 −2 −1 0 1 2 3 4

yE(n) = [x(n)]↑2

(b)

Figure 1.3: L-fold expander input/output relationship: (a) Block diagram, (b) Example for L = 2.

as the one shown in Fig. 1.2 operates on a r × 1 vector input x(n) to produce an p × 1 vector

output y(n). Such a filter is characterized by its p × r impulse response matrix sequence h(n) or

equivalently by its p × r transfer function matrix H(z).

While LTI systems are used to filter discrete-time signals, expanders and decimators are used

to alter the rates of such signals. An L-fold expander, such as the one shown in Fig. 1.3(a), is used

to increase the rate by a factor of L. The L-fold expander essentially pads (L − 1) zeros between

each sample of its input and as such, the output must operate at a rate of L times the input rate

in order to maintain the proper sampling rate of the input signal. This is shown in Fig. 1.3(b) for

L = 2. In contrast to the L-fold expander, the M -fold decimator, such as the one shown in Fig.

1.4(a), is used to decrease the rate by a factor of M . The M -fold decimator only preserves every

5

� ↓ M �x(n) yD(n) = [x(n)]↓M = x(Mn)

(a)

� � n

−3 −2 −1 0 1 2 3

x(n)

�

�

� � n

−1 0 1

yD(n) = [x(n)]↓3

(b)

Figure 1.4: M -fold decimator input/output relationship: (a) Block diagram, (b) Example forM = 3.

M -th sample of its input and as such, the output must operate at an M times lower rate in order

to keep the proper sampling rate of the input signal. This is shown in Fig. 1.4(b) for M = 3.

Frequency or z-domain representations of the input/output relationships of expanders and dec-

imators provide more insight about the nature of these rate altering building blocks. From Fig.

1.3(a) and 1.4(a), it can be shown [67] that we have

YE(z) = [X(z)]↑L = X(zL)

(L-fold expander)

YD(z) = [X(z)]↓M =1M

M−1∑k=0

X(z

1M W k

M

)(M -fold decimator)

(1.1)

where WM � e−j 2πM denotes the M -th root of unity. In the frequency domain, the L-fold expander

output YE(ejω) consists of an L-fold compressed version of X(ejω) along with (L − 1) uniformly

shifted copies called images. Similarly, the M -fold decimator output YD(ejω) consists of an M -

fold stretched version of X(ejω) (scaled by 1/M) along with (M − 1) uniformly shifted scaled

copies called alias components. This is shown in Fig. 1.5 for both the expander and decimator for

L = M = 4. The presence (L− 1) images for an L-fold expander indicate a redundancy of a factor

6

�

α π2 π 3π

2 2πω

0

X(ejω)

1

(a)

�

α4

π2 π 3π

2 2πω

0

YE(ejω) original spectrum compressed

�

images

� �1

(b)

�

4α 2πω

0

YD(ejω)14

original

spectrum

stretched

�alias

components��

(c)

Figure 1.5: (a) Given input signal spectrum X(ejω), (b) Expanded signal spectrum YE(ejω) (L = 4),(c) Decimated signal spectrum YD(ejω) (M = 4).

of L, whereas the (M − 1) alias components for an M -fold decimator indicate a loss of information

of a factor of M . While the original input X(z) can always be recovered by its L-fold expanded

version YE(z), it can not in general be recovered by its M -fold decimated version YD(z).

Special combinations of LTI systems and decimators/expanders can be used to parallelize/serialize

discrete-time operations. One common way of parallelizing or vectorizing a scalar signal is the M -

fold blocking system shown in Fig. 1.6(a). The blocked signal vector x(n) is obtained by stacking

every nonoverlapping block of M samples of x(n) into a vector. Note that the overall rate of the

blocked signal x(n) is the same as that of the input x(n), since x(n) consists of M components each

operating at (1/M) times the input rate. The dual structure of the blocking system of Fig. 1.6(a)

is the L-fold unblocking system shown in Fig. 1.6(b). This system is used to serialize or scalarize

7

� �

�

�

�

�

�

↓ M

↓ M

↓ M

�

�

�

x(n)

x(n) =

x(Mn)

x(Mn + 1)...

x(Mn + (M − 1))

z

z

z

(a)

�

�

�

↑ L

↑ L

↑ L

�

�

�

�

z−1

z−1

z−1

y(n)

y(n) =

y(Ln)

y(Ln + 1)...

y(Ln + (L − 1))

(b)

Figure 1.6: (a) M -fold blocking system, (b) L-fold unblocking system.

� H(z) � ↓ M �x(n) y(n)

(a)

� ↑ L � H(z) �x(n) y(n)

(b)

Figure 1.7: (a) M -fold decimation filter system, (b) L-fold interpolation filter system.

a vector signal input y(n). The components of the input vector y(n) are interlaced with each

other to produce the output y(n). As with the blocking system, the overall rate of the unblocking

system output y(n) is the same as its input y(n), since y(n) consists of (1/L) of the components of

y(n) operating at L times the input rate. Blocking and unblocking systems naturally arise upon

exploiting polyphase decompositions of LTI systems in conjunction with decimators and expanders,

as is shown in the next section.

1.1.3 Decimation/Interpolation Filter Systems

Recall that decimators and expanders are responsible for the undesirable phenomena of aliasing

and imaging, respectively. In multirate signal processing, LTI filtering is commonly used to remove

these artifacts. To remove the effects of aliasing a signal is filtered before being decimated, whereas

to counteract the effects of imaging, a signal is filtered after being expanded. This is shown in

Fig. 1.7(a) and (b), respectively. The system of Fig. 1.7(a) is known as a decimation filter system

whereas that shown in Fig. 1.7(b) is known as an interpolation filter system [67].

Though these systems are capable of attenuating or even eliminating the effects of aliasing and

imaging, their current implementations shown in Fig. 1.7 are not computationally efficient. This is

8

� ↓ M � G(z) �≡� G(zM)

� ↓ M �

(a)

� G(z) � ↑ L �≡� ↑ L � G(zL)

�

(b)

Figure 1.8: (a) Decimator noble identity, (b) Expander noble identity.

because the decimation filter system only keeps every M -th output sample and the interpolation

filter system is only feeding a nonzero input to its filter at every L-th instance of time. By using

polyphase decompositions (Sec. 1.1.3.2) of the filters appearing in the decimation/interpolation filter

systems together with the noble identities (Sec. 1.1.3.1), we can obtain computationally efficient

structures for these systems.

1.1.3.1 Noble Identities

Certain LTI systems can be moved across decimators/expanders by using the noble identities [67].

The noble identities are shown in Fig. 1.8 for (a) decimators and (b) expanders. Note that the

noble identities, when they can be used, provide computationally efficient implementations, since

they give us a way to filter after decimating and before expanding. However, note that the decima-

tor/expander noble identities only apply when the original filter system is a function of zM or zL,

respectively. As such, they cannot immediately be applied to the decimation/interpolation filter

systems of Fig. 1.7. By using polyphase decompositions of the filters appearing in Fig. 1.7, we can

overcome this problem.

1.1.3.2 Polyphase Decompositions

A polyphase decomposition of an LTI system H(z) with impulse response h(n) is simply an alternate

way of expressing H(z) or h(n). Note that for any integer K, the impulse response h(n) can be

expressed as the interlaced sum of K lower rate signals. One example of this is shown in Fig. 1.9

for K = 3. This partitioning of h(n) into K lower rate signals is analogous to partitioning the set of

integers into K equivalence classes modulo the integer K. Though there are infinitely many ways

to express h(n) or H(z) as a sum of K interlaced lower rate signals, the two most common ways

9

� � n

−3 −2 −1 0 1 2 3 4 5

h(n)

�

�

�

H(z) = E0(z3)

+ z−1E1(z3)

+ z−2E2(z3)

� �

−1 0 1

n

e0(n)

� �

−1 0 1

n

e1(n)

� �

−1 0 1

n

e2(n)

Figure 1.9: Time domain interpretation of the polyphase representation for K = 3.

are shown below.

H(z) =K−1∑k=0

z−kEk

(zK)

(Type I)

H(z) =K−1∑k=0

zkRk

(zK)

(Type II)

(1.2)

where the impulse responses of the lower rate signals Ek(z) and Rk(z) are given by

ek(n) = h(Kn + k) ⇐⇒ Ek(z) =[zkH(z)

]↓K (Type I)

rk(n) = h(Kn − k) ⇐⇒ Rk(z) =[z−kH(z)

]↓K (Type II)

for 0 ≤ k ≤ K − 1. The alternate expressions for H(z) given in (1.2) are known as polyphase

decompositions of H(z) [67].

Using polyphase decompositions of the filter H(z) appearing in the decimation/interpolation

filter systems of Fig. 1.7, we can then apply the noble identities of Fig. 1.8 to obtain computationally

10

� �

�

�

�

�

�

↓ M

↓ M

↓ M

�

�

�

�

�

R0(z)

R1(z)

RM−1(z)

x(n)x(Mn)

x(Mn + 1)

x(Mn + (M − 1))

z

z

z

�

�

�

� y(n)

x(n)

M -fold blocked

version of x(n)

(a)

� �

�

�

�

�

�

↑ L

↑ L

↑ L

z−1

z−1

z−1

�

�

�

�

�

E0(z)

E1(z)

EL−1(z)

x(n)y(Ln)

y(Ln + 1)

y(Ln + (L − 1))

�

�

�

� y(n)

y(n)

L-fold blocked

version of y(n)

(b)

Figure 1.10: Polyphase implementations of (a) decimation filter system (using (1.3)), (b) interpo-lation filter system (using (1.4)).

efficient structures for these systems. In particular, if we use a Type II decomposition of the form

H(z) =M−1∑k=0

zkRk

(zM

)(1.3)

for the decimation filter system of Fig. 1.7(a) and a Type I decomposition of the form

H(z) =L−1∑k=0

z−kEk

(zL)

(1.4)

for the interpolation filter system of Fig. 1.7(b), then the equivalent computationally efficient struc-

tures that result upon using the noble identities are shown in Fig. 1.10. Note that the decimation

filter system of Fig. 1.10(a) consists of blocking the input signal and then filtering the blocked ver-

sion by a multiple-input single-output (MISO) system. Similarly, the interpolation filter system of

Fig. 1.10(b) consists of filtering the input by a single-input multiple-output (SIMO) system followed

by unblocking the filter output. Since blocking and unblocking operations require no computations,

the systems of Fig. 1.10 are indeed more computationally efficient than those shown in Fig. 1.7.

Polyphase decompositions play a prominent role in the study of multirate filter banks and their

dual structures transmultiplexers, as will be shown in the next section.

11

� � H0(z)

�

� H1(z)

�

�

� HL−1(z)

�

�

�

↓ n0

↓ n1

↓ nL−1

�

�

�

Analysis Bank

x(n)w0(n)

w1(n)

wL−1(n)

↑ n0

↑ n1

↑ nL−1

�

�

�

F0(z)

F1(z)

FL−1(z)

�

�

�

�

�

�

�

Synthesis Bank

x(n)

(a)

� � H0(z)

�

� H1(z)

�

�

� HL−1(z)

�

�

�

↓ n0

↓ n1

↓ nL−1

�

�

�

Analysis Bank

x0(n)

x1(n)

xL−1(n)

w(n)�

�

�

↑ n0

↑ n1

↑ nL−1

�

�

�

F0(z)

F1(z)

FL−1(z)

�

�

�

Synthesis Bank

x0(n)

x1(n)

xL−1(n)

(b)

Figure 1.11: (a) General nonuniform multirate filter bank, (b) The dual structured transmultiplexer.

1.1.4 Filter Banks and Transmultiplexers

As mentioned above, a multirate filter bank is used to decompose a given input signal into a set

of lower rate signals called subbands. This is often done by feeding the input, say x(n), to a bank

of decimation filter systems called the analysis bank, as shown in Fig. 1.11(a). Here, the filters

{Hk(z)} are called analysis filters. The subbands {wk(n)} are then usually processed in some way,

though this is not shown in Fig. 1.11(a). For example, to achieve lossy data compression, {wk(n)}are quantized [41, 39]. Afterwards, the subbands are typically used to try to reconstruct the original

input. This is often done by feeding the subbands {wk(n)} into a bank of interpolation filter systems

called the synthesis bank, as shown in Fig. 1.11(a). Here, the filters {Fk(z)} are called synthesis

filters. The analysis/synthesis banks of Fig. 1.11(a) together constitute a multirate filter bank [67].

12

By reversing the roles of the analysis and synthesis banks, we obtain the dual to the multirate

filter bank known as the transmultiplexer [67], as shown in Fig. 1.11(b). Whereas filter banks are

typically used for source coding applications such as data compression, transmultiplexers are often

used for channel coding applications in digital communications. The transmultiplexer model of

Fig. 1.11(b) may represent, for example, a digital communications system in which we have L users

{xk(n)} who wish to transmit data over a common path. After passing through the channel, the

data from each user must be isolated and recovered, yielding the received signals {xk(n)}.The systems of Fig. 1.11 are given special names according to the overall rate of the subband

signals, which depend on the nature of the decimator/expander values {nk} used. Note that the

overall rate of the subband signals is simply

(L−1∑k=0

1nk

)times the input rate. If we have

L−1∑k=0

1nk

= 1

then the filter bank is said to be maximally decimated, whereas the transmultiplexer will be said

to be minimally expanded. In this case, we may have neither a loss of information nor redundancy.

On the other hand, if we haveL−1∑k=0

1nk

< 1

then the filter bank will be said to be overdecimated, whereas the transmultiplexer will be said to

be underexpanded. In this case, the filter bank incurs a loss of information, whereas the transmul-

tiplexer system possesses redundancy. Finally, if we have

L−1∑k=0

1nk

> 1

then the filter bank will be said to be underdecimated, whereas the transmultiplexer will be said to

be overexpanded. In this case, the filter bank system has redundancy, whereas the transmultiplexer

incurs a loss of information.

If x(n) = x(n) for all x(n) for the filter bank system or xk(n) = xk(n) for all k and for all {xk(n)}for the transmultiplexer system, then each system is said to possess the perfect reconstruction (PR)

property1 [67]. If all of the decimator/expander values {nk} are equal, the systems of Fig. 1.11 are

said to be uniform. For arbitrary values of {nk}, these systems are said to be nonuniform.1A more general definition of the PR condition [67] is a distortionless reconstruction in which we have x(n) =

cx(n − D) for the filter bank system or xk(n) = ckxk(n − Dk) for the transmultiplexer system. Here, c and {ck}represent scale factors, whereas D and {Dk} denote delay values. Since scale factors and delays can often easily beabsorbed into the analysis/synthesis filters, for the remainder of the thesis, we will only be concerned with the stricterPR property defined above.

13

� �

�

�

↓ M

↓ M

↓ M

�z

�z

�z

�

�

�

H(z)

�

�

�

x(n)

x(n)

� w0(n)

w1(n)

wL−1(n)

�

�

�

F(z)

�

�

�

z−1

z−1

z−1

�↑ M

↑ M

↑ M

x(n)

x(n)

�

(a)

�

�

�

F(z)

�

�

�

x0(n)

x1(n)

xL−1(n)

�

�

�

H(z)

x0(n)

x1(n)

xL−1(n)

(b)

Figure 1.12: Polyphase representations of (a) uniform filter bank, (b) uniform transmultiplexer.

1.1.4.1 Polyphase Representations of Uniform Systems

Suppose that the systems of Fig. 1.11 are uniform with nk = M for all k. Also, let the Type II and

Type I polyphase decompositions of the analysis and synthesis filters, respectively, be given by

Hk(z) =M−1∑�=0

z�Hk,�

(zM

)(Type II)

Fk(z) =M−1∑�=0

z−�Fk,�

(zM

)(Type I)

for 0 ≤ k ≤ L−1. Using these decompositions, together with the noble identities from Sec. 1.1.3.1,

we can redraw the systems of Fig. 1.11(a) and (b) as in Fig. 1.12(a) and (b), respectively, where

H(z) and F(z) are, respectively, L × M and M × L matrices with

[H(z)]k,� = Hk,�(z) , [F(z)]�,k = Fk,�(z)

for 0 ≤ k ≤ L − 1 and 0 ≤ � ≤ M − 1. Here, H(z) is called the analysis polyphase matrix, whereas

F(z) is called the synthesis polyphase matrix.

14

Note that the polyphase form of the filter bank system shown in Fig. 1.12(a) consists of blocking

the input x(n) by M , filtering the blocked signal x(n) by the MIMO LTI system F(z)H(z), and then

unblocking the filtered signal x(n) by M to obtain the output x(n). Also note that the polyphase

form of the transmultiplexer consists of simply filtering the input vector signal {xk(n)} by the

MIMO LTI system H(z)F(z) to obtain the output vector signal {xk(n)}. Analogous properties

hold true for the general nonuniform systems shown in Fig. 1.11, although the details are much

more involved than for the uniform case [12] (see the Appendix of Chapter 2 for the analysis

required in the general nonuniform case).

1.1.4.2 Biorthogonality and Orthonormality

Recall that the filter bank system of Fig. 1.11(a) is said to be PR iff x(n) = x(n) for all x(n). For

the polyphase representation of the uniform system of Fig. 1.12(a), it can be shown [67] that PR

is equivalent to

F(z)H(z) = IM (Filter Bank Biorthogonality Condition) (1.5)

The filter bank system described by F(z) and H(z) is said to be biorthogonal if (1.5) holds. In other

words, biorthogonality is equivalent to the PR property. (This is true not only for filter banks, but

also for transmultiplexers as discussed below.) Clearly, we must have L ≥ M in order for (1.5) to

hold, since otherwise the left-hand side of (1.5) can never be of full normal rank M [67]. This is

consistent with the fact that the overall rate of the subbands, which is LM times the input rate, is

strictly less than the input rate when L < M , indicating a loss of information.

If H(z) satisfies a paraunitary (PU) condition [67] of the form

H(z)H(z) = IM (Paraunitary Condition) (1.6)

where the tilde notation is defined as H(z) � H†(1/z∗) [67], a special class of filter banks known

as orthonormal filter banks are generated upon choosing F(z) as

F(z) = H(z) (Ensures PR)

In other words, the filter bank is said to be orthonormal iff

H(z)H(z) = IM ,F(z) = H(z) (Filter Bank Orthonormality Condition) (1.7)

Orthonormal filter banks have been found to be useful for many applications [67] including data

compression [47]. Several popular data compression schemes such as JPEG and JPEG 2000 use

15

orthonormal filter banks to obtain good lossy data compression [41, 39]. Orthonormal filter banks

have several advantages which make them very attractive to use. For example, orthonormal filter

banks preserve the energy of the input signal in the subbands (i.e., the �2 norm of the subband

signals equals that of the input signal [67]), which may be useful if we wish to avoid overamplifying

or overattenuating the subbands. Also, orthonormal filter banks only require the design of either the

analysis or synthesis polyphase matrix, since the analysis/synthesis polyphase matrices are related

as F(z) = H(z). Finally, if the analysis filters {Hk(z)} or equivalently the analysis polyphase matrix

H(z) of an orthonormal filter bank are FIR, then the corresponding synthesis filters {Fk(z)} and

synthesis polyphase matrix F(z) are also necessarily FIR.

Biorthogonality and orthonormality conditions also exist for the transmultiplexer system of

Fig. 1.11(b). In particular, for the uniform system of Fig. 1.12(b), the biorthogonality condition

becomes

H(z)F(z) = IL (Transmulitplexer Biorthogonality Condition) (1.8)

Clearly, we must have L ≤ M here in order for (1.8) to hold, since otherwise the left-hand side

of (1.8) can never be of full normal rank L [67]. Analogous with the filter bank system, this is

consistent with the fact that when L > M , we have a loss of information. Similar to (1.7), the

orthonormality condition for the transmultiplexer system of Fig. 1.12(b) is given by

F(z)F(z) = IL ,H(z) = F(z) (Transmultiplexer Orthonormality Condition) (1.9)

It should be noted that biorthogonality and orthonormality conditions analogous to those given

in (1.5), (1.8), (1.7), and (1.9) exist for the general nonuniform systems shown in Fig. 1.11. The

interested reader is referred to [67, 45, 12] for more details.

1.2 Signal-Adapted Filter Banks

Any filter bank whose filters somehow depend on any knowledge of the input statistics is called

a signal-adapted filter bank. A typical model for a signal-adapted filter bank that will be used

throughout the thesis is the uniform M -channel maximally decimated filter bank shown in Fig.

1.13(a). The M -fold polyphase representation of this filter bank is shown in Fig. 1.13(b). Here,

the subband processors {Pk} may be nonlinear systems such as scalar quantizers or thresholding

devices or linear filters for denoising.

Throughout the thesis, we will assume that the input signal x(n) is a cyclo wide sense station-

ary process with period M (abbreviated CWSS(M)) [40]. This means that the mean µ(n) and

16

� � H0(z)

�

� H1(z)

�

�

� HM−1(z)

�

�

�

↓ M

↓ M

↓ M

�

�

�

Analysis Bank

x(n) w0(n)

w1(n)

wM−1(n)

�

P0

P1

PM−1

�

�

�

↑ M

↑ M

↑ M

�

�

�

F0(z)

F1(z)

FM−1(z)

�

�

�

�

�

�

�

Synthesis Bank

x(n)

w0(n)

w1(n)

wM−1(n)

�

SubbandProcessors

(a)

� �

�

�

↓ M

↓ M

↓ M

�z

�z

�z

�

�

�

H(z)

�

�

�

x(n)w0(n)

w1(n)

wM−1(n)

�

x(n)

� P0

P1

PM−1

�

�

�

F(z)

�

�

�

�

�

�

z−1

z−1

z−1

�↑ M

↑ M

↑ M

x(n)w0(n)

w1(n)

wM−1(n)

�

x(n)

�

(b)

Figure 1.13: (a) Typical uniform M -channel maximally decimated filter bank system. (b) Polyphaserepresentation of the filter bank.

autocorrelation Rxx(n, k) of x(n), which are given by

µ(n) = E [x(n)]

Rxx(n, k) = E [x(n)x∗(n − k)]

are periodic in n with period M . (Here E denotes the expectation operator [67].) It should be

noted that x(n) is CWSS(M) iff its M -fold blocked version x(n) shown in Fig. 1.13(b) is wide sense

stationary (WSS) [40], meaning that the mean and autocorrelation of x(n) do not depend on n.

Here, the mean µ and autocorrelation Rxx(k) are given by

µ = E [x(n)]

Rxx(k) = E[x(n)x†(n − k)

]

17

For the remainder of the thesis, we will assume that x(n) and hence x(n) are zero mean. Also, we

will assume that we only have knowledge of the second order statistics of x(n) (namely, Rxx(k)),

in addition to the given zero mean assumption on x(n). An equivalent representation of the

autocorrelation Rxx(k) that will be commonly used is the power spectral density (psd) Sxx(z),

which is simply the z-transform of Rxx(k).

1.2.1 Principal Component Filter Banks

Focusing on Fig. 1.13(b), often times, for simplicity of design as well as for other reasons, we will

restrict our attention to orthonormal or PU filter banks in which we have2

F(z)F(z) = I , H(z) = F(z) (1.10)

Recently, it has been shown that a special type of PU filter bank matched to the input statistics

Sxx(z) known as the principal component filter bank (PCFB) [62] is simultaneously optimal for

a variety of objective functions [1]. Among these objectives are included several important data

compression objectives such as mean-squared error under the presence of quantization noise [28]

(for any bit allocation) and coding gain [68, 69] (with optimal bit allocation). By definition, a

PCFB for an input psd Sxx(z) and for a class C of filter banks, if it exists, is one whose subband

variance vector

σ �[

σ2w0

σ2w1

· · · σ2wM−1

]T(1.11)

majorizes [22] any other subband variance vector arising from any other filter bank in C. (Recall

that a vector a �[

a0 a1 · · · aP−1

]T

with a0 ≥ a1 ≥ · · · ≥ aP−1 ≥ 0 is said to majorize [22]

a vector b �[

b0 b1 · · · bP−1

]T

with b0 ≥ b1 ≥ · · · ≥ bP−1 ≥ 0 iff we have

p∑k=0

ak ≥p∑

k=0

bk ∀ 0 ≤ p ≤ P − 2 ,P−1∑k=0

ak =P−1∑k=0

bk .)

In addition to being optimal for coding gain and mean-squared error in the presence of quantization

noise, the PCFB has also been shown to be optimal for any concave objective function of σ [1].2It should be noted that (1.10) is equivalent to the orthonormality condition given in (1.7) since the filter bank

is maximally decimated. Here, we have opted for the form given in (1.10), in which we focus on the design of thesynthesis bank, as it will be more natural when we constrain the synthesis filters to be causal FIR. These filters arecausal FIR iff F(z) is as well. This is not however true for the analysis filters and H(z) on account of the advancechain present in the blocking system.

18

1.2.1.1 Classes for Which PCFBs Are Known to Exist

Though PCFBs exhibit many optimal characteristics, they are only known to exist for special

classes of filter banks [1]. One notable exception to this is for the special case where M = 2, in

which case a PCFB always exists for any class of PU filter banks [1]. For general M , however,

PCFBs are known to exist only for two special classes. If C is the class of all transform coders

Ct, in which F(z) is a constant unitary matrix T, then the PCFB exists and is the Karhunen-

Loeve transform (KLT) for the input process x(n) (i.e., T diagonalizes the autocorrelation matrix

Rxx(0)) [23, 1]. Furthermore, if C is the class of all (unconstrained order) PU filter banks Cu, then

the PCFB exists and is the pointwise in frequency KLT for x(n) [68, 1, 69]. By this, we mean

that F(ejω) diagonalizes (i.e., totally decorrelates) Sxx(ejω) for every ω such that the frequency

dependent eigenvalues are always arranged in decreasing order, which is a property called spectral

majorization [68]. For many practical cases of inputs (for example, if the scalar input signal x(n)

is itself WSS), the corresponding analysis and synthesis filters are ideal bandpass filters called

compaction filters [68, 66, 65] (see Chapter 2 for more on compaction filters). As such, they are

unrealizable in practice. However, they serve to compute an upper bound on the performance that

we can expect from a PU filter bank.

1.2.1.2 Difficulties with the Class of FIR PU Systems

The problem with the class of FIR PU filter banks in which F(z) has finite memory (or more

appropriately finite McMillan degree3) is that it is believed that a PCFB doesn’t exist [27, 1, 24],

although this has not yet been formally proven. Instead, for this class, F(z) is typically chosen

to optimize a specific objective for a given input psd, such as coding gain [11, 8, 35, 79], rate-

distortion [36], or a multiresolution energy compaction criterion [37]. All such methods require the

numerical optimization of nonlinear and nonconvex objective functions which offer little insight

into the behavior of the solutions as the filter order (i.e., the memory of F(z)) increases.

Another common approach is to calculate an optimal FIR compaction filter [64, 59] (for the

first filter F0(z)) and then obtain the rest of the filters via an appropriate filter bank completion for

a multiresolution criterion [37, 49]. Though elegant in the sense that the filter bank design problem

is tantamount to calculating an FIR compaction filter followed by an appropriate KLT, it suffers

from the ambiguity caused by the nonuniqueness of the FIR compaction filter. Different compaction3The McMillan degree of a causal MIMO system is defined as the minimum number of delay elements required to

implement the system [67].

19

filter spectral factors lead to different filter banks which in turn yield different performances (as

is shown in Chapter 3). As such, all such spectral factors need to be tested for their performance

[49], which is exponentially computationally complex with respect to the compaction filter order.

Finally, none of the above-mentioned methods for the design of FIR PU signal-adapted filter

banks in the literature have been shown to tend toward the infinite-order PCFB solution as the

FIR degree or order increases. Intuition tells us that as the order increases, any FIR PU filter bank

designed to optimize any one objective for which the PCFB is optimal will tend to behave more

and more like the infinite-order PCFB, though this has not previously been shown in the literature.

One of the major contributions of this thesis is to show this behavior and to bridge the gap between

the zeroth-order KLT and infinite-order PCFB (see Chapters 3 and 4).

1.3 The FIR PU Interpolation Problem

In certain applications, it may be necessary for an FIR PU system, say F(ejω), to take on a

prescribed set of values over a prescribed set of frequencies. For example, suppose that for the

frequencies ω0, ω1, . . . , ωL−1, we require

F(ejωk) = Uk ∀ 0 ≤ k ≤ L − 1 (1.12)

Evidently, the matrices {Uk} must be unitary in light of the PU assumption on F(ejω). The problem

of finding an FIR PU system of a certain McMillan degree which satisfies (1.12) is known as the

FIR PU interpolation problem [71].

In the traditional FIR interpolation problem, in which the only restriction made on the inter-

polant is the FIR constraint, we can always find an interpolant of length at most equal to the

number of interpolation conditions by using the Lagrange interpolation formula [22]. However, for

the FIR PU interpolation problem of (1.12), in general, it is not known whether there even exists

an interpolant of finite degree which will satisfy all L conditions from (1.12). For the special case in

which F(ejω) is scalar, it is known that in general, only one condition from (1.12) can be satisfied

(since in this case, F(z) is necessarily a pure delay [71]).

Though there is no known solution to the FIR PU interpolation problem, using the optimization

algorithm presented in Chapter 4, we can find an approximant to an interpolant. For cases where an

interpolant of a certain degree is known to exist, this algorithm can be used to find the interpolant.

One of the contributions of the thesis here is thus a numerical approach to solve a theoretically

intractable problem.

20

M

W−1

IDFTM

Cyclic Prefix−L

& Unblocking� C(z) �

�

Blocking& GuardInterval

RemovalM

W

DFTM

ΛC

FEQM

s(n) s(n)

Channel

η(n)

Noise

ΛC is diagonal with

[ΛC

]k,k

=1

C(ejωk), ωk =

2πk

M, 0 ≤ k ≤ M − 1

Figure 1.14: Typical discrete multitone (DMT) system.

1.4 The Channel Shortening Equalizer Problem

Modern discrete multitone (DMT) digital communications systems such as the digital subscriber

loop (DSL) [46] have become popular and have revolutionized telephone wireline communications.

With the advent of such systems has come the need for channel shortening equalizers [33, 3, 6].

To see this, consider the typical DMT system shown in Fig. 1.14. Here, s(n) denotes a vector

of data corresponding to M users in the communications system. Also, W denotes the M × M

discrete Fourier transform (DFT) matrix [67], W−1 denotes the inverse DFT (IDFT), and ΛC is

the frequency domain equalizer (FEQ) matrix used to undo the effects of the channel. Perhaps the

most important feature of the DMT system of Figure 1.14 is the inclusion of redundancy in the

form of a cyclic prefix of length L [46]. The beauty of this redundancy is that it can be shown that

in the absence of noise, we have perfect reconstruction, i.e., s(n) = s(n), if the channel C(z) is of

length less than or equal to L + 1. Essentially, the DMT system is able to equalize an FIR channel

using only FIR components (namely, the blocking components along with the DFT matrices W

and W−1 as well as the FEQ). This is only possible because of the inherent redundancy.

From a practical point of view, we want the cyclic prefix length L as small as possible, since it

represents a redundancy which hinders the overall rate of the system by a factor of M(M+L) . However,

the channel may be very long, as is the case in practical DMT systems such as asymmetric DSL

(ADSL) in which the channel is a telephone wire line [46]. For example, in ADSL, M = 512 and

L = 32 [46], although the channels are typically hundreds of samples long. This suggests the need

for an equalizer at the receiver which shortens the channel, as shown in Figure 1.15. Such an

equalizer is typically called a time domain equalizer or TEQ [46]. As the channel may have zeros

21

� C(z) � � H(z) �

x(n)

Channel Equalizer

y(n)

η(n)

Noise

Figure 1.15: Channel shortening equalizer model.

near or outside the unit circle, the equalizer H(z) is usually not chosen to exactly shorten the

channel C(z), as this may result in spurious noise amplification. Instead, H(z) is typically an FIR

filter chosen to concentrate the energy of the effective channel Ceff(z) � H(z)C(z) in a window of

length L + 1.

The eigenfilter method [74, 56], which is one of the techniques used in some of the iterative

optimization algorithms proposed in the thesis, can be used for the design of channel shortening

equalizers. In Chapter 6, we show how the eigenfilter technique can be applied to obtain a low

complexity equalizer that accounts for both the effects of the channel length and the noise. There

it is shown through simulations that the equalizers designed perform nearly optimally in terms of

throughput or bit rate.

1.5 Outline of the Thesis

1.5.1 Eigenfilter Design of Overdecimated Compaction Filter Banks

(Chapter 2)

In Chapter 2, we present an iterative method for the design of FIR compaction filters and filter

banks. The model used is an overdecimated filter bank in which the PR property can never

hold in general. There, it is shown that the compaction filter design problem is tantamount to

minimizing the mean-squared error of the output and consists of optimizing a quadratic form

subject to quadratic PU constraints. To solve this problem, an iterative method is proposed in

which the constraints are linearized at each step. At each iteration, the linearized problem can

then be solved using the eigenfilter method [74, 54]. The proposed iterative algorithm, though

suboptimal, is low in complexity on account of the eigenfilter method, can be used to design more

than one filter at a time, and is shown to yield filters that behave like the infinite-order PCFB as

the order increases.

22

1.5.2 Multiresolution Optimal FIR PU Filter Banks (Chapter 3)

Though very low in complexity, the iterative eigenfilter method for compaction filter design suffers

from the drawbacks that the quadratic PU constraints may not be satisfied and also that the

performance in terms of compaction gain tends to saturate well below that of the infinite-order

PCFB as the order increases. To alleviate this problem, in Chapter 3, a different iterative algorithm

is proposed for the design of compaction filters. Using a complete parameterization of all FIR

PU systems in terms of degree-one Householder building blocks [75, 67], an iterative algorithm

for designing an FIR compaction filter is presented in which the objective is to approximate the

infinite-order PCFB compaction filter in the least-squares sense. This ensures that the desired

PU constraint is always in effect. At each iteration, one set of parameters in the Householder-like

characterization of the compaction filter is globally optimized assuming all other parameters are

fixed. As such, the algorithm is greedy and the mean-squared error at every iteration is guaranteed

to be monotonic nonincreasing per iteration.

Since the phase of the infinite-order PCFB compaction filter is arbitrary (see Chapters 2 and

4), a modification is proposed in which the phase of the FIR compaction filter is fed back to the

desired infinite-order compaction filter response. In essence, this modification, which we call the

phase feedback modification, allows the algorithm to find an easier phase to approximate with an

FIR solution. With this modification in effect, it is shown that the algorithm not only still remains

greedy, but also results in better FIR compaction filters in terms of compaction gain. Simulation

results show a monotonic increase in compaction gain as a function of filter order that comes very

close to the infinite-order compaction filter bound.

To obtain the rest of the filters of a maximally decimated signal-adapted orthonormal filter

bank, a multiresolution optimality criterion [62, 37] is used. With this criterion, the entire filter

bank is elegantly designed using only an FIR compaction filter followed by a simple KLT. Though

elegant, this criterion is shown to suffer from the inherent nonuniqueness of the FIR compaction

filter in terms of its different spectral factors. Different spectral factors lead to different filter banks

which in turn yield different performances. By choosing the spectral factor which yields the largest

coding gain, it is shown that the FIR filter banks designed perform more and more like the infinite-

order PCFB in terms of several objectives as the FIR order increases. This serves to bridge the gap

between the zeroth-order KLT and infinite-order PCFB.

23

1.5.3 Direct FIR PU Approximation of the PCFB (Chapter 4)

Due to the inherent nonuniqueness of an FIR compaction filter in terms of its spectral factors,

designing signal-adapted filter banks using the multiresolution criterion used in Chapter 3 becomes

computationally intractable as the filter order increases. This is because the number of spectral

factors and hence filter banks whose performance must be tested grows exponentially with order.

To avoid this problem, in Chapter 4, the iterative algorithm of Chapter 3 is generalized to obtain all

of the synthesis filters at the same time. In particular, a method is proposed for solving the general

weighted least-squares approximation problem for the matrix case using an FIR PU approximant.

Using the above-mentioned Householder-like factorization of such systems, an iterative algorithm is

proposed in which a set of parameters is globally optimized at each iteration assuming all others are

fixed. As such, the algorithm is greedy in the same way in which the one from Chapter 3 is as well.

To design an FIR PU signal-adapted filter bank, the proposed algorithm is used to approximate

the synthesis polyphase matrix of an infinite-order PCFB. With this method, all of the filters are

obtained at the same time with only a marginal increase in complexity over the method of Chapter

3. As the infinite-order PCFB synthesis polyphase matrix exhibits a phase-type ambiguity (which

we formally define in Chapter 4), a generalization of the phase feedback modification from Chapter 3

to the matrix case is proposed. With this modification in effect, the algorithm not only still remains

greedy but also yields a better magnitude-type fit for the synthesis filters. As all of the filters are

found together, this eliminates the need to complete the filter bank using an FIR compaction

filter as well as the problems caused by the nonuniqueness of this filter. Simulation results show

that the FIR PU signal-adapted filter banks designed using this method behave more and more

like the infinite-order PCFB in terms of numerous objectives as the order increases. As with the

method proposed in Chapter 3, this serves to bridge the gap between the zeroth-order KLT and the

infinite-order PCFB. However, this method avoids the need to check the performance of different

compaction filter spectral factors, which is exponentially computationally complex with filter order.

In addition to being useful for the design of signal-adapted filter banks, the proposed algorithm

can also be used for the FIR PU interpolation problem through proper choice of the weight function.

Though this problem is still open, the proposed algorithm can always be used to find an approximant

to a desired interpolant. For cases where an interpolant is known to exist, the proposed algorithm

can be used to find this interpolant, as simulations show.

24

1.5.4 Coding Gain Optimal FIR Filter Banks (Chapter 5)

In Chapter 5, we consider the optimal design of a signal-adapted filter bank in which the filters are

FIR but otherwise unconstrained. The model used is a uniform filter bank with a variable number of

channels with scalar quantizers in the subbands and the objective is to minimize the mean-squared

error observed at the filter bank output. This is equivalent to maximizing the coding gain of the

system. Assuming that the analysis (synthesis) bank is fixed, globally optimizing the corresponding

synthesis (analysis) bank is very simple and can be done using the principle of completing the square

[22], as is shown. This leads to an iterative greedy algorithm in which the analysis and synthesis

banks are alternately optimized. Simulation results verify the greediness of the algorithm and

exhibit its usefulness. When we ignore the effects due to quantization, the algorithm can be used

to design overdecimated filter banks for optimal reconstruction. In many cases, the filters designed

compact the energy of the input, much like the compaction filters of an infinite-order PCFB. Upon

accounting for the effects of quantization, it is shown that the quantization systems designed come

close to the information theoretic rate-distortion bound [10, 7] and coding gain bound [25]. As the

filter orders increase, the quantized filter banks designed come closer to these bounds, in line with

intuition. This phenomenon has previously not been shown in the literature.

1.5.5 Eigenfilter Channel Shortening Equalizer for DMT Systems (Chapter 6)

Chapter 6 differs in theme from the previous chapters in the sense that it is concerned with the

design of channel shortening equalizers for DMT systems. There, it is shown that the eigenfilter

method used in Chapter 2 can be used for this problem. In particular, we show how the eigenfilter

method can be used for the design of a fractionally spaced equalizer (FSE) for channel shortening

which jointly accounts for the effects due to the channel length as well as noise. One advantage

of this method is that it is lower in complexity than other commonly used methods. As opposed

to other similar methods which require a different Cholesky decomposition [22] of a certain matrix

for every delay parameter chosen, the proposed method only requires one. Despite a significant

decrease in complexity, it is shown that there is only a minimal loss in the observed bit rate or

throughput. Simulation results for various practical channels encountered in a DSL environment

show that the proposed method performs nearly optimally in terms of bit rate.

25

1.6 Notations

The notation used in the thesis closely parallels that used in [67]. In particular, the subscripts (∗),(T), and

(†) denote, respectively, the conjugate, transpose, and conjugate transpose of a general

matrix quantity. The real and imaginary parts of a complex number c will be denoted by Re[c]

and Im[c], respectively. Typically, boldface letters are used for matrices and vectors. The (k, �)-th

element of a matrix P will be denoted by [P]k,�. The notation diag (a0, a1, . . . , aM−1) is used to

denote an M × M diagonal matrix whose diagonal elements are the values a0, a1, . . . , aM−1 (i.e.,

if Λ = diag (a0, a1, . . . , aM−1), then [Λ]k,k = ak). Here, Tr [A] will be used to denote the trace

of a matrix A, which is the sum of its diagonal elements. Also, we will use ||A||F to denote the

Frobenius norm of any matrix A, which is defined as ||A||F �√

Tr [A†A] [22].

Lowercase letters are commonly used for discrete-time sequences, whereas uppercase letters are

often used for Fourier and z-transforms. Together with Fig. 1.3(a) and 1.4(a), (1.1) establishes

the notation that will be used for expanders and decimators. For any matrix system H(z), its

paraconjugate H†(1/z∗) [67] will be denoted by H(z).

Finally, in line with standard notation used in statistical signal processing [48], E [·] will be

used to denote the expectation operator.

26

Chapter 2

Eigenfilter Design of OverdecimatedCompaction Filter Banks

In this chapter, a method is proposed for the design of a uniform overdecimated FIR PU filter

bank optimized for energy compaction. If no FIR constraint is imposed, then the corresponding

optimal filters are unconstrained order PCFB filters. For many practical cases of inputs, these

filters are ideal bandpass filters called compaction filters, as is shown here. As such, these filters

are necessarily of infinite-order.

When an FIR constraint is imposed on the energy compaction objective, the problem becomes

tantamount to maximizing a quadratic form subject to quadratic (and often singular) constraints,

as we show here. The method proposed to solve this problem consists of iteratively linearizing the

singular quadratic constraints. At each iteration, the eigenfilter approach [74, 56] can be used to

solve this problem, as is shown. Simulations provided show that the filters designed behave like

those of the infinite-order PCFB as the order increases.

Finally, we expand on some of the optimality properties of PU overdecimated filter banks in

general. This is shown not only for the uniform overdecimated filter bank considered here, but also

for a more general rational nonuniform overdecimated synthesis bank signal approximation model.

It turns out that for this model, the optimal driving signals are generated by a corresponding

overdecimated analysis bank as we show here. If the synthesis bank satisfies a PU condition similar

to (1.6), then the corresponding optimal analysis bank is just the paraconjugate of the synthesis

bank, yielding an orthonormal overdecimated system.

The content of this chapter is mainly drawn from [54, 49] and portions of it have been presented

at [57].

27

2.1 Outline

In Sec. 2.2, we review the traditional compaction filter problem. The solution in the unconstrained

order case is presented in Sec. 2.2.1, with a special emphasis in Sec. 2.2.1.1 for the case in which the

input is WSS. An overview of previous work on FIR compaction filters is presented in Sec. 2.2.2.

In Sec. 2.3, we introduce the energy compaction problem for uniform overdecimated PU filter

banks. The relation between energy compaction and minimizing the mean-squared error of the

output is revealed in Sec. 2.3.1. As shown in the Appendix, this relation holds not only for the

uniform overdecimated model, but also for a more general rational nonuniform model as well. In

Sec. 2.3.2, the FIR constraint is imposed on the energy compaction problem and it is shown that

the problem is quadratic in the filter coefficients with quadratic constraints.

The iterative approach to solving the FIR energy compaction problem by linearizing the singular

quadratic constraints is proposed in Sec. 2.4. In Sec. 2.4.1, it is shown that the linearized problem

at each iteration can be solved using the eigenfilter method. Simulation results provided in Sec. 2.5

show that the FIR filters designed have a tendency to behave more and more like the corresponding

PCFB filters as the order increases. Concluding remarks are made in Sec. 2.6.

Finally, in the Appendix, we focus on the general rational nonuniform overdecimated model

for signal approximation and derive the optimal driving signals for this model. First, the opti-

mal driving signals are derived for the deterministic case and then proven to be optimal for the

stochastic case as well. When the synthesis bank satisfies a PU condition analogous to (1.6), the

optimality of the corresponding orthonormal filter bank is shown as well as the relation between

energy compaction and minimizing mean-squared error.

2.2 The Compaction Filter Problem

Consider the one-channel overdecimated filter bank system shown in Fig. 2.1(a). This is essentially

an example of the system shown in Fig. 1.13(a) in which the subband processors {Pk} keep only

one subband and discard the rest, i.e., Pk = δ(k) for 0 ≤ k ≤ M −1. As before, we assume that the

input x(n) is CWSS(M) whose M -fold blocked version x(n) has psd Sxx(z). The compaction filter

problem is to maximize the variance of the subband process w(n) subject to an orthonormality

condition on the filter bank similar to the one from (1.10). In particular, this condition is

H(z) = F (z) ,[F (z)F (z)

]↓M

= 1 (2.1)

28

� H(z) � ↓ M �x(n) F (z) �↑ M � x(n)w(n)

(a)

� �

�

�

↑ M

↑ M

↑ M

�

�

�

��

�

�

↓ M

↓ M

↓ M

�

�

�

�

�

�

�x(n) x(n)

z

z

z z−1

z−1

z−1

x(n)

�

x(n)

�w(n)f(z)

M × 1

h(z)

1 × M

(b)

Figure 2.1: (a) One-channel overdecimated filter bank model. (b) Polyphase implementation.

This constraint on F (z) (and hence H(z)) is called the magnitude squared Nyquist(M) constraint,

since the magnitude squared response∣∣F (ejω)

∣∣2 is Nyquist(M) [67]. It can be shown that the

compaction problem is tantamount to minimizing the mean-squared error of the filter bank output

in blocked form (see Sec. 2.3.1 and the Appendix). In other words, the error is minimized when we

compact the energy of the input x(n) in the subband signal w(n).

If we express H(z) and F (z), respectively, in terms of their Type II and Type I M -fold polyphase

decompositions, then the system of Fig. 2.1(a) can be redrawn as in Fig. 2.1(b). Here, h(z) is a

row vector consisting of polyphase components of H(z) and f(z) is a column vector consisting of

polyphase components of F (z). In terms of h(z) and f(z), the orthonormality condition of (2.1) is

h(z) = f(z) , f(z)f(z) = 1 (2.2)

which is similar to the condition of (1.10). With (2.2) in effect, the variance of the subband signal

w(n), namely, σ2w, becomes the following.

σ2w =

12π

∫ 2π

0f †(ejω)Sxx(ejω)f(ejω) dω (2.3)

Combining (2.3) with (2.2), the compaction filter problem is to choose F (z) or equivalently f(z) to

solve the following problem.

Maximize σ2w =

12π

∫ 2π

0f †(ejω)Sxx(ejω)f(ejω) dω subject to f †(ejω)f(ejω) = 1 ∀ ω. (2.4)

29

2.2.1 Solution in the Unconstrained Order Case

If no restrictions are made on the order of F (z) (or equivalently f(z)), then the optimal choice of

f(ejω) which solves the compaction filter problem of (2.4) is any frequency dependent unit norm

eigenvector v(ejω) of the matrix Sxx(ejω) which corresponds to its largest frequency dependent

eigenvalue λmax(ejω) [62, 68]. This follows from Rayleigh’s principle [22], which states that the max-

imum value of the integrand of (2.3) for any ω, namely, f †(ejω)Sxx(ejω)f(ejω), is simply λmax(ejω),

with the unit norm constraint f †(ejω)f(ejω) = 1 in effect. This maximum value occurs iff f(ejω)

is any unit norm vector in the eigenspace corresponding to λmax(ejω). Since this choice of f(ejω)

maximizes the integrand of (2.3) for all ω, it hence maximizes its integral σ2w as well. The resulting

optimal compaction filter F (ejω) (or equivalently F ∗(ejω)) is in fact the first synthesis (analysis)

filter of the unconstrained order PCFB [62, 68, 1]. It should be noted that the remaining synthesis

(as well as analysis) filters are themselves optimal compaction filters. In particular, the k-th syn-

thesis (or analysis) filter corresponding to the unconstrained order PCFB is an optimal compaction

filter for the psd Sxx(ejω) with the k largest eigenvalues removed (i.e., set to zero). This psd will

be called the k-th peeled spectrum of x(n), since it represents the psd Sxx(ejω) with the largest k

eigenvalues peeled off. It should be noted here that the optimal filter F (ejω) can only be expressed

in terms of the psd Sxx(ejω) pointwise in frequency. The order of the optimal F (ejω) depends

greatly on the nature of the psd Sxx(ejω), as will soon be shown.

2.2.1.1 Special Case of WSS Input

For many practical scenarios, the scalar input signal x(n) is itself WSS. In this case, the psd Sxx(z)

of the blocked version x(n) specifically has a pseudocirculant structure [40] (see the Appendix). If

Sxx(z) denotes the psd of x(n) here, then the compaction filter problem of (2.4) can be simplified

as follows [68].

Maximize σ2w =

12π

∫ 2π

0Sxx(ejω)

∣∣F (ejω)∣∣2 dω subject to

[∣∣F (ejω)∣∣2]

↓M= 1 ∀ ω. (2.5)

Note that in this special case where x(n) is WSS, the compaction filter problem of (2.5) only

depends on the magnitude of F (ejω) and not on its phase. As such, the optimum compaction filter

is not unique.

If no order constraint is made on F (z), then the optimum compaction filter is in general an

ideal bandpass filter. This was first shown by Unser [66] for the special case of M = 2 and later

generalized by Vaidyanathan [68] for arbitrary M . In particular, the optimum compaction filter

30

must satisfy ∣∣F (ejω)∣∣2 =

M , ω ∈ ωx

0 , otherwise(2.6)

where ωx is the set of frequencies defined as follows.

ωx �{

ω ∈ [0, 2π) : Sxx(ejω) ≥ Sxx

(ej(ω+ 2πm

M ))

∀ 1 ≤ m ≤ M − 1}

(2.7)

If ties exist in (2.7), then only one frequency is chosen. In addition to the nonuniqueness of the

compaction filter with respect to phase, there is also a possible nonuniqueness with respect to mag-

nitude if ties exist in the comparison step in (2.7). Regardless of these sources of nonuniqueness,

it is clear that when the input is WSS, any unconstrained order optimal compaction filter is nec-

essarily an ideal bandpass filter in general. As such, these filters are unrealizable in practice and

only serve as a benchmark for the performance we can expect from practical, realizable filters.

Finally, we should note that when x(n) is WSS, the PCFB analysis/synthesis filters, which

are obtained from Sxx(ejω) and its peeled spectra, are themselves infinite-order bandpass filters

whose magnitude responses are similar to the form given in (2.6). Hence, all of the filters of an

unconstrained PCFB in this case are necessarily infinite in order.

2.2.2 Overview of Previous Work on FIR Compaction Filters

If, in addition to the PU constraint of (2.2), we impose an FIR constraint on F (z) (or equivalently

f(z)), the compaction filter problem of (2.4) becomes much more complicated. The reason for this

is that, in terms of the filter coefficients, the problem is tantamount to maximizing a quadratic

form subject to a quadratic unit norm condition as well as several singular quadratic constraints

[9], as is shown in Sec. 2.3.2. This problem is nonconvex in terms of the filter coefficients [9, 35, 64]

and as such, complicated numerical techniques must be employed for the design of FIR compaction

filters. A brief survey of some of the methods proposed for their design is given below. It should

be noted here that all of these methods apply only for the special case in which the input x(n) is

WSS (i.e., the compaction filter problem simplifies to (2.5)).

1. Quadratically Constrained Optimization: Several methods were proposed for solving the above-

mentioned quadratically constrained quadratic form maximization problem numerically using

the method of Lagrange multipliers (see [8] for the special case where M = 2 and [9] for

arbitrary M). Though these methods do not require spectral factorization, they are only

guaranteed to reach a local optimum due to the nonconvex nature of the optimization problem.

31

2. Optimizing the FIR Lattice Structure: For the special case where M = 2, it is well known

that any real coefficient FIR PU filter bank is completely parameterized by a lattice structure

[67] which is characterized by a finite set of rotation angles. Several methods were proposed

for the optimization of these angles [70, 11]. Though these methods automatically satisfy

the desired PU constraint and do not require spectral factorization, they have the drawback

that they are only guaranteed to reach a local optimum and only work for the special case of

M = 2.

3. Optimizing the Product Filter: As the compaction filter problem of (2.5) is only in terms

of the magnitude squared response∣∣F (ejω)

∣∣2, several methods have been proposed for the

design of the product filter G(z) � F (z)F (z). In this case, the objective function σ2w becomes

linear in terms of the coefficients of G(z), but, in addition to the usual PU constraint of (2.2),

the positivity constraint G(ejω) ≥ 0 must be also be enforced. Hence, the problem is a linear

programming (LP) problem with infinitely many positivity inequality constraints (which is

typically called a semi-infinite programming (SIP) problem [35]). Several approaches were

proposed for finding the optimal product filter, including the window method of [29] and

the frequency discretization method of [35, 37], in which the positivity constraints are only

satisfied on a finite set of frequency points. These methods have been shown to yield good

performance, despite the fact that there is no guarantee of global optimality and a spectral

factorization step is required at the end.

4. State Space Approach: Recently an elegant method for the design of optimum FIR compaction

filters was proposed based on a state space description of the compaction filter [64]. This

method, which is based on a semi-definite programming (SDP) technique, ensures a globally

optimal compaction filter and doesn’t require a spectral factorization step. However, this

method only applies for WSS inputs x(n) and becomes extremely computationally intensive

as the filter order increases.

Using the proposed iterative eigenfilter method for the overdecimated filter bank model of Sec.

2.3, we can design several PCFB-like compaction filters simultaneously. Furthermore, since the

eigenfilter approach is used here, the method is very low in complexity [74, 56] and also can be

used when the input x(n) is CWSS(M).

32

� �

�

�

H0(z)

H1(z)

HL−1(z)

�

�

�

�

�

�

↓ M

↓ M

↓ M

�

�

�

x(n) �

�

�

F0(z)

F1(z)

FL−1(z)

�

�

�

�

↑ M

↑ M

↑ M

�

�

� y(n)

(a)

�

�

�

�

�

�

↑ M

↑ M

↑ M

�

�

�

��

�

�

↓ M

↓ M

↓ M

�

�

�

�

�

�

�x(n) y(n)

z

z

z z−1

z−1

z−1

x(n)

�

y(n)

�

w(n)

�

F(z)

M × L

H(z)

L × M

(b)

Figure 2.2: (a) Uniform overdecimated filter bank (L < M), (b) Polyphase representation.

2.3 Energy Compaction Problem for Overdecimated Filter Banks

Here, we focus on the overdecimated uniform filter bank shown in Fig. 2.2(a). In accordance with

Sec. 1.1.4, by overdecimated, we mean that the number of channels L satisfies L < M , i.e., the

number of subbands is strictly less than the decimation ratio. Also, recall from Sec. 1.1.4 that with

such a system, we have a loss of information and properties such as alias cancellation and PR are

in general impossible. If we consider the following polyphase decompositions (see Sec. 1.1.3.2) of

the analysis filters Hk(z) and synthesis filters Fk(z) for 0 ≤ k ≤ L − 1,

Hk(z) =M−1∑�=0

z�Hk,�(zM ) (Type II)

Fk(z) =M−1∑�=0

z−�Fk,�(zM ) (Type I)

33

then the system of Fig. 2.2(a) can be redrawn as in Fig. 2.2(b), where

[H(z)]�,m = H�,m(z) , [F(z)]m,� = F�,m(z)

for 0 ≤ � ≤ L − 1 and 0 ≤ m ≤ M − 1. Note that here, the vector signals x(n) and y(n) denote,

respectively, the M -fold blocked versions [67] of the filter bank input x(n) and output y(n).

2.3.1 Derivation of the Energy Compaction Problem

Let us denote the autocorrelation sequence and psd of x(n) by Rxx(k) and Sxx(z), respectively.

In addition to this stationarity assumption on x(n), we will also assume that the filter bank is

orthonormal. This means that the matrices H(z) and F(z) from Fig. 2.2(b) satisfy [67]

H(z) = F(z) , F(z)F(z) = IL (2.8)

With the above assumptions on the input and filter bank, it can easily be shown that minimizing

the error of the output is equivalent to compacting the energy of the signal w(n).

Suppose that we wish to choose H(z) and F(z) subject to the orthonormality constraint of (2.8)

to minimize the expected mean-squared error between x(n) and y(n), defined as follows.

ξ � E[||x(n) − y(n)||2

](2.9)

If we define the blocked filter error e(n) as e(n) � x(n) − y(n) and denote the psd of e(n) by

See(z), then from (2.9), we have

ξ = Tr[E[e(n)e†(n)

]]=

12π

∫ 2π

0Tr[See(ejω)

]dω (2.10)

From Fig. 2.2(b) and [67], it can be shown that we have

See(z) = Sxx(z) − F(z)H(z)Sxx(z) − Sxx(z)H(z)F(z)

+ F(z)H(z)Sxx(z)H(z)F(z) (2.11)

Imposing the orthonormality constraint of (2.8) in (2.11) yields

Tr [See(z)] = Tr [Sxx(z)] − Tr[F(z)Sxx(z)F(z)

]Substituting this into (2.10) leads to the following.

ξ =12π

∫ 2π

0Tr[Sxx(ejω)

]dω

− 12π

∫ 2π

0Tr[F†(ejω)Sxx(ejω)F(ejω)

]dω︸︷︷︸

σ2w

(2.12)

= Tr [Rxx(0)] − σ2w (2.13)

34

Hence, from (2.13), with the orthonormality constraint of (2.8) in effect, minimizing ξ from (2.9) is

equivalent to maximizing σ2w. But σ2

w is just the energy of the subband vector process w(n) from

Fig. 2.2(b), i.e., σ2w = Tr [Rww(0)], where Rww(k) denotes the autocorrelation of w(n). Thus,

minimizing the mean-squared error of the overdecimated filter bank is equivalent to maximizing or

compacting the energy of the subband process w(n). It can be shown that if no length constraints

are made on the matrix F(z) from Fig. 2.2(b), then an optimal set of synthesis filters Fk(z) for

0 ≤ k ≤ L − 1 from Fig. 2.2(a) which maximize σ2w from (2.13) are the first L ideal compaction

filters appearing in the infinite-order PCFB for Sxx(z) [62, 68].

2.3.2 Imposing the FIR Constraint on the Matrix F(z)

Suppose now that in addition to the orthonormality constraint of (2.8), the matrix F(z) is causal

and FIR of length N . In other words, suppose that we have the following.

F(z) =N−1∑n=0

f(n)z−n (2.14)

where f(n) is the M ×L impulse response of F(z). Define the MN ×L impulse response matrix f

and M × MN block delay matrix d(z) as follows.

f �[

fT (0) fT (1) · · · fT (N − 1)]T

d(z) �[

IM z−1IM · · · z−(N−1)IM

]From (2.14), we clearly have F(z) = d(z)f and F(z) = f †d(z). Substituting this into (2.12) yields

the following.

σ2w = Tr

[f †(

12π

∫ 2π

0d†(ejω)Sxx(ejω)d(ejω) dω

)︸︷︷︸

R

f]

(2.15)

Here, the MN × MN matrix R is positive semidefinite and can be expressed in terms of the

autocorrelation of x(n) as follows.

R =

Rxx(0) Rxx(−1) · · · Rxx(−(N − 1))

Rxx(1) Rxx(0) · · · Rxx(−(N − 2))...

.... . .

...

Rxx(N − 1) Rxx(N − 2) · · · Rxx(0)

(2.16)

35

From (2.16), note that R is the N -fold block autocorrelation matrix corresponding to x(n) and

that R is a block Toeplitz matrix [22]. In the special case where the scalar input signal x(n) is WSS

with autocorrelation Rxx(k), then we have

[Rxx(k)]�,m = Rxx(Mk + � − m) , 0 ≤ �, m ≤ M − 1

and so R in this case is actually Toeplitz.

To analyze the orthonormality condition of (2.8) with the FIR constraint on F(z) in effect,

define G(z) � F(z)F(z). Then, from (2.8), in the time domain, we require

g(n) = f †(−n) ∗ f(n) =∑m

f †(m)f(m + n) = ILδ(n) (2.17)

where g(n) is the impulse response of G(z). Assuming F(z) to be causal and FIR as in (2.14), then

g(n) can only be nonzero for −(N − 1) ≤ n ≤ (N − 1). As g†(−n) = g(n), the orthonormality

conditions of (2.17) only need to be satisfied for 0 ≤ n ≤ N−1. These constraints can be compactly

written in terms of the matrix f as follows [9].

f †Sk f = ILδ(k) , 0 ≤ k ≤ N − 1 (2.18)

where Sk (MN × MN) is the k-th block shift matrix given by the following.

Sk =

0(N−k)M×kM I(N−k)M

0kM×kM 0kM×(N−k)M

, 0 ≤ k ≤ N − 1 (2.19)

For example, we have

S1 =

0 IM 0 · · · 0

0 0 IM. . .

......

.... . . . . . 0

......

. . . IM

0 0 · · · · · · 0

Note that for 1 ≤ k ≤ N − 1, the matrix Sk from (2.19) is singular. Hence, from (2.18), it follows

that there are (N − 1) singular quadratic constraints. The key to deriving the iterative eigenfilter

technique lies in linearizing these singular constraints. This is done by expressing the constraints

of (2.17) in an implicit form. Note that for n = 0, (2.17) can be expressed in terms of the matrix

f as follows.

f †f = IL (2.20)

36

Similarly, for 1 ≤ n ≤ N − 1, (2.17) can be expressed in terms of f as follows.

0 f †(0) f †(1) · · · f †(N − 2)

0 0 f †(0) · · · f †(N − 3)...

.... . . . . .

...

0 0 · · · 0 f †(0)

︸︷︷︸

C

f(0)

f(1)...

f(N − 1)

︸︷︷︸

f

=

0

0...

0

︸︷︷︸

0L(N−1)×L

(2.21)

It should be noted that the L(N − 1) × MN matrix C from (2.21) is a function of the impulse

response coefficients f(n). As such, the constraint in (2.21) is an implicit quadratic constraint.

Combining (2.15), (2.20), and (2.21), the energy compaction problem in the presence of the FIR

constraint on F(z) can be expressed as follows.

Maximize σ2w = Tr

[f †Rf

]subject to f †f = IL and Cf = 0L(N−1)×L

(2.22)

with R and C as in (2.16) and (2.21), respectively.

In general, the optimization problem of (2.22) is nonlinear and nonconvex in terms of the

elements of the matrix f . What makes the problem difficult to solve is the implicit quadratic

constraint Cf = 0 from (2.21). Using the iterative approach for solving the optimization problem

of (2.22) to be discussed in the next section, it is possible to turn this implicit quadratic constraint

into an explicit linear constraint. Once this constraint becomes linear, the optimization at each

iteration can be solved exactly using the eigenfilter technique [74, 56], which is low in complexity

and numerically stable. Before showing this, we will first formally present the iterative algorithm

for solving the optimization problem of (2.22).

2.4 Iterative Eigenfilter Method for Solving the FIR Compaction

Problem

In what follows, let fk(n) denote the impulse response f(n) at the k-th iteration. Also, define the

MN × L matrix fk and L(N − 1) × MN matrix Ck as follows.

fk �[

fTk (0) fT

k (1) · · · fTk (N − 1)

]T(2.23)

37

Ck �

0 f †k(0) f †k(1) · · · f †k(N − 2)

0 0 f †k(0) · · · f †k(N − 3)...

.... . . . . .

...

0 0 · · · 0 f †k(0)

(2.24)

Then, the proposed iterative algorithm is as follows.

Initialization:

Choose any f0(n) which satisfies the orthonormality constraints of (2.17). This can be eas-

ily done using the complete characterization of FIR PU systems in terms of degree-one

Householder-like building blocks [75, 67]. Compute f0 from f0(n).

Iteration: For k ≥ 1, do the following.

1. Compute the constraint matrix Ck−1.

2. Solve the linearized optimization problem

Maximize σ2w,k = Tr

[f †kRfk

]subject to f †k fk = IL and Ck−1fk = 0L(N−1)×L

(2.25)

3. To measure the convergence of the iteration to an orthonormal solution, calculate the

orthonormality error matrix at the k-th iteration defined by

εk � Ck fk (2.26)

As we need εk = 0 in theory (from (2.21), (2.23), (2.24), and (2.26)), terminate the

iteration when we have

||εk||F < δT (2.27)

where ||εk||F denotes the Frobenius norm of εk [22] and δT is a some small threshold

value.

Before proceeding, it should be noted that there is no guarantee that the iterative algorithm

will converge to an orthonormal solution, although in simulations it often does so as shown below.

At present, there is no known method as to what should be done if the iteration fails to converge

to an orthonormal solution. Furthermore, even if there is convergence, there is no guarantee that

the resulting solution is globally optimal.

38

Despite this, often times in simulations such as those presented below, the algorithm performs

well in terms of approaching the behavior of the ideal compaction filters of the infinite-order PCFB.

Also, the algorithm can be used for relatively large orders N , as the linearized optimization problem

of (2.25) can be solved using the eigenfilter approach [67]. We now proceed to show how to solve

the linearized optimization problem of (2.25).

2.4.1 Solution to the Iterative Linearized Optimization Problem

Consider the linear constraint Ck−1fk = 0 from (2.25). This constraint holds iff the columns of fk

lie in the null space of Ck−1 [22]. Let Uk−1 denote a unitary matrix whose columns span the null

space of Ck−1. If ρ denotes the dimension of the null space of Ck−1, then Uk−1 is MN × ρ. As

the columns of fk must lie in the null space of Ck−1, fk must be of the form fk = Uk−1a for some

arbitrary ρ × L matrix a. Hence, we have

Ck−1fk = 0 ⇐⇒ fk = Uk−1a (2.28)

Given that Ck−1 is L(N − 1) × MN , we can easily argue that the dimension of its null space ρ

satisfies ρ ≥ L. Hence, the linear constraint Ck−1fk = 0 transforms the problem of finding fk into

that of finding the ρ × L matrix a. The quantity a is arbitrary but must be such that the unitary

constraint f †k fk = IL from (2.25) is satisfied. Clearly, from (2.28), we have

f †k fk = IL ⇐⇒ a†a = IL

upon exploiting the unitarity of Uk−1. As can be seen, the constraints of (2.25) transform the

problem of finding fk into that of finding a where a is allowed to by any ρ × L unitary matrix.

Hence, the optimization problem of (2.25) can be recast as follows.

Maximize σ2w,k = Tr

[a†Rk−1a

]where Rk−1 � U†

k−1RUk−1

subject to the constraint a†a = IL

(2.29)

The solution to this problem follows from a generalization of Rayleigh’s principle [22, p. 191] and

is as follows. Suppose that Rk−1 has the following unitary diagonalization.

Rk−1 = Vk−1Λk−1V†k−1

where Vk−1 is a ρ × ρ matrix of eigenvectors of Rk−1 and Λk−1 is a diagonal matrix consisting of

the eigenvalues of Rk−1. In addition, suppose that Λk−1 = diag (λk−1,0, λk−1,1, . . . , λk−1,ρ−1) and

39

0 0.1 0.2 0.3 0.4 0.50

2

4

6

8

f = ω/2πM

agni

tude

Figure 2.3: Input power spectral density Sxx(ejω).

that the eigenvalues have been ordered in decreasing order, i.e., λk−1,0 ≥ λk−1,1 ≥ · · · ≥ λk−1,ρ−1.

Then, the solution to the optimization problem of (2.29) is given to be [22]

σ2w,k =

L−1∑i=0

λk−1,i

which occurs iff we have

a = Vk−1b

where b is a ρ × L matrix of the form

b =

B

0(ρ−L)×L

(2.30)

and B is any L×L square unitary matrix. In other words, the optimal a is a such that its columns

are unitary combinations of the first L eigenvectors of Rk−1. Once an optimal a has been found,

the corresponding optimal fk can be found using fk = Uk−1a. As computing the optimal synthesis

filter matrix fk requires the eigendecomposition of a particular matrix, it follows that the original

linearized optimization problem of (2.25) is an eigenfilter type problem [74, 56]. The simulation

results presented in the next section show the merit of the proposed iterative eigenfilter method.

2.5 Simulation Results

To test the proposed iterative eigenfilter algorithm, the input process x(n) was chosen to be a real

WSS autoregressive process of order 4 (AR(4)) whose power spectrum Sxx(ejω) is plotted in Fig.

2.3. For all of the simulation results presented here, we chose M = 7, i.e., the block size of the

filter bank used was 7. Also, the matrix B from (2.30) was chosen to be IL for all examples.

40

0 100 200 300 400−350

−300

−250

−200

−150

−100

−50

0

k||ε

k|| F (

dB)

N = 3N = 10

Figure 2.4: Orthonormality error ||εk||F vs. the iteration index k (L = 1, M = 7).

0 0.1 0.2 0.3 0.4 0.50

2

4

6

8

f = ω/2π

Mag

nitu

de

1st PCFBN = 3N = 10

Figure 2.5: Magnitude squared responses of the designed FIR synthesis filter F0(z) along with thefirst filter of the infinite-order PCFB (L = 1, M = 7).

We first considered the design of a single channel of the overdecimated system (i.e., L = 1). The

observed error in orthonormality using the iterative eigenfilter method is shown in Fig. 2.4 in dB

for two values of orders, namely, N = 3 and 10. In order to observe the behavior of our algorithm,

we ran it for 500 iterations and opted not to choose a stopping threshold value δT from (2.27). As

can be seen from Fig. 2.4, the proposed method indeed converged toward an orthonormal solution

for both cases of filter orders. The error ||εk||F saturated at around −300 dB for both cases, most

likely due to quantization effects as a result of finite precision arithmetic.

To gauge the performance of the algorithm, a plot of the magnitude squared response of the

resulting synthesis filter F0(z) from Fig. 2.2(a) is shown in Fig. 2.5 for the two orders N = 3 and

10, along with that of the first filter of the infinite-order PCFB. As can be seen, both FIR filters

have a response close to that of the ideal compaction filter. Furthermore, the higher order filter

offers a better approximation than the lower order one, in line with intuition.

41

5 10 153

3.1

3.2

3.3

3.4

3.5

3.6

3.7

N

Gco

mp

ideal compaction filteriterative eigenfilter

Figure 2.6: Compaction gain Gcomp vs. the filter order parameter N .

To quantitatively measure the performance of the proposed iterative algorithm, we opted to

calculate the compaction gain [68] of the designed filters, which is simply the subband variance

σ2w from (2.3) normalized by the average input power. Though this quantity has only previously

been defined for the case in which the input x(n) is WSS, we extend this definition here for the

CWSS(M) case considered in Sec. 2.2 and 2.3. For this case, we define the compaction gain Gcomp

as follows.

Gcomp � σ2w

1M Tr [Rxx(0)]

(2.31)

In the special case in which x(n) is WSS, (2.31) becomes

Gcomp =σ2

w

σ2x

=

12π

∫ 2π

0Sxx(ejω)

∣∣F (ejω)∣∣2 dω

12π

∫ 2π

0Sxx(ejω) dω

(2.32)

which is consistent with the definition given in [68]. The ideal compaction filter maximizes this

quantity over all filters satisfying the required orthonormality condition of (2.8) [68]. A plot of the

observed compaction gain as a function of the filter order parameter N is shown in Fig. 2.6. Though

the compaction gain increases monotonically as N increases, it appears to saturate well below the

ideal compaction gain. At this time, it is not known why this phenomenon occurs. Despite this,

however, for small orders, the observed compaction gain is reasonably large.

To further test the algorithm, we then considered the design of two channels of the overdecimated

system (i.e., L = 2) and fixed the order to be N = 10. The observed error in orthonormality (in

dB) as a function of iteration is shown in Fig. 2.7. As before, it can be seen that the algorithm is

converging to an orthonormal solution. The magnitude squared responses of the designed synthesis

42

0 100 200 300 400−350

−300

−250

−200

−150

−100

−50

0

k

||εk|| F

(dB

)

Figure 2.7: Orthonormality error ||εk||F vs. the iteration index k (L = 2, M = 7, N = 10).

0 0.1 0.2 0.3 0.4 0.50

2

4

6

8

f = ω/2π

Mag

nitu

de

1st PCFB

|F0(ejω)|

2

2nd PCFB

|F1(ejω)|

2

Figure 2.8: Magnitude squared responses of the designed FIR synthesis filters F0(z) and F1(z)along with the first two filters of the infinite-order PCFB (L = 2, M = 7, N = 10).

filters F0(z) and F1(z) are shown in Fig. 2.8 along with those of the first two filters of the infinite-

order PCFB. From this, it is clear that the proposed algorithm is yielding filters close to the ideal

compaction filters of the infinite-order PCFB, as desired.

2.6 Concluding Remarks

The iterative eigenfilter method for designing signal-adapted overdecimated filter banks for energy

compaction was shown to yield filters similar to the optimal infinite-order compaction filters corre-

sponding to the unconstrained order PCFB. Furthermore, as the eigenfilter approach is used at each

iteration, the method is very low in computational complexity. Despite this, however, the iterative

eigenfilter method is not without shortcomings. First of all, the method is not guaranteed to yield

43

a PU or orthonormal solution. Though in all of the examples shown in Sec. 2.5, the algorithm

converged to an orthonormal solution, there are many examples in which this is not the case. In

addition to this, even if we have convergence, there is no guarantee that the resulting solution is

optimal. This was best shown in Fig. 2.6, where the compaction gain was shown to saturate at

a level well below that of the optimal compaction gain. Finally, the method only applies strictly

for the overdecimated case in which the number of channels L satisfies L < M , where M is the

decimation ratio. If we wish to obtain a maximally decimated system in which L = M , then the

algorithm breaks down, since the subband variance σ2w from (2.12) becomes fixed in this case.

As such, if we wish to obtain a good maximally decimated system from L < M filters, we must

complete the filter bank using some criterion to obtain the remaining (M − L) filters.

With the iterative algorithms of Chapters 3 and 4, these problems no longer exist. Using the

complete parameterization of FIR PU systems in terms of Householder-like degree-one building

blocks [75, 67], the PU constraint is always satisfied as it is structurally imposed. Both methods are

shown to saturate close to the performance of the infinite-order PCFB with filter order in terms of

many objectives including compaction gain and coding gain. Finally, for the method of Chapter

3, a multiresolution criterion, for which the PCFB is optimal, is used to complete the filter bank

given only the first compaction filter, whereas in Chapter 4, no filter bank completion is necessary

as all of the filters are found simultaneously.

Appendix: Least-Squares Signal Model Approximation Problem

For the overdecimated filter bank system of Fig. 2.2(b) in which the orthonormality condition of

(2.8) is satisfied, it turns out that the subband vector process w(n) is the optimal driving signal

to the synthesis bank for minimizing the mean-squared error between the input x(n) and output

y(n). This optimality holds not only for the system of Fig. 2.2, but also for a more general model

that we will introduce below. We will prove this optimality first for the deterministic case and then

for the stochastic case.

Consider the rational nonuniform overdecimated synthesis bank shown in Fig. 2.9. By overdec-

imated, we mean thatP−1∑k=0

mk

nk< 1

and so the inputs {ck(n)} operate at a lower overall rate than the output y(n). As such, for

fixed synthesis filters {Fk(z)}, we cannot steer the output y(n) to be any signal which we desire.

44

� ↑ n0 � F0(z) � ↓ m0 �

�

� ↑ n1 � F1(z) � ↓ m1 �

�

�� ↑ nP−1 � FP−1(z) � ↓ mP−1 � �

c0(n)

c1(n)

cP−1(n)

y0(n)

y1(n)

yP−1(n)y(n)

Figure 2.9: Rational nonuniform synthesis bank.

Mathematically, as the synthesis bank is overdecimated, the subspace of signals V generated by the

model given by

V �{

y(n) : y(n) =P−1∑k=0

∞∑m=−∞

ck(m)fk(mkn − nkm) , ck(n) ∈ �2 ∀ k

}(2.33)

is a proper subspace of �2. The question then naturally arises as to how to choose the input driving

signals {ck(n)} to minimize the mean-squared error between a desired signal x(n) and the synthesis

bank output y(n). In this section, we address this problem first for the deterministic case and then

for the stochastic case. This is a generalization of the results given in [77] for integer nonuniform

filter banks for the deterministic case. Though rational nonuniform filter banks can be shown to

be transformable to integer nonuniform filter banks [30], the approach here avoids this complicated

transformation and solves the least-squares problem in a more direct way. Prior to proceeding, we

present a few important results of multirate system theory which will be needed here.

Blocked Form of LTI Systems and Pseudocirculant Matrices

Two important multirate identities which will greatly facilitate the least-squares approximation

problem we will consider here is the decimator/expander cascade identity shown in Fig. 2.10(a)

and the polyphase identity shown in Fig. 2.10(b) [67]. The decimator/expander cascade identity

allows us to interchange the order of decimation and expansion when the ratios are relatively prime,

whereas the polyphase identity allows us to replace certain expander/filter/decimator cascades by

an LTI system.

In certain cases, we will want to represent an LTI system using multirate building blocks. This

can be done by using the blocked form of an LTI system. If H(z) represents any LTI system, then

it can be implemented in an M -fold blocked form [67] as shown in Fig. 2.11.

45

� ↓ M � ↑ L � ≡ � ↑ L � ↓ M �

iff gcd(L, M) = 1

(a)

� ↑ M � G(z) � ↓ M � ≡ � [G(z)]↓M �

(b)

Figure 2.10: (a) Decimator/expander cascade identity, (b) Polyphase identity.

� �

�

�

↓ M

↓ M

↓ M

�z

�z

�z

�

�

�

H(z)

�

�

�

x(n)

H(z)� �x(n) y(n)

�

�

�

z−1

z−1

z−1

�↑ M

↑ M

↑ M

y(n)

[H(z)]k,� =[zk−�H(z)

]↓M , 0 ≤ k, � ≤ M − 1

Figure 2.11: M -fold blocked form of an LTI system H(z).

The matrix H(z) from Fig. 2.11 is said to be a pseudocirculant matrix [67], since it has a form

similar to a circulant matrix [22]. To see this, suppose that H(z) has the following M -fold Type I

polyphase decomposition.

H(z) =M−1∑k=0

z−kEk

(zM

)(Type I)

46

Then, from Fig. 2.11, it can be shown that we have

H(z) =

E0(z) z−1EM−1(z) z−1EM−2(z) · · · z−1E1(z)

E1(z) E0(z) z−1EM−1(z) · · · z−1E2(z)

E2(z) E1(z) E0(z). . .

......

......

. . . z−1EM−1(z)

EM−1(z) EM−2(z) EM−3(z) · · · E0(z)

which is simply a down circulant matrix [22] formed from {E0(z), E1(z), . . . , EM−1(z)} with z−1

delays appearing above the main diagonal. Pseudocirculant matrices play a major role in the study

of alias-free filter banks [67].

Together with the multirate identities of Fig. 2.10, the blocked form of LTI systems as shown

in Fig. 2.11 will greatly facilitate solving the least-squares approximation problem as we now show.

Least-Squares Approximation Problem - Deterministic Case

Consider again the rational nonuniform synthesis bank of Fig. 2.9. We will make the following

assumptions here.

• gcd(mk, nk) = 1 ∀ k (Coprimeness of mk and nk)

•P−1∑k=0

mk

nk< 1 (Overdecimated system)

There is no loss of generality in making the first assumption, as common factors between mk and

nk can be absorbed into the filter Fk(z) by using the decimator/expander cascade identity along

with the polyphase identity (see Fig. 2.10). The second assumption ensures that the subspace V in

(2.33) is a proper subspace of �2. Let us define the following integers.

• N � lcm(n0, n1, . . . , nP−1)

• pk � Nnk

∀ k

• K �P−1∑k=0

mkpk

Note that as the system is overdecimated, we have K < N .

The goal here is to choose the driving signals {ck(n)} to minimize the deterministic mean-

squared error objective

ξdet �∑

n

|x(n) − y(n)|2

47

where x(n) is any signal in �2. If x(n) and y(n) denote, respectively, the N -fold blocked versions

of x(n) and y(n), we have

ξdet =∑

n

||x(n) − y(n)||2 (2.34)

Using Parseval’s relation [67], ξdet can be expressed as follows.

ξdet =12π

∫ 2π

0

∣∣∣∣X(ejω) − Y(ejω)∣∣∣∣2 dω (2.35)

where X(z) and Y(z) denote, respectively, the z-transforms of x(n) and y(n).

To simplify Y(z), consider the k-th branch of the system of Figure 2.9 reproduced in Figure

2.12(a). If we implement Fk(z) in an mkN -fold block form (see Fig. 2.11), we obtain the system

shown in Figure 2.12(b), where Ak(z) is an mkN × mkN pseudocirculant matrix [67] with

[Ak(z)]r,s =[zr−sFk(z)

]↓mkN

for 0 ≤ r, s ≤ mkN − 1. By applying the polyphase identity of Fig. 2.10(b), the expander on the

left (↑nk) as well as the decimator on the right (↓mk) can be moved across the network resulting

in the system of Figure 2.12(c). The N × mkpk transfer matrix Fk(z) is obtained by preserving

only the N rows of Ak(z) which are multiples of mk and the mkpk columns which are multiples of

nk. In other words,

[Fk(z)]c,d =[zcmk−dnkFk(z)

]↓mkN

=[zc[z−dnkFk(z)

]↓mk

]↓N

(2.36)

for 0 ≤ c ≤ N − 1 and 0 ≤ d ≤ mkpk − 1. Note that from Figure 2.12(c), ck(n) is simply the

mkpk-fold blocked version of ck(n) and yk(n) is the N -fold blocked version of yk(n). Clearly, we

have the following.

Yk(z) = Fk(z)Ck(z) (2.37)

But note that we have

y(n) =P−1∑k=0

yk(n) ⇐⇒ y(n) =P−1∑k=0

yk(n)

Thus, using (2.37), we get

Y(z) =P−1∑k=0

Yk(z) =P−1∑k=0

Fk(z)Ck(z)

48

� ↑ nk� Fk(z) � ↓ mk

�ck(n) yk(n)

(a)

� ↑ nk� �

�

�

�

��

↓ mkN

↓ mkN

↓ mkN

�

�

�

Ak(z)

�

�

�

↑ mkN

↑ mkN

↑ mkN

�

�

�

� ↓ mk�ck(n) yk(n)

z

z

z z−1

z−1

z−1

Fk(z)� �

(b)

� �

�

�

�

��

↓ mkpk

↓ mkpk

↓ mkpk

�

�

�

�

�

�

↑ N

↑ N

↑ N

�

�

�

�ck(n) yk(n)

z

z

z

z−1

z−1

z−1

ck(n) yk(n)

� �

mkpk inputs

N outputs

�

Fk(z)

(c)

Figure 2.12: (a) The k-th branch of the signal model from Fig. 2.9, (b) With Fk(z) implementedin an mkN -fold block form, (c) Resulting structure after applying the polyphase identity.

49

�

�

�

�

�

�

↑ N

↑ N

↑ N

�

�

�

��

�

�

↓ N

↓ N

↓ N

�

�

�

�

�

�

�x(n) y(n)

z

z

z z−1

z−1

z−1

C(z)

N inputs

K outputs/inputs

N outputs

�

F(z)

N × K

H(z)

K × N

Figure 2.13: System for obtaining the optimal driving signal C(z).

This can be expressed as

Y(z) =[

F0(z) F1(z) · · · FP−1(z)]

︸︷︷︸F(z)

C0(z)

C1(z)...

CP−1(z)

︸︷︷︸

C(z)

(2.38)

where F(z) is an N ×K matrix and C(z) is a K×1 vector. Note that even though the fixed matrix

F(z) has a restricted structure as can be seen from (2.36), the vector C(z) is completely arbitrary.

Substituting (2.38) into (2.35), we have

ξdet =12π

∫ 2π

0

∣∣∣∣ F(ejω)C(ejω) − X(ejω)︸︷︷︸ε(ω)

∣∣∣∣2 dω

and so we can minimize ξdet by minimizing ||ε(ω)||2 pointwise in ω. The solution to this well-known

least-squares problem is [22]

C(ejω) = F#(ejω)X(ejω) =[F†(ejω)F(ejω)

]#F†(ejω)X(ejω)

where A# denotes the Moore-Penrose pseudoinverse of the matrix A [22].

In the z-domain, the optimum driving signal C(z) is given by

C(z) = F#(z)︸︷︷︸H(z)

X(z) =[F(z)F(z)

]#F(z)︸︷︷︸

H(z)

X(z) (2.39)

50

F(z)

N × KK Nc(n)

C(z)

y(n)

Y(z) = F(z)C(z)

(a)

H(z)

K × NN Kx(n)

X(z)

c0(n)

C0(z) = H(z)X(z)

H(z) = F#(z) =[F(z)F(z)

]#F(z)

(b)

Figure 2.14: (a) Equivalent form of Fig. 2.9 with blocking/unblocking elements removed, (b)Method for obtaining the optimal driving signal.

Hence, the optimal C(z) from (2.39) can be obtained via the system shown in Figure 2.13. In

other words, the optimal driving signals to the overdecimated synthesis bank of Fig. 2.9 are the

subbands corresponding to an appropriate overdecimated analysis bank as shown in Fig. 2.13. Here,

the polyphase matrix H(z) corresponding to the analysis bank must be the pseudoinverse of the

polyphase matrix F(z) corresponding to the synthesis bank.

Least-Squares Approximation Problem - Stochastic Case

Using several properties of multirate systems, we were able to show that the rational nonuniform

overdecimated model of Fig. 2.9 could be equivalently redrawn as the LTI MIMO system model

shown in Fig. 2.14(a) in which the blocking/unblocking mechanisms have been omitted for simplicity

here. In the previous section, we showed that the optimal driving signal c0(n) which minimized

the deterministic mean-squared error

ξdet =∑

n

||x(n) − y(n)||2

from (2.34) could be obtained by filtering x(n) as shown in Fig. 2.14(b). The question then naturally

arises as to whether the same driving signal will be optimal for the stochastic case when we want

to minimize the expected mean-squared error given by

ξsto � E[||x(n) − y(n)||2

](2.40)

where we assume here that the driving signal c(n) and desired signal x(n) are jointly WSS [67]. It

turns out that the answer is in the affirmative, however the proof in this case is far different from

the one given in the previous section for the deterministic case. In particular, we will show that

the error obtained using the system of Fig. 2.14(b) is less than or equal to the error obtained for

any other driving signal c(n).

51

To start, first note that since we assume c(n) and x(n) are jointly WSS, it follows that the

vector

v(n) �

c(n)

x(n)

is WSS [67]. Its psd Svv(z) is given by

Svv(z) =

Scc(z) Scx(z)

Sxc(z) Sxx(z)

(2.41)

As Svv(z) is a psd, it follows that it is positive semidefinite [22] on the unit circle z = ejω, i.e.,

w†Svv(ejω)w ≥ 0 for all ω and all nonzero vectors w. This property will prove to be useful here.

To proceed, define the error signal to be e(n) � x(n) − y(n). As Y(z) = F(z)C(z) from Fig.

2.14(a), it follows that c(n) and y(n) are jointly WSS, which implies that x(n) and y(n) are also

jointly WSS, which in turn implies that e(n) is WSS [67]. From (2.40), we have

ξsto = E[e†(n)e(n)

]= Tr

[E[e(n)e†(n)

]]= Tr [Ree(0)] =

12π

∫ 2π

0Tr[See(ejω)

]dω (2.42)

where Ree(k) and See(z) denote, respectively, the autocorrelation and psd of e(n). Also, using

e(n) = x(n) − y(n), we have See(z) = Sxx(z) − Sxy(z) − Syx(z) + Syy(z). As Y(z) = F(z)C(z),

we have Syy(z) = F(z)Scc(z)F(z), Syx(z) = F(z)Scx(z), and Sxy(z) = Syx(z) = Sxc(z)F(z) [67].

Hence, we have

See(z) = Sxx(z) − Sxc(z)F(z) − F(z)Scx(z) + F(z)Scc(z)F(z) (2.43)

Now, let Se0e0(z) denote the error psd obtained when the driving signal is obtained using the

system of Fig. 2.14(b). Also, let ξsto,0 denote the corresponding mean-squared error. Then, from

(2.42), we have

ξsto,0 =12π

∫ 2π

0Tr[Se0e0(e

jω)]

dω

Hence, we get

ξsto − ξsto,0 =12π

∫ 2π

0

{Tr[See(ejω)

]− Tr[Se0e0(e

jω)]}︸︷︷︸

D(ejω)

dω (2.44)

Note that from Fig. 2.14(b), we have

Se0e0(z) = Sxx(z) − Sxx(z)H(z)F(z) − F(z)H(z)Sxx(z) + F(z)H(z)Sxx(z)H(z)F(z) (2.45)

52

Using the following properties of the pseudoinverse [22],

A#A =(A#A

)†, AA# =

(AA#

)†(Hermitian Product Properties) (2.46)

A#AA# = A# , AA#A = A (Projection Properties) (2.47)

as well as the fact that Tr [AB] = Tr [BA] whenever A and B are conformable [22], from (2.45),

we get the following.

Tr [Se0e0(z)] = Tr [Sxx(z)] − Tr[F(z)H(z)Sxx(z)H(z)F(z)

]Thus, from (2.43) and (2.44), we get

D(z) = Tr [See(z)] − Tr [Se0e0(z)]

= Tr[F(z)Scc(z)F(z)

]− Tr [F(z)Scx(z)] − Tr

[Sxc(z)F(z)

]+ Tr

[F(z)H(z)Sxx(z)H(z)F(z)

](2.48)

Using (2.47), we have

F(z)Scx(z) = F(z)H(z)F(z)Scx(z)

Then, using (2.46), we get

Tr [F(z)Scx(z)] = Tr [F(z)Scx(z)F(z)H(z)] = Tr[F(z)Scx(z)H(z)F(z)

](2.49)

Similarly, we have

Sxc(z)F(z) = Sxc(z)F(z)H(z)F(z) = Sxc(z)F(z)F(z)H(z)

and hence, we get

Tr[Sxc(z)F(z)

]= Tr

[F(z)H(z)Sxc(z)F(z)

](2.50)

Combining (2.49) and (2.50) in (2.48), we get

D(z) = Tr[F(z)Scc(z)F(z) − F(z)Scx(z)H(z)F(z)

−F(z)H(z)Sxc(z)F(z) + F(z)H(z)Sxx(z)H(z)F(z)]

= Tr[F(z)

(Scc(z) − Scx(z)H(z) − H(z)Sxc(z) + H(z)Sxx(z)H(z)

)F(z)

]Note that from (2.41), we can express D(z) as follows.

D(z) = Tr[F(z)

[IK −H(z)

]︸︷︷︸

G(z)

Scc(z) Scx(z)

Sxc(z) Sxx(z)

︸︷︷︸

Svv(z)

IK

−H(z)

︸︷︷︸

G(z)

F(z)]

(2.51)

53

But recall that Svv(ejω) is positive semidefinite as it is the psd of v(n). Thus, from (2.51), we have

D(z) = Tr[F(z)G(z)Svv(z)G(z)F(z)︸︷︷︸positive semidefinite on z = ejω

]As the trace of any positive semidefinite matrix is nonnegative [22], D(ejω) ≥ 0. From (2.44), this

implies

ξsto − ξsto,0 =12π

∫ 2π

0D(ejω) dω ≥ 0 ⇐⇒ ξsto,0 ≤ ξsto

Hence, the error obtained using the system of Fig. 2.14(b) is always less than or equal to the error

obtained with any other driving signal. Thus, the driving signal c0(n) from Fig. 2.14(b) is always

an optimal driving signal and so the optimal driving signal in the deterministic case is also an

optimal driving signal in the stochastic case. In parting, the following comments are in order. If the synthesis polyphase matrix F(z) is PU,

i.e., F(z)F(z) = IK , then the optimal analysis bank H(z) = F#(z) =[F(z)F(z)

]#F(z) is simply

H(z) = F(z), i.e., H(z) is the paraconjugate of F(z). In this case, the resulting filter bank system

is orthonormal. The uniform PU filter bank of Fig. 2.2 is an example of such a filter bank.

In addition to this, with the optimal driving signal c(n), it can easily be shown that if F(z) is

PU, then minimizing the mean-squared error further by choice of F(z) is equivalent to compacting

the energy of c(n). To see this, note that with the optimal C(z) from (2.39), the mean-squared

error in the deterministic and stochastic cases are respectively given by (2.34) and (2.40) to be

ξdet =∑

n

|x(n)|2 − Tr[

12π

∫ 2π

0H(ejω)X(ejω)X†(ejω)F(ejω) dω

]︸︷︷︸

σ2c,det

ξsto = Tr [Rxx(0)] − Tr[

12π

∫ 2π

0H(ejω)Sxx(ejω)F(ejω) dω

]︸︷︷︸

σ2c,sto

where H(z) = F#(z). Clearly, minimizing ξdet and ξsto is equivalent to maximizing σ2c,det and σ2

c,sto,

respectively. If F(z) is PU, then H(z) = F(z) or equivalently F(z) = H(z). In this case, we get

σ2c,det = Tr

[12π

∫ 2π

0H(ejω)X(ejω)X†(ejω)H†(ejω) dω

]σ2c,det = Tr

[12π

∫ 2π

0H(ejω)Sxx(ejω)H†(ejω) dω

]which is simply the energy of the optimal driving signal from Fig. 2.14(b) for the deterministic

and stochastic cases, respectively. Hence, with the PU constraint in effect, minimizing the mean-

squared error is equivalent to compacting the energy of the subband vector process, as was shown

for the uniform case in Sec. 2.3.1.

54

Chapter 3

Multiresolution OptimalFIR PU Filter Banks

One major drawback of the iterative eigenfilter method of the previous chapter is that there is no

guarantee of convergence to a PU solution for the synthesis bank. In this chapter, an alternate

method for designing a compaction filter is proposed in which the PU constraint is structurally

imposed. This follows from the complete parameterization of all 1-D causal FIR PU systems in

terms of Householder-like degree-one building blocks [75, 67], which we review here.

Using this characterization, an iterative algorithm is proposed to approximate, in the least-

squares sense, any desired response by an FIR filter whose vector of polyphase components is PU.

At each iteration, one set of parameters in the characterization of the FIR PU system is globally

optimized assuming all other parameters to be fixed. As such, the resulting algorithm is greedy and

so the error is guaranteed to be monotonic nonincreasing with iteration.

When the desired response is the response of an optimal infinite-order compaction filter, the

algorithm can be used for the design of FIR compaction filters. As the desired response in this case

suffers from a phase ambiguity, which we show here, a modification to the algorithm is proposed

in which the phase of the FIR solution is fed back to the desired response. With this modification,

which we call the phase feedback modification, in effect, the iterative algorithm not only still remains

greedy, but also outperforms the unmodified one in terms of compaction gain. In simulations

provided, we show that when this modification is used, the compaction gain monotonically increases

with filter order and comes very close to the performance of the infinite-order compaction filter.

To complete a maximally decimated filter bank given an FIR compaction filter, a multiresolution

criterion [62, 37] for which the PCFB is optimal, is used. Using the above-mentioned characteriza-

tion of FIR PU systems, it can be shown that the entire filter bank can be elegantly designed with

55

only one FIR compaction filter followed by an appropriate KLT, as we review here. Though ele-

gant, this approach suffers from the nonuniqueness of the FIR compaction filter. Different spectral

factors of a given FIR compaction filter lead to different FIR PU filter banks which in turn yield

different performances in terms of certain objective functions. This phenomenon has not previously

been reported in the literature. By choosing the spectral factor which yields the largest coding gain,

in simulations, we show that as the filter order increases, the resulting FIR PU filter bank behaves

more and more like the unconstrained or infinite-order PCFB in terms of numerous objectives.

In particular, this behavior is shown for coding gain, multiresolution, denoising with zeroth-order

Wiener filters in the subbands, and power minimization in nonredundant PU transmultiplexers.

This serves to bridge the gap between the zeroth-order memory KLT and infinite-order PCFB,

which has not previously been reported in the literature.

Since all spectral factors of a given FIR compaction filter must be tested for performance, the

computational complexity grows exponentially with filter order. As such, designing FIR PU filter

banks using the multiresolution criterion of [62, 37] becomes less feasible as the filter order increases.

In essence, what we show here is that due to the inherent nonuniqueness of an FIR compaction

filter, the FIR compaction filter problem, though elegant and well studied, is not well suited for

the design of complete signal-adapted FIR PU filter banks. This previously has not been reported

in the literature.

The content of this chapter is mainly drawn from [49] and portions of it will been presented at

[59].

3.1 Outline

In Sec. 3.2, we review the complete parameterization of 1-D causal FIR PU systems in terms of

Householder-like degree-one building blocks. There, the conditions under which the factorization is

unique are mentioned, which will be pivotal for designing FIR PU filter banks using only the FIR

compaction filter.

With this characterization, in Sec. 3.3, we proceed to the problem of least-squares design of

FIR filters whose vector of polyphase components is PU. Such filters have the property that their

magnitude squared response satisfies a Nyquist condition [67] and hence they are referred to as

magnitude squared Nyquist filters. The optimal parameters of these filters are derived in Sec. 3.3.1

using the method of Lagrange multipliers. In Sec. 3.3.2, the iterative algorithm for solving the

56

least-squares problem is presented and shown to be greedy. Simulation results for the design of FIR

compaction filters are shown in Sec. 3.3.3. There, the problems due to the phase ambiguity of the

desired infinite-order compaction filter are shown.

The phase feedback modification, which we propose to mitigate the effects of phase ambiguity,

is introduced in Sec. 3.4. There, it is shown that the iterative algorithm still remains greedy with

this modification in effect. Simulations using the phase feedback modification are also provided

in Sec. 3.4. It is shown that with the modification in effect, we obtain a monotonic increase in

compaction gain with filter order which comes very close to the performance of the ideal infinite-

order compaction filter, in line with intuition.

In Sec. 3.5, we review the multiresolution optimality criterion (MOC) [62, 37] for completing

a maximally decimated FIR PU filter bank. This criterion is well suited for the design of signal-

adapted filter banks since the PCFB, if it exists, is optimal for this criterion. Using the complete

parameterization of FIR PU systems from Sec. 3.2, in Sec. 3.5.1, we show that with this criterion,

the entire filter bank can be obtained from a single FIR compaction filter using an appropriate

KLT.

In Sec. 3.6, we present simulation results for FIR PU filter banks designed using the MOC. There,

the problems associated with the inherent nonuniqueness of the FIR compaction filter become

apparent. Different spectral factors lead to different filter banks which in turn lead to different

performances. By choosing the spectral factor yielding the largest coding gain, we show that the

FIR PU filter banks designed behave more and more like the infinite-order PCFB as the filter

order increases. This PCFB-like monotonic behavior is shown in terms of several objectives such

as coding gain, multiresolution, denoising with zeroth-order Wiener filters in the subbands, and

power minimization for nonredundant PU transmultiplexers. The work here serves to bridge the

gap between the zeroth-order memory KLT and infinite-order PCFB which previously has not been

shown in the literature.

Finally, concluding remarks are made in Sec. 3.7. There, we discuss the merits and shortcomings

of the iterative algorithm and MOC for designing FIR PU signal-adapted filter banks.

57

3.2 Complete Parameterization of 1-D Causal FIR PU Systems

Suppose that A(z) is a p × r causal FIR transfer function with p ≥ r. Then A(z) is PU with

McMillan degree (N − 1) iff it can be expressed as [75, 67]

A(z) = WN−1(z)WN−2(z) · · ·W1(z)A0 (3.1)

where A0 is a unitary p × r matrix, i.e., A†0A0 = Ir, and Wk(z) is a p × p Householder-like PU

degree-one building block of the form

Wk(z) = Ip − wkw†k + z−1wkw

†k , 1 ≤ k ≤ N − 1 (3.2)

and wk is a p × 1 unit norm vector for all k.

In general, the matrix A0 from (3.1) is unique [67]. When r > 1, the vectors wk from (3.2)

are in general not unique. In contrast to this, when r = 1, i.e., when A(z) is a vector transfer

function, the vectors wk are unique upto a scale factor of unit magnitude and hence the diadic

terms wkw†k appearing in (3.2) are unique. This uniqueness in the vector case, as will be shown

in Sec. 3.5.1, greatly facilitates the design of multiresolution optimal FIR PU filter banks once a

suitable compaction filter has been designed.

It should be noted that the factorization of (3.1) is only complete for the one-dimensional case

where z is a single complex variable. For D-dimensional FIR PU systems of the form A(z), where

z is a D-dimensional vector of complex variables, an analogous parameterization to that given in

(3.1) is in general not complete. The reason for this is that when D > 1, the notion of factoring

polynomials into roots no longer exists.

We will now proceed to use the factorization of (3.1) for the least-squares design of a causal FIR

magnitude squared Nyquist filter [67], for which the vector of polyphase components is a causal FIR

PU system. As the ultimate goal here will be the design of an FIR compaction filter F (z) for the M -

channel maximally decimated filter bank of Fig. 1.13, we will require the orthonormality condition

of (2.1) to be satisfied. In other words, we will consider the design of a causal FIR magnitude

squared Nyquist(M) filter whose vector of M -fold polyphase components is equivalently a causal

FIR PU system.

58

3.3 Least-Squares Design of FIR Magnitude Squared

Nyquist Filters

The design problem we focus on here is as follows. Suppose that D(ejω) is a desired response that

we wish to approximate (in the least-squares sense) with a causal FIR filter F (ejω) of length MN

whose magnitude squared response is Nyquist(M) [67]. For example, D(ejω) may represent the

spectrum of an infinite-order compaction filter. (In the case where the input x(n) to the system of

Fig. 2.1 is WSS, then D(ejω) would represent any spectral factor of the magnitude squared response

given in (2.6).) Then, the problem is to minimize the mean-squared error ξ, defined to be

ξ � 12π

∫ 2π

0

∣∣D(ejω) − F (ejω)∣∣2 dω (3.3)

subject to an FIR constraint on F (ejω) and the magnitude squared Nyquist(M) constraint [67][∣∣F (ejω)∣∣2]

↓M= 1 (3.4)

Expanding ξ from (3.3) yields

ξ =12π

∫ 2π

0

∣∣D(ejω)∣∣2 dω +

12π

∫ 2π

0

∣∣F (ejω)∣∣2 dω − 1

2π

∫ 2π

0

(D∗(ejω)F (ejω) + F ∗(ejω)D(ejω)

)dω

(3.5)

The first term of the right-hand side of (3.5) is just the �2 norm squared of the desired impulse

response d(n), namely, ||d(n)||22. Using (3.4), it can be shown that the second term is simply unity

[67]. Hence, the only quantities in ξ depending on F (z) are the cross product terms from the third

term in (3.5). Thus, the problem is linear in f(n), which greatly simplifies the optimization problem.

In order for F (z) to be a nondegenerate causal FIR filter of length MN which satisfies the

magnitude squared Nyquist(M) constraint of (2.1), the M × 1 vector of Type I M -fold polyphase

components of F (z), namely, f(z) from Section 2.2, must be a causal FIR PU vector system with

McMillan degree (N − 1) [75, 67]. From the results of Sec. 3.2, it follows that we have

F (z) = a(z)f(zM

), where f(z) = V(z)u0 (3.6)

Here a(z) is the M × 1 advance chain vector given by

a(z) =[

1 z · · · zM−1]T

Also, the quantity V(z) is an M ×M lossless matrix consisting of (N − 1) Householder-like parau-

nitary degree-one building blocks of the form

V(z) =1∏

i=N−1

Vi(z) (3.7)

59

Vi(z) = I − viv†i + z−1viv

†i , 1 ≤ i ≤ N − 1 (3.8)

where the M × 1 vectors vi are unit norm vectors. Finally, the quantity u0 is some M × 1 unit

norm vector.

As can be seen, F (z) in this case is completely characterized by the unit norm vectors vi and

u0. Though it is difficult to jointly find the vectors vi and u0 which minimize ξ from (3.3), it will be

shown that given the rest of the vectors, optimizing only one vector at a time is very simple. This

will serve as the basis for our proposed iterative technique. Prior to proceeding, let us define V to

be the set of all vis, i.e., V � {vi : 1 ≤ i ≤ N − 1}, and, in accordance with standard set theoretic

notation, let V \ vk denote the set V with the element vk removed.

3.3.1 Solving the Least-Squares Optimization Problem Using the Method of

Lagrange Multipliers

In order to minimize ξ from (3.5) subject to the unit norm constraints on u0 and the vis, we

construct the Lagrangian objective function [48]

J(u0,V) � ξ(u0,V) + λ0

(1 − u†

0u0

)+

N−1∑i=1

λi

(1 − v†

ivi

)(3.9)

where the λis are Lagrange multipliers chosen to satisfy the unit norm constraints. In what follows,

we consider the problem of optimizing one vector at a time, assuming that the rest of the vectors

are fixed.

3.3.1.1 Optimal Choice of u0

To find the optimal choice of u0, we set the conjugate gradient of J(u0,V) with respect to u0 to

be the zero vector [48]. From (3.9), we have [48]

∇u†

0J = ∇

u†0ξ − λ0u0 = 0 (3.10)

Here, the notation ∇x†f denotes a p × 1 column vector whose k-th component is given by

[∇x†f ]k � ∂f

∂ [[x]∗k]

where x is a p × 1 column vector. Using (3.6) in (3.5) yields

ξ(u0,V) = ||d(n)||22 + 1 − b†(V)u0 − u†0b(V) (3.11)

60

where b(V) is the M × 1 vector

b(V) � 12π

∫ 2π

0V†(ejωM

)a(ejω)D(ejω) dω (3.12)

Differentiating ξ from (3.11) with respect to u†0 yields [48]

∇u†

0ξ = −b(V)

Substituting this into (3.10) yields the optimal choice of u0 given by u0 = − 1λ0

b(V). In order to

satisfy the unit norm constraint u†0u0 = 1, it follows that the optimal u0 is of the form

u0 = ejα

(b(V)

||b(V)||)

(3.13)

where α ∈ [0, 2π) is some phase factor. To find the optimal choice of α here, we substitute (3.13)

into (3.11). This yields

ξ = ||d(n)||22 + 1 − 2 ||b(V)|| cos α

Clearly, to minimize ξ, we must maximize the third term, which occurs when we choose α = 0.

Hence, the optimal choice of u0 and corresponding ξ are given by the following.

u0,opt =b(V)

||b(V)|| , ξopt = ||d(n)||22 + 1 − 2 ||b(V)|| (3.14)

3.3.1.2 Optimal Choice of vk

In order to find the optimal choice of vk assuming that all other vectors are fixed, we must cleverly

extract only those portions of ξ which depend on vk. For simplicity, let us define the following

M × M matrices.

Lk(z) �

k+1∏

i=N−1

Vi(z) , 0 ≤ k ≤ N − 2

I , k = N − 1

(3.15)

Rk(z) �

I , k = 1

1∏i=k−1

Vi(z) , 2 ≤ k ≤ N(3.16)

Note that Lk(z) and Rk(z) are, respectively, the left and right neighbors of the matrix Vk(z) for

1 ≤ k ≤ N − 1 appearing in V(z) from (3.7). In other words, we have

V(z) = Lk(z)Vk(z)Rk(z) , 1 ≤ k ≤ N − 1 (3.17)

61

Also note that by construction, we have L0(z) = RN (z) = V(z). Substituting (3.17) and (3.8) into

(3.6) and (3.5) yields

ξ = ||d(n)||22 + 1 − 2 Re [c(u0,V \ vk)] + v†kT(u0,V \ vk)vk (3.18)

where the 1 × 1 scalar c(u0,V \ vk) and M × M matrix T(u0,V \ vk) are defined as follows.

c(u0,V \ vk) � 12π

∫ 2π

0D∗(ejω)a†(ejω)Lk

(ejωM

)Rk

(ejωM

)u0 dω (3.19)

T(u0,V \ vk) � A(u0,V \ vk) + A†(u0,V \ vk), where we have,

A(u0,V \ vk) � 12π

∫ 2π

0Rk

(ejωM

)u0

(1 − e−jωM

)D∗(ejω)a†(ejω)Lk

(ejωM

)dω (3.20)

It should be emphasized that from (3.19) and (3.20) the quantities c(u0,V \ vk) and T(u0,V \ vk)

do not depend on the vector vk.

As before, to find the optimal choice of vk, we set the conjugate gradient of the Lagrangian

J(u0,V) from (3.9) with respect to vk to be the zero vector. From (3.9), we have [48]

∇v†

kJ = ∇

v†kξ − λkvk = 0 (3.21)

Differentiating ξ from (3.18) with respect to vk yields [48]

∇v†

kξ = T(u0,V \ vk)vk

Substituting this into (3.21) yields

T(u0,V \ vk)vk = λkvk

which is just an eigenvector equation. In order to minimize ξ from (3.18), by Rayleigh’s principle

[48, 67], vk must be a unit norm eigenvector corresponding to the smallest eigenvalue of T(u0,V \vk), which we will denote here by µk,min. If wk,min denotes any unit norm eigenvector corresponding

to µk,min, then the optimal vk and corresponding error ξ are given by the following.

vk,opt = wk,min ,

ξopt = ||d(n)||22 + 1 − 2 Re [c(u0,V \ vk)] + µk,min

(3.22)

To summarize the results of this section, using the complete factorization of causal FIR mag-

nitude squared Nyquist(M) systems in terms of Householder-like building blocks, the optimal pa-

rameter vectors (u0 and the vis) can be easily found one at a time assuming that the rest of the

vectors are fixed. This property forms the basis of the proposed iterative algorithm for solving the

original least-squares problem.

62

3.3.2 Iterative Gradient Optimization Algorithm

Let ξm denote the mean-squared error at the m-th iteration for m ≥ 0. Then, the iterative gradient

optimization algorithm is as follows.

Initialization:

1. Generate N random unit norm vectors u0, vi, 1 ≤ i ≤ N − 1.

2. Compute the matrix RN (z) using (3.16).

Iteration: For m ≥ 0, do the following.

1. If m is a multiple of N :

(a) Calculate the optimal vector u0 and corresponding error ξm using (3.14) and (3.12)

with V(z) = RN (z).

(b) Compute L0(z) = V(z) and R1(z) = I.

Otherwise, if m ≡ k mod N where 1 ≤ k ≤ N − 1:

(a) From (3.15), update the left matrix as Lk(z) = Lk−1(z)Vk(z).

(b) Calculate the optimal vector vk and corresponding error ξm using (3.22), (3.19),

and (3.20).

(c) From (3.16), update the right matrix as Rk+1(z) = Vk(z)Rk(z).

2. Increment m by 1 and return to Step 1.

As the iterations progress, the left matrix is shortened by the old optimal vectors vk whereas the

right matrix is lengthened by the newly computed ones. After all of the vks have been optimized,

the left matrix assumes the value of the right matrix while the right matrix is then refreshed to be

the identity matrix.

Since at each stage in the iteration, we are globally optimizing one vector while fixing the rest,

the above technique is a greedy algorithm. As such, the mean-squared error ξm is guaranteed to be

monotonic nonincreasing as a function of the iteration index m. Furthermore, as ξm has a lower

bound (i.e., we always have ξm ≥ 0), ξm is guaranteed to have a limit as m → ∞ [67]. Simulation

results provided here verify this monotonic and limiting behavior as we now show.

63

0 0.1 0.2 0.3 0.4 0.50

1

2

3

4

5

f = ω/2π

Mag

nitu

de

Sxx

(ejω)

|D(ejω)| 2

Figure 3.1: Input psd Sxx(ejω) and ideal compaction filter magnitude squared response∣∣D(ejω)

∣∣2for M = 3.

3.3.3 Simulation Results for Designing FIR Compaction Filters

Here, we considered the design of an FIR compaction filter F (z) of length MN for the system of

Fig. 2.1 for a WSS input x(n) with psd Sxx(ejω). As such, we chose the desired response D(ejω)

to be any optimal unconstrained order compaction filter. From (2.6), we know that we must have

∣∣D(ejω)∣∣ =

√

M , ω ∈ ωx

0 , otherwise

where ωx is the set of frequencies given by (2.7). Though the phase of the ideal compaction filter

is arbitrary in this case, we opted to choose a linear phase spectral factor with a delay equal to half

of the FIR filter order. Namely, we chose

D(ejω) =∣∣D(ejω)

∣∣ ejφ(ω) , where φ(ω) =(

MN − 12

)ω (3.23)

For the simulation results here, we chose x(n) to be a real autoregressive order 4 (AR(4))

process with psd Sxx(ejω) as shown in Fig. 3.1. Due to symmetry, only the region ω ∈ [0, π] is

shown. Also in Fig. 3.1, we have plotted the magnitude squared response of the ideal compaction

filter for M = 3. In accordance with intuition, the compaction filter preserves the significant

portions of Sxx(ejω) while discarding the rest.

To test the proposed iterative algorithm, we considered designing an FIR compaction filter with

N = 16 (implying a filter length of MN = 48). All integrals were evaluated numerically using 1,024

uniformly spaced frequency samples. A plot of the observed error ξm as a function of the iteration

index m is shown in Fig. 3.2 for a total of KN iterations, where we chose K =⌈

1,000N

⌉. (We

64

0 200 400 600 800 10000

0.5

1

1.5

2

m

ξ m

Figure 3.2: Mean-squared error ξm vs. the iteration index m.

0 0.1 0.2 0.3 0.4 0.50

1

2

3

4

f = ω/2π

Mag

nitu

de

|D(ejω)| 2

|F(ejω)| 2

Figure 3.3: Magnitude squared responses of the ideal compaction filter D(ejω) and the designedFIR compaction filter F (ejω).

opted for an integer multiple of N iterations to ensure that all of the vectors were optimized the

same number of times.) As can be seen from Fig. 3.2, the observed mean-squared error is indeed

monotonic nonincreasing and appears to be approaching a limit. In Fig. 3.3, plots of the magnitude

squared responses of the ideal and FIR compaction filters are shown. As can be seen, the FIR filter

designed here is a good approximation to the ideal response.

To quantitatively measure the performance of the proposed algorithm, we opted to calculate

the compaction gain [68] of the designed filters. Recall from (2.32) that this quantity is given by

Gcomp =σ2

w

σ2x

=

12π

∫ 2π

0Sxx(ejω)

∣∣F (ejω)∣∣2 dω

12π

∫ 2π

0Sxx(ejω) dω

65

5 10 150.8

1

1.2

1.4

1.6

1.8

2

N

Gco

mp

idealFIR

Figure 3.4: Compaction gain Gcomp vs. the filter order parameter N .

when the input x(n) is WSS. Also recall that the ideal compaction filter maximizes this quantity

over all magnitude squared Nyquist(M) filters. A plot of the observed compaction gain as a function

of the filter order parameter N is shown in Fig. 3.4. The compaction gain often comes close to

the optimal gain, but does not monotonically increase with N . Though counterintuitive, it is most

likely due to the fact that we are constraining the desired response to have linear phase as in (3.23).

Despite this, the observed compaction gain in many cases comes close to the ideal one. Here, the

largest compaction gain observed was 2.0230 for N = 16 compared to the ideal one of 2.0501.

3.4 Phase Feedback Modification for Improving Compaction Gain

One of the problems which arose in the design of FIR compaction filters for WSS inputs using the

proposed iterative method was the fact that the phase of the desired response D(ejω) is arbitrary.

For the linear phase choice of (3.23), it was qualitatively seen that the iterative method yielded filters

close to the ideal compaction filter. However, quantitatively, the algorithm yielded a nonmonotonic

behavior with compaction gain as a function of the polyphase order parameter N , contrary to

intuition. As the mean-squared error ξm is monotonic nonincreasing with m, this suggests that the

algorithm is often times putting more of an emphasis on trying to match the phase of D(ejω) as

opposed to its magnitude. This is undesirable for designing compaction filters here since all of the

emphasis should be focused on the magnitude.

To mitigate this effect, we propose a modification to the iterative algorithm whereby the phase

of the desired response D(ejω) is changed to match that of the FIR filter F (ejω). We call this the

66

phase feedback modification, since after a certain number of iterations, the phase of F (ejω) is fed

back as the phase of D(ejω). Heuristically, we should expect that with the phase of the desired

response matched to that of the FIR filter, the design emphasis for future iterations will be more

focused on magnitude than phase.

Suppose that we wish to update the phase after every NPF iterations (i.e., NPF denotes the

number of iterations to run before performing a phase feedback). Then, the modification is as

follows. If Fm(ejω) =∣∣Fm(ejω)

∣∣ ejφm(ω) denotes the FIR filter obtained at the m-th iteration, then

when m = kNPF, where k is some nonnegative integer, we update the desired response as follows.

D(ejω) =⇒ ∣∣D(ejω)∣∣ ejφkNPF

(ω)

If KPF denotes the number of phase feedback cycles desired, then the total number of iterations

is KPFNPF.

Prior to showing design examples with the phase feedback modification, we should note the

following. With the modification in effect, it can be shown that the proposed algorithm still

remains greedy. To see this, suppose that a phase feedback is performed at the m-th iteration and

let ξm,before and ξm,after denote, respectively, the error before and after the phase feedback. From

(3.5), we have

ξm,before = ||d(n)||22 + 1 − 1π

∫ 2π

0

∣∣D(ejω)∣∣ ∣∣Fm(ejω)

∣∣ cos (φD(ω) − φm(ω)) dω

ξm,after = ||d(n)||22 + 1 − 1π

∫ 2π

0

∣∣D(ejω)∣∣ ∣∣Fm(ejω)

∣∣ dω

Clearly we have ξm,after ≤ ξm,before. As ξm+1,after ≤ ξm,after, since the unmodified algorithm is

greedy, we thus have ξm+1,after ≤ ξm,before. Hence, the algorithm remains greedy even with the

phase feedback modification in effect.

For the simulations run using the phase feedback modification, we choose NPF = N and KPF =⌈1,000

N

⌉here, corresponding to same number of iterations considered without the modification.

This corresponds to feeding back the phase each time all of the vectors (u0 and the vis) have been

updated once. A plot of the compaction gain observed using the phase feedback modification as a

function of the polyphase order N is shown in Fig. 3.5. The compaction gain observed without the

modification is also included here for comparison. As can be seen, the phase feedback modification

yielded a much better compaction gain than without, especially at low filter orders. Furthermore,

here, the observed gain was monotonically increasing with N . It should be noted that this behavior

did not always occur in our simulations. Due to the fact that the initial conditions were always

67

5 10 150.8

1

1.2

1.4

1.6

1.8

2

NG

com

p

idealFIR (with PF)FIR (without PF)

Figure 3.5: Compaction gain Gcomp vs. the filter order parameter N using the phase feedbackmodification.

0 200 400 600 800 10000

0.2

0.4

0.6

0.8

m

ξ m

(a)

0 0.1 0.2 0.3 0.4 0.50

1

2

3

4

f = ω/2π

Mag

nitu

de|D(ejω)|

2

|F(ejω)| 2

(b)

Figure 3.6: Compaction filter design using the phase feedback modification. (a) Mean-squarederror vs. iteration. (b) Magnitude squared response.

randomly chosen, there were some fluctuations with the compaction gain. However, even with

the fluctuations, the phase feedback modification always yielded a larger gain than that observed

without it. Here, the largest gain observed was 2.0403 for N = 16, which is very close to the ideal

one of 2.0501. In Fig. 3.6(a) and (b), respectively, we have plotted the mean-squared error versus

iteration and the magnitude squared response of the optimal filter obtained for N = 16. From

Fig. 3.3 and Fig. 3.6(b), it can be seen that the optimal filter designed using the phase feedback

approach yielded a better approximation to the ideal compaction filter than that without it.

68

� �

�

�

↓ M

↓ M

↓ M

�z

�z

�z

�

�

�

F(z)

M × M

�

�

�

x(n)w0(n)

w1(n)

wM−1(n)

x(n)

� �

�

�

F(z)

M × M

�

�

�

z−1

z−1

z−1

�↑ M

↑ M

↑ M

x(n)

x(n)

�

Figure 3.7: Uniform M -channel maximally decimated PU filter bank.

3.5 Design of Signal-Adapted FIR PU Filter Banks

Using a Multiresolution Criterion

In this section, we focus on the PU filter bank shown in Fig. 3.7 in which the M × M synthesis

polyphase matrix F(z) satisfies F(z)F(z) = I and the synthesis filters are given by [67][F0(z) F1(z) · · · FM−1(z)

]= a(z)F

(zM

)(3.24)

Note that this filter bank is just a special case of the system shown in Fig. 1.13. In order to mimic

the behavior of the infinite-order PCFB for the case where F(z) is FIR, we will opt to design F(z)

according to the following multiresolution optimality criterion originally considered by Moulin and

Mıhcak [37].

Multiresolution Optimality Criterion (MOC):

1. Maximize σ2w0

subject to[F0(z)F0(z)

]↓M

= 1 (Compaction Filter Problem)

2. For i = 1, 2, . . . , M − 1, successively maximize σ2wi

subject to[Fi(z)Fi(z)

]↓M

= 1 (Nyquist(M) Criterion)[Fk(z)Fi(z)

]↓M

= 0 ∀ 0 ≤ k ≤ i − 1 (Orthogonality Criterion)

As can be seen, the MOC consists of optimizing the filters one at a time starting with the

design of the compaction filter. At each stage, the next filter is chosen to maximize its subband

variance subject to the remaining degrees of freedom dictated by the PU constraint and the other

filters already designed. A multiresolution optimal filter bank is also said to be optimal in terms of

69

scalability [24] since such a filter bank represents the best approximation for any given scale (i.e.,

number of subband signals kept). It can easily be shown that the PCFB, if it exists for a class

of PU filter banks considered, is multiresolution optimal, by virtue of the fact that it majorizes

the vector of subband variances. As we might expect, FIR filter banks designed using the MOC

should exhibit infinite-order PCFB-like behavior as the filter order increases. This will be seen via

simulation examples presented later in the chapter.

Moulin and Mıhcak [37] were the first to use the MOC for the design of FIR PU filter banks.

By using the complete parameterization of FIR PU systems from Sec. 3.2, they showed that the

design process for a multiresolution optimal filter bank elegantly consisted of the construction of an

FIR compaction filter, followed by an appropriate KLT. However, contrary to intuition, they found

that filter banks constructed in this manner were not exhibiting PCFB-like behavior. In particular,

it was shown that traditional nonadaptive filter banks (i.e., filter banks with uniformly stacked

frequency support) performed better than those designed to satisfy the MOC in terms of coding

gain. This is most likely because they did not exploit the nonuniquness of the compaction filter for

the WSS inputs they considered. Given any valid FIR compaction filter, any spectral factor of it is

also a valid compaction filter since both have the same magnitude response. As we might expect,

different spectral factors lead to different filter bank parameterizations which in turn yield different

performance with respect to the MOC. In [37], the authors only chose the minimum-phase spectral

factor as their compaction filter. As we will show below through simulation examples, this spectral

factor is often far from being optimal in terms of the MOC.

Prior to presenting our simulation results for the different compaction filter spectral factors, we

show how the parameterization of FIR PU systems from Sec. 3.2 can be applied to the MOC. In

particular, we will show that using the MOC, the FIR PU filter bank design problem is tantamount

to designing an FIR compaction filter followed by an appropriate KLT.

3.5.1 Application of the Factorization of FIR PU Systems to the MOC

Suppose that F(z) from Fig. 3.7 is a causal FIR PU M ×M system with McMillan degree (N − 1).

This implies that the synthesis filters {Fk(z)} are causal and FIR of length MN by using (3.24).

Then, from (3.1), F(z) has a factorization of the form

F(z) = VN−1(z)VN−2(z) · · ·V1(z)︸︷︷︸V(z)

U (3.25)

70

� �

�

�

↓ M

↓ M

↓ M

�z

�z

�z

�

�

�

V(z)

�

�

�

U†

�

�

�

x(n) w0(n)

w1(n)

wM−1(n)

x(n)

�

t(n)

�

w(n)

�

Figure 3.8: Implementation of the analysis bank using the factorization of F(z) from (3.25).

where U is an M × M unitary matrix and Vk(z) is an M × M degree-one system of the form

Vk(z) = I − vkv†k + z−1vkv

†k , 1 ≤ k ≤ N − 1 (3.26)

where vk is a unit norm M × 1 vector for all k. The implementation of the analysis bank F(z)

using the factorization of (3.25) is shown in Fig. 3.8. Assuming x(n) is WSS, then so is t(n) and

hence w(n). Clearly, we have [67]

Sww(z) = U†Stt(z)U =⇒ Rww(0) = U†Rtt(0)U (3.27)

where Sww(z) and Stt(z) denote, respectively, the psds of w(n) and t(n). From Fig. 3.8, we get [67]

Stt(z) = V(z)Sxx(z)V(z) =⇒ Rtt(0) =12π

∫ 2π

0V†(ejω)Sxx(ejω)V(ejω) dω (3.28)

To simplify the MOC using the factorization of F(z) from (3.25), partition U into its columns as

U =[

u0 u1 · · · uM−1

](3.29)

Then, from (3.27), we have

σ2wi

= [Rww(0)]i,i = u†iRtt(0)ui (3.30)

Furthermore, note that from (3.24) the Nyquist(M) and orthogonality constraints appearing in the

MOC have the following equivalent expressions.[Fi(z)Fi(z)

]↓M

= 1 ⇐⇒ u†iui = 1 (3.31)[

Fk(z)Fi(z)]↓M

= 0 ∀ 0 ≤ k ≤ i − 1 ⇐⇒ u†kui = 0 ∀ 0 ≤ k ≤ i − 1 (3.32)

These constraints are in addition to the constraint that the vectors vk from (3.26) be of unit norm.

71

Recall that Step 1 of the MOC requires the computation of a compaction filter. Suppose that

a causal FIR compaction filter F0(z) of length MN has already been designed using either the

proposed iterative algorithm or any of the methods described in Section 2.2.2. Substituting (3.25)

and (3.29) in (3.24), it follows that we know the value of

F0(z) = a(z)V(zM

)u0︸︷︷︸

f0(zM )

(3.33)

where f0(z) = V(z)u0 is an M × 1 causal FIR PU vector system of degree (N − 1) consisting of

Type I polyphase components of F0(z). As f0(z) is a vector system, it follows from Sec. 3.2 that

the diadic forms present in V(z), namely, the quantities vkv†k for 1 ≤ k ≤ N − 1, are unique.

Hence, the matrix V(z) itself is unique. Thus, a given compaction filter F0(z) corresponds to a

unique V(z). It should be noted here that this uniqueness holds iff F0(z) is a nondegenerate causal

FIR filter of length MN , i.e., the length of F0(z) can not be less than MN . In all practical cases,

however, including the FIR compaction filters designed here, this is never a problem since the filters

are always nondegenerate.

To summarize, once a nontrivial FIR compaction filter F0(z) has been designed, we know the

unique value of the matrix V(z) appearing in Fig. 3.8. Hence, the second order statistics of the

process t(n) from Fig. 3.8 are known, namely, the autocorrelation sequence Rtt(k).

It should also be noted that with F0(z) given, we also know the first column u0 of the matrix

U from (3.29). By Rayleigh’s principle [22], it follows that u0 must be a unit norm eigenvector

corresponding to the largest eigenvalue of the matrix Rtt(0) given by (3.27) and (3.28). To see this,

note that with Rtt(0) uniquely determined, using (3.30) and (3.31), Step 1 of the MOC becomes

Maximize σ2w0

= u†0Rtt(0)u0 subject to u†

0u0 = 1.

which is precisely the problem statement in Rayleigh’s principle [22].

To determine the other columns u1,u2, . . . ,uM−1 of the matrix U from (3.29), note that with

Rtt(0) uniquely determined, from (3.30), (3.31), and (3.32), Step 2 of the MOC becomes

Maximize σ2wi

= u†iRtt(0)ui subject to u†

iui = 1 and u†kui = 0 ∀ 0 ≤ k ≤ i − 1.

This is done successively for i = 1, 2, . . . , M − 1. As such, the vectors u1,u2, . . . ,uM−1 are found

sequentially beginning with u1 and ending with uM−1. By an extension of Rayleigh’s principle [22],

it follows that u1 must be a unit norm eigenvector corresponding to the second largest eigenvalue

of Rtt(0). Similarly, by extension of Rayleigh’s principle, ui must be a unit norm eigenvector

72

corresponding to the i-th largest eigenvalue of Rtt(0) for all i. In other words, the optimal matrix

U from (3.29) is the unitary matrix which diagonalizes Rtt(0). Hence, the optimal U is a KLT for

the process t(n) from Fig. 3.8. Furthermore, the subband variances σ2wi

are simply the eigenvalues

of Rtt(0).

In conclusion, designing an FIR PU filter bank using the MOC consists of computing an optimal

FIR compaction filter followed by an appropriate KLT. A summary of this design algorithm is given

below.

3.5.2 Algorithm for the Design of FIR PU Multiresolution Optimal

Filter Banks

1. Design an optimal causal FIR compaction filter F0(z) of length MN . Identify the M×1 causal

PU vector f0(z) of degree (N − 1) from (3.33) via a Type I M -fold polyphase decomposition.

2. Find the unique Vk(z) from (3.26) corresponding to f0(z) as follows. Define the M ×1 vector

system Pk(z) from k = N − 1, N − 2, . . . , 1 as

PN−1(z) � f0(z) , Pk−1(z) � Vk(z)Pk(z) (3.34)

Then, in order for Pk(z) to be a polynomial in z−1 of degree k of the form

Pk(z) =k∑

n=0

pk(n)z−n (3.35)

we must have

vk = ckpk(k)

||pk(k)|| for some ck with |ck| = 1 =⇒ vkv†k =

pk(k)p†k(k)

||pk(k)||2 =pk(k)p†

k(k)

p†k(k)pk(k)

(3.36)

Hence, the diadic terms vkv†k appearing in Vk(z) from (3.26) are found recursively starting

from k = N − 1 and ending at k = 1 using (3.34), (3.35), and (3.36) successively.

3. Compute V(z) = VN−1(z)VN−2(z) · · ·V1(z). Then calculate Rtt(0) using (3.28).

4. Choose U to be the KLT of t(n), i.e., the unitary matrix which diagonalizes Rtt(0).

5. Compute the synthesis polyphase matrix F(z) using F(z) = V(z)U.

73

2 4 6 8 10

1.2

1.3

1.4

1.5

1.6

1.7

N

Cod

ing

Gai

n

PCFBFIR PU (opt)FIR PU (min−φ)KLT

Figure 3.9: Observed coding gain Gcode as a function of the synthesis polyphase order N .

3.6 Simulation Results for MOC Designed FIR PU Filter Banks

3.6.1 Coding Gain Results

Suppose that the input x(n) to the PU filter bank of Fig. 3.7 is the real AR(4) process considered

in Sec. 3.3.3, whose psd Sxx(ejω) is shown in Fig. 3.1. Here, we considered the design of an M = 3

channel system. For each filter bank designed, the filters designed in Sec. 3.4 using the proposed

iterative algorithm with the phase feedback modification were used as the optimum FIR compaction

filter required for multiresolution optimality. The synthesis polyphase matrix order N was varied

from 1 to 11 in order to see the behavior of the filter banks designed. For each N , the filter banks

corresponding to all compaction filter spectral factors were computed and the one judged to be

optimal was the one which yielded the largest coding gain. (See Chapter 5 for more on coding

gain.) Assuming optimal bit allocation, the coding gain is given by [67]

Gcode =

1M

M−1∑i=0

σ2wi(

M−1∏i=0

σ2wi

) 1M

In this case, the coding gain is simply the arithmetic mean/geometric mean (AM/GM) ratio of the

subband variances. Due to the subband variance majorization property of the PCFB, it can be

shown that the PCFB, if it exists for a class of filter banks under consideration, maximizes this

quantity [1]. Hence, the coding gain is lower bounded by unity (because of the AM/GM inequality

[67]) and upper bounded by the gain produced by the PCFB.

74

−1 0 1 2−2

−1

0

1

2

Re[z]

Im[z

]

Figure 3.10: Zero locations of the optimal spectral factor for N = 11.

A plot of the largest coding gain observed as a function of N is shown in Fig. 3.91. Along

with the coding gain of the optimal filter bank, the coding gain of the filter bank yielded by the

minimum-phase spectral factor is also shown. As can be seen, there is a large gap between these

two solutions. Furthermore, the optimal filter bank always exhibited a monotonically increasing

coding gain, whereas the minimum-phase filter bank had a fluctuating one.

From Fig. 3.9, it appears as though the optimal filter bank is approaching the performance

of the infinite-order PCFB as N increases. It would be interesting to view the behavior for much

larger N , however this is not feasible since the number of compaction filter spectral factors increases

exponentially with N . In general, a compaction filter of order MN − 1 has 2MN−1 spectral factors.

For sake of simplicity, since the input process x(n) is real here, we only considered real coefficient

spectral factors. In this case, if the compaction filter has Nr real roots and Nc complex roots

(with Nc being even here of course), then only 2Nr+(Nc/2) real coefficient spectral factors exist.

For N = 11 (MN = 33), there were a total of 22+ 302 = 217 = 131, 072 different spectral factors

that needed to be computed! The locations of the zeros corresponding to the best spectral factor

for N = 11 is shown in Fig. 3.10. In Fig. 3.11 and 3.12, the magnitude squared responses of the

analysis filters is shown for the optimal spectral factor and the minimum-phase one, respectively.

As can be seen, the filter responses corresponding to the optimal spectral factor appear to be a

better fit to the infinite-order PCFB compaction filters than those of the minimum-phase one.1For N = 1, the KLT of x(n) was chosen as the optimal solution, since it is the PCFB for this class of filter banks.

75

0 0.1 0.2 0.3 0.4 0.50

1

2

3

4

f = ω/2π

Mag

nitu

de

|H0(ejω)|2

1st PCFB

(a)

0 0.1 0.2 0.3 0.4 0.50

1

2

3

4

f = ω/2π

Mag

nitu

de

|H1(ejω)|2

2nd PCFB

(b)

0 0.1 0.2 0.3 0.4 0.50

1

2

3

4

f = ω/2π

Mag

nitu

de

|H2(ejω)|2

3rd PCFB

(c)

Figure 3.11: Analysis filter magnitude squared responses for the optimal spectral factor for N = 11.(a) First channel. (b) Second channel. (c) Third channel.

0 0.1 0.2 0.3 0.4 0.50

1

2

3

4

f = ω/2π

Mag

nitu

de

|H0(ejω)|2

1st PCFB

(a)

0 0.1 0.2 0.3 0.4 0.50

1

2

3

4

f = ω/2π

Mag

nitu

de

|H1(ejω)|2

2nd PCFB

(b)

0 0.1 0.2 0.3 0.4 0.50

1

2

3

4

f = ω/2π

Mag

nitu

de

|H2(ejω)|2

3rd PCFB

(c)

Figure 3.12: Analysis filter magnitude squared responses for the minimum-phase spectral factor forN = 11. (a) First channel. (b) Second channel. (c) Third channel.

76

1 1.5 2 2.5 30.6

0.7

0.8

0.9

1

Number of subbands retained

Per

cent

age

of to

tal v

aria

nce


(a)

1 1.5 2 2.5 30.6

0.7

0.8

0.9

1

Number of subbands retained

Per

cent

age

of to

tal v

aria

nce


(b)

Figure 3.13: Proportion P (L) of the total variance as a function of the number of subbands keptL for (a) N = 3 and (b) N = 9.

3.6.2 Multiresolution Optimality Results

In addition to being good for coding gain, the spectral factor optimized for coding gain also yielded

good performance with respect to the MOC. As a measure of multiresolution optimality, we opted

to consider the proportion of the partial subband variances to the total. By preserving only L out

of M subbands, the proportion of the total variance carried by these L subbands is given by

P (L) �

L−1∑i=0

σ2wi

M−1∑i=0

σ2wi

, 1 ≤ L ≤ M

Due to the subband variance majorization property of the PCFB (see Sec. 1.2.1), the PCFB max-

imizes P (L) for all L. The proportion P (L) as a function of the number of subbands kept L is

shown in Fig. 3.13 for N = 3 and N = 9. As can be seen, the variances of the optimized filter

bank are closer to the infinite-order PCFB variances than those obtained using the minimum-phase

spectral factor. From Fig. 3.13(b), we see that the difference in performance between these two

solutions is rather noticable. In line with intuition, it can be seen that as N increases from 4 to 9,

the optimized filter bank subband variances are trying to come closer and closer to the PCFB ones.

3.6.3 Noise Reduction Using Zeroth-Order Wiener Filters

Recall from Sec. 1.2.1, that the PCFB, if it exists, is optimal for a wide variety of objective functions.

In particular, it has been shown [1] that the PCFB is optimal for noise reduction with zeroth-order

77

2 4 6 8 100.54

0.56

0.58

0.6

0.62

N

Mea

n S

quar

ed E

rror


(a)

2 4 6 8 101.16

1.18

1.2

1.22

1.24

1.26

N

Mea

n S

quar

ed E

rror PCFB

FIR PU (opt)FIR PU (min−φ)KLT

(b)

Figure 3.14: Mean-squared error ε from (3.37) as a function of N for (a) η2 = 1 and (b) η2 = 4.

Wiener filters in the subbands if the noise is white. In other words, if the input to the filter bank of

Fig. 1.13(a) is x(n) = s(n)+µ(n), where s(n) is a pure signal and µ(n) is a white noise process and

if the subband processors {Pk} are taken to be zeroth-order Wiener filters (i.e., multipliers), then

the PCFB for x(n) (which is also the PCFB for s(n) in this case) is optimal in terms of minimizing

the mean-squared value of the error e(n) � s(n) − x(n) [1]. In general, the mean-squared error ε

in the presence of zeroth-order Wiener filters is given by

ε =1M

M−1∑i=0

σ2wi

η2

σ2wi

+ η2(3.37)

where σ2wi

denotes the variance of the i-th subband when the input is the desired signal s(n) and

η2 denotes the variance of the white noise process µ(n). As ε is a concave function of the subband

variance vector σ from (1.11), the PCFB for s(n), if it exists, is optimal for this objective [1].

Interestingly enough, the filter banks optimized over different spectral factors for coding gain

yielded PCFB-like performance with respect to the mean-squared error ξ from (3.37). The observed

mean-squared errors as a function of the synthesis polyphase order N is shown in Fig. 3.14 for (a)

η2 = 1 and (b) η2 = 4. As can be seen in both cases, the observed mean-squared error for the

optimum spectral factor filter bank noticably outperforms the minimum-phase one. Furthermore,

in both cases, it can be seen that the error for the optimized filter bank monotonically decreased

as N increased. Finally, in accordance with our intuition, the optimized filter bank is trying to

emulate the behavior of the infinte order PCFB.

78

� �

�

�

↓ M

↓ M

↓ M

�z

�z

�z

�

�

�

F(z)

M × M

�

�

�

x0(n)

x1(n)

xM−1(n)

�

�

�

�

�

�

F(z)

M × M

�

�

�

z−1

z−1

z−1

�

Channel

C(z) �

η(n)

Noise

�

Equalizer

1C(z)↑ M

↑ M

↑ M

x0(n)

x1(n)

xM−1(n)

Figure 3.15: Uniform PU nonredundant transmultiplexer.

3.6.4 Power Minimization for Nonredundant PU Transmultiplexers

In addition to applications in data compression, the theory of PCFBs has also been found useful

in digital communications involving the design of optimal PU transmultiplexers [73]. A typical

nonredundant PU transmultiplexer [67] in polyphase form is shown in Fig. 3.15. We distinguish

nonredundant transmultiplexers from redundant ones such as those used in DMT transceivers in

which the polyphase matrix F(z) is L×M with L < M . The system of Fig. 3.15 represents a digital

communications system in which M users {xk(n)} transmit data over a common path. Prior to

receiving the data and separating the users at the receiver, the incoming signal undergoes a linear

distortion in the form of the channel C(z) and a noise process η(n) is added to it. To undo the

effects of the channel, we assume that a zero-forcing equalizer [38, 73] of 1C(z) has been used here.

Assuming that the k-th input signal xk(n) consists of pulse amplitude modulated (PAM) sym-

bols [38] with bk bits and power Pk, then if the noise η(n) is Gaussian, the probability of error in

detecting the symbol xk(n) is given by [73]

Pe(k) = 2(1 − 2−bk

)Q(√

3Pk

(22bk − 1) σ2qk

)(3.38)

Here Q(x) is the Marcum Q function which is frequently used in communications [38]. Also, σ2qk

denotes the noise power seen at the k-th output xk(n). Solving (3.38) for Pk yields

Pk = β (Pe(k), bk)σ2qk

where β (Pe(k), bk) =

(22bk − 1

)3

[Q−1

( Pe(k)2 (1 − 2−bk)

)]2

(3.39)

As Pk is a linear function of σ2qk

, it follows that the total power P given by

P =M−1∑k=0

Pk (3.40)

is a convex function of the variances{σ2

qk

}. As such, this power is minimized if F(z) is chosen

to be a PCFB for the effective noise process seen at the input to the receiver [73]. If η(n) is a

79

2 4 6 8 10

2

2.5

3

3.5

4

4.5

5

x 104

N

Tot

al R

equi

red

Pow

er PCFBFIR PU (opt)FIR PU (min−φ)KLT

Figure 3.16: Total required power P from (3.40) as a function of N .

WSS process with psd Sηη(ejω), then the effective noise seen at the receiver input is WSS with psdSηη(ejω)

|C(ejω)|2 . Hence, the total power P from (3.40) is minimized if F(z) is a PCFB for the psd Sηη(ejω)

|C(ejω)|2 .

As an example, suppose that the desired probability of error is Pe(k) = 10−9 for all k. Also,

suppose that b0 = 2, b1 = 4, and b2 = 6. (This is a not an optimal bit allocation [73] and is only

chosen as such for simplicity.) Finally, suppose that the effective noise psd Sηη(ejω)

|C(ejω)|2 is simply the

psd Sxx(ejω) shown in Fig. 3.1. Then, the observed required powers as a function of the synthesis

polyphase order N is shown in Fig. 3.16. As can be seen, the optimal spectral factor not only

outperforms the minimum-phase one, but also monotonically decreases with N . The optimized

filter bank appears to be approaching the performance of the infinite-order, in line with intuition.


In this chapter, we presented a method for the design of FIR compaction filters based on the

complete parameterization of FIR PU systems shown in Sec. 3.2. Using this same characterization,

we constructed FIR PU signal-adapted filter banks based on the MOC of Sec. 3.5 using only these

filters. Through simulations provided, we showed examples of FIR PU filter banks exhibiting an

increasingly PCFB-like behavior as the filter order increased. This behavior has not previously been

seen in the literature. However, this came at the expense of an exponential increase in complexity on

account of the nonuniqueness of the FIR compaction filter. In the next chapter, we present a direct

method for the design of signal-adapted FIR PU filter banks that avoids the problems caused by

the ambiguity of the compaction filter. Furthermore, with this new method, the above-mentioned

PCFB-like behavior continues to hold true.

80

Chapter 4

Direct FIR PU Approximationof the PCFB

Though the signal-adapted filter bank design algorithm of the previous chapter was shown to yield

FIR PU filter banks that behaved more like the infinite-order PCFB as the filter order increased,

it was shown to suffer from one major drawback. In particular, it was shown that the inherent

nonuniqueness of the FIR compaction filter in terms of its spectral factors added an exponential

increase in computational complexity, since each spectral factor needed to be tested for its perfor-

mance. This suggests that the FIR compaction filter problem is perhaps not well suited for the

design of complete signal-adapted filter banks. To avoid this dilemma, in this chapter, we present

a signal-adapted filter bank design algorithm in which all of the filters are found simultaneously.

In particular, using the complete parameterization of FIR PU systems given in [75, 67], an

iterative algorithm is proposed to approximate, in a weighted least-squares sense, any MIMO desired

response by an FIR PU MIMO approximant. This is both a generalization of and departure from

the FIR compaction filter design method from the previous chapter in the sense that it applies to

general MIMO systems and allows for the incorporation of a weight function. As with the previous

method, at each iteration, one set of parameters in the characterization of the FIR PU system

is globally optimized assuming all other parameters to be fixed. Because of this, as before, the

resulting algorithm is greedy and so the error is guaranteed to be monotonic nonincreasing with

iteration.

When the desired response is the synthesis polyphase matrix of an infinite-order PCFB, the

algorithm can be used for the design of the entire FIR PU synthesis bank. As the desired response in

this case suffers from a phase-type ambiguity, which we define here, a modification to the algorithm

is proposed in which the phase of the FIR approximant is cleverly fed back to the desired response.

81

With this phase feedback modification, which is a generalization of the one proposed in the previous

chapter, the iterative algorithm not only still remains greedy, but also yields a better magnitude-type

fit to the desired response. Simulation results provided here show that, with the phase feedback

modification in effect, the FIR PU filter banks designed exhibit an increasingly PCFB-like behavior

as the filter order increases. In particular, in terms of objectives such as coding gain, denoising

using zeroth-order Wiener filters in the subbands, and power minimization for nonredundant PU

transmultiplexers, the FIR PU filter banks designed monotonically approached the performance

of the infinite-order PCFB as the filter order increased. As with the method presented in the

previous chapter, this serves to bridge the gap between the zeroth-order KLT and infinite-order

PCFB. However, unlike the previous method, there is no additional exponential overhead as we

are no longer plagued by the nonuniqueness of the FIR compaction filter as before. In fact, the

algorithm proposed here is only nominally more computationally complex than the compaction

filter design algorithm of the previous chapter.

As we are able to incorporate a weight function in the design algorithm, in addition to being

useful for the design of signal-adapted filter banks, the algorithm can also be used for the FIR PU

interpolation problem mentioned in Sec. 1.3. Though this problem is still open, the iterative greedy

algorithm proposed here can always be used to approximate a desired interpolant. Furthermore, for

cases in which an interpolant is known to exist, the algorithm can be used to find the interpolant,

as shown through simulations presented. Thus, the main contribution of the proposed algorithm

here is a numerical approach to solve a theoretically intractable problem.

The content of this chapter is drawn from [50].

4.1 Outline

In Sec. 4.2, we analyze the weighted least-squares FIR PU approximation problem. Using the

complete parameterization of FIR PU systems introduced in Sec. 3.2, we show how to obtain the

optimal parameters in Sec. 4.2.1 and 4.2.2. The iterative algorithm for obtaining the FIR PU

approximant is formally introduced in Sec. 4.2.3 and is shown to be greedy there.

In Sec. 4.3, we introduce the phase feedback modification to the iterative algorithm for cases in

which the desired response has a phase-type ambiguity. We begin by formally defining the phase-

type ambiguity in Sec. 4.3.1 and then proceed to derive the phase feedback modification in Sec.

4.3.2. In Sec. 4.3.3, we show that the iterative algorithm continues to be greedy with the phase

82

feedback modification in effect.

Simulation results for the proposed iterative greedy algorithm are presented in Sec. 4.4. In Sec.

4.4.1, we focus on the design of infinite-order PCFB-like FIR PU filter banks. There, the FIR

PU filter banks designed are shown to monotonically behave more and more like the infinite-order

PCFB in terms of several objectives. In Sec. 4.4.2, simulation results are presented for the FIR PU

interpolation problem.

Finally, concluding remarks are made in Sec. 4.5. There, we discuss the bridging of the gap be-

tween the zeroth-order KLT and infinite-order PCFB presented here, which has not been previously

reported in the literature.

4.2 The FIR PU Approximation Problem

Let D(ejω) be any p×r desired response matrix that we wish to approximate with a p×r causal FIR

PU system F(ejω) of McMillan degree (N − 1). Note that we require p ≥ r in order to satisfy the

PU condition F(z)F(z) = Ir. Here, we opt to choose F(ejω) to minimize a weighted mean-squared

Frobenius norm error between D(ejω) and F(ejω) given by

ξ � 12π

∫ 2π

0W (ω)

∣∣∣∣D(ejω) − F(ejω)∣∣∣∣2

Fdω (4.1)

Here, W (ω) is a scalar nonnegative weight function and ||A||F denotes the Frobenius norm of any

matrix A given by ||A||F =√

Tr [A†A] [22]. If we only impose an FIR constraint on F(z), then

the optimal filter coefficients of F(z) can be found in closed form in terms of an appropriate matrix

inverse as shown by Tufts and Francis in 1970 [63]. With the additional PU constraint that we

impose here, however, this problem becomes more complicated as we show.

Expanding (4.1) and using the PU condition F(z)F(z) = Ir on F(z) yields the following.

ξ =12π

∫ 2π

0W (ω)

∣∣∣∣D(ejω)∣∣∣∣2

Fdω +

r

2π

∫ 2π

0W (ω) dω︸︷︷︸

a

− 12π

∫ 2π

0W (ω) Tr

[D†(ejω)F(ejω) + F†(ejω)D(ejω)

]dω (4.2)

Note that the quantity a in (4.2) is simply a constant and that the only quantity that depends on

the system F(z) is the last term of (4.2). Hence, with the PU constraint in effect, the error ξ is

linear or first-order in F(z). This will greatly simplify the optimization problem as will soon be

shown.

83

To help solve this optimization problem with the PU constraint on F(z), we exploit the complete

parameterization of causal FIR PU systems in terms of Householder-like degree-one building blocks

[75, 67]. In particular, F(z) is a causal FIR PU system of McMillan degree (N − 1) iff it is of the

form

F(z) = V(z)U (4.3)

where V(z) is a p× p PU matrix consisting of (N − 1) degree-one Householder-like building blocks

of the form

V(z) =1∏

i=N−1

Vi(z) , Vi(z) = Ip − viv†i + z−1viv

†i , 1 ≤ i ≤ N − 1 (4.4)

where the vectors vi are unit norm vectors, i.e., v†ivi = 1 for all i. Also, the matrix U is some p× r

unitary matrix, i.e., U†U = Ir.

Though it is difficult to jointly optimize the parameters U and {vk} which minimize ξ from

(4.2), it will be shown that optimizing each parameter separately while holding all other parameters

fixed is very simple. This will lead to the proposed iterative algorithm whereby the parameters are

individually optimized at each iteration.

4.2.1 Optimal Choice of U

Substituting (4.3) into (4.2) yields the following.

ξ = a − 12π

∫ 2π

0W (ω) Tr

[D†(ejω)V(ejω)U + U†V†(ejω)D(ejω)

]dω

= a − Tr[ (

12π

∫ 2π

0W (ω)D†(ejω)V(ejω) dω

)︸︷︷︸

A†

U]

− Tr[

U†(

12π

∫ 2π

0W (ω)V†(ejω)D(ejω) dω

)︸︷︷︸

A

](4.5)

= a − 2 Re[Tr[U†A

]]︸︷︷︸

µ

(4.6)

Note that minimizing ξ from (4.6) is equivalent to maximizing µ. To find the optimal p× r unitary

matrix U which maximizes µ, we must exploit the singular value decomposition (SVD) [22] of A.

Suppose that A has the following SVD.

A = TΣW† (4.7)

84

Here, T and W are, respectively, p × p and r × r unitary matrices. The quantity Σ is a p × r

diagonal matrix of the form

Σ =

Σ0 0ρ×(r−ρ)

0(p−ρ)×ρ 0(p−ρ)×(r−ρ)

(4.8)

where ρ = rank(A) and Σ0 is a diagonal matrix of the singular values of A. In other words, we

have Σ0 = diag (σ0, σ1, . . . , σρ−1) where {σi} are the singular values of A which satisfy σi > 0 for

all 0 ≤ i ≤ ρ − 1. Substituting (4.7) into (4.6) yields the following.

µ = Re[Tr[U†TΣW†

]]= Re

[Tr[ΣW†U†T︸︷︷︸

G†

] ](4.9)

Note that the p × r matrix G = T†UW is unitary, i.e., G†G = Ir. As such, the components of G

satisfy

Re[[G]k,�

]≤ 1 (4.10)

with equality iff [G]k,q = δ(q−�) and [G]p,� = δ(p−k), since the columns of G form an orthonormal

set of vectors [22]. Using (4.8) in (4.9) yields

µ = Re[Tr[ΣG†

]]= Re

[ρ−1∑i=0

σi [G]∗i,i

]=

ρ−1∑i=0

σi Re[[G]∗i,i

]=

ρ−1∑i=0

σi Re[[G]i,i

](4.11)

In light of (4.10) and the fact that σi > 0 for all i, from (4.11), we have

µ ≤ρ−1∑i=0

σi (4.12)

with equality iff [G]i,q = δ(q − i) and [G]p,i = δ(p − i) for all 0 ≤ i ≤ ρ − 1. Since G is unitary, we

have equality iff

G = Gopt =

Iρ 0ρ×(r−ρ)

0(p−ρ)×ρ G0

(4.13)

where G0 is an arbitrary (p−ρ)× (r−ρ) unitary matrix, i.e., G†0G0 = I(r−ρ). As G = T†UW, we

have U = TGW†, and so the optimum U and corresponding optimal value of ξ is given by (4.12)

and (4.6) to be the following.

Uopt = TGoptW† with Gopt as in (4.13) , ξopt = a − 2

(ρ−1∑i=0

σi

)(4.14)

85

In the special case where ρ = r (i.e., A has full rank), we have

Uopt = T

W†

0(p−r)×r

, ξopt = a − 2

(r−1∑i=0

σi

)

Since the matrices T and W from (4.14) depend on V(z), the choice of U from (4.14) is optimal

for fixed W (ω), D(ejω), and V(z).

4.2.2 Optimal Choice of vk

In order to find the optimal choice of vk assuming that all other parameters are fixed, we must

cleverly extract only those portions of ξ which depend on vk. Similar to what was done for the

iterative method of the previous chapter (see (3.15) and (3.16)), for simplicity, let us define the

following p × p matrices.

Lk(z) �

k+1∏

i=N−1

Vi(z) , 0 ≤ k ≤ N − 2

Ip , k = N − 1

(4.15)

Rk(z) �

Ip , k = 1

1∏i=k−1

Vi(z) , 2 ≤ k ≤ N(4.16)

As before, note that Lk(z) and Rk(z) are, respectively, the left and right neighbors of the matrix

Vk(z) for 1 ≤ k ≤ N −1 appearing in V(z) from (4.4). In other words, the following relation holds

true here.

V(z) = Lk(z)Vk(z)Rk(z) , 1 ≤ k ≤ N − 1 (4.17)

86

Also note that by construction, we have L0(z) = RN (z) = V(z). Substituting (4.17) and (4.4) into

(4.3) and (4.2) yields the following.

ξ = a − 12π

∫ 2π

0W (ω) Tr

[D†(ejω)Lk(ejω)Rk(ejω)U

]dω︸︷︷︸

c

+12π

∫ 2π

0W (ω) Tr

[D†(ejω)Lk(ejω)

(1 − e−jω

)vkv

†kRk(ejω)U

]dω

− 12π

∫ 2π

0W (ω) Tr

[U†R†

k(ejω)L†

k(ejω)D(ejω)

]dω︸︷︷︸

c∗

+12π

∫ 2π

0W (ω) Tr

[U†R†

k(ejω)

(1 − ejω

)vkv

†kL†

k(ejω)D(ejω)

]dω (4.18)

= a − 2 Re [c] + v†k

[12π

∫ 2π

0W (ω)

(1 − e−jω

)Rk(ejω)UD†(ejω)Lk(ejω) dω

]︸︷︷︸

B

vk

+ v†k

[12π

∫ 2π

0W (ω)

(1 − ejω

)L†k(e

jω)D(ejω)U†R†k(e

jω) dω

]︸︷︷︸

B†

vk (4.19)

= a − 2 Re [c] + v†k

(B + B†

)︸︷︷︸

Q

vk = a − 2 Re [c] + v†kQvk︸︷︷︸

ν

(4.20)

Here, the quantity c defined in (4.18) depends on all of the parameters except vk. Hence, to

minimize ξ with respect to vk, we must minimize the quantity ν from (4.20). But note that

ν = v†kQvk is simply a quadratic form corresponding to the Hermitian matrix Q [22]. As vk must

satisfy v†kvk = 1, it follows from Rayleigh’s principle [22] that the optimal vk must be a unit norm

eigenvector corresponding to the smallest eigenvalue of Q. If λmin denotes the smallest eigenvalue

of Q and wmin is any unit norm eigenvector corresponding to λmin, then the optimum choice of vk

and corresponding optimal ξ are given by (4.20) to be the following.

vk,opt = wmin , ξopt = a − 2 Re [c] + λmin (4.21)

Note that since wmin from (4.21) depends on Lk(z), Rk(z), and U, it follows that the choice of vk

from (4.21) is optimal for fixed W (ω), D(ejω), U, and all vi for which i �= k.

In summary, finding the optimal parameters corresponding to the Householder-like factorization

of causal FIR PU systems is simple if the parameters are optimized individually. The process of

updating the individual parameters to their optimal values forms the basis of the proposed iterative

algorithm for solving the FIR PU approximation problem, which we now present.

87

4.2.3 Iterative Greedy Algorithm for Solving the Approximation Problem

Let ξm denote the mean-squared error at the m-th iteration for m ≥ 0. Then, the iterative algorithm

for solving the FIR PU approximation problem is as follows.

Initialization:

1. Generate a random p× r unitary matrix U and (N − 1) p× 1 random unit norm vectors

vi, 1 ≤ i ≤ N − 1.

2. Compute the matrix RN (z) using (4.16).

Iteration: For m ≥ 0, do the following.

1. If m is a multiple of N :

(a) Calculate the optimal U and corresponding ξm using (4.14), (4.7), and (4.5) with

V(z) = RN (z).

(b) Compute L0(z) = V(z) and R1(z) = Ip.

Otherwise, if m ≡ k mod N where 1 ≤ k ≤ N − 1:

(a) From (4.15), update the left matrix as Lk(z) = Lk−1(z)Vk(z).

(b) Calculate the optimal vk and corresponding ξm using (4.21), (4.20), (4.19), and

(4.18).

(c) From (4.16), update the right matrix as Rk+1(z) = Vk(z)Rk(z).

2. Increment m by 1 and return to Step 1.

Note that like the iterative algorithm of the previous chapter (see Sec. 3.3.2), as the iterations

progress, the left matrix is shortened by the old optimal vectors vk whereas the right matrix is

lengthened by the newly computed ones. After all vks have been optimized, the left matrix assumes

the value of the right matrix while the right matrix is then refreshed to be the identity matrix.

Similar to the iterative algorithm of Sec. 3.3.2, at each stage in the iteration, we are optimizing

one parameter while fixing the rest, and so the above technique is a greedy algorithm. As such, the

error ξm is guaranteed to be monotonic nonincreasing as a function of m. Furthermore, as ξm has a

lower bound (i.e., we always have ξm ≥ 0), ξm is guaranteed to have a limit as m → ∞ [67]. Thus,

the algorithm is guaranteed to converge monotonically to a local optimum. Prior to presenting

simulation results for the iterative algorithm, we introduce the phase feedback modification for

cases in which the desired response D(ejω) has a phase-type ambiguity, which we will define shortly.

88

4.3 Phase Feedback Modification

4.3.1 Phase-Type Ambiguity

Referring back to Fig. 1.13(b), suppose that we would like to design an FIR PU synthesis polyphase

matrix approximant to that of the infinite-order PCFB as described in Section 1.2.1. In this case,

the desired response D(ejω) is any system that totally decorrelates and spectrally majorizes the

blocked input signal x(n) (i.e., D(ejω) diagonalizes Sxx(ejω) for every ω in such a way that the

eigenvalues are arranged in descending order [68, 1]). This implies a nonuniqueness for the desired

response D(ejω). To see this, note that D(ejω) must contain the unit norm eigenvectors of Sxx(ejω)

arranged in some order to preserve the spectral majorization property. Partitioning D(ejω) into its

columns as

D(ejω) =[

d0(ejω) d1(ejω) · · · dM−1(ejω)]

(4.22)

it follows that dk(ejω) is a unit norm eigenvector of Sxx(ejω) for all ω. As any unit magnitude

scale factor of a unit norm eigenvector is itself a unit norm eigenvector, it follows that any system

of the form

Da(ejω) =[

d0(ejω)ejφ0(ω) d1(ejω)ejφ1(ω) · · · dM−1(ejω)ejφM−1(ω)]

= D(ejω)Λ(ejω) , where Λ(ejω) = diag(ejφ0(ω), ejφ1(ω), . . . , ejφM−1(ω)

)(4.23)

is a valid desired response for an infinite-order PCFB. If the eigenvalues of Sxx(ejω) are distinct

for all ω, then all valid desired responses are related to each other as in (4.23). On the other hand,

if the eigenvalues are not distinct at some frequency, say ω0, then at that frequency, the columns of

any one desired response corresponding to the nondistinct eigenvalues can be expressed as a unitary

combination of the same columns of any other desired response. As an example, suppose that at

ω0, the largest eigenvalue of Sxx(ejω) has multiplicity 2. Then, given any desired response D(ejω)

89

of the form given in (4.22), we can obtain another desired response Da(ejω) in which we have

Da(ejω0) =[ [

d0(ejω0) d1(ejω0)]Φ(ω0) d2(ejω0)ejφ2(ω0) · · · dM−1(ejω0)ejφM−1(ω0)

]

= D(ejω0)

Φ(ω0) 0 · · · · · · 0

0 ejφ2(ω0) 0 · · · 0... 0

. . . . . ....

......

. . . . . . 0

0 0 · · · 0 ejφM−1(ω0)

︸︷︷︸

Λ(ejω0 )

where Φ(ω0) is a 2 × 2 unitary matrix. In general, for an eigenvalue with multiplicity µ, the

corresponding eigenvectors of one desired response can be related in terms of any other via a µ×µ

unitary matrix.

Any p × r desired response Da(ejω) which has a nonuniqueness of the form

Da(ejω) = D(ejω)Λ(ejω) (4.24)

where D(ejω) is some p× r given desired response and Λ(ejω) is an r × r block diagonal matrix of

unitary matrices will be said to have a phase-type ambiguity, since the phases of the columns are

arbitrary in this case. (In the PCFB example described here, the number of blocks of Λ(ejω) is

equal to the number of distinct eigenvalues of Sxx(ejω) and the size of each block is equal to the

multiplicity of each of these eigenvalues.) When the desired response has a phase-type ambiguity,

some desired responses may yield a better overall FIR PU approximant than others. The reason for

this is that the causal FIR constraint we assume here imposes severe restrictions on the allowable

phase of the FIR PU approximant. Since we do not know the best desired response to choose a

priori, we propose a phase feedback modification to the iterative greedy algorithm of Section 4.2.3

in order to learn the proper desired response.

4.3.2 Derivation of the Phase Feedback Modification

Suppose that we are given a desired response D(ejω) with a phase-type ambiguity as in (4.24).

In addition, suppose that the matrix Λ(ejω) from (4.24) corresponds only to a simple phase-type

ambiguity of the form

Λ(ejω) = diag(ejφ0(ω), ejφ1(ω), . . . , ejφr−1(ω)

)∀ ω

90

The question then arises as to how to choose the phases {φk(ω)} to minimize the mean-squared

error in (4.1) with the desired response D(ejω) replaced by Da(ejω), which is given to be

ξ =12π

∫ 2π

0W (ω)

∣∣∣∣Da(ejω) − F(ejω)∣∣∣∣2

Fdω (4.25)

To solve this problem, we partition the old given desired response D(ejω) and the FIR PU approx-

imant F(ejω) as follows.

D(ejω) =[

d0(ejω) d1(ejω) · · · dr−1(ejω)]

F(ejω) =[

f0(ejω) f1(ejω) · · · fr−1(ejω)]

Then, from (4.25), it can easily be shown that we have

ξ =12π

∫ 2π

0W (ω)

(r−1∑k=0

∣∣∣∣∣∣dk(ejω)ejφk(ω) − fk(ejω)∣∣∣∣∣∣2

2

)dω (4.26)

Note that we can minimize ξ from (4.26) by minimizing each term of the summation pointwise in

frequency. This can be done here since the phases {φk(ω)} are independent functions of k that

have arbitrary response (in terms of ω). Hence, minimizing ξ is tantamount to minimizing

�k(ω) �∣∣∣∣∣∣dk(ejω)ejφk(ω) − fk(ejω)

∣∣∣∣∣∣22

(4.27)

for each k. Upon expanding �k(ω) in (4.27), we get the following.

�k(ω) =∣∣∣∣dk(ejω)

∣∣∣∣22+∣∣∣∣fk(ejω)

∣∣∣∣22− e−jφk(ω)d†

k(ejω)fk(ejω) − f †k(ejω)dk(ejω)ejφk(ω) (4.28)

Expressing d†k(e

jω)fk(ejω) as

d†k(e

jω)fk(ejω) =∣∣∣d†

k(ejω)fk(ejω)

∣∣∣ ejθk(ω) (4.29)

then we have f †k(ejω)dk(ejω) =∣∣∣d†

k(ejω)fk(ejω)

∣∣∣ e−jθk(ω), and so from (4.28), we get

�k(ω) =∣∣∣∣dk(ejω)

∣∣∣∣22+∣∣∣∣fk(ejω)

∣∣∣∣22− 2

∣∣∣d†k(e

jω)fk(ejω)∣∣∣ cos(θk(ω) − φk(ω)) (4.30)

Hence, to minimize �k(ω), we must choose φk(ω) as follows.

φk,opt(ω) = θk(ω) (4.31)

Thus, from (4.31) and (4.29), it can be seen that the optimal thing to do for each column of the

desired response is to mix its phase with that of the FIR PU approximant. In other words, the

phase of the FIR PU approximant must be fed back to the desired response in order to minimize

the mean-squared error.

91

4.3.3 Greediness of the Phase Feedback Modification

With the phase feedback modification of (4.31) in effect, it can be shown that the iterative algorithm

from Section 4.2.3 still remains greedy. To see this, suppose that a phase feedback is performed

at the m-th iteration and let ξm,before and ξm,after denote, respectively, the error before and after

the phase feedback. Note that ξm,before and ξm,after are given by (4.1) and (4.25), respectively. For

simplicity of notation, let fk;m(ejω) denote the k-th column of F(ejω) at the m-th iteration and

let θk;m(ω) denote the phase of the inner product d†k(e

jω)fk;m(ejω) as in (4.29). Using (4.30) and

(4.27) in (4.26), we get

ξm,before − ξm,after =1π

∫ 2π

0W (ω)

(r−1∑k=0

∣∣∣d†k(e

jω)fk;m(ejω)∣∣∣ (1 − cos (θk;m(ω)))

)dω ≥ 0

since the integrand from above is always nonnegative. Hence, it follows that ξm,after ≤ ξm,before. As

ξm+1,after ≤ ξm,after, since the unmodified algorithm is greedy, we have ξm+1,after ≤ ξm,before. Thus,

the algorithm remains greedy even with the phase feedback modification in effect. As will be shown

in Sec. 4.4.1 regarding the design of PCFB-like FIR PU filter banks, the phase feedback modification

can offer a better magnitude-type fit to the desired response than the unmodified algorithm.


4.4.1 Design of PCFB-like FIR PU Filter Banks

Recall that the proposed iterative algorithm can be used to design a PCFB-like FIR PU filter bank

when the desired response D(ejω) is the synthesis polyphase matrix of any infinite-order PCFB for

the psd Sxx(z) of the blocked filter bank input x(n) from Fig. 1.13(b). Suppose that the unblocked

scalar input signal x(n) from Fig. 1.13 is a real WSS autoregressive order 4 (AR(4)) process whose

psd Sxx(ejω) is as shown in Fig. 4.1. From Sec. 2.2.1.1, recall that if x(n) is itself WSS with psd

Sxx(z), then the psd of the blocked process Sxx(z) is a pseudocirculant matrix (see the Appendix

of Chapter 2) formed from Sxx(z). In this case, the synthesis filters {Fk(z)} corresponding to any

infinite-order PCFB are ideal bandpass compaction filters corresponding to Sxx(ejω) and its peeled

spectra [68]. Also recall that because of the assumed orthonormality condition of (1.10), it follows

that the corresponding analysis filters {Hk(z)} are also ideal compaction filters.

Here, we considered the design of an M = 4 channel system. To first test the proposed algorithm,

we opted to see the effects of the phase-type ambiguity of (4.24) on the desired response D(ejω) for a

fixed synthesis polyphase matrix length of N = 10 and fixed weight function W (ω) ≡ 1. This implies

92

0 0.1 0.2 0.3 0.4 0.50

10

20

30

40

f = ω/2π

Mag

nitu

de

Figure 4.1: Input psd Sxx(ejω) of the AR(4) process x(n).

that the corresponding synthesis filters {Fk(z)} are causal and FIR of length MN = 40. All integrals

were evaluated numerically using 1,024 uniformly spaced frequency samples. A plot of the observed

error ξm as a function of the iteration index m for both the unmodified and phase feedback modified

algorithms is shown in Fig. 4.2(a) for a total of KN iterations1, where we chose K =⌈

3,000N

⌉. A

magnified plot of the first 20 iterations is shown in Fig. 4.2(b). For the phase feedback modified

algorithm, a phase feedback was performed at every iteration. As can be seen in Fig. 4.2(a), both

algorithms exhibit a monotonically nonincreasing error that appears to be approaching a limit as

expected. Both errors appear to saturate after about 500 iterations. In addition, it can be seen

that the error for the phase feedback modified algorithm is much lower overall than that of the

unmodified one. Though it is difficult to see (view Fig. 4.2(b) for more clarity), with the randomly

chosen initial conditions, we had ξ0 = 6.8570 for the unmodified algorithm and ξ0 = 7.2634 for

the phase feedback modified one. Clearly, the phase feedback modification here yielded an overall

lower mean-squared error by finding, in some sense, the best PCFB to accomodate the causal FIR

PU constraint in effect.

To see the effects of the phase feedback modification more clearly, in Figs. 4.3 and 4.4, we have

plotted, respectively, the magnitude squared responses of the resulting synthesis filters {Fk(z)}for the original and modified algorithms together with the responses of the infinite-order PCFB

synthesis filters. (Due to the phase-type ambiguity present in D(ejω) here, only the magnitude has1We opted for an integer multiple of N iterations to ensure that all of the parameters were optimized the same

number of times. Also, for all of the simulation results presented in this section, a total of KN iterations was usedeach time, where K =

⌈3,000

N

⌉.

93

0 500 1000 1500 2000 25000

2

4

6

8

10

m

ξ m

unmodifiedPF modified

(a)

0 5 10 150

2

4

6

8

10

m

ξ m

unmodifiedPF modified

(b)

Figure 4.2: Mean-squared error ξm vs. iteration m for both the unmodified and phase feedbackmodified iterative algorithms. (a) Plot of KN = 3, 000 iterations, (b) Magnified plot of the first 20iterations.

been plotted since the infinite-order PCFB filters can have arbitrary phase.) As can be seen, the

FIR synthesis filters designed with the phase feedback modification offer a better magnitude-type

fit to the infinite-order PCFB filters than those designed with the unmodified algorithm. Due to

this observed phenomenon, we opted to carry out the rest of the PCFB simulations using the phase

feedback modification. It should also be noted that the remainder of the PCFB simulations in this

section were carried out for the real AR(4) process x(n) with psd Sxx(ejω) as in Fig. 4.1.

4.4.1.1 Multiresolution Optimality Results

Recall from Sec. 1.2.1 and 3.6.2 that due to the subband majorization property of the PCFB,

the PCFB is optimal for maximizing the proportion of the partial subband variances to the total

variance. By preserving only L out of M subbands, this proportion is given by

P (L) �

L−1∑i=0

σ2wi

M−1∑i=0

σ2wi

, 1 ≤ L ≤ M

Note that P (L) for each L is a measure of multiresolution optimality as it measures the amount of

signal energy compacted into the first L subbands of the filter bank.

94

0 0.1 0.2 0.3 0.4 0.50

1

2

3

4

5

f = ω/2π

Mag

nitu

de

PCFBFIR PU

(a)

0 0.1 0.2 0.3 0.4 0.50

1

2

3

4

5

f = ω/2πM

agni

tude

PCFBFIR PU

(b)

0 0.1 0.2 0.3 0.4 0.50

1

2

3

4

5

f = ω/2π

Mag

nitu

de

PCFBFIR PU

(c)

0 0.1 0.2 0.3 0.4 0.50

1

2

3

4

5

f = ω/2π

Mag

nitu

de

PCFBFIR PU

(d)

Figure 4.3: Magnitude squared responses of the PCFB and FIR PU synthesis filters using theunmodified iterative algorithm. (a) F0(z), (b) F1(z), (c) F2(z), (d) F3(z).

95

0 0.1 0.2 0.3 0.4 0.50

1

2

3

4

5

f = ω/2π

Mag

nitu

de

PCFBFIR PU

(a)

0 0.1 0.2 0.3 0.4 0.50

1

2

3

4

5

f = ω/2πM

agni

tude

PCFBFIR PU

(b)

0 0.1 0.2 0.3 0.4 0.50

1

2

3

4

5

f = ω/2π

Mag

nitu

de

PCFBFIR PU

(c)

0 0.1 0.2 0.3 0.4 0.50

1

2

3

4

5

f = ω/2π

Mag

nitu

de

PCFBFIR PU

(d)

Figure 4.4: Magnitude squared responses of the PCFB and FIR PU synthesis filters using the phasefeedback modified iterative algorithm. (a) F0(z), (b) F1(z), (c) F2(z), (d) F3(z).

96

1 2 3 40.3

0.4

0.5

0.6

0.7

0.8

0.9

1

L

P(L

)

PCFBFIR PUKLT

(a)

1 2 3 40.3

0.4

0.5

0.6

0.7

0.8

0.9

1

L

P(L

)

PCFBFIR PUKLT

(b)

Figure 4.5: Proportion of the total variance P (L) as a function of the number of subbands kept Lfor an M = 4 channel system with (a) N = 3 and (b) N = 10.

Using the proposed iterative algorithm for the design of a PCFB-like filter bank for the real

AR(4) process x(n) considered here, a plot of the observed proportion P (L) as a function of the

number of subbands preserved L is shown in Fig. 4.5 for N = 3 and N = 10. Included in Fig. 4.5

are the performances of the zeroth-order PCFB (namely, the KLT) as well as the infinite-order one.

As can be seen, both FIR filter banks designed outperform the KLT. Furthermore, by comparing

Fig. 4.5(a) and (b), it can be seen that as the filter order increased, the subband variances came

closer to those of the infinite-order PCFB.

To show another example of this phenomenon, we considered the design of an M = 8 channel

system. For this case, a plot of P (L) as a function of L is shown in Fig. 4.6(a) and (b) for N = 3

and N = 10, respectively. As before, it can be seen that as the filter order increased, the subband

variances of the FIR filter banks came closer to those of the infinite-order PCFB2. This is in

accordance with intuition which states that as the filter order increases, the designed FIR PU filter

banks should behave more and more like the infinite-order PCFB.2It should be noted that this phenomenon continues to hold true for larger M , however the results become less

dramatic since the gap between the KLT and infinite-order PCFB shrinks as M increases.

97

2 4 6 80.3

0.4

0.5

0.6

0.7

0.8

0.9

1

L

P(L

)

PCFBFIR PUKLT

(a)

2 4 6 80.3

0.4

0.5

0.6

0.7

0.8

0.9

1

L

P(L

)

PCFBFIR PUKLT

(b)

Figure 4.6: Proportion of the total variance P (L) as a function of the number of subbands kept Lfor an M = 8 channel system with (a) N = 3 and (b) N = 10.

4.4.1.2 Coding Gain Results

From Sec. 1.2.1 and 3.6.1, recall that the PCFB is optimal for coding gain with optimal bit allocation

in the subbands [62, 1]. Assuming optimal bit allocation, the coding gain is given by [67]

Gcode =

1M

M−1∑i=0

σ2wi(

M−1∏i=0

σ2wi

) 1M

Here, the proposed iterative algorithm was used to design an M = 4 channel PCFB-like filter

bank in which the synthesis polyphase matrix length N was varied from 1 to 10. A plot of the

coding gain observed as a function of N is shown in Fig. 4.73. In addition, we have included the

coding gain of the KLT (2.1276 dB) along with that of the infinite-order PCFB (8.3081 dB). From

Fig. 4.7, we can see that even at small filter orders the FIR PU filter banks designed yielded a

much larger coding gain than the KLT. Furthermore, the optimized FIR filter banks exhibited a

monotonically increasing coding gain. This is consistent with intuition which dictates that as the

filter order increases, the FIR filter banks designed should become more and more PCFB-like. From

Fig. 4.7, it appears as though the coding gain of the FIR filter banks will asymptotically achieve

the infinite-order PCFB performance as N → ∞.3For N = 1, the KLT of x(n) was chosen as the optimal solution, since it is the PCFB for this class of filter banks.

98

2 4 6 8 101

2

3

4

5

6

7

8

9

N

Cod

ing

Gai

n (d

B) PCFB

FIR PUKLT

Figure 4.7: Observed coding gain Gcode as a function of the FIR PU filter order parameter N .

4.4.1.3 Noise Reduction Using Zeroth-Order Wiener Filters

In addition to being optimal for coding gain, recall from Sec. 3.6.3 that the PCFB is optimal for

denoising of white noise with zeroth-order Wiener filters in the subbands. From (3.37), recall that

the mean-squared error in the presence of zeroth-order Wiener filters is given by

ε =1M

M−1∑i=0

σ2wi

η2

σ2wi

+ η2

where σ2wi

denotes the desired signal variance of the i-th subband and η2 denotes the variance of

the white noise process.

Using the same FIR PU filter banks as those computed in Section 4.4.1.2, the observed mean-

squared error ε from (3.37) as a function of N is shown in Fig. 4.8 for (a) η2 = 1 and (b) η2 = 4.

As can be seen in both cases, the FIR filter banks significantly outperform the KLT. Furthermore,

it can be seen that the error monotonically decreased as N increased, in accordance with intuition.

Asymptotically, it appears as though the optimized FIR filter bank is trying to emulate the behavior

of the infinte order PCFB.

4.4.1.4 Power Minimization for DMT-Type Transmultiplexers

From Sec. 3.6.4, recall that the PCFB is optimal for minimizing the total power of a PU transmul-

tiplexer digital communications system in which the noise is Gaussian, the input signals consist of

PAM symbols [38], a zero-forcing equalizer [38] has been used, and the symbol error probabilities

99

2 4 6 8 100.5

0.55

0.6

0.65

0.7

N

Mea

n S

quar

ed E

rror PCFB

FIR PUKLT

(a)

2 4 6 8 101.1

1.2

1.3

1.4

1.5

1.6

N

Mea

n S

quar

ed E

rror PCFB

FIR PUKLT

(b)

Figure 4.8: Noise reduction performance (ε from (3.37)) with zeroth-order subband Wiener filtersas a function of the FIR PU filter order parameter N for (a) noise variance (η2) of 1 and (b) noisevariance of 4.

and bit allocations are fixed. Recall from (3.39) and (3.40) that the total power is given by

P =M−1∑k=0

β (Pe(k), bk)σ2qk

where β (Pe(k), bk) is a constant that depends only on the symbol error probability and bit allocation

for the k-th subband and σ2qk

denotes the noise power seen at the k-th output. As P is a convex

function of the variances σ2qk

, it is minimized if the transmultiplexer polyphase matrix F(z) is chosen

to be a PCFB for the effective noise process seen at the input to the receiver (i.e., the original noise

filtered by the zero-forcing equalizer).

As an example, suppose that the desired probability of error is Pe(k) = 10−9 for all k. Also,

suppose that we have b0 = 2, b1 = 3, b2 = 4, and b3 = 5. It should be noted that this is a not

an optimal bit allocation [73] and is only chosen here as such for simplicity. Finally, suppose that

the effective noise psd is simply the psd Sxx(ejω) shown in Fig. 4.1. Then, using the proposed

iterative algorithm, the required powers as a function of the synthesis polyphase order N is shown

in Fig. 4.9. As can be seen, the FIR filter banks designed here significantly outperform the KLT

and exhibit a monotonically decreasing power as a function of N , in accordance with intuition.

Furthermore, as before, the optimized FIR filter bank appears to be approaching the performance

of the infinite-order PCFB as the order increases.

100

2 4 6 8 10

0.8

1

1.2

1.4

1.6

1.8

2

x 104

N

Tot

al R

equi

red

Pow

er

PCFBFIR PUKLT

Figure 4.9: Nonredundant DMT-type transmultiplexer total required power P as a function of theFIR PU filter order parameter N .

4.4.2 The FIR PU Interpolation Problem

Recall from Section 1.3 that the FIR PU interpolation problem involves finding an FIR PU sys-

tem of a certain McMillan degree, say F(ejω), which takes on a prescribed set of L values, say

U0,U1, . . . ,UL−1, over a prescribed set of L frequencies, say ω0, ω1, . . . , ωL−1. In other words, we

seek an FIR PU F(z) of a certain degree such that F(ejωk) = Uk for all 0 ≤ k ≤ L − 1. (Clearly

the matrices {Uk} must be unitary.) As mentioned in Section 1.3, there is no known solution to

the FIR PU interpolation problem. However, for this problem, the proposed iterative algorithm

can be used to approximate an interpolant. In this case, the desired response D(ejω) is as follows.

D(ejω) =

Uk , ω = ωk ∀ 0 ≤ k ≤ L − 1

don’t care, otherwise

As we don’t care about the response at all frequencies not in the set {ωk}, it only makes sense

that these regions be given no weight in the approximation problem. One weight function which

accomodates this need is the interpolation weight function Wint(ω), given by the following.

Wint(ω) = 2π

L−1∑k=0

pkδ(ω − ωk) (4.32)

Here, the pks are discrete weight parameters used to emphasize the design of some interpolation

conditions over others which satisfies

pk ≥ 0 ,L−1∑k=0

pk = 1

101

0 100 200 300 4000

2

4

6

8

10

12

14

m

ξ m

Figure 4.10: FIR PU interpolation problem - Example 1: Mean-squared error ξm vs. iteration m.

In other words, {pk} is a discrete probability density function (pdf). Substituting (4.32) into the

expression for the weighted mean-squared error ξ from (4.1) yields

ξ =L−1∑k=0

pk

∣∣∣∣D(ejωk) − F(ejωk)∣∣∣∣2

F=

L−1∑k=0

pk

∣∣∣∣Uk − F(ejωk)∣∣∣∣2

F

Hence, with the interpolation weight function Wint(ω) from (4.32), the mean-squared error integral

becomes a discrete summation. This simplifies the proposed iterative algorithm since no numerical

integration is required.

4.4.2.1 FIR PU Interpolation - Example 1

As an example, suppose that we seek a 3 × 2 FIR PU system F(z) such that F(ejω) = Uk for

0 ≤ k ≤ 3, where U0, . . . ,U3 are randomly chosen 3 × 2 unitary matrices. Furthermore, suppose

that the frequencies are chosen as

ω0 = 0 , ω1 =π

2, ω2 =

3π

4, ω3 =

5π

4

Since there are 4 interpolation conditions, we might expect that we need N ≥ 4 for the FIR PU

interpolant in general. Using the proposed iterative algorithm for N = 4, the observed mean-

squared error ξm as a function of iteration m is shown in Fig. 4.10. Here, we used p0 = p1 =

p2 = p3 = 14 (i.e., uniform weighting) and KN iterations, where we chose K =

⌈500N

⌉. As the error

appears to have saturated at a nonzero value (in this case 4.1844), this suggests that there may

not exist an FIR PU system with N = 4 that satisfies the desired interpolation conditions. Despite

this, the algorithm has found a good approximant to the desired interpolant.

102

0 10 20 30 400

0.5

1

1.5

2

2.5

3

m

ξ m

Figure 4.11: FIR PU interpolation problem - Example 2: Mean-squared error ξm vs. iteration m.

4.4.2.2 FIR PU Interpolation - Example 2

To further test the performance of the proposed iterative algorithm, we can use it to obtain an

FIR PU system for which we know that an interpolant exists. For example, suppose that we seek

a 3 × 2 FIR PU system F(z) such that

F(ejω0) = U0 =(I − vv† + e−jω0vv†

)U

F(ejω1) = U1 =(I − vv† + e−jω1vv†

)U

Here, v is an arbitrary 3× 1 unit norm vector and U is a 3× 2 arbitrary unitary matrix. As there

are 2 interpolation conditions, we expect that in general, we need N ≥ 2 here. Clearly, for N = 2,

the choice

F(z) =(I − vv† + z−1vv†

)U (4.33)

satisfies the desired interpolation conditions. Using the proposed iterative algorithm, we can see if

the algorithm can converge to the interpolant of (4.33). For this simulation, we chose ω0 = 0 and

ω1 = 3π4 and p0 = p1 = 1

2 (i.e., uniform weighting). A plot of the observed mean-squared error as

a function of iteration is shown in Fig. 4.11 for KN iterations, where we chose K =⌈

50N

⌉. As we

can see, it appears as though the algorithm does in fact converge to the interpolant of (4.33).

In summary, even though there is no general solution to the FIR PU interpolation problem, the

proposed algorithm offers a way to approximate a suitable interpolant.

103


Using the complete characterization of FIR PU systems in terms of Householder-like degree-one

building blocks, in this chapter we proposed an iterative greedy algorithm for solving a general

matrix version of the weighted least-squares approximation problem for an FIR PU approximant.

For cases in which the desired response matrix exhibited a phase-type ambiguity, which we formally

defined here, a phase feedback modification was proposed in which the phase of the FIR PU

approximant was fed back to the desired response. With this modification in effect, the resulting

algorithm was shown to still remain greedy and provide a better magnitude-type fit of the desired

response.

As opposed to most traditional signal-adapted filter bank design methods which optimize a

specific objective for which the PCFB is optimal, the method used here was to approximate the

infinite-order PCFB itself using a realizable FIR PU filter bank. In contrast to popular criteria for

signal-adapted design, such as the MOC of Chapter 3, which require a filter bank completion step

after a suitable FIR compaction filter has been computed, the method used here avoids this entirely

as all of the filters are found simultaneously. The advantage of this is that we avoid the problems

due to the nonuniqueness of the compaction filter, which plagued the method presented in Chapter

3 with an exponential increase in complexity. Through simulations presented here, it was shown

that the FIR PU filter banks designed monotonically behaved more and more like the infinite-order

PCFB as the filter order increased in terms of numerous objectives. This serves to bridge the gap

between the zeroth-order KLT and infinite-order PCFB. Together with the results of the design

method of Chapter 3, this phenomenon has not previously been reported in the literature.

In addition to being useful for the design of signal-adapted filter banks, we showed that the

iterative algorithm could also be used for the FIR PU interpolation problem. Though this problem

is still open, the algorithm always offers a way to approximate a desired interpolant, which may or

may not even exist. With simulations provided here, we showed that the algorithm could converge

to an interpolant, when one was known to exist. In essence, the algorithm provides us with a

numerical approach to solve a theoretically open problem.

104

Chapter 5

Coding Gain OptimalFIR Filter Banks

This chapter differs somewhat in theme from the rest of the chapters in that we focus on the design

of FIR signal-adapted filter banks in which no PU constraint is enforced. As opposed to traditional

filter bank design algorithms which enforce a PU or biorthogonality condition to be satisfied, we

make no such constraints here. The only constraint that we adhere to is that the analysis/synthesis

filters be FIR and hence realizable.

Here, the model we focus on is a uniform filter bank with scalar quantizers in the subbands.

Such a model is commonly used to achieve good lossy data compression in methods such as JPEG

and MP3 [41, 39]. The goal here is to choose the best analysis/synthesis filters, subject to an FIR

constraint, to minimize the filter bank mean-squared reconstruction error for a fixed bit allocation

among the quantizers. This is shown to be equivalent to maximizing the coding gain of the system.

Under the high bit rate assumption [25], we show how to derive the optimal analysis/synthesis

bank assuming that the corresponding synthesis/analysis bank is fixed. This will lead to an iterative

greedy algorithm for designing the filter bank where the analysis and synthesis banks are alternately

optimized.

Simulation results presented here show the versatility and merit of the proposed greedy algo-

rithm. By neglecting the effects of the quantizers, we show how the method can be used to design

an overdecimated filter bank which minimizes the mean-squared error of its output. Though the

PU constraint is not enforced here, we show that many similarities exist between these FIR filter

banks designed and optimal PU filter banks or PCFBs. In particular, we show that the FIR filters

designed appear to compact the energy of the input process, a property which is also shared with

the PCFB.

105

When we account for the effects of the quantizers, we show how the algorithm can be used to

design optimal FIR pre/postfilters for quantization. The performance in this case is measured in

terms of coding gain and distortion or mean-squared error. We compare the distortion observed

to the rate-distortion bound given by information theory [25, 10, 7]. It is shown that the method

comes close to this bound and comes closer when we increase the filter orders, as expected.

Finally, we consider the design of a maximally decimated system with quantizers in the sub-

bands. As before, the performance is measured in terms of coding gain and distortion. When

compared to the rate-distortion bound, it is shown that the method comes closer to the bound as

we increase the filter orders, in line with intuition. Furthermore, we show that the quantized filter

bank system outperforms the scalar pre/postfiltering quantization system, as expected.

The content of portions of this chapter will be presented at [58].

5.1 Outline

In Sec. 5.2, we present the quantized filter bank model which we will focus on here. There, we

review the high bit rate assumption for scalar quantizers, which greatly simplifies the mathematical

analysis required for the objective under consideration.

In Sec. 5.3, we derive the optimal analysis/synthesis banks for minimizing the reconstruction

error at the output. The optimal synthesis bank for a fixed analysis bank is derived in Sec. 5.3.1,

whereas the optimal analysis bank for a fixed synthesis bank is derived in Sec. 5.3.2. In Sec. 5.3.3,

we formally present an iterative greedy algorithm for obtaining the optimal FIR filter bank for

minimizing the reconstruction error.

Simulation results are presented in Sec. 5.4. In Sec. 5.4.1, we neglect the effects due to quanti-

zation and show how the algorithm can be used to design an overdecimated filter bank. There, in

Sec. 5.4.1.1 and Sec. 5.4.1.2, the similarities and differences between the FIR filter banks designed

and the PCFB are shown. The effects of quantization are discussed in Sec. 5.4.2 along with several

important measures of optimality. In Sec. 5.4.2.1, we use the iterative algorithm to design optimal

FIR pre/postfilters for quantization. The method is shown there to exhibit distortion behavior close

to the rate-distortion bound. In Sec. 5.4.2.2, we consider the design of a maximally decimated filter

bank with quantizers in the subbands. There, it is shown that such filter banks yield performance

close to the rate-distortion bound and outperform the scalar pre/postfiltering quantization system.

Finally, concluding remarks are made in Sec. 5.5.

106

� � H0(z)

�

� H1(z)

�

�

� HL−1(z)

�

�

�

↓ M

↓ M

↓ M

�

�

�

x(n) w0(n)

w1(n)

wL−1(n)

�

Q0

Q1

QL−1

�

�

�

↑ M

↑ M

↑ M

�

�

�

F0(z)

F1(z)

FL−1(z)

�

�

�

�

�

�

�x(n)

w0(n)

w1(n)

wL−1(n)

�

(a)

�

�

�

�

�

�

↑ M

↑ M

↑ M

�

�

�

�

z−1

z−1

z−1

x(n)

x(n)

�

F(z)

M × L

w(n)

��

�

�

↓ M

↓ M

↓ M

�

�

�

�

�

�

� �

�

�

Q0

Q1

QL−1

w(n)

�x(n)

z

z

z

x(n)

�

H(z)

L × M

(b)

Figure 5.1: (a) Uniform filter bank with scalar quantizers in the subbands, (b) Polyphase represen-tation of the filter bank.

5.2 Uniform Quantized Filter Bank Model

The model we focus on here is a uniform filter bank with scalar quantizers in the subbands as shown

in Fig. 5.1(a). This model is often used in data compression to achieve good lossy compression

[41, 39]. Here, the number of channels L is variable, but for the purposes of data compression, we

will often desire L ≤ M , since if L > M , we are introducing an unnecessary redundancy into the

system. If we consider the following M -fold polyphase decompositions of the analysis filters Hk(z)

and synthesis filters Fk(z) for 0 ≤ k ≤ L − 1,

Hk(z) =M−1∑�=0

z�Hk,�

(zM

)(Type II)

Fk(z) =M−1∑�=0

z−�Fk,�

(zM

)(Type I)

107

� Qk�wk(n) wk(n) � �

wk(n) wk(n)

qk(n)

≡

Figure 5.2: High bit rate quantizer model.

then the system of Fig. 5.1(a) can be redrawn as in Fig. 5.1(b), where we have

[H(z)]�,m = H�,m(z) , [F(z)]m,� = F�,m(z) for 0 ≤ � ≤ L − 1 , 0 ≤ m ≤ M − 1

Here, x(n) and x(n) denote, respectively, the M -fold blocked versions of the filter bank input x(n)

and x(n). Also, w(n) and w(n) denote, respectively, the vector of quantizer inputs and outputs

given by

w(n) �[

w0(n) w1(n) · · · wL−1(n)]T

, w(n) �[

w0(n) w1(n) · · · wL−1(n)]T

The quantizer Qk is used to limit the number of possible output values of each sample of its

input wk(n) to a finite amount of possibilities. Most often, the quantizer approximates wk(n) using

a finite amount of discrete levels. For example, if the sample wk(n) is a 16 bit word stored on a

computer in memory, the quantizer Qk may truncate wk(n) to 4 bits by preserving only the 4 most

significant bits of wk(n). Since the quantizer output requires less information to store than its input

in general, it allows us to achieve data compression. As can be seen, the quantizer decreases the

rate of the overall system while introducing distortion, since we are always discarding information

in general. Hence, any compression achieved by the quantizer is necessarily lossy.

Since most if not all digital signals are processed on computers, quantizers are typically charac-

terized by the number of bits that have been allocated to it. For example, if the quantizer Qk from

Fig. 5.1 is allocated bk bits, then its output wk(n) can take on 2bk possible values. Though quan-

tizers are nonlinear devices which often become mathematically intractable to account for exactly,

under certain assumptions and approximations, they become very simple to analyze.

For the remainder of the thesis, we will assume that the high bit rate assumption [25] is valid

for each quantizer Qk here. The high bit rate assumption, which approximately holds true if the

number of bits bk is large enough [25], allows us to treat the quantizer Qk as an additive zero mean

white noise process. This is shown in Fig. 5.2. The variance of the additive white noise qk(n) is

given by

σ2qk

= ck2−2bkσ2wk

(5.1)

108

H(z)

L × MM L

w(n)

L

q(n)

L

w(n)F(z)

M × L Mx(n)x(n)

Figure 5.3: Equivalent quantized filter bank model under the high bit rate assumption.

Here, ck is a quantity called the quantizer performance factor that depends on the coding method

used as well as the probability density function (pdf) of the quantizer input sample wk(n) [25]

(which is assumed to be the same for all n). For example, if Qk is a Lloyd-Max quantizer [25],

which is optimized for the input pdf, then ck = 1.00 for a uniform pdf and ck = 2.71 for a Gaussian

pdf [25]. In addition to the above assumptions on qk(n), it is assumed that qk(n) is uncorrelated

with the other noise processes q�(n) for all � �= k as well as the input signals wk(n) for all k.

Using the high bit rate additive noise quantizer model of Fig. 5.2, the filter bank model of Fig.

5.1(b) can be redrawn as in Fig. 5.3. The vector q(n) simply consists of the noise processes qk(n)

and is given by

q(n) �[

q0(n) q1(n) · · · qL−1(n)]T

Here, we have removed the blocking/unblocking structures used to obtain the scalar signals x(n)

and x(n), as we will only be concerned about their respective equivalent M -fold blocked versions

x(n) and x(n).

As before, we will assume that x(n) is WSS with psd Sxx(z). From the above assumptions, it

follows that the vector q(n) is WSS with psd Sqq(z), where we have

Sqq(z) = Q where Q = diag(σ2

q0, σ2

q1, . . . , σ2

qL−1

)(5.2)

Also, the processes q(n) and w(n) are uncorrelated here.

5.3 Optimizing the Mean-Squared Reconstruction Error

The quantized filter bank model of Fig. 5.1 is plagued with distortion introduced by the quantizers

as well as errors caused by aliasing, amplitude, and phase distortions. As we wish to minimize these

effects, we will consider the problem of minimizing the expected M -fold blocked mean-squared error

ξ given by

ξ � E[||x(n) − x(n)||2

](5.3)

109

for a fixed bit allocation {bk}. Using the high bit rate model of the quantized filter bank shown

in Fig. 5.3, it follows that x(n) and x(n) are jointly WSS. Hence, the error e(n) � x(n) − x(n) is

WSS. We have

ξ = E[e†(n)e(n)

]= Tr

[E[e(n)e†(n)

]]= Tr [Ree(0)] = Tr

[12π

∫ 2π

0See(ejω) dω

](5.4)

where Ree(k) and See(z) denote, respectively, the autocorrelation and psd of e(n). As e(n) =

x(n) − x(n), we have

See(z) = Sxx(z) − Sxx(z) − Sxx(z) + Sxx(z) (5.5)

where Sxx(z) and Sxx(z) denote the cross psds of x(n) and x(n) [48], and Sxx(z) denotes the psd

of x(n). Note that from Fig. 5.3, we have

X(z) = F(z)H(z)X(z) + F(z)Q(z)

Thus, assuming x(n) and q(n) are uncorrelated, we have [67]

Sxx(z) = Sxx(z)H(z)F(z) (5.6)

Sxx(z) = Sxx(z) = F(z)H(z)Sxx(z) (5.7)

Sxx(z) = F(z)H(z)Sxx(z)H(z)F(z) + F(z)Sqq(z)F(z)

= F(z)H(z)Sxx(z)H(z)F(z) + F(z)QF(z) (5.8)

Here, we used (5.2) in (5.8). Substituting (5.6), (5.7), and (5.8) in (5.5), we get

See(z) = Sxx(z) − Sxx(z)H(z)F(z) − F(z)H(z)Sxx(z)

+ F(z)H(z)Sxx(z)H(z)F(z) + F(z)QF(z)(5.9)

Though it is difficult to jointly choose H(z) and F(z) subject to FIR constraints to optimize ξ

(see [20, 19, 21] for a Lagrange multiplier technique for a fixed quantization term Q), doing so one

at a time is simple and can be done globally. This will lead to an iterative algorithm whereby the

analysis and synthesis banks are alternately optimized. Before proceeding, note that the matrix Q

from (5.9) depends on the analysis bank H(z). To see this, note that from (5.2) and (5.1), we have

Q = diag(c02−2b0σ2

w0, c12−2b1σ2

w1, . . . , cL−12−2bL−1σ2

wL−1

)(5.10)

where we have, from Fig. 5.3,

σ2wk

=12π

∫ 2π

0

[H(ejω)Sxx(ejω)H†(ejω)

]k,k

dω (5.11)

Hence, optimizing the analysis bank H(z) for a fixed synthesis bank F(z) is more mathematically

challenging than optimizing F(z) for a fixed H(z). As such, we first consider optimizing F(z) for

a fixed H(z) and then move on to the more challenging task of optimizing H(z) for a fixed F(z).

110

5.3.1 Optimal Synthesis Bank F(z) for Fixed Analysis Bank H(z)

Suppose that the analysis bank H(z) is fixed and that the synthesis bank F(z) is FIR of length Nf

and of the form

F(z) = zPFc(z)

where P is an advance parameter and Fc(z) is a causal FIR system of the form

Fc(z) =Nf−1∑n=0

fc(n)z−n

Note that the impulse response fc(n) is an M × L sequence. If we define the M × LNf matrix f c

and LNf × L delay matrix d(z) as

f c �[

fc(0) fc(1) · · · fc(Nf − 1)]

d(z) �[

zP IL zP−1IL · · · zP−(Nf−1)IL

]T

then clearly we have F(z) = f cd(z) and all of the degrees of freedom in choosing F(z) with the FIR

constraint lie in the choice of the constant matrix f c. From (5.9), we have

See(z) = Sxx(z) − Sxx(z)H(z)d(z)f †c − f cd(z)H(z)Sxx(z)

+ f cd(z)[H(z)Sxx(z)H(z) + Q

]d(z)f †c

(5.12)

Substituting (5.12) into (5.4) yields

ξ = Tr[Rxx(0) − B†f †c − f cB + f cAf

†c

](5.13)

where Rxx(k) denotes the autocorrelation of x(n) and the LNf × LNf matrix A and LNf × M

matrix B are defined as follows.

A � 12π

∫ 2π

0d(ejω)

[H(ejω)Sxx(ejω)H†(ejω) + Q

]d†(ejω) dω (5.14)

B � 12π

∫ 2π

0d(ejω)H(ejω)Sxx(ejω) dω (5.15)

Note that A is simply the Nf -fold block autocorrelation matrix of the process w(n) from Fig. 5.3

[48]. In other words, if Rww(k) denotes the autocorrelation of w(n), then we have

A =

Rww(0) Rww(1) · · · Rww(Nf − 1)

Rww(−1) Rww(0) · · · Rww(Nf − 2)...

.... . .

...

Rww(−(Nf − 1)) Rww(−(Nf − 2)) · · · Rww(0)

111

As such, the matrix A is strictly positive definite and hence invertible [48]. Since A is invertible,

using the trick of completing the square [22, 34], we can express ξ in (5.13) as follows

ξ = Tr[ (

f c − B†A−1)A(f c − B†A−1

)†︸︷︷︸

positive semidefinite

+Rxx(0) − B†A−1B]

As we wish to minimize ξ, this can only be done by setting the first term on the right-hand side

of the above equation equal to the zero matrix. This yields the following optimal choice of f c and

corresponding optimal ξ.

f c,opt = B†A−1 , ξopt = Tr[Rxx(0) − B†A−1B

](5.16)

Here, A and B are as in (5.14) and (5.15), respectively. The choice of f c given in (5.16) is optimal

for a fixed Sxx(z), H(z), and Q.

5.3.2 Optimal Analysis Bank H(z) for Fixed Synthesis Bank F(z)

5.3.2.1 Simplifying the Quantization Noise Term

Prior to optimizing the mean-squared error ξ with respect to the analysis bank H(z), it will help to

simplify the quantization noise term that appears in the expression for See(z) given in (5.9). Note

that from (5.4), we have

ξ =12π

∫ 2π

0Tr[See(ejω)

]dω (5.17)

Also, from (5.9), we have

Tr [See(z)] = Tr [Sxx(z)] − Tr[Sxx(z)H(z)F(z)

]− Tr [F(z)H(z)Sxx(z)]

+ Tr[F(z)H(z)Sxx(z)H(z)F(z)

]+ Tr

[F(z)QF(z)

]︸︷︷︸

G(z)

(5.18)

Before simplifying Tr [See(z)] further, we will simplify the term G(z), which is due to the

quantization noise q(n) filtered by the synthesis bank F(z) as can be seen in Fig. 5.3. The reason

for this is that the matrix Q appearing in G(z) depends on the analysis bank H(z) as can be seen

from (5.10) and (5.11). Using the fact that Tr [AB] = Tr [BA] whenever A and B are conformable

[22], we have

G(z) = Tr[QF(z)F(z)

]Upon using (5.10), we get

G(z) =L−1∑k=0

ck2−2bkσ2yk

[F(z)F(z)

]k,k

=L−1∑k=0

ck2−2bk

[F(z)F(z)

]k,k

σ2yk

112

As σ2yk

is given by (5.11), we have

G(z) =L−1∑k=0

ck2−2bk

[F(z)F(z)

]k,k

(12π

∫ 2π

0

[H(ejλ)Sxx(ejλ)H†(ejλ)

]k,k

dλ

)

=12π

∫ 2π

0

(L−1∑k=0

ck2−2bk

[F(z)F(z)

]k,k

[H(ejλ)Sxx(ejλ)H†(ejλ)

]k,k

)dλ (5.19)

Define the L × L matrix Λ(z) as follows.

Λ(z) � diag(

c02−2b0[F(z)F(z)

]0,0

, c12−2b1[F(z)F(z)

]1,1

, . . . , cL−12−2bL−1

[F(z)F(z)

]L−1,L−1

)(5.20)

Note that Λ(z) does not depend on H(z). Using Λ(z) in (5.19), we can express G(z) as

G(z) =12π

∫ 2π

0Tr[Λ(z)H(ejλ)Sxx(ejλ)H†(ejλ)

]dλ (5.21)

With the simplification of the noise term G(z) given in (5.21), we are now ready to optimize ξ

with respect to H(z) with an FIR constraint in effect.

5.3.2.2 Imposing the FIR Constraint on H(z)

Suppose now that the synthesis bank F(z) is fixed and that the analysis bank H(z) is FIR of length

Nh and of the form

H(z) = zQHc(z)

where Q is an advance parameter and Hc(z) is a causal FIR system of the form

Hc(z) =Nh−1∑n=0

hc(n)z−n

Note that the impulse response hc(n) is an L × M sequence and that there are Nh such matrix

coefficients. Hence, H(z) is characterized by a total of LMNh degrees of freedom with the FIR

constraint in effect. In order to minimize ξ from (5.17) in this case, we need to group all of these

degrees of freedom together, which can be done through the use of the vec operator [34].

Recall that if A is any M × N matrix, then vec(A) is an MN × 1 column vector with [34]

[vec(A)]k � [A]k mod M,� kM � , 0 ≤ k ≤ MN − 1

In other words, the vec operator stacks the columns of any matrix on top of each other creating

a large column vector. Through clever use of the vec operator, we can express ξ from (5.17) as a

quadratic function in terms of the filter coefficients of H(z).

113

For simplicity, define the LM × 1 column vectors hn as follows.

hn � vec (hc(n)) , 0 ≤ n ≤ Nh − 1

Furthermore, define the LMNh×1 vector h, the LM ×1 vector v(z) and the LMNh×LM advance

matrix a(z) as follows.

h �[

hT0 h

T1 · · · h

TNh−1

]T

v(z) � vec(F(z)Sxx(z)

)a(z) �

[z−QILM z1−QILM · · · zNh−1−QILM

]T

Note that h contains all of the degrees of freedom of H(z) here. Also note that due to the linearity

property of the vec operator [34], we have

vec (H(z)) =Nh−1∑n=0

zQ−n vec (hc(n))︸︷︷︸hn

=[

zQILM zQ−1ILM · · · zQ−(Nh−1)ILM

]︸︷︷︸

a(z)

h0

h1

...

hNh−1

︸︷︷︸

h

(5.22)

By exploiting the following properties of the trace and vec operators [34],

Tr[A†B

]= (vec(A))† vec(B) (5.23)

vec(AXB) =(BT ⊗ A

)vec(X) (5.24)

where ⊗ denotes the Kronecker product operator [34], we can express the error ξ as a quadratic

function of the vector h, which can then be minimized by completing the square as was done in

Sec. 5.3.1. Note that for any M ×N matrix A and P ×Q matrix B, the Kronecker product A⊗B

is a MP × NQ matrix defined as follows [34].

A ⊗ B �

[A]0,0 B [A]0,1 B · · · [A]0,N−1 B

[A]1,0 B [A]1,1 B · · · [A]1,N−1 B...

.... . .

...

[A]M−1,0 B [A]M−1,1 B · · · [A]M−1,N−1 B

114

Substituting (5.21) into (5.18) and exploiting (5.23), (5.24), and (5.22), we have the following.

Tr [See(z)] = Tr [Sxx(z)] − Tr[H(z)F(z)Sxx(z)

]− Tr [Sxx(z)F(z)H(z)]

+ Tr[H(z)F(z)F(z)H(z)Sxx(z)

]+

12π

∫ 2π

0Tr[H†(ejλ)Λ(z)H(ejλ)Sxx(ejλ)

]dλ

= Tr [Sxx(z)] − h†a(z)v(z) − v(z)a(z)h

+ h†a(z) vec

(F(z)F(z)H(z)Sxx(z)

)+

12π

∫ 2π

0h†a(ejλ) vec

(Λ(z)H(ejλ)Sxx(ejλ)

)dλ

= Tr [Sxx(z)] − h†a(z)v(z) − v(z)a(z)h

+ h†a(z)

(ST

xx(z) ⊗ F(z)F(z))a(z)h

+ h†(

12π

∫ 2π

0a(ejλ)

(ST

xx(ejλ) ⊗ Λ(z))a†(ejλ) dλ

)h (5.25)

Upon substituting (5.25) into (5.17), we get the following.

ξ = Tr [Rxx(0)] − h†g − g†h + h

†Ch (5.26)

where the LMNh × 1 vector g and LMNh × LMNh matrix C are defined as

g � 12π

∫ 2π

0a(ejω)v(ejω) dω (5.27)

C � 12π

∫ 2π

0a(ejω)

(ST

xx(ejω) ⊗(F†(ejω)F(ejω) + D

))a†(ejω) dω (5.28)

and the L × L matrix D is defined to be

D � 12π

∫ 2π

0Λ(ejω) dω

where Λ(z) is as in (5.20).

As can be seen from (5.26), the mean-squared error ξ is a quadratic function of the coefficients

of H(z) which are contained in the vector h. To minimize ξ from (5.26), we can complete the

square as was done in Sec. 5.3.1. In order to do so, first note that the matrix C from (5.28) is

invertible. To see this, note that from (5.8), h†Ch represents the energy of x(n) from Fig. 5.3, i.e.,

h†Ch = Tr [Rxx(0)] > 0 for all h �= 0, since we assume that the energy of x(n) is nonzero here.

Hence, by completing the square [22], the optimal h and corresponding optimal ξ are given by

hopt = C−1g , ξopt = Tr [Rxx(0)] − g†C−1g (5.29)

Here, g and C are given by (5.27) and (5.28), respectively. The choice of h given in (5.29) is optimal

for a fixed Sxx(z), F(z), {ck}, and {bk}.

115

5.3.3 Iterative Greedy Analysis/Synthesis Filter Bank Optimization Algorithm

By alternately optimizing the analysis and synthesis banks, we obtain an iterative greedy algorithm

for designing an FIR filter bank adapted to the psd Sxx(z) and the bit allocation {bk}. In what

follows, let Fk(z), Hk(z), and ξk denote, respectively, the synthesis bank, analysis bank, and

reconstruction error at the k-th iteration for k ≥ 0. Then, the iterative filter bank optimization

algorithm is as follows.

Initialization:

1. Select a set of values for the desired filter bank parameters L, M , P , Nf , Q, and Nh.

2. Select a desired bit allocation {bk} along the subbands and choose appropriate quantizer

performance factors {ck} for the subband pdfs.

3. Choose an initial synthesis bank F0(z).

4. Compute the corresponding optimal analysis bank H0(z) and reconstruction error ξ0

using (5.29).

Iteration: For k ≥ 1, do the following.

1. With a fixed analysis bank Hk−1(z), compute the optimal synthesis bank Fk(z) using

(5.16).

2. With a fixed synthesis bank Fk(z), compute the optimal analysis bank Hk(z) and cor-

responding reconstruction error ξk using (5.29).

3. Increment k by 1 and return to Step 1.

Since this algorithm is greedy, the error ξk is guaranteed to be a monotonic nonincreasing

function of the iteration index k. As the error is always lower bounded by zero (i.e., ξk ≥ 0), ξk

is also guaranteed to have a limit as k → ∞ [67]. Simulation results provided here verify this

monotonic and limiting behavior. In particular, it will be seen that the error appears to quickly

converge to its limit after only a few iterations.

116

0 0.1 0.2 0.3 0.4 0.50

10

20

30

40

f = ω/2π

Mag

nitu

de

Figure 5.4: Input psd Sxx(ejω) to the system of Fig. 5.1.


5.4.1 Overdecimated Filter Bank Design

The proposed iterative algorithm of this chapter can be used for the design of overdecimated

compaction-like filter banks in which the only constraint made on the analysis and synthesis filters

is that they be FIR. This is done by setting L < M and the quantizer performance factors ck = 0.

In this case, the filter bank only suffers from aliasing, amplitude, and phase distortions due to the

fact that the filter bank is overdecimated (see Chapter 2). Even though we are not enforcing a

PU condition here, it will be seen that in some instances, the filters designed share something in

common with those of the infinite-order PCFB. In particular, it will be seen that in some cases,

the filters designed try to compact the energy of the input signal, a property shared by the PCFB

filters.

5.4.1.1 Single Subband Design Example

Suppose that the input x(n) to Fig. 5.1 is a real WSS AR(4) process whose psd Sxx(ejω) is shown

in Fig. 5.4. Furthermore, suppose that we chose the following filter bank parameters here.

• Number of channels – L = 1, Decimation ratio – M = 3.

• Synthesis advance parameter – P = 0, Synthesis polyphase matrix length – Nf = 7.

• Analysis advance parameter – Q = Nh − 1, Analysis polyphase matrix length – Nh = 7.

117

0 10 20 30 401

2

3

4

5

6

7

8

k

ξ k

Proposed algorithmPCFB

Figure 5.5: Mean-squared reconstruction error ξk as a function of the iteration index k. (Singlesubband example)

In other words, for this example, we have opted to design one subband of a three channel system

in which the synthesis filter F0(z) is a causal FIR filter of length MNf = 21 and the analysis filter

is an anticausal FIR filter of length MNh = 21. Here, we chose the synthesis filter to be causal

and the analysis filter to be anticausal and of the same length as the synthesis filter so as not to

introduce a temporal bias into the filter bank. For the initialization, the initial synthesis bank

F0(z) was chosen to be a random causal FIR PU system of degree (Nf − 1) using the complete

parameterization of such systems given in Sec. 3.2. All integrals required in the proposed algorithm

were computed numerically using 256 uniformly spaced frequency samples.

A plot of the reconstruction error ξk as a function of the iteration index k is shown in Fig.

5.5 for a total of 50 iterations. In addition to the FIR filter bank error, we have shown the error

obtained using the infinite-order PCFB compaction filters for comparison. As can be seen, the

error ξk indeed is monotonic nonincreasing and appears to be approaching a limit. After the last

iteration, the error was 2.3026, which is close to the corresponding PCFB error of 2.0583. As we

increase the order of the filters Nf and Nh, the observed error comes closer to the PCFB error

and may in fact surpass this error since outside of the inherent FIR constraint, we are imposing

no other constraints such as orthonormality or biorthogonality. When we ran the same simulations

but increased Nf and Nh both to 10, the observed reconstruction error after 50 iterations was found

to be 2.2163.

It would be interesting to further increase Nf and Nh to see their asymptotic behavior. However,

the proposed algorithm becomes excessively computationally intensive in this case and more prone

118

0 0.1 0.2 0.3 0.4 0.50

1

2

3

4

5

6

7

f = ω/2π

Mag

nitu

de

|H0(ejω)|2

|F0(ejω)|2

1st PCFB

(a)

0 0.1 0.2 0.3 0.4 0.50

2

4

6

8

f = ω/2π

Mag

nitu

de

|H0(ejω)|2

|F0(ejω)|2

1st PCFB

(b)

Figure 5.6: Magnitude squared responses of the analysis and synthesis filters for (a) Nf = Nh = 7and (b) Nf = Nh = 10. (Single subband example)

to numerical inaccuracies. This is because the matrices required to be inverted, namely, A in

(5.14) and C in (5.28), grow linearly in size with Nf and Nh, respectively, and hence become more

complex to invert and more susceptible to numerical errors as a result.

A plot of the magnitude squared responses of the analysis and synthesis filters designed is shown

in Fig. 5.6 for (a) Nf = Nh = 7 and (b) Nf = Nh = 10. In addition, we have also plotted the

magnitude squared response of the first PCFB analysis/synthesis filter, which is an ideal compaction

filter. As can be seen, the filters designed appear to have most of their energy contained in the

same frequency support region as that of the ideal compaction filter. Though no PU constraint was

enforced here, the filters designed appear to be compacting the energy of the input signal, much

like the ideal compaction filter of the PCFB.

In order to gauge the behavior of the solutions obtained with the proposed algorithm, we opted

to calculate the deviation of the observed solutions from orthonormality and biorthogonality. To

measure the deviation from orthonormality, we considered the metric

δ⊥,k � 12π

∫ 2π

0

∣∣∣∣∣∣IL − F†k(e

jω)Fk(ejω)∣∣∣∣∣∣2

2dω

whereas to measure the deviation from biorthogonality, we used

δBIO,k � 12π

∫ 2π

0

∣∣∣∣IL − Hk(ejω)Fk(ejω)∣∣∣∣2

2dω

119

0 10 20 30 400

10

20

30

40

k

δ ⊥,k

(a)

0 10 20 30 400

0.002

0.004

0.006

0.008

0.01

0.012

k

δ BIO

,k

(b)

Figure 5.7: Deviation from (a) orthonormality δ⊥,k and (b) biorthogonality δBIO,k as a function ofthe iteration index k. (Single subband example)

In Fig. 5.7(a) and (b), we have plotted, respectively, δ⊥,k and δBIO,k as functions of k for the case

Nf = Nh = 7. From Fig. 5.7(a), it can be seen that the solution obtained deviates monotonically

from orthonormality, whereas from Fig. 5.7(b), the solution fluctuates but appears approximately

biorthogonal. Similar phenomena occurred for Nf = Nh = 10 but the results are omitted here for

sake of brevity.

5.4.1.2 Multiple Subband Design Example

For this section, we consider the same design example considered in Sec. 5.4.1.1, except that this

time, we will take L = 2 here. In other words, we consider here the design of two subbands of a

three channel system. As before, we ignore the effects due to quantization and so the only sources of

error are those due to aliasing, amplitude, and phase distortions as the filter bank is overdecimated.

A plot of the observed reconstruction error ξk as a function of the iteration index k is shown in

Fig. 5.8 for 50 iterations. As before, we have included the corresponding error obtained with the

PCFB. After the last iteration, the error was 0.2455, which is close to the corresponding PCFB

error of 0.2294. Note that, as before, if we increase the filter order parameters Nf and Nh, the

error comes closer to the PCFB error and may surpass it. Here, when we chose Nf = Nh = 10,

the observed error after 50 iterations was 0.2377, which indeed is closer to the PCFB error than

0.2455, which was obtained for Nf = Nh = 7.

120

0 10 20 30 400.2

0.3

0.4

0.5

k

ξ k

Proposed algorithmPCFB

Figure 5.8: Mean-squared reconstruction error ξk as a function of the iteration index k. (Multiplesubband example)

0 10 20 30 400

0.5

1

1.5

2

k

δ ⊥,k

(a)

0 10 20 30 400

0.005

0.01

0.015

0.02

0.025

k

δ BIO

,k

(b)

Figure 5.9: Deviation from (a) orthonormality δ⊥,k and (b) biorthogonality δBIO,k as a function ofthe iteration index k. (Multiple subband example)

121

0 0.1 0.2 0.3 0.4 0.50

1

2

3

4

5

6

f = ω/2π

Mag

nitu

de

|H0(ejω)|2

|F0(ejω)|2

1st PCFB

(a)

0 0.1 0.2 0.3 0.4 0.50

1

2

3

4

5

6

7

f = ω/2π

Mag

nitu

de

|H1(ejω)|2

|F1(ejω)|2

2nd PCFB

(b)

Figure 5.10: Magnitude squared responses of the analysis filters Hk(z) and synthesis filters Fk(z)for (a) k = 0, and (b) k = 1. (Multiple subband example)

In Fig. 5.9(a) and (b), we have plotted the deviation in orthonormality and biorthogonality,

respectively. As was the case in Sec. 5.4.1.1, the solution obtained monotonically deviates from

orthonormality but appears approximately biorthogonal after some fluctuation.

In Fig. 5.10, we have plotted the magnitude squared responses of the analysis and synthesis

filters. For comparison, we have included the corresponding responses of the first two analy-

sis/synthesis filters of the infinite-order PCFB. Note that unlike in Sec. 5.4.1.1, the filters designed

do not resemble the PCFB ideal compaction filters.

The designed FIR filters do not individually compact the energy of the input signal but rather

collaborate to minimize the mean-squared error of the output. It turns out that there is always a

nonunique way for the filters to collaborate. To see this, note that in the absence of quantizers, the

overdecimated filter bank can be redrawn as in Fig. 5.11, where T is any L × L invertible matrix.

Clearly, if H(z) and F(z) are optimal FIR systems for minimizing the mean-squared reconstruction

error ξ, so too are H(z) = T−1H(z) and F(z) = F(z)T. As T need not be diagonal, given any

optimal set of analysis/synthesis filters, we can obtain a new set of optimal filters which collaborate

with one another to minimize ξ. In the special case where L = 1, then T becomes simply a scale

factor, and so in this case, no such collaboration is possible. Hence, the analysis and synthesis

filters must individually compact the energy of the input so as to minimize the reconstruction error

ξ, which was observed in Sec. 5.4.1.1.

122

T

L × L

�

�

�

�

�

�

↑ M

↑ M

↑ M

�

�

�

�

z−1

z−1

z−1

x(n)

x(n)

�

F(z)

M × L

T−1

L × L

�

�

�

�

�

�

↓ M

↓ M

↓ M

�

�

�

�

�

�

� �

�

�

x(n)

z

z

z

x(n)

�

H(z)

L × M

Figure 5.11: Equivalent overdecimated filter bank model in the absence of quantizers.

5.4.2 Quantized Filter Bank Design

In this section, we focus on the design of quantized filter banks. For all of the examples we will

consider here, the filter bank will be assumed to be maximally decimated, i.e., L = M here. To

gauge the performance of the filter banks designed, we will consider several relevant measures.

The first measure of optimality we consider here is the coding gain [25, 67] of the filter banks

designed. This quantity measures the improvement in distortion by using a filter bank quantization

system as opposed to direct quantization at the same average bit rate. The coding gain Gcode is

defined as the ratio between the distortion incurred by direct quantization and that incurred by

using the filter bank system. In other words, we have

Gcode � Ddir

DFB(5.30)

where Ddir is the distortion incurred from direct quantization given by (5.1) to be

Ddir = c2−2bσ2x

where c, b, and σ2x are the quantizer performance factor, average bit rate, and input variance,

respectively, and DFB is the average distortion incurred by the filter bank system given by

DFB =1M

ξ (5.31)

where ξ is the filter bank blocked reconstruction error given in (5.3). Here, in order for Gcode to

be a meaningful measure, we require that the bit rate b of the directly quantized signal be equal to

123

the average bit rate of the filter bank quantization system, i.e., we have

b =1M

M−1∑k=0

bk (5.32)

Hence, the coding gain is a measure of the improvement in terms of distortion offered by using

a sophisticated filter bank quantization scheme as opposed to an unsophisticated single quantizer

system at the same bit rate.

In certain special cases, the coding gain can be simplified greatly. For example, if the quantized

filter bank system of Fig. 5.1 is a maximally decimated orthonormal or PU filter bank, the coding

gain becomes [25, 72]

Gcode =c2−2bσ2

x

1M

M−1∑k=0

σ2qk

=c2−2bσ2

x

1M

M−1∑k=0

ck2−2bkσ2wk

Assuming that all of the subband quantizer performance factors ck are all equal to c, which occurs

if the input pdfs are all the same, then we have

Gcode =2−2bσ2

x

1M

M−1∑k=0

2−2bkσ2wk

It turns out [25, 67] that for any fixed PU filter bank, there is an optimal way to allocate the

subband bit rates {bk} subject to the average bit rate constraint of (5.32) to maximize Gcode. This

occurs as a result of using the arithmetic mean/geometric mean (AM/GM) inequality [22, 67].

With optimal bit allocation, the coding gain becomes

Gcode =σ2

x(M−1∏k=0

σ2wk

) 1M

=

1M

M−1∑k=0

σ2wk(

M−1∏k=0

σ2wk

) 1M

which is itself the AM/GM ratio of the subband variances. By the AM/GM inequality, we always

have Gcode ≥ 1 and so for any PU filter bank, there is always an improvement in terms of distortion

if the bits have been optimally allocated. With optimal bit allocation, among all PU filter banks

of a certain class, the PCFB, if it exists, exhibits the largest coding gain [1, 68].

Using the proposed iterative greedy algorithm of this chapter, it can be seen from (5.30) and

(5.31) that we obtain a filter bank system whose coding gain is monotonic nondecreasing with

iteration for a fixed bit allocation {bk}. This is more practical than optimizing the bit allocation

for a fixed filter bank, which will often times yield unrealizable noninteger bit allocations [25, 67].

124

� H(z) � Q � F (z) �x(n)

Prefilter Postfilter

x(n)

Figure 5.12: Pre/postfiltering quantization system.

Another measure we will focus on here is the distortion itself. It turns out that for a given

average bit rate b, there is a minimum distortion Dmin which can be achieved using any compression

scheme. This bound, known as the rate-distortion bound [10, 7] represents an information theoretic

limit as to the performance that can ever be achieved. For a Gaussian CWSS(M) input x(n) and

for a sufficiently large bit rate b, the rate-distortion bound becomes [25, 67]

Dmin(b) = γ2x2−2bσ2

x (5.33)

where γ2x is the spectral flatness measure [25, 31] of the process x(n) given by

γ2x �

exp{

12π

∫ 2π

0log

[(det

(Sxx(ejω)

)) 1M

]dω

}12π

∫ 2π

0

1M

Tr[Sxx(ejω)

]dω

=exp

{1

2πM

∫ 2π

0log

[det

(Sxx(ejω)

)]dω

}σ2

x

where Sxx(z) denotes the psd of the M -fold blocked version of x(n), namely, x(n). Using (5.33) in

(5.30), it follows that the coding gain Gcode can be upper bounded as follows.

Gcode ≤ Gcode,max =c

γ2x

(5.34)

where c is the quantizer performance factor of the direct quantization scheme. Through the sim-

ulation results we present here, it will be shown that the filter banks designed come closer to the

rate-distortion bound and coding gain bound as the filter orders are increased, in line with intuition.

5.4.2.1 Single Channel Example - Pre/Postfiltering in Quantization

The proposed iterative algorithm can be used for the design of an optimal pre/postfiltering quan-

tization scheme such as the one shown in Fig. 5.12. Note that this system is simply a special case

of the quantized filter bank system of Fig. 5.1 in which we have L = M = 1. The system shown in

Fig. 5.12 is often used for lossy data compression applications and the popular quantization scheme

differential pulse code modulation (DPCM) is itself a special case of this system.

125

0 20 40 60 800

0.5

1

1.5

2

2.5

3

k

ξ k

(a)

0 20 40 60 800

0.5

1

1.5

2

k

ξ k

(b)

Figure 5.13: Mean-squared reconstruction error ξk as a function of the iteration index k for (a)N = 10 and (b) N = 20.

To test the proposed algorithm, we chose the following parameters here.

• Quantizer performance factor – c0 = c = 2.71 (Gaussian pdf)

• Bit rate b0 = b was varied from 2 to 6 bits.

• Synthesis filter F0(z) = F (z) was causal FIR and of length N . Analysis filter H0(z) = H(z)

was anticausal FIR and of length N also. Here N was varied from 10 to 20.

For each run of the algorithm, a total of 100 iterations was used. Here, the initial synthesis bank

F0(z) was chosen completely randomly.

In Fig. 5.13, we have plotted the observed reconstruction error ξk for a bit rate of b0 = b = 2 as a

function of the iteration index k for (a) N = 10 and (b) N = 20. As can be seen, the error decreases

monotonically and appears to saturate as before. For N = 10, the error at the last iteration was

0.2884, whereas for N = 20, the error was 0.2627. As expected, when N increased here, the steady

state error decreased.

In Fig. 5.14 and 5.15 we have respectively plotted the deviation in orthonormality and biorthog-

onality for (a) N = 10 and (b) N = 20, respectively. As can be seen, in all of these examples,

the solutions appear to become more orthonormal and biorthogonal as the iterations progress. For

N = 10, we have δ⊥,k = 0.3910 and δBIO,k = 0.0972 for the last iteration, whereas for N = 20, we

126

0 20 40 60 800

1

2

3

4

5

k

δ ⊥,k

(a)

0 20 40 60 800

10

20

30

40

k

δ ⊥,k

(b)

Figure 5.14: Deviation from orthonormality δ⊥,k as a function of the iteration index k for (a)N = 10 and (b) N = 20.

0 20 40 60 800

0.2

0.4

0.6

0.8

1

k

δ BIO

,k

(a)

0 20 40 60 800

0.1

0.2

0.3

0.4

0.5

k

δ BIO

,k

(b)

Figure 5.15: Deviation from biorthogonality δBIO,k as a function of the iteration index k for (a)N = 10 and (b) N = 20.

127

0 0.1 0.2 0.3 0.4 0.50

0.5

1

1.5

2

2.5

3

f = ω/2π

Mag

nitu

de

|H(ejω)|2

|F(ejω)|2

(a)

0 0.1 0.2 0.3 0.4 0.50

0.5

1

1.5

2

2.5

3

3.5

f = ω/2π

Mag

nitu

de

|H(ejω)|2

|F(ejω)|2

(b)

Figure 5.16: Magnitude squared responses of the analysis filter H0(z) = H(z) and the synthesisfilter F0(z) = F (z) for (a) N = 10 and (b) N = 20.

have δ⊥,k = 10.9306 and δBIO,k = 0.0821 for the same iteration. Hence, both solutions appear to be

approximately biorthogonal but not quite orthonormal. This is in line with intuition, since if the

solution were also orthonormal, then the system of Fig. 5.12 would be a simple direct quantization

system, which we do not expect to be optimal here.

In Fig. 5.16, we have plotted the magnitude squared responses of the designed pre/postfilters

for (a) N = 10 and (b) N = 20. As can be seen, in both cases the filter responses appear to be very

similar to each other in terms of shape and support. This suggests that the algorithm is possibly

striving to approximate a globally optimal set of filter responses.

In Fig. 5.17, we have plotted the observed distortion as a function of the average bit rate b for

N = 10 and N = 20. For sake of comparison, we have included the distortion obtained through

direct quantization, as well as the rate-distortion bound. As can be seen, the distortion always

decreased monotonically with rate and always outperformed direct quantization. Furthermore, as

the order increased, the distortion came closer to the rate-distortion bound, as expected.

Finally, in Fig. 5.18, we have plotted the observed coding gain Gcode as a function of the filter

order parameter N for an average bit rate of b = 2. As can be seen, the coding gain monotonically

increased with N , in line with intuition. Furthermore, it came close to the maximum coding

gain. Here, we had Gcode = 1.9405 for N = 20 in contrast to the maximum coding gain of

Gcode,max = 3.0068.

128

2 3 4 5 60

0.05

0.1

0.15

0.2

0.25

Average bit rate − b

Dis

tort

ion

− D

Ddir

DFB

(N = 10)D

FB (N = 20)

Dmin

Figure 5.17: Observed distortion DFB as a function of the average bit rate b plotted with the directquantization and rate-distortion bounds.

10 12 14 16 18 201.75

1.8

1.85

1.9

1.95

N

Gco

de

Figure 5.18: Observed coding gain Gcode as a function of the analysis/synthesis filter order N .

129

0 20 40 60 800

1

2

3

4

5

6

k

ξ k

(a)

0 20 40 60 800

1

2

3

4

5

k

ξ k

(b)

Figure 5.19: Mean-squared reconstruction error ξk as a function of the iteration index k for (a)N = 3 and (b) N = 6.

5.4.2.2 Maximally Decimated Quantized Filter Bank

In this section, we consider the design of a maximally decimated filter bank with quantizers in the

subbands. We chose the following parameters here.

• Number of subbands – L = 3. Decimation ratio – M = 3

• Quantizer performance factors – ck = c = 2.71 for all k (Gaussian pdf)

• Average bit rate b was varied from 2 to 6 bits. For simplicity, the bits in the subbands were

allocated as follows.

b0 = b + 1 , b1 = b , b2 = b − 1

• Synthesis polyphase matrix F(z) was causal FIR and of length N . Analysis polyphase matrix

H(z) was anticausal FIR and of length N also. Here, N was varied from 1 to 6.

As before, the input x(n) considered was the WSS process whose psd Sxx(ejω) is as shown in Fig.

5.4. For each run of the algorithm, a total of 100 iterations was used. Here, the initial synthesis

bank F0(z) was chosen randomly as was done in the previous section.

In Fig. 5.19, we have plotted the observed reconstruction error ξk as a function of the iteration

index k for (a) N = 3 and (b) N = 6. As can be seen, the error decreases monotonically and

130

0 20 40 60 800

10

20

30

40

50

k

δ ⊥,k

(a)

0 20 40 60 800

20

40

60

80

100

k

δ ⊥,k

(b)

Figure 5.20: Deviation from orthonormality δ⊥,k as a function of the iteration index k for (a) N = 3and (b) N = 6.

0 20 40 60 800

0.5

1

1.5

2

k

δ BIO

,k

(a)

0 20 40 60 800

0.5

1

1.5

2

k

δ BIO

,k

(b)

Figure 5.21: Deviation from biorthogonality δBIO,k as a function of the iteration index k for (a)N = 3 and (b) N = 6.

131

0 0.1 0.2 0.3 0.4 0.50

0.2

0.4

0.6

0.8

1

1.2

1.4

f = ω/2π

Mag

nitu

de

|H0(ejω)|2

|F0(ejω)|2

(a)

0 0.1 0.2 0.3 0.4 0.50

0.5

1

1.5

2

2.5

3

3.5

f = ω/2π

Mag

nitu

de

|H1(ejω)|2

|F1(ejω)|2

(b)

0 0.1 0.2 0.3 0.4 0.50

0.5

1

1.5

2

f = ω/2π

Mag

nitu

de

|H2(ejω)|2

|F2(ejω)|2

(c)

Figure 5.22: Magnitude squared responses of the analysis and synthesis filters for N = 3. (a) H0(z),F0(z), (b) H1(z), F1(z), (c) H2(z), F2(z).

0 0.1 0.2 0.3 0.4 0.50

0.5

1

1.5

2

2.5

3

f = ω/2π

Mag

nitu

de

|H0(ejω)|2

|F0(ejω)|2

(a)

0 0.1 0.2 0.3 0.4 0.50

0.5

1

1.5

2

f = ω/2π

Mag

nitu

de

|H1(ejω)|2

|F1(ejω)|2

(b)

0 0.1 0.2 0.3 0.4 0.50

0.2

0.4

0.6

0.8

1

1.2

1.4

f = ω/2π

Mag

nitu

de

|H2(ejω)|2

|F2(ejω)|2

(c)

Figure 5.23: Magnitude squared responses of the analysis and synthesis filters for N = 6. (a) H0(z),F0(z), (b) H1(z), F1(z), (c) H2(z), F2(z).

appears to saturate as before. For N = 3, the error at the last iteration was 0.2339, whereas for

N = 6, the error was 0.1919. As expected, when N increased here, the steady state error decreased.

In Fig. 5.20 and 5.21 we have respectively plotted the deviation in orthonormality and biorthog-

onality for (a) N = 3 and (b) N = 6, respectively. As can be seen, in all of these examples, the

solutions appear to become more orthonormal and biorthogonal as the iterations progress. For

N = 3, we have δ⊥,k = 16.0173 and δBIO,k = 0.2127 for the last iteration, whereas for N = 6, we

have δ⊥,k = 28.2432 and δBIO,k = 0.2347 for the same iteration. Hence, both solutions appear to

be neither orthonormal nor biorthogonal.

In Fig. 5.22 and 5.23, we have plotted the magnitude squared responses of the designed analysis

and synthesis filters for N = 3 and N = 6, respectively. As opposed to the pre/postfiltering quanti-

132

2 3 4 5 60

0.05

0.1

0.15

0.2

0.25


Dis

tort

ion

− D

Ddir

DFB

(N = 3)D

FB (N = 6)

Dmin

(a)

2 3 4 5 60

0.05

0.1

0.15

0.2

0.25


Dis

tort

ion

− D

Pre/post (N = 10)Pre/post (N = 20)FB (N = 3)FB (N = 6)

(b)

Figure 5.24: Observed distortion DFB as a function of the average bit rate b plotted with (a) thedirect quantization and rate-distortion bounds, (b) the pre/postfiltering quantization results.

zation system in which the different order solutions looked similar to each other, the corresponding

responses here look very different.

In Fig. 5.24, we have plotted the observed distortion as a function of the average bit rate b

for N = 1 and N = 6. For sake of comparison, in Fig. 5.24(a) we have included the distortion

obtained through direct quantization and the rate-distortion bound, whereas in Fig. 5.24(b) we

have included the results obtained using the pre/postfiltering quantization system analyzed in Sec.

5.4.2.1. As can be seen in Fig. 5.24(a), the distortion always decreased monotonically with rate

and always outperformed direct quantization. Furthermore, as the order increased, the distortion

came closer to the rate-distortion bound, in line with intuition. More importantly, in Fig. 5.24(b),

it can be seen that each filter bank system outperformed both of the pre/postfiltering systems.

This justifies the use of a sophisticated filter bank system for quantization as opposed to a less

sophisticated pre/postfiltering quantization system.

Finally, in Fig. 5.25, we have plotted the observed coding gain as a function of the polyphase

order N for an average bit rate of b = 2. As can be seen, the coding gain monotonically increased

with N , in line with intuition. Furthermore, it came very close to the maximum coding gain in this

case. Here, we had Gcode = 2.6576 for N = 6 in contrast to the maximum gain of Gcode,max = 3.0068.

It should be noted that with the filter bank system here, the coding gain came much closer to the

maximum possible gain than the pre/postfiltering quantization system considered in Sec. 5.4.2.1.

133

1 2 3 4 5 61

1.5

2

2.5

3

N

Gco

de

Figure 5.25: Observed coding gain Gcode as a function of the analysis/synthesis polyphase filterorder N .


In this chapter, we presented a method for the design of purely FIR filter banks for optimal

reconstruction with the presence of scalar quantizers in the subbands. Under the high bit rate

assumption for the quantizers [25], we showed that the mean-squared error objective was quadratic

in terms of either the analysis or synthesis filter coefficients and hence could be minimized using

the trick of completing the square [22]. By alternately optimizing the analysis and synthesis banks,

we proposed an iterative greedy algorithm for the design of such FIR filter banks.

Simulation results presented here brought to light the merit of the proposed iterative algorithm.

We showed how the algorithm could be used to obtain minimum mean-squared error overdecimated

filter banks. In addition, we showed that sometimes the filter banks designed exhibited behavior

similar to the ideal compaction filters of the PCFB.

Afterwards, we focused on the effects of quantization and used the algorithm to design pre/post-

filtering quantization schemes as well as general filter bank quantization systems. It was shown

that in terms of coding gain and distortion, the systems designed came close to the theoretical

bounds established by information theory. In line with intuition, we showed that as the filter order

increased, the quantization systems monotonically came closer to these bounds. Furthermore,

we showed the advantages of using a sophisticated filter bank quantization system over a simple

pre/postfiltering quantization scheme. In particular, it was observed that the filter bank system

could come much closer to the optimal information theoretic bounds than the pre/postfiltering

system. These phenomena have not previously been formally presented in the literature.

134

Chapter 6

Eigenfilter Channel ShorteningEqualizer for DMT Systems

In this chapter, we show how the eigenfilter technique [74, 56], which was widely used in the

optimization algorithms presented for designing signal-adapted filter banks, can be applied to the

design of good channel shortening equalizers for discrete multitone (DMT) systems. This chapter

differs in theme from the others in that the optimization algorithm is not iterative as the equalizer

is designed in one step. The design objective we focus on here takes into account both the effects

due to the channel length as well as the noise. We show how our method can be used for the design

of fractionally spaced equalizers (FSEs). In contrast to other methods which require a Cholesky

decomposition of a certain matrix for every delay parameter chosen, the method presented here

only requires one such decomposition, as we show here. As such, this method is lower in complexity

than those mentioned above. Despite this decrease in complexity, we show that the method yields

nearly optimal performance in terms of bit rate or throughput.

The content of this chapter is mainly drawn from [55] and portions of it have been presented

at [53, 51, 52].

6.1 Outline

In Sec. 6.2, we briefly review the channel shortening equalizer design problem and how it relates

to DMT systems. A survey of some of the previous work related to the design of such equalizers is

given in Sec. 6.2.1.

135

� C(z) � � H(z) �

x(n)

Channel Equalizer

y(n)

η(n)

Noise

Figure 6.1: Channel shortening equalizer model.

In Sec. 6.3, we introduce the design objective that we will focus on here. The objective consid-

ered jointly accounts for the channel shortening objective, as well as the effects due to noise. In

Sec. 6.3.1, we analyze the objective function and show how it can be solved using the eigenfilter

approach. There, the low complexity advantage of the proposed method is revealed.

In Sec. 6.4, we present simulation results for the proposed channel shortening equalizer design

method, along with those of several other methods. There, it is shown that the method outperforms

most methods and performs nearly optimally in terms of observed bit rate.

Concluding remarks are made in Sec. 6.5. In the Appendix, we show the equivalence between

FSEs and the SIMO-MISO channel/equalizer model we focus on in Sec. 6.3.

6.2 The Channel Shortening Equalizer Problem

Suppose that we have a causal FIR system or channel C(z) of length Lc that we would like to

shorten to a length Ld < Lc. A typical model for the system used to shorten the channel is shown

in Fig. 6.1. The goal of the equalizer H(z) is to shorten the effective channel Ceff(z) � H(z)C(z) to

a length of Ld. Though this can be done exactly by choosing H(z) = A(z)C(z) , where A(z) is some FIR

system of length Ld, there are often problems with this approach. As C(z) may have zeros on or

outside the unit circle, the choice of H(z) = A(z)C(z) , which we assume must be causal here, can never

be stable. This in turn will result in the undesired phenomenon of spurious noise amplification.

Instead, what is typically done is to choose H(z) to be a causal FIR filter used to concentrate

the energy of the effective channel Ceff(z) into a window of length Ld. If H(z) is of length Le,

then Ceff(z) = H(z)C(z) is a causal FIR system of length Leff � (Lc + Le − 1). In this case, we

can decompose the effective channel impulse response ceff(n) as a sum of two responses, namely, a

desired response cdes(n) which is exactly of length Ld, and a residual response cres(n) which consists

of whatever remains of ceff(n) after removing cdes(n). This is illustrated in Fig. 6.2. Here, ∆ is

136

��

ceff(n)

0 ∆ ∆ + Ld − 1� �Ld

Lc + Le − 2n

cdes(n) − •cres(n) − �

Figure 6.2: Decomposition of the effective channel ceff(n) into a desired channel cdes(n) and residualchannel cres(n).

a delay parameter which satisfies 0 ≤ ∆ ≤ Leff − Ld. As the effects of the equalizer H(z) are

most evident in the time domain, it is often referred to as a time-domain equalizer or TEQ [46, 6].

Through proper choice of H(z) here, we can often times not only adequately shorten the channel,

but also combat the effects due to the noise as well.

The channel shortening problem arises in DMT systems such as digital subscriber loop (DSL)

[46]. A typical DMT system is shown in Fig. 1.14. Perhaps the most important feature of the DMT

system is the inclusion of redundancy in the form of a cyclic prefix. Essentially, this redundancy

allows an FIR channel of length at most equal to one sample more than the cyclic prefix length to be

equalized using only FIR components (as discussed in Sec. 1.4). In other words, if LCP denotes the

cyclic prefix length, then the DMT system can equalize a channel of length at most Ld = LCP + 1.

The problem with this is that the channel typically encountered in a DMT system is very long.

For example, in practical DMT systems such as asymmetric DSL (ADSL), the channel is typically

a telephone wire line [46] whose impulse response may be several hundreds of samples long. Instead

of increasing the cyclic prefix length to accomodate the long channel, which would result in more

redundancy and hence a lower overall rate, a TEQ is used to shorten the channel to the desired

length Ld = LCP + 1.

Often times, we may know something about the statistics of the noise encountered in a typical

DMT system. For example, in ADSL, the noise can sometimes be modeled as a combination of

near-end crosstalk (NEXT) and far-end crosstalk (FEXT) [46]. In these cases, we can design the

TEQ to jointly shorten the channel and account for the effects due to noise.

137

For a DMT system, the primary measure of optimality of a TEQ is the overall bit rate or

throughput of the system. This measure accounts for both the effects due to the channel length as

well as the noise. If fs denotes the analog sampling frequency and NDFT denotes the size of the

DFT matrix used in the DMT system (see Fig. 1.14), then the observed bit rate is given by [6]

Rb =fs

NDFT + LCP

∑k∈S

bk (6.1)

where bk denotes the number of bits to allocate for the k-th real subcarrier and S is a subset of the

set of real subcarriers{

0, 1, . . . , NDFT2 − 1

}that are used. Here, S is chosen according to the famous

“water filling” principle [10, 7], as the subcarriers are modeled as parallel independent Gaussian

channels [46]. As such, the number of bits to allocate for the k-th subcarrier is [6]

bk = log2

(1 +

SNRk

Γ

)where SNRk is the signal-to-noise (SNR) ratio for the k-th subcarrier which takes into account the

effects due to both the channel length and the noise given by

SNRk =Sxx(ejωk)

∣∣Cdes(ejωk)∣∣2

Sxx(ejωk) |Cres(ejωk)|2 + Sηη(ejωk) |H(ejωk)|2 , where ωk =2πk

NDFT

and Γ is an SNR gap [46] for achieving the Shannon capacity limit [10, 7] that is assumed to be the

same for all subcarriers. Also, Sxx(ejω) and Sηη(ejω) denote, respectively, the psds of the input to

the channel and noise (see Fig. 6.1), which we have assumed are WSS here. The quantity Γ depends

on the modulation format used as well as the desired probability of error [46, 6]. For uncoded

quadrature amplitude modulation (QAM) [38] with a probability of error of 10−7, Γ = 9.8 dB [46].

The maximum achievable bit rate is given by the matched-filter bound (MFB) [6], which is

RMFB =fs

NDFT + LCP

∑k∈S

bk (6.2)

where bk is the number of bits allocated to the k-th subcarrier given by

bk = log2

(1 +

SNRMFB,k

Γ

)and SNRMFB,k is the MFB SNR given by

SNRMFB,k =Sxx(ejωk)

∣∣C(ejωk)∣∣2

Sηη(ejωk), ωk =

2πk

NDFT

Note that the MFB SNR is only a function of the statistics of the input and noise, as well as the

channel response. These are the only invariants of the communications system and as such, dictate

the maximum possible performance alone.

Prior to presenting our method for TEQ design, we briefly review previous work on the matter.

138

6.2.1 Overview of Previous Work on Channel Shortening Equalizers

1. Minimum Mean-Squared Error (MMSE) Methods: Among the first approaches for the design

of channel shortening equalizers were the minimum mean-squared error (MMSE) methods

proposed by Al-Dhahir and Cioffi [3, 4]. With these methods, the TEQ h(n) is chosen to be

an FIR Wiener filter [48] for the response b(n) ∗ x(n), where b(n) is a target impulse response

(TIR), which is some FIR system of length Ld. In other words, referring to Fig. 6.1, the

equalizer h(n) is chosen to minimize

ξ � E[|y(n) − b(n) ∗ x(n)|2

]For a fixed TIR b(n), the TEQ coefficients of h(n) can be found using the orthogonality

principle [48].

Early methods for the design of the TIR b(n) consisted of choosing it such that b(n) satisfied

a unit energy or unit tap constraint as shown below [3].∑n

|b(n)|2 = 1 (Unit Energy Constraint (UEC))

b(n0) = 1 for some n0 (Unit Tap Constraint (UTC))

Though these choices of b(n) are easy to obtain and compute, they often perform poorly with

respect to overall observed bit rate. Furthermore, once b(n) has been appropriately computed,

the equalizer coefficients of h(n) must still be found using the orthogonality principle.

Another method for the design of the TIR b(n) perhaps better suited for DMT systems is the

geometric SNR (GSNR) method of [4]. In it, the authors obtained an approximate expression

for the bit rate and found that increasing the product of∣∣B(ejω)

∣∣2 at the DFT frequencies

ωk = 2πkNDFT

would increase the bit rate. Formally, the GSNR method consists of maximizing

this product, which is equivalent to maximizing the geometric mean of∣∣B(ejω)

∣∣2 at the DFT

frequencies, subject to a mean-squared error constraint ξ ≤ ξmax for some error threshold

value ξmax.

Unfortunately, the GSNR method suffers from several shortcomings. First of all, in their

expression for the bit rate, the authors of [4] did not take into account the effects due to

intersymbol interference (ISI) (i.e., the effects due to the channel length). Furthermore, they

treated the target response B(ejω) and equalizer H(ejω) as independent quantities, when in

fact h(n) is related to b(n) via the orthogonality principle. Finally, finding the TIR b(n)

139

involves solving a constrained nonlinear optimization problem, and as such, is very high in

computational complexity. Though the GSNR method often outperforms the UEC and UTC

methods of [3], other methods developed later not only surpass it, but also do so with less

computational complexity. Despite this, the GSNR method represented the first attempt to

design a TEQ by maximizing the bit rate of a DMT system.

2. Minimum Shortening SNR (MSSNR) Method: In [33], Melsa, Younce, and Rohrs developed

a method for designing a channel shortening equalizer that delt directly with the TEQ coef-

ficients of h(n). The objective was to maximize the shortening SNR (SSNR) given by

SSNR � Edes

Eres

where Edes and Eres denote, respectively, the energy of the effective desired response cdes(n)

and residual response cres(n) (see Fig. 6.2), given by

Edes =∑

n

|cdes(n)|2 , Eres =∑

n

|cres(n)|2

To maximize the SSNR, the authors of [33] solved the constrained optimization problem

Minimize Eres, subject to Edes = 1.

They showed that this optimization problem could be solved using the eigenfilter technique

[74, 56], after computing an appropriate Cholesky decomposition [22] of a particular matrix.

Though this method performs relatively well with relatively low complexity, on account of

the fact that the design problem is an eigenfilter-type problem, it suffers from several short-

comings. First, the method does not account for any effects due to noise present in the

system. Furthermore, the Cholesky factor obtained from the above-mentioned Cholesky de-

composition depends on the delay parameter ∆. If we wish to vary ∆ over a certain region to

see which value performs the best, which is typically done in practice, then a new Cholesky

factor must be computed for each ∆. This results in an extra computational load for each

∆ considered. Despite these shortcomings, the MSSNR method of [33] represented the first

attempt to pose the channel shortening problem as an eigenfilter-type problem.

3. Maximum Bit Rate (MBR) and Minimum ISI (Min-ISI) Methods: Arslan, Evans, and Kiaei

[6] were the first to design a TEQ to directly maximize the bit rate given in (6.1). This

method, called the maximum bit rate (MBR) method, was shown to perform very well and

140

often came very close to the MFB maximum achievable bit rate given in (6.2). However, as

the bit rate given in (6.1) is a nonlinear, nonconvex function of the TEQ coefficients of h(n),

computationally intensive nonlinear optimization techniques had to be employed to maximize

the bit rate. Furthermore, these techniques were found to converge slowly to a desired local

extrema.

In order to mitigate the effects of this complexity, the authors of [6] proposed a suboptimal

method called the minimum ISI (min-ISI) method in which the objective was to minimize

a combination of the effects due to ISI as well as those due to noise. As with the MSSNR

method of [33], it was shown that the objective could be optimized using the eigenfilter

approach after an appropriate Cholesky factor was computed. This method, as it accounted

for both ISI and noise, was shown to perform nearly as well as the MBR method with much

lower computational complexity. However, as with the MSSNR method of [33], the Cholesky

factor needed in the min-ISI method of [6] depends on the delay parameter ∆.

4. Delay Spread Minimization Method: Schur and Speidel proposed a unique method for the

design of channel shortening equalizers in [44]. In it, the objective was to minimize the delay

spread of the effective channel ceff(n), defined as follows.

Jdelay �

∑n

(n − ∆)2 |ceff(n)|2∑n

|ceff(n)|2

As with the MSSNR method of [33] and the min-ISI method of [6], it was shown that this

problem could be solved using the eigenfilter approach after an appropriate Cholesky factor

was computed. However, unlike these methods, the Cholesky factor required for the delay

spread minimization method does not depend on the delay parameter ∆. Hence, the method

is very low in complexity as only one Cholesky factor must be computed for all ∆.

Despite this significant decrease in complexity, the method suffers from several shortcomings.

Though the method was shown to be less prone to synchronization errors than others, it

does not account for the desired channel length Ld = LCP + 1 or any knowledge of the noise

statistics. Here, we generalize the delay spread minimization method of [44] to account for

both of these things. Furthermore, we apply it to a more general model than that shown in

Fig. 6.1 which can be used for the design of FSEs for channel shortening.

141

� ↑ K � CK(z) � � HK(z) � ↓ K �

x(n)

Channel Equalizer

y(n)

η(n)

Noise

Figure 6.3: Discrete-time model of a K-fold oversampled FSE.

� C(z)K

K

KH(z) �x(n)

Channel Equalizer

y(n)

η(n)

Noise

Figure 6.4: SIMO-MISO channel/equalizer model.

6.3 Proposed Eigenfilter Design Method

Consider the system shown in Fig. 6.3. This system is the discrete-time model of an FSE with

oversampling ratio K [61]. Here, CK(z) and HK(z) represent, respectively, a K-fold oversampled

version of the original channel C(z) and equalizer H(z) from Fig. 6.1. Similarly, η(n) from Fig.

6.3 represents a K-fold oversampled version of the noise process η(n) in Fig. 6.1. In contrast to

the symbol spaced equalizer (SSE) shown in Fig. 6.1, the FSE of Fig. 6.3 has a redundancy of a

factor of K. As a result of this, channel equalization/shortening and noise suppression with FSEs

often become more well conditioned than with SSEs. For example, under certain mild conditions,

we can achieve exact channel equalization with FIR filters [76, 78].

If the K-fold oversampled channel CK(z) and equalizer HK(z) have the following polyphase

decompositions,

CK(z) =K−1∑k=0

z−kEk

(zK)

(Type I)

HK(z) =K−1∑k=0

zkRk

(zK)

(Type II)

(6.3)

then the FSE system of Fig. 6.3 can be redrawn as in Fig. 6.4 (see the Appendix), where we have

[C(z)]k,0 = Ek(z) , [H(z)]0,k = Rk(z) , [η(n)]k,0 = η(Kn + k) , 0 ≤ k ≤ K − 1 (6.4)

Hence, the FSE of Fig. 6.3 is equivalent to a single-input multiple-output (SIMO) channel and a

multiple-input single-output (MISO) equalizer model. Note that the noise process η(n) is simply

the blocked version of the noise process η(n) from Fig. 6.3.

142

The eigenfilter design method for channel shortening that we propose here will be for the SIMO-

MISO channel/equalizer model of Fig. 6.4. We make the following assumptions here.

• The channel C(z) is a known causal FIR filter of length Lc.

• The equalizer H(z) is a causal FIR filter of length Le.

• The input x(n) is zero mean and white with variance σ2x.

• The noise η(n) is a zero mean WSS random process with autocorrelation Rηη(k).

• The processes x(n) and η(n) are uncorrelated.

Note that the output y(n) can be expressed as y(n) = xf (n) + q(n), where xf (n) and q(n) are,

respectively, the filtered signal and noise processes given by

xf (n) = ceff(n) ∗ x(n) , q(n) = h(n) ∗ η(n)

and ceff(n) is the effective channel given by ceff(n) � h(n) ∗ c(n).

We want the equalizer to shorten the effective channel ceff(n) and minimize the noise power

σ2q with respect to the signal power σ2

xf. To that end, we propose to choose h(n) to minimize the

objective function

J � αJshort + (1 − α)Jnoise , 0 ≤ α ≤ 1 (6.5)

where Jshort and Jnoise are defined as follows.

Jshort �

∑n

f(n − ∆)|ceff(n)|2∑n

|ceff(n)|2, Jnoise �

σ2q

σ2xf

=σ2

q

σ2x

∑n

|ceff(n)|2(6.6)

Here, Jshort is a channel shortening objective, Jnoise is the noise-to-signal ratio, and α is a tradeoff

parameter between these two objectives. Also, ∆ is the delay of the shortened channel as before and

f(n) is a “penalty” function which is any nonnegative function used to penalize different values of

ceff(n). The special case where K = 1, α = 1, and f(n) = n2 corresponds to the objective analyzed

by Schur and Speidel in [44].

Though α is arbitrary, one heuristic choice which is well suited for the design of a channel

shortening equalizer for DMT systems (see Sec. 6.4) is the value

α = α0 � PISI

PISI + Pnoise(6.7)

143

Here, PISI is the intersymbol interference (ISI) power present in c(n) before equalization, which is

σ2x times the difference of the energy of c(n) and the energy of the (LCP +1) length window of c(n)

with maximum energy, where LCP is the cyclic prefix length. Also, Pnoise is the input noise power,

namely, Tr [Rηη(0)]. Finding the optimal choice of α for a given criterion is still an open problem.

However, for maximizing bit rate, it was found (see Sec. 6.4) that α0 in (6.7) yielded good results.

To further incorporate the cyclic prefix length in the design, in the simulation results presented

in Sec. 6.4, we chose the penalty function to be

f(n) = fCP(n) �

0 , 0 ≤ n ≤ LCP

1 , otherwise(6.8)

Note that fCP(n − ∆) penalizes uniformly any samples of ceff(n) outside of n ∈ [∆, ∆ + LCP].

6.3.1 Analysis of the Objective Function J

Note that from the above assumptions, the effective channel ceff(n) = h(n)∗c(n) is causal and FIR

of length Leff � Lc + Le − 1. To proceed with analyzing the objective function J from (6.5), let us

define the following matrix quantities.

h �[

h(0) h(1) · · · h(Le − 1)]

(1 × KLe)

C �

c(0) c(1) · · · c(Lc − 1) 0 · · · 0

0 c(0) c(1) · · · c(Lc − 1). . .

......

. . . . . . . . . . . . 0

0 · · · 0 c(0) c(1) · · · c(Lc − 1)

(KLe × Leff)

ceff �[

ceff(0) ceff(1) · · · ceff(Leff − 1)]

(1 × Leff)

Λ∆ � diag (f(0 − ∆), f(1 − ∆), . . . , f((Leff − 1) − ∆)) (Leff × Leff)

Rη �

Rηη(0) Rηη(1) · · · Rηη(Le − 1)

Rηη(−1) Rηη(0). . .

......

. . . . . . Rηη(1)

Rηη(−(Le − 1)) · · · Rηη(−1) Rηη(0)

(KLe × KLe)

Note that all of the degrees of freedom in the design problem reside in the choice of the row vector

h here. From the convolution equation ceff(n) = h(n) ∗ c(n), note that we have

ceff = hC (6.9)

144

Also, from (6.6), we have

Jshort =ceffΛ∆c†effceffc

†eff

, Jnoise =σ2

q

σ2xceffc

†eff

(6.10)

Substituting (6.9) into (6.10), we get

Jshort =hCΛ∆C†h†

hCC†h† , Jnoise =σ2

q

σ2xhCC†h† (6.11)

To simplify σ2q from (6.11), recall that we have

σ2q =

12π

∫ 2π

0Sqq(ejω) dω

As q(n) = h(n) ∗ η(n), we have [67]

Sqq(z) = H(z)Sηη(z)H(z) =∑m,n

h(m)[Sηη(z)zn−m

]h†(n)

and so we get

σ2q =

∑m,n

h(m)[

12π

∫ 2π

0Sηη(ejω)ejω(n−m) dω

]︸︷︷︸

Rηη (n−m)

h†(n)

In light of the FIR assumption on h(n), we can express σ2q in terms of the row vector h as follows.

σ2q = hRηh† (6.12)

Combining (6.12) with (6.11), we get

Jshort =hCΛ∆C†h†

hCC†h† , Jnoise =hRηh†

σ2xhCC†h† (6.13)

Hence, using (6.13) in (6.5), we get

J =h[αCΛ∆C† + (1 − α) 1

σ2xRη

]h†

hCC†h† (6.14)

To show that J can be optimized via the eigenfilter approach, we proceed as follows. Assuming C

has a full rank of KLe, then A � CC† is strictly positive definite [22]. As such, it has a Cholesky

decomposition [22] of the form A = G†G, where G is a nonsingular KLe × KLe matrix1. Define

the KLe × 1 column vector v as v � Gh†. Then, as G is nonsingular, we have h = v† (G−1)†, and

1If C is rank deficient, then we can still optimize the objective function J using a Cholesky decomposition, butthe details become more complicated [22]. For all of the practical examples considered here in simulations, C wasalways of full rank.

145

so finding the optimal h is equivalent to finding the optimal v. Substituting v = Gh† into (6.14)

yields

J =v†Tvv†v

, where T � α[(

G−1)† CΛ∆C† (G−1

)]+ (1 − α)

[1σ2

x

(G−1

)† Rη

(G−1

)]As T is Hermitian, it follows by Rayleigh’s principle [22] that the minimum value of J is λmin

where λmin denotes the smallest eigenvalue of T. Furthermore, J = λmin iff v lies in the eigenspace

corresponding to λmin. Thus, the optimization problem here can be solved using the eigenfilter

approach [74, 56]. If vmin denotes any nonzero vector in the eigenspace corresponding to λmin,

then the optimal equalizer coefficient vector hopt and corresponding optimal objective value Jopt

are given by

hopt = v†min

(G−1

)†Jopt = λmin

One important point to note is that the Cholesky factor G does not depend on the delay

parameter ∆. Other eigenfilter methods, such as the MSSNR method of [33] and the min-ISI

method of [6], require Cholesky factors that do depend on ∆. If we wish to perform an exhaustive

search over a range of possible parameters ∆ to see which one yields the best performance in terms

of bit rate, which is typically done in practice, then these methods are much higher in complexity

than the proposed one, which only requires one Cholesky decomposition for all values of ∆.


For the reasons mentioned in Sec. 6.2, we opted to compare our proposed TEQ design method with

others on the basis of observed bit rate (see (6.1)). Data for the channel and noise was obtained

from the Matlab DMTTEQ Toolbox [5]. We used the following typical asymmetric DSL (ADSL)

input parameters.

• Analog sampling frequency (fs) – 2.208 MHz, DFT size (NDFT) – 512, Cyclic prefix length

(LCP) - 32, SNR gap (Γ) – 9.8 dB

• K = 1, Lc = 512, Le = 16, σ2x = 21 dBm

• NEXT noise model with eight disturbers [46] plus additive white Gaussian noise (AWGN)

with power density −110 dBm/Hz (see Fig. 6.5 for a plot of the noise psd)

146

0 0.1 0.2 0.3 0.4 0.5

−33.1

−33.05

−33

−32.95

−32.9

−32.85

−32.8

f = ω/2π

Mag

nitu

de (

dBm

)

Figure 6.5: Noise psd Sηη(ejω) corresponding to NEXT noise with eight disturbers [46] plus AWGNwith power density −110 dBm/Hz.

0 100 200 300 400 500

−1

−0.5

0

0.5

1

n

Original ChannelEqualized Channel

Figure 6.6: Original and equalized channel impulse responses using the proposed eigenfilter methodfor CSA loop #1.

0 0.2 0.4 0.6 0.8 10

20

40

60

80

100

α

Bit

rate

(%

of M

FB

)

Figure 6.7: Observed bit rate (as a percentage of the MFB bit rate of (6.2)) as a function of thetradeoff parameter α for CSA loop #1.

147

CSAloop #

Observed bit rate as a % of the MFB maximum bit rateMFB

maximumachievablebit rate

MMSE-UTC [3]

MMSE-UEC [3]

GSNR[4]

MSSNR[33]

min-ISI[6]

Delay SpreadMinimization [44]

ProposedMethod

1 82.4% 83.3% 84.8% 92.6% 95.0% 77.4% 91.1% 2.844 Mbps

2 81.1% 86.6% 89.3% 86.4% 95.3% 66.8% 91.1% 3.068 Mbps

3 80.3% 81.1% 83.1% 90.4% 95.3% 80.2% 92.3% 2.643 Mbps

4 62.7% 74.9% 74.2% 88.6% 93.2% 51.9% 92.9% 1.933 Mbps

5 79.7% 72.6% 75.2% 81.9% 93.7% 68.5% 89.5% 2.839 Mbps

6 77.8% 80.7% 80.6% 89.5% 94.3% 80.6% 92.8% 2.350 Mbps

7 64.1% 78.2% 79.6% 90.3% 93.9% 72.2% 89.8% 2.519 Mbps

8 69.1% 78.1% 80.7% 88.6% 91.8% 79.0% 90.6% 2.325 Mbps

Table 6.1: Observed bit rates for CSA loops #1-8 using various TEQ design methods. (Bit ratesare expressed as a percentage of the MFB maximum achievable bit rate of (6.2) for each loop.)

In Fig. 6.6, we have plotted the original and equalized channel impulse responses for carrier

service area (CSA) loop #1 designed using our proposed method. Here, we chose α = α0 as in (6.7)

and f(n) = fCP(n) as in (6.8). We varied the delay parameter ∆ from 0 to 40 and chose the one

which yielded the best bit rate. As we can see, our method shortened the channel quite well. In

Fig. 6.7, we have plotted the observed bit rate (as a percentage of the MFB maximum achievable

bit rate from (6.2)) as a function of the tradeoff parameter α. Here, α0 = 0.9977 and yielded a

percentage of 91.0819, whereas the optimum α was 0.998 with a percentage of 91.0852. Clearly the

heuristic choice of α = α0 from (6.7) yielded nearly optimal results as desired. From Figure 6.7, we

can see that performance remained relatively constant for 0.1 ≤ α ≤ 1, which heuristically means

that for the simulation parameters chosen here, ISI is more of a problem than noise.

In Table 6.1, we have tabulated the observed bit rates from (6.1) for CSA loops #1-8 using the

above parameters as a percentage of the MFB bit rate given in (6.2). For each method considered

except for the geometric SNR method (GSNR) [4], which requires nonlinear optimization, we varied

the delay parameter ∆ from 0 to 40 and chose the value that yielded the best bit rate. The optimum

MMSE-UEC (unit energy constraint) method of [3] was used as the initial condition for the GSNR

method. As was done in [6], the mean-squared error (MSE) parameter used was set to be 2 dB

above the MSE obtained from the optimal MMSE-UEC equalizer. From Table 6.1, we can see that

148

our proposed method comes very close to the MFB maximum bit rate and is comparable to the min-

ISI method of [6]. However, we should note that our proposed method requires less computational

load, as we only require one Cholesky decomposition for all values of ∆, as opposed to the min-ISI

method which requires a different such decomposition for each ∆. The MISO equalizers for FSEs

designed using our method offer a further improvement over all the methods considered here (see

[51] for more details). This improvement, however, comes at the expense of a dramatic increase in

redundancy and so for sake of fair comparison, these results have been omitted.


In this chapter, we generalized the delay spread minimization method of [44] to account for the

cyclic prefix length as well as the noise encountered in the system. Furthermore, we showed how

the method can be applied to the design of FSEs for channel shortening. In addition, we showed

that the proposed eigenfilter method is less complex to implement than other common eigenfilter

TEQ design methods in that it only requires one Cholesky decomposition for all delay parameter

values. From our simulation results, it was observed that our method came close to MFB maximum

bit rate for all CSA loops considered, showing the merit of the proposed TEQ design method.

The proposed eigenfilter channel shortening method not only applies to the SIMO-MISO chan-

nel/equalizer model of Fig. 6.4, but also to a more general MIMO model (see [51] for more details).

However, as the results in this case are less intuitive and the practical applications are not as clear

as for the SIMO-MISO model of Fig. 6.4, we have chosen to omit them here for sake of brevity and

clarity. The interested reader is referred to [2, 51] for more details regarding the design of channel

shortening equalizers for the general MIMO case.

Appendix: Equivalence between FSEs and the SIMO-MISO

Channel/Equalizer Model

Here, we show the equivalence between the discrete time model of the K-fold oversampled FSE

shown in Fig. 6.3 and the SIMO-MISO channel/equalizer model of Fig. 6.4. In particular, we will

show that if CK(z) and HK(z) have the polyphase decompositions given in (6.3), then the system

of Fig. 6.3 can be redrawn as in Fig. 6.4 where the components of C(z), H(z), and η(n) are as in

(6.4). This can be done with the help of the noble identities (see Sec. 1.1.3.1).

149

� � E0(z)

�

� E1(z)

�

�

� EK−1(z)

�

�

�

↑ K

↑ K

↑ K

z−1

z−1

z−1

�

�

�

�

η(n)

�

�z

�z

�z

x(n)�

�

�

↓ K

↓ K

↓ K

�

�

�

R0(z)

R1(z)

RK−1(z)

�

�

�

�y(n)

Figure 6.8: Equivalent form of Fig. 6.3 upon using the noble identities.

� � E0(z)

�

� E1(z)

�

�

� EK−1(z)

�

�

�

↑ K

↑ K

↑ K

z−1

z−1

z−1

�

�

�

�

�z

�z

�z

x(n)�

�

�

↓ K

↓ K

↓ K

�

�

�

�

�

�

R0(z)

R1(z)

RK−1(z)

�

�

�

�y(n)

η(Kn)

η(Kn + 1)

η(Kn + (K − 1))Identity System

Figure 6.9: Equivalent form of Fig. 6.8 upon moving the noise process η(n) past the blocking system.

If CK(z) and HK(z) have the polyphase decompositions given in (6.3), then using the noble

identities, we can redraw the system of Fig. 6.3 as in Fig. 6.8. It can be seen that just before the

noise is added, the signal is unblocked by a factor of K, whereas just after the noise is added, the

signal is blocked by a factor of K (see Sec. 1.1.2).

Note that instead of adding the noise prior to blocking the signal, we can add the noise after

the signal has been blocked. In other words, the system of Fig. 6.8 can be equivalently redrawn as

in Fig. 6.9. Now note that the system comprised of K-fold unblocking followed by K-fold blocking

is an identity system, and so we can remove the expanders and decimators from the model entirely

as desired. Doing so, we get the system shown in Fig. 6.10.

From Fig. 6.10, it is clear that the original FSE system of Fig. 6.3 is equivalent to the SIMO-

MISO model shown in Fig. 6.4, where the quantities C(z), H(z), and η(n) are as in (6.4).

150

� � E0(z)

�

� E1(z)

�

�

� EK−1(z)

x(n) �

�

�

�

�

�

R0(z)

R1(z)

RK−1(z)

�

�

�

� y(n)

η(Kn)

η(Kn + 1)

η(Kn + (K − 1))

Figure 6.10: Equivalent form of Fig. 6.9 upon removing the unblocking/blocking systems.

151

Chapter 7

Conclusion

In this thesis we presented a wide variety of optimization algorithms for the design of realizable

signal-adapted filter banks. Our focus in the first part of the thesis was specifically on the design

of FIR PU signal-adapted filter banks. One of the major contributions was to bridge the gap

between two theoretically optimal PU filter banks, namely, the zeroth-order PCFB (KLT) and the

unconstrained or infinite-order PCFB. This link has previously not been shown in the literature.

As opposed to traditional FIR PU signal-adapted filter bank design methods, in which the filter

bank is chosen to optimize a specific objective for which the PCFB is optimal, the design goals of

the methods presented here were quite different and novel. In particular, in one of the methods

proposed, the design objective was to approximate the infinite-order PCFB itself in the least-

squares sense with a realizable FIR PU filter bank. Using an important complete characterization

of FIR PU systems in terms of degree-one Householder-like building blocks, we showed how to

optimize each parameter seperately, which in turn led to an iterative greedy algorithm for solving

the original least-squares problem. Simulation results for this and other methods presented here

showed that the FIR PU filter banks designed exhibited PCFB-like behavior. The FIR filter banks

designed were shown to monotonically behave more and more like the infinite-order PCFB as the

FIR order increased. This monotonic behavior was shown in terms of numerous objectives for

which the PCFB is optimal. These results served to bridge the gap between the zeroth-order KLT

and infinite-order PCFB which previously had not been reported in the literature.

In the second part of the thesis, we focused on the design of a signal-adapted filter bank in which

the analysis and synthesis filters were FIR but otherwise unconstrained. The model considered was

a uniform filter bank with scalar quantizers in the subbands. Under the high bit rate assumption, we

showed how to obtain either the optimal analysis or synthesis bank using the trick of completing the

square. This in turn led to another iterative greedy algorithm in which the analysis and synthesis

152

banks were alternately optimized. The main contribution of the algorithm here was shown through

simulations presented here. In particular, the FIR filter banks designed exhibited a monotonic

tendency toward information theoretic bounds on distortion and coding gain as the FIR filter

orders increased. Though this result is intuitive, it has not formally been shown in the literature

until now.

In the third part of the thesis, we showed how some of the optimization techniques used for

the design of signal-adapted filter banks could be used for designing channel shortening equalizers.

Here, we showed how the eigenfilter method could be used for the design of good TEQs for DMT

systems. As opposed to other eigenfilter TEQ design methods which require a different Cholesky

factor for every delay parameter considered, the proposed method only required one such factor for

all delays. In addition to being low in computational complexity, the proposed method was shown

to perform nearly optimally in terms of observed bit rate in simulations presented here.

Matlab code for several of the algorithms presented in this thesis will soon be available at [60].

7.1 Open Problems

Despite the contributions related to the design of realizable signal-adapted filter banks presented

here, a number of problems still remain open. First of all, note that all of the algorithms proposed

here apply only to uniform filter banks. In several wavelet-based lossy data compression schemes,

a nonuniform filter bank is commonly used to achieve compression [41, 39]. At this time, it is not

known how to generalize the proposed optimization algorithms to nonuniform filter banks. It should

be noted that any nonuniform filter bank can be equivalently redrawn as a uniform filter bank with

restrictions on the polyphase matrices [67]. Hence, in order to use the algorithms proposed here,

we would have to account for these restrictions along with inherent FIR assumption and any other

constraints such as the PU condition.

Another problem that also remains open is generalizing the FIR PU design algorithms to the

multidimensional case. This may be useful, for example, in image or video processing if we seek an

optimal nonseparable filter bank for compression. The problem in the general multidimensional case

is that there is no known complete parameterization of FIR PU systems in terms of Householder-like

degree-one building blocks as in the one-dimensional case. Though FIR PU factorizations similar

to those given in Sec. 3.2 exist for the multidimensional case, they do not cover all such systems

and are as such incomplete.

153

On a practical note, one issue which remains unresolved is how the iterative greedy algorithms

will perform when the input signal statistics rapidly fluctuate and must be adaptively updated.

This phenomenon commonly occurs in audio signal processing where the assumed stationarity is

only valid for short periods of time.

Another issue which remains unanswered at this point in time is whether the algorithms pro-

posed here can be used in applications outside of source coding and data compresson. In particular,

it is not known if the design methods proposed here can be used for channel coding applications

in digital communications. Typically, in digital communications, a redundant transmultiplexer is

used and the design objectives are quite different in nature than those considered for data compres-

sion. Nevertheless, it would be interesting to see whether the proposed design algorithms could be

used for this application. For the FIR PU design algorithms proposed here, this widely depends on

whether the theory of PCFBs can be applied to redundant transmultiplexers. The theory of PCFBs,

which has been found useful for nonredundant transmultiplexers, has not yet been extended for the

redundant case.

154

Bibliography

[1] S. Akkarakaran and P. P. Vaidyanathan, “Filterbank optimization with convex objectives and

the optimality of principal component forms,” IEEE Trans. Signal Processing, vol. 49, no. 1,

pp. 100–114, Jan. 2001.

[2] N. Al-Dhahir, “FIR channel-shortening equalizers for MIMO ISI channels,” IEEE Trans. Com-

mun., vol. 49, no. 2, pp. 213–218, Feb. 2001.

[3] N. Al-Dhahir and J. M. Cioffi, “Efficiently computed reduced-parameter input-aided MMSE

equalizers for ML detection: a unified approach,” IEEE Trans. Inform. Theory, vol. 42, no. 3,

pp. 903–915, May 1996.

[4] ——, “A bandwidth-optimized reduced-complexity equalized multicarrier transceiver,” IEEE

Trans. Commun., vol. 45, no. 8, pp. 948–956, Aug. 1997.

[5] G. Arslan, M. Ding, B. Lu, M. Milosevic, Z. Shen, and B. L. Evans. (2003)

Matlab DMTTEQ Toolbox. [Online]. Available: http://www.ece.utexas.edu/ be-

vans/projects/adsl/dmtteq/dmtteq.html

[6] G. Arslan, B. L. Evans, and S. Kiaei, “Equalization for discrete multitone transceivers to

maximize bit rate,” IEEE Trans. Signal Processing, vol. 49, no. 12, pp. 3123–3135, Dec. 2001.

[7] R. E. Blahut, Principles and Practice of Information Theory. Reading, MA: Addison-Wesley,

1987.

[8] H. Caglar, Y. Liu, and A. N. Akansu, “Statistically optimized PR-QMF design,” in Proc. SPIE

1605, Wavelet Appl. Signal Image Process., San Diego, CA, 1991, pp. 86–94.

[9] P. R. Chevillat and G. Ungerboeck, “Optimum FIR transmitter and receiver filters for data

transmission over band-limited channels,” IEEE Trans. Commun., vol. COM-30, no. 8, pp.

1909–1915, Aug. 1982.

155

[10] T. M. Cover and J. A. Thomas, Elements of Information Theory. New York, NY: John Wiley

& Sons, 1991.

[11] P. Delsarte, B. Macq, and D. T. M. Slock, “Signal-adapted multiresolution transform for image

coding,” IEEE Trans. Inform. Theory, vol. 38, no. 2, pp. 897–904, Mar. 1992.

[12] I. Djokovic and P. P. Vaidyanathan, “Results on biorthogonal filter banks,” Journal of Applied

and Computational Harmonic Analysis, vol. 1, pp. 329–343, 1994.

[13] N. J. Fliege, Multirate Digital Signal Processing: Multirate Systems - Filter Banks - Wavelets.

Chichester, England: John Wiley & Sons, 1994.

[14] G. B. Giannakis, “Filterbanks for blind channel identification and equalization,” IEEE Signal

Processing Lett., vol. 4, no. 6, pp. 184–187, June 1997.

[15] G. B. Giannakis, Y. Hua, P. Stoica, and L. Tong, Signal Processing Advances in Wireless and

Mobile Communications, Volume 1: Trends in Channel Estimation and Equalization. Upper

Saddle River, NJ: Prentice-Hall, 2000.

[16] ——, Signal Processing Advances in Wireless and Mobile Communications, Volume 2: Trends

in Single- and Multi-User Systems. Upper Saddle River, NJ: Prentice-Hall, 2000.

[17] G. B. Giannakis, Z. Wang, A. Scaglione, and S. Barbarossa, “AMOUR-generalized multicarrier

CDMA irrespective of multipath,” in Proc. Global Telecommunications Conference, vol. 1B,

Rio de Janeireo, Brazil, Dec. 1999, pp. 965–969.

[18] ——, “AMOUR-generalized multicarrier transceivers for blind CDMA regardless of multi-

path,” IEEE Trans. Commun., vol. 48, no. 12, pp. 2064–2076, Dec. 2000.

[19] A. Hjørungnes, H. Coward, and T. A. Ramstad, “Minimum mean square error FIR filter banks

with arbitrary filter lengths,” in Proc. Int. on Image Processing, vol. 1, Kobe, Japan, Oct. 1999,

pp. 619–623.

[20] A. Hjørungnes and T. A. Ramstad, “Algorithm for jointly optimized analysis and synthesis

FIR filter banks,” in Proc. IEEE Int. Conf. on Electron., Circuits, and Systems, vol. 1, Pafos,

Cyprus, Sept. 1999, pp. 369–372.

156

[21] A. Hjørungnes and T. Saramaki, “Minimum mean square error nonuniform FIR filter banks,”

in Proc. IEEE Int. Conf. on Acoust., Speech, and Signal Processing, vol. 6, Salt Lake City,

UT, May 2001, pp. 3609–3612.

[22] R. A. Horn and C. R. Johnson, Matrix Analysis. Cambridge, U.K.: Cambridge University

Press, 1985.

[23] Y. Huang and P. M. Schultheiss, “Block quantization of correlated Gaussian random variables,”

IEEE Trans. Commun. Syst., vol. C-10, pp. 289–296, Sept. 1963.

[24] O. S. Jahromi, B. A. Francis, and R. H. Kwong, “Algebraic theory of optimal filterbanks,”

IEEE Trans. Signal Processing, vol. 51, no. 2, pp. 442–457, Feb. 2003.

[25] N. S. Jayant and P. Noll, Digital Coding of Waveforms: Principles and Applications to Speech

and Video. Englewood Cliffs, NJ: Prentice-Hall, 1984.

[26] I. Kalet, “The multitone channel,”IEEE Trans. Commun., vol.37, no.2, pp. 119–124, Feb.1989.

[27] A. Kırac and P. P. Vaidyanathan, “On existence of FIR principal component filter banks,” in

Proc. IEEE Int. Conf. on Acoust., Speech, and Signal Processing, vol. 3, Seattle, WA, May

1998, pp. 1329–1332.

[28] ——, “Optimality of orthonormal transforms for subband coding,” in Proc. IEEE DSP Work-

shop, Bryce Canyon, UT, Aug. 1998.

[29] ——, “Theory and design of optimum FIR compaction filters,” IEEE Trans. Signal Processing,

vol. 46, no. 4, pp. 903–919, Apr. 1998.

[30] J. Kovacevic and M. Vetterli, “Perfect reconstruction filter banks with rational sampling fac-

tors,” IEEE Trans. Signal Processing, vol. 41, no. 6, pp. 2047–2066, June 1993.

[31] B. Maison and L. Vandendorpe, “About the asymptotic performance of multiple-

input/multiple-output linear prediction of subband signals,” IEEE Signal Processing Lett.,

vol. 5, no. 12, pp. 315–317, Dec. 1998.

[32] S. Mallat, AWavelet Tour of Signal Processing, 2nd ed. San Diego, CA: Academic Press, 1999.

[33] P. J. W. Melsa, R. C. Younce, and C. E. Rohrs, “Impulse response shortening for discrete

multitone transceivers,” IEEE Trans. Commun., vol. 44, no. 12, pp. 1662–1672, Dec. 1996.

157

[34] T. K. Moon and W. C. Stirling, Mathematical Methods and Algorithms for Signal Processing.

Upper Saddle River, NJ: Prentice-Hall, 2000.

[35] P. Moulin, M. Anitescu, K. O. Kortanek, and F. A. Potra, “The role of linear semi-infinite

programming in signal-adapted QMF bank design,” IEEE Trans. Signal Processing, vol. 45,

no. 9, pp. 2160–2174, Sept. 1997.

[36] P. Moulin, M. Anitescu, and K. Ramchandran, “Theory of rate-distortion-optimal, constrained

filterbanks–application to IIR and FIR biorthogonal designs,” IEEE Trans. Signal Processing,

vol. 48, no. 4, pp. 1120–1132, Apr. 2000.

[37] P. Moulin and M. K. Mıhcak, “Theory and design of signal-adapted FIR paraunitary filter

banks,” IEEE Trans. Signal Processing, vol. 46, no. 4, pp. 920–929, Apr. 1998.

[38] J. G. Proakis, Digital Communications, 4th ed. New York, NY: McGraw-Hill, 2000.

[39] D. Salomon, A Guide to Data Compression Methods. New York, NY: Springer-Verlag, 2002.

[40] V. P. Sathe and P. P. Vaidyanathan, “Effects of multirate systems on the statistical properties

of random signals,” IEEE Trans. Signal Processing, vol. 41, no. 1, pp. 131–146, Jan. 1993.

[41] K. Sayood, Introduction to Data Compression, 2nd ed. San Diego, CA: Academic Press, 2000.

[42] A. Scaglione, G. B. Giannakis, and S. Barbarossa, “Redundant filterbank precoders and equal-

izers: Part I: unification and optimal designs,” IEEE Trans. Signal Processing, vol. 47, no. 7,

pp. 1988–2006, July 1999.

[43] ——, “Redundant filterbank precoders and equalizers: Part II: blind channel estimation, syn-

chronization, and direct equalization,” IEEE Trans. Signal Processing, vol. 47, no. 7, pp.

2007–2022, July 1999.

[44] R. Schur and J. Speidel, “An efficient equalization method to minimize delay spread in

OFDM/DMT systems,” in Proc. IEEE Int. Conf. on Communications, vol. 5, Helsinki, Fin-

land, June 2001, pp. 1481–1485.

[45] A. K. Soman and P. P. Vaidyanathan, “On orthonormal wavelets and paraunitary filter banks,”

IEEE Trans. Signal Processing, vol. 41, no. 3, pp. 1170–1183, Mar. 1993.

158

[46] T. Starr, J. M. Cioffi, and P. J. Silverman, Understanding Digital Subscriber Line Technology.

Upper Saddle River, NJ: Prentice-Hall, 1999.

[47] G. Strang and T. Nguyen, Wavelets and Filter Banks. Wellesley, MA: Wellesley-Cambridge

Press, 1996.

[48] C. W. Therrien, Discrete Random Signals and Statistical Signal Processing. Upper Saddle

River, NJ: Prentice-Hall, 1992.

[49] A. Tkacenko and P. P. Vaidyanathan, “Iterative gradient based greedy algorithm for the design

of optimal FIR compaction filters and signal-adapted paraunitary filter banks,” submitted to

IEEE Trans. Signal Processing.

[50] ——, “Iterative greedy algorithm for solving the FIR paraunitary approximation problem,”

submitted to IEEE Trans. Signal Processing.

[51] ——, “Eigenfilter design of MIMO equalizers for channel shortening,” in Proc. IEEE Int. Conf.

on Acoust., Speech, and Signal Processing, vol. 3, Orlando, FL, May 2002, pp. 2361–2364.

[52] ——, “A new eigenfilter based method for optimal design of channel shortening equalizers,”

in Proc. IEEE Int. Symp. on Circuits and Systems, vol. 2, Scottsdale, AZ, May 2002, pp.

504–507.

[53] ——, “Noise optimized eigenfilter design of time-domain equalizers for DMT systems,” in Proc.

IEEE Int. Conf. on Communications, vol. 1, New York, NY, Apr./May 2002, pp. 54–58.

[54] ——, “Iterative eigenfilter method for designing optimum overdecimated orthonormal FIR

compaction filter banks,” in Proc. 37th Asilomar Conference on Signals, Systems and Com-

puters, Pacific Grove, CA, Nov. 2003.

[55] ——, “A low-complexity eigenfilter design method for channel shortening equalizers for DMT

systems,” IEEE Trans. Commun., vol. 51, no. 7, pp. 1069–1072, July 2003.

[56] ——, “On the eigenfilter design method and its applications: a tutorial,” IEEE Trans. Circuits

Syst. II, vol. 50, no. 9, pp. 497–517, Sept. 2003.

[57] ——, “On the least squares signal approximation model for overdecimated rational nonuni-

form filter banks and applications,” in Proc. IEEE Int. Conf. on Acoust., Speech, and Signal

Processing, vol. 6, Hong Kong, China, Apr. 2003, pp. 481–484.

159

[58] ——, “Iterative algorithm for the design of optimal FIR analysis/synthesis filters for overdec-

imated filter banks,” accepted to IEEE Int. Symp. on Circuits and Systems, May 2004.

[59] ——, “Iterative gradient technique for the design of least squares optimal FIR magnitude

squared nyquist filters,” accepted to IEEE Int. Conf. on Acoust., Speech, and Signal Processing,

May 2004.

[60] A. Tkacenko. (2004) Matlab Signal-Adapted Filter Bank Optimization Algorithms. [Online].

Available: http://www.systems.caltech.edu/dsp/students/andre/index.html

[61] J. R. Treichler, I. Fijalkow, and J. C. R. Johnson, “Fractionally spaced equalizers: How long

should they really be?” IEEE Signal Processing Mag., vol. 13, no. 3, pp. 65–81, May 1996.

[62] M. K. Tsatsanis and G. B. Giannakis, “Principal component filter banks for optimal multires-

olution analysis,” IEEE Trans. Signal Processing, vol. 43, no. 8, pp. 1766–1777, Aug. 1995.

[63] D. W. Tufts and J. T. Francis, “Designing digital lowpass filters: comparison of some methods

and criteria,” IEEE Trans. Audio Electroacoust., vol. AU-18, pp. 487–494, Dec. 1970.

[64] J. Tuqan and P. P. Vaidyanathan, “A state space approach to the design of globally optimal

FIR energy compaction filters,” IEEE Trans. Signal Processing, vol. 48, no. 10, pp. 2822–2838,

Oct. 2000.

[65] M. Unser, “An extension of the KLT for wavelets and perfect reconstruction filter banks,” in

Proc. SPIE 2034, Wavelet Appl. Signal Image Process., San Diego, CA, 1993, pp. 45–56.

[66] ——, “On the optimality of ideal filters for pyramid and wavelet signal approximation,” IEEE

Trans. Signal Processing, vol. 41, no. 12, pp. 3591–3596, Dec. 1993.

[67] P. P. Vaidyanathan, Multirate Systems and Filter Banks. Englewood Cliffs, NJ: Prentice-Hall,

1993.

[68] ——, “Theory of optimal orthonormal subband coders,” IEEE Trans. Signal Processing,

vol. 46, no. 6, pp. 1528–1543, June 1998.

[69] P. P. Vaidyanathan and S. Akkarakaran, “A review of the theory and applications of optimal

subband and transform coders,” Journal of Applied and Computational Harmonic Analysis,

vol. 10, pp. 254–289, 2001.

160

[70] P. P. Vaidyanathan and P. Q. Hoang, “Lattice structures for optimal design and robust imple-

mentation of two-channel perfect-reconstruction QMF banks,” IEEE Trans. Acoust., Speech,

Signal Processing, vol. 36, no. 1, pp. 81–94, Jan. 1988.

[71] P. P. Vaidyanathan and A. Kırac, “Cyclic LTI systems and the paraunitary interpolation

problem,” in Proc. IEEE Int. Conf. on Acoust., Speech, and Signal Processing, vol. 3, Seattle,

WA, May 1998, pp. 1445–1448.

[72] ——, “Results on optimal biorthogonal filter banks,” IEEE Trans. Circuits Syst. II, vol. 45,

no. 8, pp. 932–947, Aug. 1998.

[73] P. P. Vaidyanathan, Y.-P. Lin, S. Akkarakaran, and S.-M. Phoong, “Discrete multitone mod-

ulation with principal component filter banks,” IEEE Trans. Circuits Syst. I, vol. 49, no. 10,

pp. 1397–1412, Oct. 2002.

[74] P. P. Vaidyanathan and T. Q. Nguyen, “Eigenfilters: a new approach to least-squares FIR filter

design and applications including Nyquist filters,” IEEE Trans. Circuits Syst., vol. CAS-34,

no. 1, pp. 11–23, Jan. 1987.

[75] P. P. Vaidyanathan, T. Q. Nguyen, Z. Doganata, and T. Saramaki, “Improved technique for

design of perfect reconstruction FIR QMF banks with lossless polyphase matrices,” IEEE

Trans. Acoust., Speech, Signal Processing, vol. 37, no. 7, pp. 1042–1056, July 1989.

[76] P. P. Vaidyanathan and B. Vrcelj, “Biorthogonal partners and applications,” IEEE Trans.

Signal Processing, vol. 49, no. 5, pp. 1013–1027, May 2001.

[77] B. Vrcelj and P. P. Vaidyanathan, “Least squares signal approximation using multirate systems:

multichannel nonuniform case,” in Proc. 35th Asilomar Conference on Signals, Systems and

Computers, vol. 1, Pacific Grove, CA, Nov. 2001, pp. 553–557.

[78] ——, “MIMO biorthogonal partners and applications,” IEEE Trans. Signal Processing, vol. 50,

no. 3, pp. 528–542, Mar. 2002.

[79] B. Xuan and R. H. Bamberger, “FIR principal component filter banks,” IEEE Trans. Signal

Processing, vol. 46, no. 4, pp. 930–940, Apr. 1998.