14-Bayesian Wavelet-Based Image Denoising Using the Gauss–Hermite Expansion.pdf

8/10/2019 14-Bayesian Wavelet-Based Image Denoising Using the GaussHermite Expansion.pdf

1/17

IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 17, NO. 10, OCTOBER 2008 1755

Bayesian Wavelet-Based Image Denoising Using theGaussHermite Expansion

S. M. Mahbubur Rahman, M. Omair Ahmad, Fellow, IEEE, and M. N. S. Swamy, Fellow, IEEE

AbstractThe probability density functions (PDFs) of thewavelet coefficients play a key role in many wavelet-based imageprocessing algorithms, such as denoising. The conventional PDFsusually have a limited number of parameters that are calculatedfrom the first few moments only. Consequently, such PDFs cannotbe made to fit very well with the empirical PDF of the waveletcoefficients of an image. As a result, the shrinkage functionutilizing any of these density functions provides a substandarddenoising performance. In order for the probabilistic model of theimage wavelet coefficients to be able to incorporate an appropriatenumber of parameters that are dependent on the higher ordermoments, a PDF using a series expansion in terms of the Hermite

polynomials that are orthogonal with respect to the standardGaussian weight function, is introduced. A modification in theseries function is introduced so that only a finite number of termscan be used to model the image wavelet coefficients, ensuring atthe same time the resulting PDF to be non-negative. It is shownthat the proposed PDF matches the empirical one better thansome of the standard ones, such as the generalized Gaussian orBessel K-form PDF. A Bayesian image denoising technique is thenproposed, wherein the new PDF is exploited to statistically modelthe subband as well as the local neighboring image wavelet coef-ficients. Experimental results on several test images demonstratethat the proposed denoising method, both in the subband-adaptiveand locally adaptive conditions, provides a performance betterthan that of most of the methods that use PDFs with limitednumber of parameters.

Index TermsBayesian image denoising, GaussHermite,higher order moments, orthogonal series expansion, probabilitydensity function, wavelet coefficient.

I. INTRODUCTION

DENOISING of images corrupted by additive white

Gaussian noise (AWGN) is a classical problem in

signal-processing. The distortion of images by noise is

common during its acquisition, processing, compression,

storage, transmission, and reproduction. The aim of denoising

is to remove the noise while keeping the signal features as muchas possible. Traditional algorithms perform image denoising

Manuscript receivedNovember 9, 2007; revisedMay 5, 2008. Current versionpublished September 10, 2008. This work was supported in part by the NaturalSciences and Engineering Research Council (NSERC) of Canada and in partby the Regroupement Stratgique en Microlectronique du Qubec (ReSMiQ).The associate editor coordinating the review of this manuscript and approvingit for publication was Dr. Michael Elad.

The authors are with the Center for Signal Processing and Communications,Department of Electrical and Computer Engineering, Concordia University,Montral, QC, H3G 1M8 Canada (e-mail: [email protected];[email protected]; [email protected]).

Color versions of one or more of the figures in this paper are available onlineat http://ieeexplore.ieee.org.

Digital Object Identifier 10.1109/TIP.2008.2002163

in the pixel domain. However, the application of the wavelet

transform in image denoising, compression, etc. has shown

remarkable success over the last decade. Wavelet shrinkage

method proposed by Donoho [1][3] is the pioneering work

for signal denoising using the wavelet transform. The method

described in [3] provides a mini-max optimal solution. Another

criterion commonly used is the mean squared error (MSE),

even though it does not match well with the characteristics of

the human visual system [4]. However, since it is a measure of

the noise power [5], and is mathematically convenient in the

context of optimization, it is employed as a denoising criterionin many image denoising applications. The discrete wavelet

transform (DWT) used in image denoising can be of many

types, such as orthogonal/biorthogonal, real/complex-valued,

separable/nonseparable, or decimated/nondecimated. Due

to the shift-invariance property, the nondecimated separable

DWT [6] improves the image denoising performance in MSE

approximately by 1 dB as compared to that of the decimated

representation [7][10]. To reduce the computational load of

the nondecimated wavelet transform as well as the Gibbs arti-

facts, cycle-spinning has been proposed [11]. Image denoising

by using transforms with improved directional selectivity,

such as the dual-tree complex wavelet transform (DT-CWT)[12], ridgelet [13], curvelet [14], contourlet [15], and steerable

pyramid [16], [17] have also been introduced.

There are two major approaches for statistical wavelet-based

denoising [18], [19]. The first approach is to design a sta-

tistically optimal threshold parameter for some nonlinear

thresholding or shrinkage functions. Image denoising methods

using such an approach include the VisuShrink [2], SureShrink

[3], BayesShrink [20], amplitude-scale-invariant Bayes esti-

mator [21], NeighCoeff [22], Spatial-correlation thresholding

[23], and empirical Bayes thresholding (EBayesThresh) [24].

Standard choices for nonlinear thresholding or shrinkage

functions are soft- and hard-thresholding [1], firm-shrinkage

[25], and non-negative garrote shrinkage [26]. Methods usingsuch shrinkage functions are computationally simple, but have

certain drawbacks in view of the fact that the functions repre-

senting the nonlinearity are arbitrarily chosen. For example,

the soft-thresholding technique yields a biased estimate with

a moderate variance, whereas the hard-thresholding technique

yields a less biased estimate but with a higher variance [26]. In

the second approach, instead of using an arbitrary function to

represent the nonlinearity, the shrinkage function is designed

by minimizing a Bayesian risk, typically under the minimum

mean squared error (MMSE) criterion [27], minimum mean

absolute error criterion [28] or maximum a posteriori (MAP)

criterion [29], [30]. The development of an optimal threshold

1057-7149/$25.00 2008 IEEE


2/17

1756 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 17, NO. 10, OCTOBER 2008

or shrinkage function is usually based on the probability den-

sity function (PDF) or simply the prior function of the image

wavelet coefficients. By exploiting the nature of the histogram

of the wavelet coefficients, different noise reduction algorithms

try to incorporate the prior model that best matches the true

image coefficients. The marginal PDF of a true image and its

self-similar subband wavelet coefficients are unimodal andsharply peaked in nature [31], [32]. Therefore, higher order

statistics are necessary to describe the sharp central cusp of

the density function of the image coefficients. The parameters

of a PDF and their estimation play a significant role in the

Bayesian denoising technique, since the computational com-

plexity of a subband-adaptive Bayesian denoising is ,

being the number of parameters of the PDF and

the size of the image. Subsequent to the work of Mallat [33],

the generalized Gaussian (GG) distribution with the shape

parameter has been commonly used to model the

image wavelet coefficients [8][10], [20], [27], [34][36]. For

example, the threshold factor in the BayesShrink method [20]

and the parametersignal of interestin the ProbShrink method[10] assume that the wavelet coefficients in each subband are

independent and identically distributed (i.i.d.) random variables

having a GG distribution. Under the simplest prior GG with

, and the i.i.d. assumption, both the MMSE and MAP

estimators provide the same closed-form solution, the well

known Wiener filter. A general class of prior functions used

for the purpose of denoising is the Gaussian scale mixture

(GSM), wherein the wavelet coefficients are modelled as a

zero-mean Gaussian function multiplied by random positive

state variables [7], [16], [37][42]. For a simple two-state GSM

prior, an expectation maximization (EM) algorithm can be used

to estimate the multipliers [7], [41]. In [43], the EM algorithmhas been extended for a finite number of state variables and

it has been shown that four to eight state variables of the

GSM density function are sufficient for a good approximation

of the prior function of the subband wavelet coefficients of

images. In general, the estimation of the GSM prior requires

another PDF, such as the log-normal density function [40]

or Jeffreys noninformative prior [16], to estimate the set of

multipliers. In order to obtain a Bayes estimator for image

denoising, Figueiredo and Nowak [21] have used Jeffreys

prior itself as a statistical model for the wavelet coefficients,

since this prior is free of any parameter. In [44] and [45], a

class of heavy-tailed densities, namely, the -stable family with

characteristic exponent has been proposed as a

prior of the image coefficients. This PDF has no closed-form

expression even though its characteristic function has one, and

its parameter estimation is poor, specially in the presence of

noise [43]. Another family of sharply peaked PDFs, called the

Bessel K-form (BK), introduced to model the filtered image in

[32], has been applied in the wavelet-domain MMSE estimator

by Fadili and Boubchir [46]. The wavelet coefficients of a

log-transformed image (for example, synthetic aperture radar

or medical ultrasound image) are modelled by the normal

inverse Gaussian PDF for removing the multiplicative noise

using a MAP estimator in the homomorphic approach [47],

[48]. The parameters of a PDF can be estimated using severalstrategies, such as the method of moments [49], maximum

likelihood method [50], and entropy-matching method [51].

Since the image wavelet coefficients of a subband can be treated

as a data set having a large sample size, the moment-based

parameter estimation provides a consistent result [52]. In order

to capture the higher order statistics, most of the PDFs used in

modeling the image wavelet coefficients use afinite number of

parameters that are usually derived from the second and fourthorder moments of the image coefficients. As a result, in most

of the cases these density functions become much more lep-

tokurtic than the empirical one. Since the wavelet coefficients

are locally stationary [53], the parameters of some of the PDFs

are refined with respect to the local spatial context within a sub-

band [8], [16], [20], [22], [40], [54][57]. In such a case, when

the denoising technique is locally adaptive within a subband,

the computational complexity increases to ,

being the size of a squared-shape local window. A few of the

PDFs have also been extended to the bivariate [29], [58] [60]

and multivariate cases [36], [42] or refined in the hidden

Markov tree framework [7], [41] to take into consideration the

intersubband dependency. In such a case, the computationalload of a denoising technique increases exponentially with the

number of subbands, and this is substantially higher than the

corresponding locally adaptive method.

In this paper, a novel marginal statistical model is proposed

for the wavelet coefficients of images by using the GaussHer-

mite (GH) expansion. This expansion is used in view of the fact

that it allows higher order moments to be incorporated in the

probabilistic modeling of the wavelet coefficients. A method of

choosing a finite number of terms in the GH series such that

the resulting PDF matches well with the empirical PDF of the

wavelet coefficients of the images and at the same time remains

non-negative, is proposed. A closed-from expression is derivedfor the parameters of the modified PDF in terms of the higher

order sample-moments. The proposed PDF is then applied as

a prior function in the subband-adaptive and locally adaptive

Bayesian MMSE-based noise reduction algorithms, in view of

the fact that they offer lower computational burden as com-

pared to the method using intersubband dependency. It is shown

that the introduction of the more accurate prior in the proposed

noise reduction technique results in a performance better than

that of the subband-adaptive and locally adaptive wavelet-based

image denoising algorithms that use standard PDFs having lim-

ited number of parameters, in terms of both the visual perception

and standard indices.

The paper is organized as follows. In Section II, the modifica-

tion of the GH expansion in the context of statistical modeling

of the wavelet coefficients of images is presented. A comparison

with some of the standard PDFs on a number of test images from

a database is also given. A Bayesian image denoising method

that exploits the proposed PDF is described in Section III. In

Section IV, the performance of the method is evaluated and

compared with some of the existing denoising methods. Finally,

Section V gives some conclusions.

II. GAUSSHERMITE EXPANSION FOR PDF OF IMAGE

WAVELET COEFFICIENTS

This section introduces a probabilistic model for the crit-ically sampled real DWT coefficients as well as for the real


3/17

RAHMANet al.: BAYESIAN WAVELET-BASED IMAGE DENOISING USING THE GAUSSHERMITE EXPANSION 1757

TABLE I

WEIGHTFUNCTIONS ANDREGIONS OFSUPPORT FOR THE SERIESEXPANSIONSOFSOME ORTHOGONAL POLYNOMIALS

and imaginary components of the DT-CWT coefficients of im-

ages. To derive the PDF of the image wavelet coefficients, we

use a series expansion so that the derived PDF enables one to

use an arbitrary number of parameters that are dependent on

higher order moments. In practice, the weight functions for the

various expansions are represented in terms of standard PDFs.

Table I lists a few orthogonal polynomials along with their cor-

responding weight functions and regions of support. Among

the various possible series expansions, we choose the one that

uses the Hermite polynomials, which are orthogonal with re-

spect to the standard Gaussian function. There are two major

reasons for such a choice. First, this expansion is valid for the

image wavelet coefficients that do not have any compact sup-

port. Second, the Gaussian weight function resembles the em-

pirical PDF, and hence there is a high likelihood that the ex-

panded PDF would have a better match with the empirical one

[61]. There are some other advantages in using the Hermite se-

ries expansion: it has a uniform mean square convergence [62],

the parameters can be expressed in terms of higher order mo-

ments in closed-form (as seen later in this section), and the poly-

nomials can be estimated recursively.

Let represent a wavelet coefficient at spatial locationof a subband in the decomposition level with a transfor-

mation matrix applied on the true image. Let represent the

vector containing all the coefficients of a subband, wherein each

coefficient is a sample of that subband. Let represent

the PDF of the random variable . This PDF may be expressed

in terms of the Hermite polynomials, which are orthogonal with

respect to a Gaussian weight function, in the form [61]

(1)

where is the standard deviation of and is the Her-mite polynomial of order that satisfies the recurrance relation

[63]

(2)

The resulting PDF will be called as the GH PDF. A closed-form

expression of the series coefficients can be obtained in terms

of the th order moment as (see Appendix A)

(3)

where denotes the largest integer contained in . The se-

ries coefficients are determined only by the even-order mo-

ments for even , and the odd-order moments for odd . Hence,

the parameter estimation of the GH PDF becomes relatively

simple and straight forward. Although the PDF of the image

coefficients described in (1) is an infinite series expression, in

practice, only thefirst few terms are used for a reasonably goodapproximation. For example, if the highest value of order in

(1) is chosen as 4, the resulting density function reduces to the

GramCharlier (GC) PDF [57], [64], [65]

(4)

where and , respectively, are the

skewness and kurtosis of . However, the approximation of the

right side of (1) by only three terms may introduce some er-rors in , and the error can be reduced by including more

number of terms in the series. In order to include a larger number

of terms in the series, higher order moments are necessary for

estimating the series coefficients . But, the deviation between

the estimated sample-moments and the true-moments is not in-

significant when the order becomes high [64]. Therefore, the

error introduced by the estimation of is not negligible when

a large number of terms of the series is included. In addition,

the decay rate of the density function is proportional

to or simply for suf-

ficiently large . The larger the order , the slower the decay of, thus making it a challenge to fit the tail part of

with that of the empirical PDF of the image coefficients. As-

suming the largest value of in (1) to be , the right side of (1)

may introduce real zeros in the expansion of , the max-

imum number of such zeros being . In essence, the PDF defined

by the truncated GH series may not be always non-negative. For

example, the symmetric (i.e., the skewness is zero) GC PDF is

non-negative only if [66].

Table II shows the average values of the magnitude of the

skewness and the kurtosis of the DWT coefficients in thefirst

three levels for some test images. It can be observed that the

skewness of the image wavelet coefficients is insignificant. It

has also been observed that other odd-order moments of interest

are also insignificant. Hence, the PDF of the image coefficients

can be treated as symmetric. As a result, all in the infinite

series of (1) for odd disappear and the truncated GH PDF for

the image wavelet coefficients can be written as

(5)

The kurtosis of the subband wavelet coefficients of images isusually more than 7 (see Table II); therefore, cannot


4/17


model these coefficients sufficiently well. In addition, each sub-

band of the image wavelet coefficients usually has a sufficiently

large number of data samples, and hence, the higher order mo-

ments would be useful for modeling. Recall that the truncated

GH PDF with a sufficiently large value of is not ensured to be

non-negative. However, the model of the subband coefficients

defined by the truncated GH PDF in (5) is an even function,and therefore, the real zeros are symmetrically situated around

the mode. Fig. 1 shows the statistical modeling of the HL1band

DWT coefficients by the empirical PDF and by the truncated GH

PDF with varying from 4 to 14 for the imagesLenaand Bar-

bara. The empirical PDF is obtained from the histogram using

a nonparametric density estimation technique, wherein the bin

width of the histogram plays a significant role. If the bin width is

too small, then the estimated PDF suffers from undersmoothing,

i.e., the density function has a small bias but a large variability.

On the other hand, if it is too large, then the estimated PDF suf-

fers from oversmoothing, i.e., the density function has a small

variability but a large bias. In order to obtain a tradeoff be-

tween the bias and variance, an appropriate choice of the binwidth is necessary. The bin width can be fixed or variable in

size. Variable width may be better, when the density function

is asymmetric and multimodal in nature. Since the PDF of the

image wavelet coefficients is unimodal and symmetric in na-

ture, afixed bin width is chosen for the histogram. The value of

this bin width is chosen to be , being the

interquartile range1 and the length of the data, since such a

choice asymptotically minimizes the mean integrated squared

error of the empirical PDF that is non-Gaussian [67]. It can be

observed from Fig. 1 that, as increases, the kurtotic behavior

of the truncated GH PDF approaches that of the empirical one,

even though it results in zeros in the tail part of the PDF. Letbe the two zeros closest to the mode for a particular value

of . The value of may be found by using any rootfinding

routine, but we calculate it as

(6)

where is a unit step function and is logicalequal

operation. It is evident from Fig. 1 that when the coefficients lie

in the interval , then t hese c oef ficients can be modelled

well by the proposed GH PDF with an appropriate choice of .

Outside this range, i.e., in the tail part, such a truncated GH PDF

introduces negative probability, which is not possible. Hence, inthe tail part of the truncated GH PDF, a modification is necessary

1The IQR is the difference between the third and first quartiles of the data.

so that the probability becomes non-negative and at the same

time the modified GH PDF fits well with the empirical PDF.

For such a modification, the use of a GH PDF is preferred to

that of using a different PDF, since the estimated parameters

and Hermite polynomials can also be used for the tail part of the

PDF. Examples of the GH PDFs with non-negative probability

are the Gaussian PDF and the symmetric GC PDF having. Since the latter has a longer tail than the former and

the PDFs of the image wavelet coefficients are known to have

long-tails, we choose the GC PDF as the modified tail. In order

to ensure that the total area under the curve of the modified GH

PDF is unity, we multiply the GC PDF by a weight parameter

. Henceforth, this density function will be referred to as the

modified GH (MGH) PDF. It can be mathematically formulated

as (7), shown at the bottom of the page, where is the

weight parameter, and . Since the

total area under the MGH PDF is unity, the parameter satisfies

(8)

After some manipulations, the value of can be obtained as (see

Appendix B)

(9)

where

(10)

and represents the error function. In summary, the MGH

PDF for a particular subband is estimated using the following

steps.1) Select as the highest-order even moment. Estimate the

moments .

if

if

(7)


5/17


Fig. 1. Probability density function of theHL1 band DWT coefficients in terms of the GH series expansion with increasing value of . Test images are (a)Lenaand (b) Barbara.

TABLE IIAVERAGEVALUES OF THE MAGNITUDE OF THESKEWNESS

AND THE

KURTOSIS

OF THEDWT COEFFICIENTS FORSOMETESTIMAGES

2) Estimate the series parameters by

using (3).

3) Select the bin width . Set the dis-

crete points , where and

, respectively, are the minimum and maximum values

of .

4) Estimate the Hermite polynomials

by using (2) for each .

5) Estimate using (5) and (6).

6) Estimate using (9) and (10).7) Obtain the MGH PDF using (7).

The proposed MGH PDF requires the estimation of ,

and , and the computational complexities of these are

, and , respectively, where is

total number of discrete points resulting from the selected bin.

Since and , the computational complexity

of the proposed MGH PDF is . Fig. 2 shows the

modeling of theHL1band DWT coefficients by the MGH PDF

in the linear-scale using different values of for theLena and

Barbara images. It can be observed from this figure that the

MGH PDF is ensured to be non-negative and for a sufficiently

large value of , the tail part of the MGH PDF matches very

well with that of the empirical PDF. Therefore, in the proposedMGH PDF the choice of is of importance, since it determines

the highest-order of the Hermite polynomials and of the mo-

ments used in the parameter estimation. In order to make a good

choice of , the MSE between the empirical and proposed MGH

PDFs is plotted as a function of the highest-order moments used

in (7) for theHL1andHL2 band DWT coefficients for some test

images, and shown in Fig. 3. It can be observed from thisfigure

that the inclusion of an increased number of higher order mo-

ments in the parameter estimation of the proposed MGH PDF

lowers the MSE, and a value of 12, 14, or 16 is a good choice

for . In the log-scale, Fig. 4 shows the modeling performanceof three PDFs, namely, the GG, the BK and the MGH that

uses for theHL1 band DWT coefficients of the Lena

and Barbara images. The closeness of the fit of the proposed

PDF with the empirical one, both in its peak and tail parts, is

obvious as compared to the others. Moreover, to quantify the

performance of the PDFs, we use, the Kolmogorov-Smirnov

distance (KSD), , and the Kull-

back-Leibler divergence (KLD), ,

as the two metrics. Table III shows the results concerning

the metrics KSD and KLD for the GG, BK, and MGH PDFs

of the first and second level image DWT coefficients. The

metrics are calculated by averaging the results obtained using96 images, each of size 512 512, from the database given in

[68]. Standard deviations of the KLD metric are also included

in the table. For the results shown here, the MGH PDF uses

the even-order moments from 2 up to 14, whereas the GG and

BK PDFs use only the second and fourth order moments in

view of their limited number of parameters. From the table, it

is evident that in most of the cases the MGH PDF, compared to

the GG and BK PDFs, has the lowest distance and divergence

from the empirical one. It should be mentioned that similar

kind of results have been obtained from the prior modeling of

the DT-CWT coefficients and are not included here for lack

of space. This high degree of matching between the empirical

and MGH PDFs is mainly due to the fact that the proposedPDF can incorporate more number of higher order moments


6/17


Fig. 2. Modified GH probability density function of the HL1 band DWT coefficients with increasing value of . Test images are (a)Lenaand (b)Barbara.

Fig. 3. Mean squared error between the empirical and the modified GH probability density functions for the DWT coefficients as a function of the highest-orderof the moments that has been used for the parameter estimation. The subbands are (a)HL1 and (b)HL2.

in its parameter estimation process as compared to the others.This feature, along with the modification proposed through the

introduction of the parameter , results in the proposed MGH

PDF to match the empirical PDF very well both in the peak and

tail segments. In order to demonstrate the effectiveness of the

proposed PDF, we next consider the case of Bayesian denoising

with the MGH prior function.

III. DENOISINGALGORITHM

In this section, we assume that the image pixels are corrupted

by AWGN with a known variance . If is unknown, it may

be estimated by applying the median-absolute-deviation method

[3] in the highest frequency subband of the noisy wavelet coeffi-cients. Since the noise is uncorrelated with the true image signal,

the wavelet coefficient of the noisy image at the spatial locationof a subband is given by

(11)

where is the wavelet coefficient of the additive noise at

the reference location. We propose an MMSE-based denoising,

wherein the Bayes-risk estimator uses the proposed MGH PDF.

In a Bayesian framework, , , and in (11) are consid-

ered as samples of the independent random variables , , and

, respectively. The signal component is modelled as the i.i.d.

MGH random variable with subband-adaptive parameters. The

random variable of the noise component is modelled as a zero-

mean Gaussian random variable with a subband-adaptive vari-ance , which depends on , , and the arithmeticoperations


7/17


Fig. 4. Modified GH probability density function in log-scale for the HL1band DWT coefficients. Hermite polynomials are up to the fourteenth order. Test imagesare (a)Lena, where

and (b)Barbara, where

.

TABLE IIIVALUES OF THE METRICSKSD AND KLD FOR THE PRIOR FUNCTION

MODELING OF THE IMAGEDWT COEFFICIENTS. T HE AVERAGEVALUES OFKLD (STANDARDDEVIATIONS INPARENTHESES) AND KSD ARE OBTAINED

FROM A96 IMAGE DATABASE

of the corresponding wavelet-domain. In this case, we assume

that is known; however, if is unknown, it may be esti-

mated using the method given in [16]. The Bayes-risk estimator

for the denoised wavelet coefficient under a quadratic cost

function that minimizes the MSE is the conditional mean given

by

(12)

where is the posterior density function conditioned

on the subband coefficients under consideration. According to

the Bayesrule, the prior function of the image wavelet

coefficients can be used to compute the posterior density func-

tion as

(13)

where is the density function of the wavelet coeffi-

cients corresponding to the noise component. It can be seen

from (12) and (13) that the MMSE-based shrinkage functionis independent of the weight parameter that has been

estimated in Section II. An analytical expression of the pro-

posed MMSE-based shrinkage function can be obtained as

in (14), shown at the bottom of the page (see Appendix C

for proof), where ,

, and is the parabolic

cylinder function [69]. It is to be noted that for ,

and . The shrinkage function

given by (14) may be directly used for estimating each of

the noisy wavelet coefficients. The computational complexity

of this function is, where is the

computational complexity of . Since the computational

load increases exponentially with , an implementation of such

(14)


8/17


a function may not be practical in view of the excessive com-

putational effort involved. Hence, we represent the shrinkage

function in terms of a linear convolution as

(15)

where is the convolution operator. Taking the computa-

tional advantage of the discrete Fourier transform (DFT), the

shrinkage function can be expressed as

(16)

where is the forward DFT and is the inverse DFT. Using

a fast implementation of the DFT, the complexity of computing

the shrinkage function can be reduced to .The function given in (16) provides the estimates of the

noisy wavelet coefficients at the discrete points , where the

probabilities of the MGH PDF are estimated. Therefore, an

interpolation function is required to estimate each of the noisy

wavelet coefficients. In our case, we choose the cubic Hermite

spline [70] as an interpolation function so that the estimated

Hermite polynomials for the MGH PDF can also be used

for interpolation. This spline function requires the estimation

of four parameters and has a computational complexity of

. Since and , the compu-

tational complexity of the shrinkage function given by (16) for

subband-adaptive denoising is . From simulationresults, we have observed that the denoising performances

using the two shrinkage functions, given by (14) and (16), are

essentially the same. Hence, we prefer to use (16) for the pro-

posed Bayesian denoising in view of its lower computational

complexity.

The proposed MMSE estimation requires that the parameters

of the proposed prior function be estimated as accurately as pos-

sible from the noisy observation. Since the parameters of the

MGH prior function are expressed in terms of , the esti-

mation of the th order moment of the signal coefficients from

the noisy observation is sufficient for such a parameter esti-

mation. The noise is independent of the signal and additive in

nature, since it is AWGN. Therefore, the th order moment of

the signal coefficients is estimated as by utilizing,

and , which are, respectively, the th order moments of the

noisy signal and noise coefficients. Since the noise is zero-mean

Gaussian distributed, the odd-order moments of the noise coef-

ficients are zero and the even-order moments can be expressed

in terms of the noise standard deviation . In general, the th

order moment of the noise coefficients is [71]

for odd

for even. (17)

Therefore, the th order moment of the signal coefficients is

evaluated as (18), shown at the bottom of the page. The param-

eter estimation of the MGH prior function is relatively simple,

since these can be substituted in (3) to obtain . On the

other hand, the parameter estimation of traditional prior func-

tions that have fewer number of parameters are often not so

simple. For example, the shape and width parameters of the GG

prior function are estimated numerically that involves iteration.

In practice, the convergence of the iterative procedure depends

on the initial choice of the parameter. Fig. 5 shows an example

of the nonlinear shrinkage function obtained from the pro-

posed denoising technique that uses three different values of

in the MGH prior function. It can be observed from thisfigure

that the amount of shrinkage can be controlled by changing .This feature of being able to vary the amount of shrinkage may

be useful, in particular, in producing a denoised image that is

visually acceptable from the users perspective.

Since the wavelet coefficients of images are locally stationary,

an improved performance of the Bayesian denoising method can

be obtained by considering the proposed prior function to be lo-

cally adaptive during the MMSE estimation. The higher order

moments of the MGH PDF, however, will not be effective for

modeling the local neighboring coefficients as compared to that

of the subband coefficients, since the error introduced by the

estimated higher order sample-moments in the former is not in-

significant as in the case of the latter. This is due to the factthat the number of data samples for the local neighboring co-

efficients is much less than that of the subband coefficients. In

our earlier work [57], it has been shown that the symmetric GC

PDF, which is a particular case of the MGH PDF having ,

performs better than the GG PDF for modeling the local neigh-

boring wavelet coefficients for images. A better quality MMSE

estimator for image denoising can, therefore, be obtained by

using the symmetric GC prior function as the prior function in

the proposed shrinkage function. It was observed from exten-

sive simulations that the difference in denoising performance in

terms of the MSEs of the MMSE estimators designed using the

MGH PDF with and is n o more than 0.25%. H ence,

in order to savethe computational effort, MGH PDF with ,

i.e., the GC PDF, is chosen as the prior function for the case of

locally adaptive denoising. In such a case, the number of pa-

rameters of the MGH prior function is the same as that of the

conventional prior functions, namely, GG and BK. The compu-

tational complexity of the locally adaptive version of the pro-

for odd

for even. (18)


9/17


Fig. 5. Shrinkage function obtained from the modified GH probability density function for the LH2 subband DWT coefficients as a function of with .The test images are (a) Lena and (b)Boat.

Fig. 6. Output PSNR as a function of the input PSNR for different subband-adaptive DWT-based Bayesian denoising algorithms using the test image Lena

( for the proposed method).

posed denoising method is , where is the

average number of discrete points in a window obtained from

the selected bin.

IV. SIMULATION RESULTS

Extensive experimentations have been carried out in order

to compare the performance of the proposed denoising method

with that of the other methods. Due to space limitation, how-

ever, we give in this section results concerning five 512 512

grayscale images, namely,Lena,Barbara,Boat,Bridge, andBa-

boon and two types of wavelet representations, namely, the deci-

mated DWT and redundant DT-CWT. Thefirst three test imagesare obtained from the same sources as mentioned in [16] and

Fig. 7. Output PSNR as a function of the input PSNR for different subband-adaptive DWT-based Bayesian denoising algorithms using the test image Bar-bara(

for the proposed method).

the last two from [68]. The DWT-based experiments use the or-

thogonal wavelet filter Symmlet-8, whereas the DT-CWT-based

ones use the -tap biorthogonal filters at level ,

and 6-tap Q-shift orthogonalfilters at , as have been pro-

posed in [72]. The proposed algorithm is applied to the 4-level

decomposition of the DWT and the 3-level decomposition of the

DT-CWT, since any further decomposition beyond these levels

does not produce a significant increase in the denoising perfor-

mance. The performance of the Bayesian denoising method is

tested using two sets of experiments. In the first set of experi-

ments, the MGH prior function is used for the subband-adaptive

denoising. In the second set of experiments, the symmetric GC

prior function, i.e., MGH prior function with , is used forlocally adaptive denoising.


10/17


Fig. 8. Output PSNR as a function of the input PSNR for different subband-adaptive DWT-based Bayesian denoising algorithms using the test image Boat

(


Fig. 9. Output PSNR as a function of the input PSNR for different subband-adaptive DWT-basedBayesian denoising algorithms usingthe testimageBridge(


A. Subband-adaptive Bayesian Denoising

Seven different wavelet-based Bayesian denoising methods,

which are subband-adaptive in nature, have been considered in

this set of experiments. These are the classical Wiener filter,

SureShrink [3], BayesShrink [20], EBayesThresh [24], Prob-

Shrink [10], BKShrink [46] and proposed method. As pointed

out in Section I, these methods enjoy a computational com-

plexity that is lower than that of the corresponding locally adap-

tive one by a factor of ; and that the computational com-

plexity of those methods that employ intersubband dependency

is even higher than those of the corresponding locally adap-

tive ones. To the best of authorsknowledge the BKShrink and

ProbShrink methods provide the best results amongst the sub-

band-adaptive denoising methods. The denoising performanceof the various methods are evaluated using two indices, namely,

Fig. 10. Output PSNR as a function of the input PSNR for different subband-

adaptive DWT-based Bayesian denoising algorithms using the test image Ba-

boon ( for the proposed method).

Fig. 11. Output PSNR of subband-adaptive Bayesian denoising algorithmsusing redundant DWT, wherein test images are corrupted with ( for the proposed method).

the peak signal-to-noise ratio (PSNR) and the structural simi-larity (SSIM) [4]. In this context, it is to be noted that the authors

in [4] have claimed that SSIM can assess the similarity between

the original and processed images better than the PSNR. De-

noising performance of the proposed method is tested by using

the two shrinkage functions, given by (14) and (16) in Sec-

tion III. The evaluation of the parabolic cylinder function for

the analytic expression in (14) is performed by the same source

code as mentioned in [46]. It is to be noted that the processing

time of (16) is very fast as compared to that of (14). For example,

in a 3 GHz and 512 MB machine, the processing time of (16)

using the MATLAB code is only 6.6 seconds for and the

time is virtually the same for values of . On the other hand, the

processing time of (14) is 1 hour and 24 minutes for andthe time increases exponentially with . It is also observed that


11/17


Fig. 12. Comparison of the denoised images obtained from the different subband-adaptive Bayesian DWT-based denoising algorithms on Lenawith :(a) original image, (b) noisy image (

dB,

), and denoised images using (c) BayesShrink method [20] (

dB,

), (d) ProbShrink method [10] ( dB, ), (e) BKShrink method [46] ( dB, ),and (f) proposed method with ( dB, ).

for a given value of , the denoising performances of both (14)

and (16) are essentially the same. Hence, we provide the results

of the proposed method that uses the shrinkage function given

by (16). From our experiments, we have observed that

provides the best performance in terms of the PSNR, whereas

provides the best performance in terms of the SSIM, for

the proposed method. These values of are consistent with thevalues of mentioned in Section II as providing the best statis-

tical modeling of the image wavelet coefficients by the MGH

PDF.

The output PSNR as a function of the input PSNR obtained

from various DWT-based denoising methods are shown in

Figs. 610, respectively, for the test images Lena, Barbara,

Boat, Bridge, and Baboon. From these figures, it can be

observed that the proposed denoising method with

that employs the DWT provides the highest output PSNR

consistently as compared to the other DWT-based methods.

According to the output PSNR values, the improvement of the

denoising performance provided by the proposed method is

substantial, when the input PSNR is low, i.e., the noise strengthis significant. Fig. 11 shows a comparison between the output

PSNR values obtained from the SureShrink [3], BayesShrink

[20], EBayesThresh [24], ProbShrink [10], and proposed

methods employing the redundant form of the DWT when the

images are corrupted with . From thisfigure, it can also

be seen that the output PSNR values of the proposed method

are consistently higher than that of the other methods. Table IV

shows the SSIM values obtained from the various DWT-baseddenoising methods for three different noise standard deviations.

It can be observed from the table that, except for two values of

in the case of the Barbaraimage, the proposed method with

provides the best SSIM compared to that provided by

the other denoising methods. It should be noted that the PSNR

values for are about the same as those for , the

difference being not more than 0.1 dB, and that these values are

at least as good as those provided by the other methods. Fig. 12

shows the original cropped image Lena, its noisy version with

, and the DWT-based denoised versions provided

by the BayesShrink method [20], ProbShrink method [10],

BKShrink method [46], and proposed method with .

From this figure, it can be observed that the proposed methodcan reduce the noise significantly with the least amount of


12/17


TABLE IV

VALUES OF THE INDEXSSIM FOR VARIOUSSUBBAND-ADAPTIVEBAYESIANDENOISINGMETHODSUSING THEDWT

artifacts (e.g., in the eye and the edge of the hat) as compared

to the other competing methods.

B. Locally Adaptive Bayesian Denoising

Six different wavelet-based denoising methods, which are lo-

cally adaptive in nature, have been considered in this set of ex-

periments. These are the BivariateShrink [58], locally adaptive

window-based MAP (LAWMAP) [54], NeighShrink [22], Prob-

Shrink [10], GSM method [16] and proposed method. In the

first method, both the inter and intrasubband dependencies are

considered. The second, third, and fourth methods inherently

use only the intrasubband dependency. In the case of the fifth,

we provide the results obtained from the intrasubband depen-

dency only in order to make a fair comparison with our pro-

posed method. The parameters of the symmetric GC prior func-

tion for the proposed denoising method is refined locally using

different sizes of square-shaped neighborhood regions. Fig. 13

shows the improvement in the PSNR (IPSNR) values2 asa func-

tion of the window size with a reference of 3 3 window, i.e.,

as a function of , for de-

noising different test imagesbeing corrupted with a noise having

. From this figure, it can be observed that as the window

size is increased from 3 3, the IPSNR value increasesfirst and

then drops slightly when the window exceeds a certain size. Thisisdue tothe fact thata sufficient number of samples are required

for improved parameter estimation of the prior function, but, if

the neighborhood region becomes relatively large, the estima-

tion process loses its local contextual information. The experi-

ments reveal that a 9 9 window provides the highest IPSNR

values in most cases. Considering this fact, we report the re-

sults provided by our proposed denoising method using a 9 9

window.

Table V shows the output PSNR values obtained from the

various locally adaptive denoising methods for the test images

considered here. From the tabular results, it can be observed that

among all the DWT-based denoising methods, the proposed one2IPSNR is the difference between output PSNR and input PSNR.

Fig. 13. IPSNR values in dB as a function of the window size relative to theIPSNR values corresponding to a 3

2

3 window.

provides the highest output PSNR values. The PSNR perfor-

mance of the proposed method improves further, if the DT-CWT

is employed. Fig. 14 shows the original cropped image Bar-

bara, its noisy version with , and the DWT-based de-

noised versions provided by four different denoising methods,

namely, the BivariateShrink method [58], ProbShrink method

[10], GSM method [16] and proposed method. From this figure

also, it can be observed that the proposed method can reduce the

noise significantly with the least amount of artifacts (e.g., in the

eyes and scarf ofBarbara) as compared to the other competing

methods.

V. CONCLUSION

The traditional probability density functions (PDFs) of the

wavelet transform coefficients of images have a limited number

of parameters, which are calculated by using only thefirst few

moments. As a result, these PDFs cannot provide a sufficiently

good statistical match for the empirical PDF of the waveletcoefficients of a true image. In this paper, a new PDF, wherein


13/17


Fig. 14. Comparison of the denoised images obtained from the different locally adaptive DWT-based denoising algorithms onBarbarawith

: (a) originalimage, (b) noisy image dB , and denoised images using the (c) BivariateShrink method [58] dB , (d) ProbShrink method [10] dB , (e) GSM method [16] dB , and (f) proposed method dB .

a more appropriate number of parameters can be incorporated,

has been proposed in order to overcome this shortfall. This PDF

uses a series expansion in terms of the Hermite polynomials,

which are orthogonal with respect to the standard Gaussian

weight function. Such an orthogonal series expansion has been

utilized, since the Gaussian density function bears a resem-

blance with the empirical PDF of the wavelet coefficients of a

true image. A closed-form expression for the parameters of thePDF (which has been referred to in this paper as the GH PDF)

has been derived in terms of the higher order sample-moments.

A modification has been proposed in the GH PDF so that an

appropriate number of parameters can be used to model the

true image wavelet coefficients while keeping the resulting

PDF to be non-negative. It has been shown that this modified

GH (MGH) PDF matches the empirical ones of the test images

used, more closely than some of the standard PDFs such as the

generalized Gaussian and Bessel K-form.

In order to demonstrate the efficacy of the proposed MGH

PDF, we have designed and tested the minimum mean squared

error-based shrinkage function for its use in an image noise re-

duction technique as a prior function of the subband as well aslocal neighboring wavelet coefficients. The parameters required

for the prior function have been estimated from the noisy obser-

vation. The simulation results using a number of standard test

images have shown that the performance of the proposed de-

noising method, both in subband-adaptive and locally adaptive

conditions, is better than that of most of the methods that use

PDFs with limited number of parameters for statistical modeling

of the wavelet coefficients.

APPENDIXA

PARAMETERS OF PDF IN TERMS OF THE

GAUSSHERMITEEXPANSION

Let be the normalized version of with the relation

and its PDF be . Since the Hermite polynomials are

orthogonal with respect to the zero-mean unit variance Gaussian

weight function, can be written as [64]

(19)


14/17


TABLE IV

OUTPUTPSNR, IN DB, WHERE IS THEERROR STANDARD

DEVIATION, FOR DIFFERENTLOCALLYADAPTIVEDENOISINGMETHODS

where are the series coefficients. Multiplying both sides of

(19) by and integrating from to , can be eval-

uated as

(20)

Using the fact that has a zero mean and unit variance, one can

obtain and . Finally, changing the variable

of to in (19) yields [61]

(21)

The Hermite polynomials can be expressed as

(22)

Using the identity and the expression of , the series coefficients in (20) can be obtained in a

generalized form

(23)

where is the th order moment of .

APPENDIXB

PARAMETER FOR THEMODIFIEDGAUSS-HERMITEPDF

Using the normalized variable , we can solve (8) as

, where

(24)

Using the identity and the

recurrance relation in (2), we obtain

(25)

Hence, for

(26)

(27)

Using the above, and , and hence, the parameter can be

expressed as

(28)


15/17


16/17


[11] R. R. Coifman and D. L. Donoho,Translation-invariant de-noising,in Wavelets and Statistics, A. Antoniadis and G. Oppenheim, Eds.Berlin, Germany: Springer-Verlag, 1995.

[12] N. G. Kingsbury, Image processing with complex wavelets, Phil.Trans. R. Soc. London A, vol. 357, no. 1760, pp. 25432560, 1999.

[13] G. Y. Chen and B. Kgl, Image denoising with complex ridgelets,Pattern Recognit., vol. 40, pp. 578585, 2007.

[14] J. L. Starck, E. J. Cands, and D. L. Donoho, The curvelet transform

for image denoising,IEEE Trans. Image Process., vol. 11, no. 6, pp.670684, Jun. 2002.[15] R. Eslami and H. Radha,Translation-invariant contourlet transform

and its application to image denoising,IEEE Trans. Image Process.,vol. 15, no. 11, pp. 33623374, Nov. 2006.

[16] J. Portilla, V. Strela, M. Wainwright, and E. P. Simoncelli, Imagedenoising using scale mixtures of Gaussians in the wavelet domain,

IEEE Trans. Image Process., vol. 12, no. 11, pp. 13381351, Nov.2003.

[17] A. A. Bharath and J. Ng,A steerable complex wavelet constructionand its application to image denoising,IEEE Trans. Image Process.,vol. 14, no. 7, pp. 948959, Jul. 2005.

[18] B. Vidakovic, Statistical Modeling by Wavelets, 1st ed. New York:Wiley, 1999.

[19] A. Antoniadis and J. Bigot,Wavelet estimators in nonparametric re-gression: A comparative simulation study,J. Statist. Softw., vol. 6, no.6, 2001.

[20] S. G. Chang, B. Yu, and M. Vetterli,Adaptive wavelet thresholdingfor image denoising and compression, IEEE Trans. Image Process.,vol. 9, no. 9, pp. 15321546, Sep. 2000.

[21] M. A. T. Figueiredo and R. D. Nowak, Wavelet-based image esti-mation: An empirical Bayes approach using Jeffreysnoninformativeprior,IEEE Trans. Image Process., vol. 10, no. 9, pp. 13221331, Sep.2001.

[22] T. Cai and B. Silverman,Incorporating information on neighboringcoefficients into wavelet estimation, Sankhya: The Indian J. Statist.,vol. 63, pp. 127148, 2001.

[23] L. Zhang and P. Bao,Denoising by spatial correlation thresholding,IEEE Trans. Circuits Syst. Video Technol., vol. 13, no. 6, pp. 535538,Jun. 2003.

[24] I. M. Johnstone and B. W. Silverman,Empirical Bayes selection ofwavelet thresholds,Ann. Statist., vol. 33, no. 4, pp. 17001752, 2005.

[25] H.-Y. Gao and A. G. Bruce,Waveshrink withfirm shrinkage,Statist.Sinica, vol. 7, no. 4, pp. 855874, 1997.

[26] H.-Y. Gao,Wavelet shrinkage denoising using the non-negative gar-rote,J. Comput. Graph. Statist., vol. 7, no. 4, pp. 469488, 1998.

[27] E. P. Simoncelli and E. Adelson,Noise removal via Bayesian waveletcoring,inProc. IEEE Int. Conf. Image Processing, Lusanne, Switzer-land, 1996, vol. 1, pp. 279382.

[28] M. I. H. Bhuiyan, M. O. Ahmad, and M. N. S.Swamy, Spatially adap-tive wavelet-based method using the Cauchy prior for denoising theSAR images, IEEE Trans. Circuits Syst. Video Technol., vol. 17, no.4, pp. 500507, Apr. 2007.

[29] L.Sendur and I. W. Selesnick, Bivariate shrinkage functions forwavelet-based denoising exploiting interscale dependency, IEEETrans. Signal Process., vol. 50, no. 11, pp. 27442756, Nov.2002.

[30] S. M. M. Rahman, M. O. Ahmad, and M. N. S. Swamy,Video de-noising based on inter-frame statistical modeling of wavelet coeffi-cients, IEEE Trans. Circuits Syst. Video Technol., vol. 17, no. 2, pp.

187198, Feb., 2007.[31] J. Huang,Statistics of Natural Images and Models, Ph.D. disserta-

tion, Brown Univ., Providence, RI, 2000.[32] A. Srivastava, A. B. Lee,E. P.Simoncelli, and S. C. Zhu, On advances

in statistical modeling of natural images,J. Math. Imag. Vis., vol. 18,pp. 1733, 2003.

[33] S. Mallat, A theory for multiresolution signal decomposition: Thewavelet representation, IEEE Trans. Pattern Anal. Machine Intell.,vol. 11, no. 7, pp. 674693, 1989.

[34] A. D. Stefano, P. R. White, and W. B. Collis, Selection of thresh-olding scheme for image noise reduction on wavelet components usingBayesian estimation,J. Math. Imag. Vis., vol. 21, pp. 225233, 2004.

[35] P. Moulin and J. Liu,Analysis of multiresolution image denoisingschemes using generalized Gaussian and complexity priors, IEEETrans. Inf. Theory, vol. 55, no. 3, pp. 909919, Apr. 1999.

[36] D. Cho and T. D. Bui,Multivariate statistical modeling for image de-noising using wavelet transforms, Signal Process.: Image Commun.,vol. 20, pp. 7789, 2005.

[37] H. A. Chipman, E. D. Kolaczyk, and R. E. McCulloch, AdaptiveBayesian wavelet shrinkage,J. Amer. Statist. Assoc., vol. 92, no. 440,pp. 14131421, 1997.

[38] F. Abramovich, T. Sapatinas, and B. W. Silverman,Wavelet thresh-olding via a Bayesian approach,J. Roy. Statist. Soc. B, vol. 60, no. 4,pp. 725749, 1998.

[39] M. Clyde and E. I. George,Flexible empirical Bayes estimation forwavelets,J. Roy. Statist. Soc. B, vol. 62, no. 4, pp. 681698, 2000.

[40] J. Portilla, V. Strela, M. Wainwright, and E. P. Simoncelli, Adap-tive Wiener denoising using a Gaussian scale mixture model,inProc.IEEE Int. Conf. Image Processing, Thessaloniki, Greece, 2001, vol. 2,pp. 3740.

[41] M. S. Crouse, R. D. Nowak, and R. G. Baraniuk,Wavelet-based sta-tistical signal processing using hidden Markov models,IEEE Trans.Signal Process., vol. 46, no. 4, pp. 886902, Apr. 1998.

[42] P. Scheunders and S. D. Backer,Wavelet denoising of multicompo-nent images using Gaussian scale mixture models and a noise-freeimage as priors, IEEE Trans. Image Process., vol. 16, no. 7, pp.18651872, Jun. 2007.

[43] L. Boubchir and J. M. Fadili,A closed-form nonparametric Bayesianestimator in the wavelet domain of images using an approximate

-stable prior,Pattern Recognit. Lett., vol. 27, pp. 13701382, 2006.[44] A. Achim, A. Bezerianos, and P. Tsakalides,Novel Bayesian multi-

scale method for speckle removal in medical ultrasound images,IEEETrans. Med. Imag., vol. 20, no. 8, pp. 772783, Aug. 2001.

[45] A. Achim, P. Tsakalides, and A. Bezerianos,SAR image denoisingvia Bayesian wavelet shrinkage basedon heavy-tailed modeling,IEEETrans. Geosci. RemoteSens., vol.41,no.8, pp. 17731784, Aug. 2003.

[46] J. M. Fadili and L. Boubchir,Analytical form for a Bayesian waveletestimator of images using the Bessel K-form densities, IEEE Trans.

Image Process., vol. 14, no. 2, pp. 231240, Feb. 2005.[47] S. Solbo and T. Eltoft, Homomorphic wavelet-based statistical de-

speckling of SAR images,IEEE Trans. Geosci. Remote Sens., vol. 42,no. 4, pp. 711721, Apr. 2004.

[48] M. I. H. Bhuiyan, M. O. Ahmad, and M. N. S. Swamy,Wavelet-baseddespeckling of ultrasound images with the symmetric normal inverse

Gaussian prior, inProc. IEEE Int. Conf. Acoustics, Speech, and SignalProcess., 2007, vol. 1, pp. 721724.

[49] R. V. Hogg and A. T. Craig, Introduction to Mathematical Statistics,5th ed. Englewood Cliffs, NJ: Prentice-Hall, 1995.

[50] M. N. Do and M. Vetterly,Wavelet-based texture retrieval using gen-eralized Gaussian density and Kullback-Leibler distance,IEEE Trans.

Image Process., vol. 11, no. 2, pp. 146158, Feb. 2002.[51] B. Aiazzi, L. Alparone, and S. Baronti,Estimation based on entropy

matching for generalized Gaussian PDF modeling, IEEE SignalProcess. Lett., vol. 6, no. 6, pp. 138140, Jun. 1999.

[52] L. P. Hansen,Large sample properties of generalized method of mo-ments estimator,Econometrica, vol. 50, no. 4, pp. 10291054, 1982.

[53] J. Liu and P. Moulin,Information-theoretic analysis of interscale andintrascale dependencies between image wavelet coefficients, IEEETrans. Image Process., vol. 10, no. 11, pp. 16471658, Nov. 2001.

[54] M. K. Mihak, I. Kozintsev, K. Ramchandran, and P. Moulin,Low-complexity image denoising based on statistical modeling of wavelet-coefficients,IEEE Signal Process. Lett., vol. 6, no. 12, pp. 300303,Dec. 1999.

[55] S. M. M. Rahman and M. K. Hasan,Wavelet-domain iterative centerweighted medianfilter for image denoising,Signal Process., vol. 83,pp. 10011012, 2003.

[56] M. Kazubek,Wavelet domain image denoising by thresholding andWiener filtering, IEEE Signal Process. Lett., vol. 10, no. 11, pp.324326, Nov. 2003.

[57] S. M. M. Rahman, M. O. Ahmad, and M. N. S. Swamy, Locallyadaptive wavelet-based imagedenoising using the GramCharlier priorfunction, in Proc. IEEE Int. Conf. Image Processing, San Antonio,TX, 2007, vol. 3, pp. 549552.

[58] L.Sendur and I. W. Selesnick,Bivariate shrinkage with local varianceestimation, IEEE Signal Process. Lett., vol. 9, no. 12, pp. 438441,Dec. 2002.

[59] A. Achim and E. E. Kuruoglu, Image denoising using bivariate -stable distributions in the complex wavelet domain,IEEE Signal Pro-cessing Lett., vol. 12, no. 1, pp. 1720, Jan. 2005.

[60] L. Zhang, P. Bao, and X. Wu,Multiscale LMMSE-based image de-noising with optimal wavelet selection, IEEE Trans. Circuits Syst.Video Technol., vol. 15, no. 4, pp. 469481, Apr. 2005.

[61] A. Zayezdny, D. Tabak, and D. Wulich, Engineering Applications ofStochastic Processes, 1st ed. New York: Wiley, 1989.


17/17


[62] G. G. Walter,Properties of Hermite series estimation of probabilitydensity,Ann. Stat., vol. 5, no. 6, pp. 12581264, 1977.

[63] M. Abramowitz and I. E. Stegun, Handbook of Mathematical Func-tions with Formulas, Graphs and Mathematical Tables, 10th ed. NewYork: Dover, 1965.

[64] M. G. Kendall and A. Stuart, The Advanced Theory of Statistics, 4thed. London, U.K.: Charles Griffin, 1977, vol. 1, Distribution Theory.

[65] E. Jondeau and M. Rockinger, GramCharlier densities, J. Econ.

Dyn. C ontrol, vol. 25, pp. 14571483, 2001.[66] D. E. Barton and K. E. Dennis, The conditions under whichGramCharlier and edgeworth curves are positive definite and uni-modal,Biometrika, vol. 39, no. 3/4, pp. 425427, 1952.

[67] J. S. Simonoff, Smoothing Methods in Statistics, 1st ed. New York:

Springer-Verlag, 1996.[68] [Online]. Available: http://decsai.ugr.es/cvg/dbimagenes/index.php[69] I. S. Gradshteyn and I. M. Ryzhik, Table of Integrals, Series and Prod-

ucts, A. Jeffrey, Ed., 5th ed. New York: Academic, 1994.[70] C. D. Boor, A Practical Guide to Splines, F. John, L. Sirovich,

J. P. Lasalle, and G. B. Whitham, Eds., 1st ed. New York:Springer-Verlag, 1978.

[71] A. Papoulis, Probability, Random Variables, and Stochastic Processes,3rd ed. New York: McGraw-Hill, 1991.

[72] N. G. Kingsbury,Complex wavelets for shift invariance analysis andfiltering of signals,Appl. Comput. Harmon. Anal., vol. 10, no. 3, pp.234253, 2001.

S. M. Mahbubur Rahman received B.Sc. andM.Sc.degrees in electrical and electronic engineering fromthe Bangladesh University of Engineering andTechnology (BUET), Dhaka, Bangladesh, in 1999

and 2002, respectively. He is currently pursuingthe Ph.D. degree in the Department of Electricaland Computer Engineering, Concordia University,

Montreal, QC, Canada.He was a Lecturer from 1999 to 2002 and an As-

sistant Professor from 2002 to 2003 at BUET. Hisresearch interests are in the areas of statistical signal

processing, digital image/video processing, and waveletfilters.Mr. Rahman hasreceivedseveral awardsduringhis Ph.D. program, including

the Concordia University Graduate Fellowship, the Faculty of Engineering andComputer Science Graduate Scholarship, the Carolyn and Richard Renaud

Teaching Assistantship, and the Doctoral Teaching Assistantship.

M. Omair Ahmad (S69M78SM83F01) re-ceived the B.Eng. degree from Sir George Williams

University, Montreal, QC, Canada, and the Ph.D.degree from Concordia University, Montreal, both

in electrical engineering.During 19781979, he was a member of the Fac-

ulty of the New York University College, Buffalo. InSeptember 1979, he joined the Faculty of ConcordiaUniversity, where he was an Assistant Professor ofcomputer science. Subsequently, he joined the De-partment of Electrical and Computer Engineering of

thesame university, where he wasthe Chair of thedepartment from June 2002 toMay 2005 and where he is presently a Professor and holds the Tier I Concordia

University Research Chair in Multimedia Signal Processing. He has publishedextensively in the area of signal processing and holds four patents. His current

research interests include the areas of multidimensionalfilter design, speech,image and video processing, nonlinear signal processing, communication DSP,artificial neural networks, and VLSI circuits for signal processing. He was afounding member of Micronet from its inception in 1990 as a Canadian Net-work of Centers of Excellence until its expiration in 2004.

Dr. Ahmad was anAssociate Editor of the IEEET RANSACTIONS ON CIRCUITSANDSYSTEMSPARTI: FUNDAMENTALTHEORY ANDAPPLICATIONS from June1999 to December 2001. He was the Local Arrangements Chairman of the 1984IEEE International Symposium on Circuits and Systems. During 1988, he wasa member of the Admission and Advancement Committee of the IEEE. He hasalso served as the Program Co-Chair for the 1995 IEEE International Con-ference on Neural Networks and Signal Processing and the 2004 IEEE Inter-

national Midwest Symposium on Circuits and Systems. Presently, he is theChairman of the IEEE Circuits and Systems Chapter (Montreal Section). He

is a recipient of the Wighton Fellowship from the Sandford Fleming Founda-tion.

M. N. S. Swamy (S59M62SM74F80) re-ceived the B.Sc. (Hons.) degree in mathematics fromMysore University, India, in 1954, the Diploma inElectrical Communication Engineering from theIndian Institute of Science, Bangalore, in 1957, theM.Sc. and Ph.D. degrees in electrical engineeringfrom the University of Saskatchewan, Saskatoon,SK, Canada, in 1960 and 1963, respectively, and the

Doctor of Science degree in engineering (HonorisCausa) from Ansted University In recognitionof his exemplary contributions to the research in

Electrical and Computer Engineering and to Engineering Education, as wellas his dedication to the promotion of Signal Processing and CommunicationsApplications in August 2001. As a recognition of his research contributions,as well as his contributions to Concordia University, the University has recentlyestablished the M. N. S. Swamy Chair in Electrical Engineering.

He is presently a Research Professor and the Director of the Center for SignalProcessing and Communications, Department of Electrical and Computer En-gineering, Concordia University, Montreal, QC, Canada, where he served asthe Chair of the Department of Electrical Engineering from 1970 to 1977 andDean of Engineering and Computer Science from 1977 to 1993. Since July2001, he has held the Concordia Tier 1 Research Chair in Signal Processing. Hehas also taught in the Electrical Engineering Department, Technical Universityof Nova Scotia, Halifax, NS, Canada, and the University of Calgary, Calgary,AB, Canada, as well as in the Department of Mathematics at the University

of Saskatchewan. He has published extensively in the areas of number theory,circuits, systems and signal processing, and holds five patents. He is the co-au-thor of two book chapters and four books: Graphs, Networks and Algorithms

(Wiley, 1981),Graphs: Theory and Algorithms(Wiley, 1992),Switched Capac-itor Filters: Theory, Analysis and Design (Prentice Hall International U.K. Ltd.,1995), andNeural Networks in a Softcomputing Framework(Springer, 2006). ARussian translation of the first book was published by Mir Publishers, Moscow,Russia, in 1984, while a Chinese version was published by the Education Press,Beijing, China, in 1987. He was a founding member of Micronet from its incep-tion in 1990 as a Canadian Network of Centers of Excellence until its expirationin 2004, as well as its Coordinator for Concordia University.

Dr. Swamy is a Fellow of the Institute of Electrical Engineers (U.K.), the En-gineering Institute of Canada, the Institution of Engineers (India), and the Insti-tution of Electronic and Telecommunication Engineers (India). He has servedthe IEEE in various capacities such as the President-Elect in 2003, President in2004, Past-President in 2005, Vice President (Publications) during 20012002,VicePresident in 1976, Editor-in-Chief of theT RANSACTIONS ON CIRCUITS AND

SYSTEMSPART I: FUNDAMENTAL THEORY AND APPLICATIONS fromJune 1999to December 2001, Associate Editor of the TRANSACTIONS ON CIRCUITS AND

SYSTEMSfrom June 1985May 1987, Program Chair for the 1973 IEEE CASSymposium, General Chair for the 1984 IEEE CAS Symposium, Vice-Chair forthe 1999 IEEE CAS Symposium, and a member of the Board of Governors ofthe CAS Society. He is the recipient of many IEEE-CAS Society awards in-cluding the Education Award in 2000, Golden Jubilee Medal in 2000, and the1986 Guillemin-Cauer Best Paper Award.

14-Bayesian Wavelet-Based Image Denoising Using the Gauss–Hermite Expansion.pdf

Documents

Transcript of 14-Bayesian Wavelet-Based Image Denoising Using the Gauss–Hermite Expansion.pdf