A NEW APPROACH FOR DENOISING IMAGES

BASED ON WEIGHTS OPTIMIZATION

Qiyu Jin

1,2

, Ion Grama

1,2

and Quansheng Liu

1,2

UMR 6205, Laboratoire de Mathmatiques de Bretagne Atlantique, Universit´e de Bretagne-Sud,

Campus de Tohanic, BP 573, 56017 Vannes, France

Universit´e Europ´eenne de Bretagne, Rennes, France

Keywords:

Non-local Means, Image Denoising, Optimization of Weights, Oracle, Statistical Estimation.

Abstract:

We propose a new algorithm to restore an image contaminated by the Gaussian white noise. Our approach

is based on the weighted average of the observations in a neighborhood as in the case of the Non-Local

Means Filter. But in contrast to the Non-Local Means Filter, we choose the weights by minimizing a tight

upper bound of the Mean Square Error. Our theoretical results show that some ”oracle” weights deﬁned by

a triangular kernel are optimal. To construct a computable ﬁlter the ”oracle” weights are replaced by some

estimates. The implementation of the proposed algorithm is straightforward. The simulations show that our

approach is very competitive.

1 INTRODUCTION

We deal with the additive Gaussian noise model:

Y(x) = f(x) + ε(x), x ∈ I, (1)

where I is the uniform N×N grid of pixels on the unit

square, Y = (Y (x))

x∈I

is the observed image bright-

ness, f : [0,1]

→R

is an unknown target regression

function and ε = (ε(x))

x∈I

are independent and iden-

tically distributed (i.i.d.) Gaussian random variables

with mean 0 and standard deviation σ > 0.

Important denoising techniques for the model (1)

have been developed in recent years. A very sig-

niﬁcant step in these developments was the intro-

duction of the Non-Local Means Filter by (Buades

et al., 2005). For closely related works, see for ex-

ample (Polzehl and Spokoiny, 2006; Kervrann and

Boulanger, 2008; Buades et al., 2009; Katkovnik

et al., 2010; Lou et al., 2010).

The basic idea of the ﬁlters by weighted means is

to estimate the unknown image f(x

) by a weighted

average of observationsY(x) of the form

) =

∑

x∈U

w(x)Y(x), (2)

where for each x

and h > 0, U

denotes a square

window with center x

and width 2h, w(x) are some

non-negative weights satisfying

∑

x∈U

w(x) = 1.

The choice of the weights w(x) are usually based on

two criteria: a spatial criterion so that w(x) is a de-

creasing function of the distance between x and x

and a similarity criterion so that w(x) is also a de-

creasing function of the brightness difference |Y(x)−

Y(x

)| (see e.g. (Yaroslavsky,1985; Tomasi and Man-

duchi, 1998)), which measures the similarity between

the pixels x and x

. In the Non-Local Means Filter,

h > 0 can be chosen relatively large, and the weights

w(x) are calculated according to the similarity be-

tween data patches Y

x,η

= (Y(y) : y ∈ U

x,η

) (identi-

ﬁed as a vector whose composants are ordered lexico-

graphically) and Y

,η

= (Y(y) : y ∈ U

,η

), instead of

the similarity between just the pixels x and x

. Here

η > 0 is the size parameter of data patches.

In this paper we address the problem of choosing

the weights w in (2) in some optimal way. Generally,

the weights w are deﬁned through some priory ﬁxed

kernels, often the Gaussian one. The important prob-

lem of the choice of the kernel has not been addressed

so far. Although the choice of the Gaussian kernel

yields good numerical performance, there is no par-

ticular reason to restrict ourselves to this kernel. Our

theoretical results and simulations show that another

kernel is preferred; this kernel leads to us an improved

Non-Local Means Filter which also has the advantage

that it is parameter free in the sense that it automati-

cally calculates the bandwidth of the smoothing ker-

nel.

Our main idea is to produce a very tight upper

112

Jin Q., Grama I. and Liu Q..

A NEW APPROACH FOR DENOISING IMAGES BASED ON WEIGHTS OPTIMIZATION.

DOI: 10.5220/0003846001120117

In Proceedings of the International Conference on Computer Vision Theory and Applications (VISAPP-2012), pages 112-117

ISBN: 978-989-8565-03-7

 2012 SCITEPRESS (Science and Technology Publications, Lda.)

bound of the Mean Square Error



)



= E



) − f(x

)



in terms of the bias and variance and to minimize

this upper bound in w under the constraints w ≥ 0

and

∑

x∈U

w(x) = 1. We ﬁrst obtain an explicit for-

mula for the optimal weights w

∗

in terms of the un-

known function f. In order to get a computable ﬁlter,

we estimate w

∗

by some adaptive weights bw based on

data patches from the observed image Y. We thus ob-

tain a new ﬁlter, which we call Optimal Weights Fil-

ter. Numerical results show that the new ﬁlter outper-

forms the typical Non-Local Means Filter, thus giving

a practical justiﬁcation that the optimal choice of the

kernel improves the denoising quality.

We would like to point out that related optimiza-

tion problems for non parametric signal and density

recovering have been proposed earlier in (Sacks and

Ylvisaker, 1978; Nazin et al., 2008). In these papers

the weights are optimized over a given class of regular

functions and thus depend only on some parameters

of the class. The novelty of our work is to deal with

optimal weights depending on the image f at hand.

Results of this type are related to the ”oracle” concept

developed in (Donoho and Johnstone, 1994).

2 OPTIMAL WEIGHTS FILTER

In this section, we present our new ﬁlter called Op-

timal Weights Filter, and explain the idea behind its

construction.

We begin with some mathematical notations that

will be used throughout the paper. For a vector

x = (x

,...,x

) ∈ R

, we denote by kxk



∑

i=1



its Euclidean norm and by kxk

∞

= max

1≤i≤d

| its

supremum norm. The cardinality of a set A is denoted

by cardA. For a positive integer N the uniform N ×N

grid on the unit square is deﬁned by

I =



,··· ,

N −1



Each element x of the grid I will be called pixel. The

number of pixels is n = N

. For any pixel x

∈ I and

a given h > 0, the square window of pixels U

{x ∈ I : kx −x

∞

≤ h} will be called search window

at x

. We naturally take h as a multiple of

(h =

for some k ∈ {1,2,··· ,N}). The size of the square

search window U

is the positive integer number

M = (2Nh+ 1)

= card U

. (3)

For any pixel x ∈ U

and a given η > 0 a second

square window of pixels U

x,η

will be called patch at

x. Like h, the parameter η is also taken as a multiple

. The size of the patch U

x,η

is the positive integer

m = (2Nη+ 1)

= card U

,η

. (4)

The vector Y

x,η

= (Y (y))

y∈U

x,η

formed by the val-

ues of the observed noisy image in the patch U

x,η

the lexicographical order will be called data patch (or

similarity patch) at x ∈U

. Finally, the positive part

of a real number a is denoted by a

: a

= a if a ≥ 0

and a

= 0 if a < 0.

Let h > 0 be ﬁxed. For any pixel x

∈I consider a

family of weighted estimates

h,w

) of the form

h,w

) =

∑

x∈U

w(x)Y(x), (5)

where the unknown weights satisfy

w(x) ≥ 0 and

∑

x∈U

w(x) = 1. (6)

The usual bias plus variance decomposition of the

Mean Square Error gives



h,w

) − f(x

)



= Bias

+Var, (7)

with

Bias





∑

x∈U

w(x)( f (x) − f(x

))





and

Var = σ

∑

x∈U

w(x)

The decomposition (7) is commonly used to construct

asymptotically minimax estimators over some given

classes of functions in the nonparametric function es-

timation. With our approach the bias term Bias

will

be bounded in terms of the unknown function f it-

self. As a result we obtain some ”oracle” weights

w adapted to the unknown function f at hand, which

will be estimated further using data patches of the im-

age Y.

First, we address the problem of determining the

”oracle” weights. With this aim denote

f,x

(x) ≡ |f(x) − f(x

)|. (8)

Note that the value ρ

f,x

(x) characterizes the varia-

tion of the image brightness of the pixel x with respect

to the pixel x

. From the decomposition (7), we easily

obtain a tight upper bound in terms of ρ

f,x



) − f(x

)



≤ g

f,x

(w), (9)

A NEW APPROACH FOR DENOISING IMAGES BASED ON WEIGHTS OPTIMIZATION

113

where

f,x

(w) =





∑

x∈U

w(x)ρ

f,x

(x)





+σ

∑

x∈U

w(x)

(10)

From the following theorem we can obtain the

form of the weights w which minimize the func-

tion g

f,x

(w) under the constraints (6) in terms of

f,x

(x). Introduce the strictly increasing function

f,x

(t) =

∑

x∈U

f,x

(x)(t −ρ

f,x

(x))

, t ≥ 0.

Let K

be the usual triangular kernel:

(t) = (1−|t|)

, t ∈ R

. (11)

Theorem 1. Assume that ρ

f,x

(x), x ∈U

, is a non-

negative function. Then the unique weights which

minimize g

f,x

(w) subject to (6) are given by

f,x

(x) =

(

f,x

(x)

)

∑

y∈U

(

f,x

(x)

)

, x ∈ U

, (12)

where the bandwidth a > 0 is the unique solution in

(0,∞) of the equation

f,x

(a) = σ

. (13)

Remark 1. The value of a > 0 can be calculated as

follows. We sort the set {ρ

f,x

(x)|x ∈ U

} in the

ascending order 0 = ρ

≤ ρ

≤ ··· ≤ ρ

< ρ

M+1

+∞, where M = CardU

. Let

∑

i=1

∑

i=1

, 1 ≤k ≤ M, (14)

and

∗

= max{1 ≤ k ≤ M|a

≥ ρ

}

= min{1 ≤ k ≤ M|a

< ρ

}−1, (15)

with the convention that a

= ∞ if ρ

= 0 and that

min∅ = M + 1. Then the solution a > 0 of (13) can

be expressed as a = a

∗

; moreover, k

∗

is the unique

integer k ∈ {1, ··· ,M} such that a

≥ ρ

and a

k+1

if k < M.

Let x

∈ I. Using the optimal weights given by

Theorem 1, we ﬁrst introduce the following non com-

putable approximation of the true image, called ”ora-

cle”:

∗

) =

∑

x∈U

(

f,x

(x)

)Y(x)

∑

y∈U

(

f,x

(x)

)

, (16)

where the bandwidth a is the solution of the equation

f,x

(a) = σ

. A computable ﬁlter can be obtained

by estimating the unknown function ρ

f,x

(x) and the

bandwidth a from data pathes.

Let h > 0 and η > 0 be ﬁxed numbers. For

any x

∈ I and any x ∈ U

consider the distance

between the data patches Y

x,η

= (Y (y))

y∈U

x,η

and

,η

= (Y (y))

y∈U

,η

deﬁned by



x,η

,η





x,η

−Y

,η



where m = card U

x,η

, and kY

x,η

−Y

,η

∑

kzk

∞

≤h

(Y(x+ z) −Y(x

+ z))

which measures the similar-

ity between the data patches Y

x,η

and Y

,η

. Our

simulations show that a convenient approximation of

f,x

(x) is given by

(x) =



d(Y

x,η

,η

) −

√

2σ



. (17)

A theoretical justiﬁcation for this choice is given in a

convergence theorem that is not presented here.

Thus our Optimal Weights Filter is deﬁned by

f(x

) =

h,η

) =

∑

x∈U

(

(x)

)Y(x)

∑

y∈U

(

(x)

)

, (18)

where the bandwidth ba > 0 is the solution of the equa-

Algorithm 1: Optimal weights ﬁlter.

Repeat for each x

∈ I:

give an initial value of ba: ba = 1 (it can be an

arbitrary positive number).

compute {

(x)|x ∈ U

} by (17)

compute the bandwidth ba at x

reorder {

(x)|x ∈ U

} as increasing se-

quence, say

) ≤

) ≤ ··· ≤

)

loop from k = 1 to M

∑

i=1

) > 0

∑

i=1

)

∑

i=1

)

≥

ρ(x

)

then ba =

∑

i=1

)

∑

i=1

)

else quit loop

else continue loop

end loop

/compute the estimated weights bw at x

compute bw(x

) =

(1−

)/ba)

∑

∈U

(1−

)/ba)

/compute the ﬁlter

f at x

compute

f(x

) =

∑

∈U

bw(x

)Y(x

VISAPP 2012 - International Conference on Computer Vision Theory and Applications

114

tion M

(ba) = σ

, which can be calculated as in Re-

mark 1 with ρ

f,x

(x) and a replaced by

(x) and ba

respectively. We end this section by giving an algo-

rithm for computing the ﬁlter (18). The input values

of the algorithm are the image Y (x), x ∈ I , the stan-

dard derivation σ of the Gaussian noise and two num-

bers m and M representing the sizes of data patches

and search windows respectively (cf. (3) and (4)).

To avoid the undesirable border effects in simula-

tions, we mirror the image outside the image limits,

that is, we extend the image outside the image lim-

its symmetrically with respect to the border. At the

corners, the image is extended symmetrically with re-

spect to the corner pixels.

The implementation of the proposed algorithm is

straightforward. Notice that an important issue in the

Non-Local Means Filter is the choice of the band-

width parameter in the Gaussian kernel; our algorithm

has the advantage that it automatically calculates the

bandwidth.

A detailed analysis of the performance of our ﬁl-

ter is given in Section 3 where the numerical simu-

lations show that our ﬁlter outperforms the classical

Non-Local Means Filter.

0 5 10 15 20 25 30 35 40 45

31.2

31.4

31.6

31.8

32.2

32.4

32.6

32.8

The width of the similar patch

PSNR

σ = 20

Figure 1: The evolution of PSNR value as a function of the

size of data patches.

3 SIMULATIONS

In this section we show the numerical performance of

the Optimal Weights Filter by simulation results.

The performance of the Optimal Weights Filter

h,η

) is measured by the usual Peak Signal-to-

Noise Ratio (PSNR) in decibels (db) deﬁned as

PSNR = 10log

255

MSE

MSE =

cardI

∑

x∈I

( f(x) −

h,η

(x))

where f is the original image, and

h,η

the estimated

one.

(a) Original image ”Lena”. (b) Noisy image with σ = 20

PSNR = 22.11db.

PSNR= 32.68db.

(e) Restored with NLMF. (f) Square error with NLMF

PSNR= 31.51db.

Figure 2: Results of denoising ”Lena” 512 ×512 image.

Comparing (d) and (f) we see that the Optimal Weights Fil-

ter (OWF) captures more details than the Non-Local Means

Filter (NLMF).

In the simulations, we sometimes use the

smoothed version of the estimate of brightness varia-

tion d

x,η

,η

) instead of the non smoothed one

d (Y

x,η

,η

), deﬁned by

x,η

,η

) =

kK(y) ·(Y

x,η

−Y

,η

∑

′

∈U

,η

K(y

′

)

where K(y) are some weights deﬁned on U

,η

The corresponding estimate of brightness variation

f,x

(x) is given by

K,x

(x) =



x,η

,η

) −

√

2σ



. (19)

With the rectangular kernel

(y) =



1, y ∈ U

,η

0, otherwise,

(20)

A NEW APPROACH FOR DENOISING IMAGES BASED ON WEIGHTS OPTIMIZATION

115

Table 1: Performance of denoising algorithms when applied to test noisy (WGN) images.

Images Lena Barbara Boat House Peppers

Sizes 512×512 512×512 512×512 256×256 256×256

σ Method PSNR PSNR PSNR PSNR PSNR

Our method(M = 13×13,m = 27×27) 33.93db 32.31db 31.64db 34.09db 31.93db

(Buades et al., 2005) 32.72db 31.67db 30.39db 33.82db 30.97db

15 (Foi et al., 2004) 32.72db 29.61db 30.93db 33.18db 31.78db

(Roth and Black, 2009) 33.29db 30.16db 31.27db 33.55db 32.06db

(Hirakawa and Parks, 2006) 33.97db 32.55db 31.59db 33.82db 31.61db

(Kervrann and Boulanger, 2008) 33.70db 31.80db 31.44db 34.08db 32.13db

(Hammond and Simoncelli, 2008) 34.04db 32.25db 31.72db 33.72db 31.82db

(Aharon et al., 2006) 33.71db 32.41db 31.77db 34.25db 32.20db

(Dabov et al., 2007) 34.27db 33.00db 32.14db 34.94db 32.70db

Our method(M = 13×13,m = 27×27) 32.68db 31.04db 30.30db 32.83db 30.61db

(Buades et al., 2005) 31.51db 30.38db 29.32db 32.51db 29.73db

20 (Foi et al., 2004) 31.43db 27.90db 39.61db 31.84db 30.30db

(Roth and Black, 2009) 31.89db 28.28db 29.86db 32.29db 30.47db

(Hirakawa and Parks, 2006) 32.69db 31.06db 30.25db 32.58db 30.21db

(Kervrann and Boulanger, 2008) 32.64db 30.37db 30.12db 32.90db 30.59db

(Hammond and Simoncelli, 2008) 32.81db 30.76db 30.41db 32.52db 30.40db

(Aharon et al., 2006) 32.39db 30.84db 30.39db 33.10db 30.80db

(Dabov et al., 2007) 33.05db 31.78db 30.88db 33.77db 31.29db

Our method(M = 13×13,m = 27×27) 31.59db 29.92db 29.16db 31.95db 29.40db

(Buades et al., 2005) 30.36db 29.19db 28.38db 31.16db 28.60db

25 (Foi et al., 2004) 30.43db 26.62db 28.60db 30.75db 29.16db

(Roth and Black, 2009) 30.57db 26.84db 28.57db 31.05db 29.17db

(Hirakawa and Parks, 2006) 31.69db 29.89db 29.21db 31.60db 29.06db

(Kervrann and Boulanger, 2008) 31.73db 29.24db 29.20db 32.22db 29.73db

(Hammond and Simoncelli, 2008) 31.83db 29.58db 29.40db 31.54db 29.29db

(Aharon et al., 2006) 31.36db 29.58db 29.32db 32.07db 29.67db

(Dabov et al., 2007) 32.08db 30.72db 29.91db 32.86db 30.16db

we obtain exactly the distance d (Y

x,η

,η

) and the

ﬁlter described in Section 2. Other smoothing kernels

K(y) used in the simulations are the Gaussian kernel

(y) = exp



−

ky−x



, (21)

where h

is the bandwidth parameter, and the follow-

ing kernel: for y ∈ U

,η

(y) =

Nη

∑

k=max(1, j)

(2k+ 1)

(22)

if ky−x

∞

for some j ∈ {0, 1,··· ,Nη}.

The best numerical results are obtained using

K(y) = K

(y) in the deﬁnition of

K,x

. The values

m = 27×27 and M = 13×13 are appropriate in most

cases and a smaller data patch size m can be consid-

ered for processing piecewise smooth images. The

comparison with several ﬁlters is given in Table 1.

The PSNR values show that our approach is as good

as more sophisticated methods, like (Hirakawa and

Parks, 2006; Kervrann and Boulanger, 2008; Ham-

mond and Simoncelli, 2008; Aharon et al., 2006), and

is better than the ﬁlters proposed in (Foi et al., 2004;

Roth and Black, 2009). Furthermore, our method is

as simple as the Non-Local Means Filter and, with

K(y) = K

(y), has only two parameters M and m

which are the sizes of data patches and search win-

dows. The proposed approach gives a denoising qual-

ity which is competitivewith that of the recent method

BM3D (Dabov et al., 2007).

The behavior of the PSNR in function of the size

m of data patches is displayed in Figure 1 for ”Lena”

image. We ﬁx M = 13×13. For σ = 20, Figure 1 il-

lustrates that the PSNR value increases as m varies be-

tween 3×3 and 41×41 (for which PSNR= 32.71db),

and that it just changes slightly when m is sufﬁciently

large (e.g. PSNR= 32.68db when m = 27 ×27).

In our experimental results (cf. Table 1) we prefer

m = 27 ×27 as the choice m = 41 ×41 is computa-

tionally expensive.

The potential of the estimation method is illus-

trated with the 512×512 image ”Lena” (Figure 2(a))

corrupted by an additivewhite Gaussian noise (Figure

2(b), PSNR = 22.10db, σ = 20). We used the kernel

(y) for computing the estimated brightness varia-

tion function

K,x

, which corresponds to the Opti-

mal Weights Filter as deﬁned in Section 2. In Figure

2(c), we can see that the noise is reduced in a natural

manner and signiﬁcant geometric features, ﬁne tex-

VISAPP 2012 - International Conference on Computer Vision Theory and Applications

116

tures, and original contrasts are visually well recov-

ered with no undesirable artifacts (PSNR= 32.68db

for ”Lena”). To better appreciate the accuracy of the

restoration process, the square of the difference be-

tween the original image and the recovered image is

shown in Figure 2(d), where the dark values corre-

spond to a high-conﬁdence estimate. As expected,

pixels with a low level of conﬁdence are located in

the neighborhood of image discontinuities. For com-

parison, we show the image denoised by Non-Local

Means Filter in Figures 2(e),(f). The overall visual

impression and the numerical results are improvedus-

ing our algorithm.

The Optimal Weights Filter seems to provide a

feasible and rational method to detect automatically

the details of images and take the proper weights for

every possible geometric conﬁguration of the image.

The distribution of the weights inside the search win-

dow U

depends on the estimated brightness vari-

ation function

K,x

(x), x ∈ U

. If the estimated

brightness variation

K,x

(x) is less than ba (see Theo-

rem 1), the similarity between patches is measured by

a linear decreasing function of

K,x

(x); otherwise it

is zero. Thus ba acts as an automatic threshold.

4 CONCLUSIONS

We have proposed a new ﬁlter to remove Gaus-

sian noise, based on optimization of weights in the

weighted means approach. Our analysis shows that

a triangular kernel is preferred rather then the Gaus-

sian kernel. The proposed ﬁlter improves the usual

Non-Local Means Filter both numerically and visu-

ally in denoising performance; it also has the advan-

tage to be adaptive in the sense that it calculates au-

tomatically the good bandwidth of the triangular ker-

nel (while in the Non-Local Means Filter the choice

of the bandwidth parameter in the Gaussian kernel is

delicate). We hope that the optimal weights that we

deduced can also bring similar improvements for re-

cently developed algorithms where the basic idea of

the Non-Local means ﬁlter is used.

ACKNOWLEDGEMENTS

The authors are grateful to the reviewers for helpful

comments and remarks. The work has been partially

supported by the National Natural Science Founda-

tion of China, Grant No. 11101039 and Grant No.

11171044.

REFERENCES

Aharon, M., Elad, M., and Bruckstein, A. (2006). rmk-

svd: An algorithm for designing overcomplete dictio-

naries for sparse representation. IEEE Trans. Signal

Process., 54(11):4311–4322.

Buades, A., Coll, B., Morel, J., et al. (2005). A re-

view of image denoising algorithms, with a new one.

SIAM Journal on Multiscale Modeling and Simula-

tion, 4(2):490–530.

Buades, T., Lou, Y., Morel, J., and Tang, Z. (2009). A note

on multi-image denoising. In Int. workshop on Local

and Non-Local Approximation in Image Processing,

pages 1–15.

Dabov, K., Foi, A., Katkovnik, V., and Egiazarian, K.

(2007). Image denoising by sparse 3-D transform-

domain collaborative ﬁltering. IEEE Trans. Image

Process., 16(8):2080–2095.

Donoho, D. and Johnstone, J. (1994). Ideal spatial adapta-

tion by wavelet shrinkage. Biometrika, 81(3):425.

Foi, A., Katkovnik, V., Egiazarian, K., and Astola, J.

(2004). A novel anisotropic local polynomial estima-

tor based on directional multiscale optimizations . In

Proc. 6th IMA int. conf. math. in signal processing,

pages 79–82.

Hammond, D. and Simoncelli, E. (2008). Image modeling

and denoising with orientation-adapted gaussian scale

mixtures. IEEE Trans. Image Process., 17(11):2089–

2101.

Hirakawa, K. and Parks, T. (2006). Image denoising us-

ing total least squares. IEEE Trans. Image Process.,

15(9):2730–2742.

Katkovnik, V., Foi, A., Egiazarian, K., and Astola, J.

(2010). From local kernel to nonlocal multiple-model

image denoising. Int. J. Comput. Vis., 86(1):1–32.

Kervrann, C. and Boulanger, J. (2008). Local adaptivity

to variable smoothness for exemplar-based image reg-

ularization and representation. Int. J. Comput. Vis.,

79(1):45–69.

Lou, Y., Zhang, X., Osher, S., and Bertozzi, A. (2010). Im-

age recovery via nonlocal operators. J. Sci. Comput.,

42(2):185–197.

Nazin, A., Roll, J., Ljung, L., and Grama, I. (2008). Direct

weight optimization in statistical estimation and sys-

tem identiﬁcation. System Identiﬁcation and Control

Problems (SICPRO08), Moscow.

Polzehl, J. and Spokoiny, V. (2006). Propagation-separation

approach for local likelihood estimation. Probab. The-

ory Rel. Fields, 135(3):335–362.

Roth, S. and Black, M. (2009). Fields of experts. Int. J.

Comput. Vision, 82(2):205–229.

Sacks, J. and Ylvisaker, D. (1978). Linear estimation for

approximately linear models. Ann. Stat., 6(5):1122–

1137.

Tomasi, C. and Manduchi, R. (1998). Bilateral ﬁltering for

gray and color images. In Proc. Int. Conf. Computer

Vision, pages 839–846.

Yaroslavsky, L. P. (1985). Digital picture processing. An

introduction. In Springer-Verlag, Berlin.

A NEW APPROACH FOR DENOISING IMAGES BASED ON WEIGHTS OPTIMIZATION

117