ArticlePDF Available

Bayesian Image Denoising with Multiple Noisy Images

July 2019
The Review of Socionetwork Strategies 13(37)

July 2019
13(37)

DOI:10.1007/s12626-019-00043-3

License
CC BY 4.0

Authors:

Muneki Yasuda

Yamagata University

In this paper, we propose a fast image denoising method based on discrete Markov random fields and the fast Fourier transform. The purpose of the image denoising is to infer the original noiseless image from a noise corrupted image. We consider the case where several noisy images are available for inferring the original image and the Bayesian approach is adopted to create the posterior probability distribution of the denoised image. In the proposed method, the estimation of the denoised image is achieved using belief propagation and an expectation–maximization algorithm. We numerically verified the performance of the proposed method using several standard images.

Average computation time and peak signal-to-noise ratio over 10 trials versus various image sizes ( 128 × 128 , 256 × 256 , and 512 × 512 ) for K = 1 and K = 5 . The noise level applied in these experiments was = 15 . a Fig. 3a. b Fig. 3b. c Fig. 3c. d Fig. 3d

…

Average peak signal-to-noise ratio over 10 trials against K for = 15 and = 30 . a Fig. 3a. b Fig. 3b. c Fig. 3c. d Fig. 3d

…

Examples of the image denoising results for Fig. 3a. a Example of a noisy image when = 30 (peak signal-to-noise ratio (PSNR):18.61). b Denoised image obtained by the proposed method for K = 1 (PSNR:25.49). c Denoised image obtained by the proposed method for K = 5 (PSNR:30.33)

…

Average computation time over 10 trials with = 15 , K = 5 , and image size 128 × 128

…

Figures - available via license: Creative Commons Attribution 4.0 International

Content may be subject to copyright.

Available via license: CC BY 4.0

Content may be subject to copyright.

Vol.:(0123456789)

The Review of Socionetwork Strategies

https://doi.org/10.1007/s12626-019-00043-3

1 3

ARTICLE

Bayesian Image Denoising withMultiple Noisy Images

ShunKataoka1 · MunekiYasuda2

Received: 3 February 2019 / Accepted: 20 June 2019

Abstract

In this paper, we propose a fast image denoising method based on discrete Markov

random ﬁelds and the fast Fourier transform. The purpose of the image denoising is

to infer the original noiseless image from a noise corrupted image. We consider the

case where several noisy images are available for inferring the original image and

the Bayesian approach is adopted to create the posterior probability distribution of

the denoised image. In the proposed method, the estimation of the denoised image is

achieved using belief propagation and an expectation–maximization algorithm. We

numerically veriﬁed the performance of the proposed method using several standard

images.

Keywords Image denoising· Discrete Markov random ﬁeld· Belief propagation·

EM algorithm· FFT

1 Introduction

Bayesian image processing based on Markov random ﬁelds (MRFs) is an important

framework in the ﬁeld of image processing [1, 2]. An MRF is a undirected graph

representation of probability distribution, and many applications of MRFs exist

in the image processing and computer vision ﬁelds [3–5]. MRFs have also been

applied to other research ﬁelds, including traﬃc engineering [6, 7] and earth science

[8, 9]. In Bayesian image processing, the objective image can be inferred based on

the posterior probability distribution.

Recently, we proposed a fast image denoising method for the case where multiple

noisy images are available for inferring the original noiseless image that is based on

* Shun Kataoka

xskataoka@res.otaru‑uc.ac.jp

Muneki Yasuda

muneki@yz.yamagata‑u.ac.jp

1 Faculty ofCommerce, Otaru University ofCommerce, Otaru‑shi047‑8571, Japan

2 Graduate School ofScience andEngineering, Yamagata University, Yonezawa‑shi992‑8510,

Japan

The Review of Socionetwork Strategies

1 3

Gaussian MRFs [10]. However, in the study in [10], for ease of mathematical treat‑

ment, we made an unnatural assumption that pixel values are continuous. In general,

a pixel takes a discrete value from 0 to 255, and an additional framework is required

to treat pixel values as discrete instead of continuous values. Therefore, in this paper,

we focus on the Bayesian image denoising problem of inferring the original noiseless

image from multiple noisy images when the pixel values are treated as discrete values.

We created a probability model for image denoising based on the discrete MRF and

Bayesian perspective. A major disadvantage of an image processing model based on

discrete MRFs is the computational complexity. In fact, the inference problem from

discrete MRFs belongs to the NP‑hard class. Therefore, an approximate inference tech‑

nique is required to infer the objective image from a discrete MRF. Belief propagation

[11] is known as one such eﬀective technique. In this paper, we propose an eﬀective

image denoising algorithm for multiple noisy images that applies belief propagation.

The main contributions of this paper are that an MRF model for image denoising with

multiple noisy images is deﬁned and a fast eﬀective denoising algorithm based on our

discrete MRF model and the fast Fourier Transform (FFT) is proposed.

The remainder of this paper is organized as follows. In Sect. 2, we deﬁne a prob‑

ability model for image denoising with multiple noisy images based on the discrete

MRF and Baysian perspective. In Sect.3, we derive an image denoising algorithm

based on the posterior probability distribution deﬁned in Sect.2. In Sect.4, we describe

the framework for estimating the parameters in the posterior probability distribution.

We explain the implementation of our denoising method using the FFT in Sect. 5. In

Sect. 6, we describe the numerical veriﬁcation of the performance of the proposed

method. Finally, in Sect.7, we present our concluding remarks.

2 Framework ofBayesian Image Denoising Method

In this section, we brieﬂy explain the framework of the Bayesian image denoising

method for the case where multiple noisy images are available. Suppose that we have K

degraded images that are independently obtained by adding additive white Gaussian

noise (AWGN) to the original image. We assume that the images are composed of

N=h×w

pixels. Let

[

⋯x

N]T

and y(k)=

[

y(k)

1y(k)

⋯y(k)

be N dimen‑

sional vectors corresponding to the original image and the k‑th noisy image, respec‑

tively. Vectors

and

y(k)

can be easily obtained by raster scanning the images. We

assume that

xi(i=1, 2, …N)

takes L discrete values from 0 to

L−1

The purpose of the image denoising is to infer the original noiseless image

from K

noisy images

{

(

)

(

)

,…,y

(

)}

. In the Bayesian framework, the original image

can be inferred using the posterior probability distribution

P(x|Y)

that is expressed as

where

∑x

denotes the multiple summations over all the possible

states of

. The

framework of the proposed Bayesian image denoising method is illustrated in Fig.1.

From the deﬁnition of

y(k)

, the probability density function

P(Y|x)

is expressed as

(1)

�

Y)=

P(Y�

)P(

)

∑x

P(Y

�

x)P(x)

1 3

The Review of Socionetwork Strategies

where

V={1, 2, …,N}

and

𝜎2

is the variance of the AWGN. We express the param‑

eters of the probability model by its arguments after the semicolon as Eq. (2). We

deﬁne the prior probability distribution as

where E is a set of edges of the

h×w

lattice graph and

𝜙(x)

is a downward convex

even function taking its minimum at

x=0

. In this study, we assumed the periodic

boundary condition on the graph structure E, as demonstrated in Fig.2.

𝛼>0

is the

parameter of the prior probability distribution; if

𝛼

is set to a large value, neighbor‑

ing

and

tend to take close values.

Zprior(𝛼)

is a normalization constant deﬁned as

By substituting Eqs. (2) and (3) into Eq. (1), the posterior probability distribution

P(x|Y)

is expressed as

(2)

Yx;𝜎2=



k=1



y(k)x;𝜎2





k=1



i∈V

2𝜋𝜎

exp



−1

2𝜎2



y(k)

i−xi



,

(3)

(x;𝛼)=1

Zprior(𝛼)exp



−𝛼



ij∈E

𝜙



xi−xj

,

(4)

prior(𝛼)=



exp



−𝛼



ij∈E

𝜙



xi−xj

.

(5)

P

xY;𝛼,𝜎

2

Zpost



𝛼,𝜎2



exp



−1

2𝜎2



i∈V

𝜓i





−𝛼



ij∈E

𝜙



xi−xj

,

Fig. 1 Illustration of the proposed Bayesian image denoising method

The Review of Socionetwork Strategies

1 3

where

and

respectively.

3 Inference Algorithm Based onBelief Propagation

The image denoising is achieved by ﬁnding the image

that maximizes the posterior

probability distribution in Eq. (5):

However, the problem of determining such an image is intractable, because this

maximization problem belongs to the NP‑hard class. Therefore, we need an approxi‑

mate inference method to ﬁnd

. In this section, we explain an eﬀective approximate

inference method called belief propagation for inferring the denoised image

Belief propagation is a method of computing the approximate marginal distributions

bi(

and

bij(

for each

i∈V

and

ij ∈E

. In the belief propagation framework, the

approximate marginal distributions

bi(

and

bij(

are given by

(6)

𝜓







k=1

xi−y(k)



(7)

post



𝛼,𝜎2





exp



−1

2𝜎2



i∈V

𝜓i





−𝛼



ij∈E

𝜙



xi−xj

,

(8)

x=argmax

(

Y;𝛼,𝜎

2).

(9)





exp



−

2𝜎2𝜓i





k∈𝜕i

Mpost

k→i



,

(10)



exp



−

2𝜎2𝜓i





k∈𝜕i

Mpost

k→i



,

Fig. 2 Periodic boundary condi‑

tion for

h×w

lattice graph E,

when

h=4

and

w=4

1 3

The Review of Socionetwork Strategies

and

respectively, where

𝜕i={k∈V|ik ∈E}

is a set of all the neighboring pixels of pixel

Mpost

k→i(

in Eqs. (9) and (10) is a message from pixel k to pixel i and is obtained

by the convergence point of the message update rule

where

→

is a normalization constant to ensure algorithmic stability. The estima‑

tion of the denoised image

{̂

i∈V

}

is achieved by ﬁnding

for

i∈V

in the belief propagation framework.

4 Parameter Estimation Using Expectation–Maximization Algorithm

In the preceding section, we explained the method for inferring the denoised image

based on belief propagation. In our framework, the denoised image is inferred from the

posterior probability distribution in Eq. (5), which has two parameters,

𝛼

and

𝜎2

. It is

obvious that the inferred denoised image

depends on these parameters. In this sec‑

tion, we explain the method for determining these parameters from degraded images Y

based on the expectation–maximization (EM) algorithm [12].

The EM algorithm is a statistical inference method to infer the maximum likelihood

estimates

by an iterated method. In the EM algorithm framework, the parameters

𝛼

and

𝜎2

are

estimated by iterative maximization of the Q function deﬁned as

(11)

(

xi,xj

)

exp

(

−𝛼𝜙

(

xi−xj

))

mpost

�j→i

(

)

mpost

�i→j

(

(12)

ij =

∑

exp

(

−𝛼𝜙

(

xi−xj

))

post

�j→i

(

)

post

�i→j

(

(13)

post

j→i

(

)

Zj→i

∑

exp

(

−𝛼𝜙

(

xi−xj

))

mpost

�i→j

(

(14)

post

�j→i





=exp



−

2𝜎2𝜓i





k∈𝜕i�{j}

Mpost

k→i



,

(15)

̂x

i=argmax

(

(16)

̂𝛼

,̂𝜎 2=argmax

𝛼,𝜎

∑

(

x,Y;𝛼,𝜎2

The Review of Socionetwork Strategies

1 3

where

𝛼t

and

𝜎2

are estimates of the parameters at the t‑th iteration and

Using belief propagation, we can approximate the expectations in Eq. (17) as

and

respectively, where

b(t)

and

b(t)

ij (

xi,xj

)

are the approximate marginal distributions

of the posterior probability distribution

Y;𝛼

,𝜎2

computed using Eqs. (9) and

(11). The parameter update at iteration t is given by

and the maximum likelihood estimates in Eq. (16) are given as the convergence point

of the above iterative estimation. By diﬀerentiating the Q function with respect to

𝛼

and

𝜎2

and considering the conditions for the extremal value, the updated parameter

𝛼t+1

in Eq. (21) is expressed as the solution of the equation

and the updated parameter

𝜎2

t+1

is calculated as

where



𝜙



xi−xj



;𝛼

prior



x𝜙



xi−xj



P(x;𝛼

)

; this expectation can also be com‑

puted approximately using belief propagation similarly to Eq. (20). Using the bisec‑

tion method, we can easily ﬁnd

𝛼t+1

that satisﬁes Eq. (22).

(17)

𝛼,𝜎2;𝛼t,𝜎2

)

∑

(

x|Y;𝛼t,𝜎2

)

log P

(

x,Y;𝛼,𝜎2

)

=− 1

2𝜎2∑

i∈V

⟨𝜓i(xi);𝛼t,𝜎2

t⟩post −NK

2log 𝜎2

−𝛼∑

ij∈E

⟨𝜙(xi−xj);𝛼t,𝜎2

t⟩post −log Zprior(𝛼

)

+Const.,

(18)

⟨

f(x);𝛼t,𝜎2

⟩

post =

∑

f(x)P

(

Y;𝛼t,𝜎2

(19)

⟨

𝜓i

(

)

;𝛼t,𝜎2

⟩

post =

∑

𝜓i

(

)

(t)

(

(20)

⟨

𝜙

(

xi−xj

)

;𝛼t,𝜎2

⟩

post =

∑

𝜙

(

xi−xj

)

(t)

(

xi,xj

(21)

𝛼

t+1,𝜎

t+1=argmax

𝛼,𝜎

(

𝛼,𝜎

;𝛼t,𝜎

(22)

∑

ij∈E

⟨

𝜙

(

xi−xj

)

;𝛼t,𝜎2

⟩

post =

∑

ij∈E

⟨

𝜙

(

xi−xj

)

;𝛼

⟩

prior

(23)

𝜎

t+1=

∑

i∈V

⟨

𝜓i

(

)

;𝛼t,𝜎2

⟩

post

1 3

The Review of Socionetwork Strategies

5 Proposed Algorithm: Fast Implementation Based ontheFast

Fourier Transform

The image denoising algorithm based on our probabilistic model in Eq. (5) described

in Sects. 2–4 is summarized in Algorithm 1, together with the diﬀerences in the

computation times of the naive implementation and the proposed method, which is

explained in this section. The worst computation time of the naive implementation

of this algorithm is

, where

TEM

and

TBP

are the maximum number of

updates for the parameter update in Eq. (21) and the message update in Eq. (13),

respectively. In Algorithm 1, we terminate the parameter and message updates in

iteration

TEM

and

TBP

, respectively. Because we assume the periodic boundary con‑

dition for the graph structure E, the number of edges is

|E|=2N

. Therefore, the

number of messages passing each edge is O(N). The computation time of the naive

message update from pixel j to pixel i is

, because Eq. (13) must be computed

for each

xi=0, 1, …,L−1

to update a message

Mpost

→

(xi

)

Algorithm 1:Image DenoisingAlgorithm

Require:

Y=y(1),y(2) ,...,y(K)

1: Initializeα0andσ2

2: fort=0,1,2,...,TEM −1do

3: initializeall messagesMj→i(xi)

4: fort=0,1,2,...,TBP −1do

5: forall j→ido

6: update Mj→i(xi)using Eq.(13) (naive:O L2→propose: O(LlogL))

7: if allmessagesMj→i(xi)are convergedthen break

8: forall i∈Vdo

9: computeψi(xi);αt,σ

tpost usingEq. (19)

10:

forall ij ∈Edo

11:

computeφ(xi−xj);αt,σ

tpost usingEq. (20) (naive:OL2→prop

ose:

O(LlogL))

12:

computeαt+1 usingEq. (22) andbisectionmethod(naive: O L2→prop

ose:

O(LlogL))

13:

computeσ2

t+1 usingEq. (23)

14:

if αandσ2areconverged then break

15:

initializeall messagesMj→i(xi)

16:

fort=0,1,2,...,T

BP −1do

17:

forall j→ido

18:

update Mj→i(xi)using Eq.(13) (naive:O L2→propose: O(LlogL))

19:

if allmessagesMj→i(xi)are convergedthen break

20:

forall i∈Vdo

21:

computebi(xi)using Eq.(9)

22:

computexiusingEq. (15)

It should be noted that the computation time of the message update can be

reduced to

O(Llog L)

using the FFT [13]. It has been conﬁrmed that this FFT‑based

method in fact accelerates the message computation for the probabilistic image

denoising model in Eq. (5) for the case where

K=1

[14]. However, there exist

The Review of Socionetwork Strategies

1 3

additional

O(L2)

computation terms in Algorithm1: Steps 11 and 12. In this section,

we show that these

O(L2)

computation terms can also be computed in

O(Llog L)

using the FFT. Therefore, we can reduce the worst computation time of Algorithm1

NL log L

)

The key idea for accelerating the message updates in Eq. (13) is to consider the

update rule a convolution calculation. If we deﬁne the function

f(x;𝛼)

we can reformulate the message update rules as

The calculation of

mpost

→

)

in Eq. (25) is a convolution calculation. Therefore, we

can calculate

mpost

→

)

for

xi=0, 1, …,L−1

O(Llog L)

computation time using

the FFT, and the computation time of M

post

→

)

in Eq. (26) is linear with respect to

L. Therefore, we can update a message

Mpost

→

)

O(Llog L)

computation time.

Now, we show that the expectation in Eq. (20) can be calculated in

O(Llog L)

computation time by using the FFT. By substituting Eqs. (11) into (20), the expecta‑

tion calculation can be expressed as

If we deﬁne functions

g(xi)

and

h(xi)

respectively, we can reformulate the expectation calculation as

(24)

f(x;𝛼)=exp (−𝛼𝜙(x)),

(25)

mpost

j→i

(

)

∑

f(xi−xj;𝛼)m

post

�i→j

(

(26)

post

j→i

(

)

=mpost

j→i

(

)/L−1

∑

l=0

mpost

j→i(l)

(27)

⟨

𝜙

(

xi−xj

)

;𝛼,𝜎2

⟩

post

(28)

∑

𝜙

(

xi−xj

)

f(xi−xj;𝛼)m

post

�i→j

(

)

post

�j→i

(

(29)

=Zij =

∑

f(xi−xj;𝛼)m

post

�i→j

(

)

post

�j→i

(

(30)

(xi)=

∑

𝜙

(

xi−xj

)

f(xi−xj;𝛼)m

post

�i→j

(

(31)

(xi)=

∑

f(xi−xj;𝛼)m

post

�i→j

(

(32)

∑

g(xi)m

post

�j→i

(

1 3

The Review of Socionetwork Strategies

and

respectively. Therefore, the computation time of the expectation in Eq. (27) is

O(Llog L)

, because the total computation time to calculate convolutions

g(xi)

and

h(xi)

in Eqs. (30) and (31) for all

xi=0, 1, …,L−1

O(Llog L)

, and Eqs. (32) and

(33) can be computed in

O(L)

computation time.

In Eq. (22), we need to calculate the messages and expectation of prior probability

distribution

x;𝛼

to ﬁnd

𝛼t+1

that satisﬁes this equation using the bisection method.

It should be noted that, because we assume the periodic boundary condition for the

graph structure E, we can calculate the messages and expectation of prior probabil‑

ity distribution faster than those of posterior probability distribution by considering the

translational symmetry assumption. If we assume both a periodic boundary condition

and translational symmetry, the messages and expectation of prior probability distribu‑

tion become not dependent on the position of the edges

ij ∈E

. Therefore, the message

update rule and expectation calculation for prior probability distribution are expressed

and

respectively, where

Mprior(

is a message of the prior probability distribution. The

calculation of the message and the expectation of the prior probability distribution

in Eqs. (34) and (36) can be computed in

O(Llog L)

computation time by the same

calculation method as Eqs. (25, 26) and Eqs. (27)–(33), respectively.

(33)

∑

h(xi)m

post

�j→i

(

(34)

prior

(

)

Zprior

∑

𝜙

(

xi−xj

)

f(xi−xj;𝛼)

(

Mprior

(

))

(35)

prior =

∑

exp

(

−𝛼𝜙

(

xi−xj

))(

Mprior

(

))3,

(36)

∑

ij∈E

⟨

𝜙

(

xi−xj

)

;𝛼

⟩

prior =2NG

prior

Hprior

(37)

prior =

∑

𝜙

(

xi−xj

)

f(xi−xj;𝛼)

(

Mprior

(

))3(

Mprior

(

))3,

(38)

prior =

∑

f(xi−xj;𝛼)

(

Mprior

(

))3(

Mprior

(

))3,

The Review of Socionetwork Strategies

1 3

6 Numerical Experiments

In this section, we describe the numerical veriﬁcation of the proposed method. We

used the standard images in Fig.3, which are widely used in the image processing

research ﬁeld. The pixel values of these images take

L=256

diﬀerent values. All

the experiments were implemented using C++ and were run single‑threaded on an

Ubuntu 18.04.1 LTS (64 bit) machine with an Intel Core i7‑6850K CPU running at

3.60 GHz and 128 GB RAM. In this experiment, we deﬁned the function

𝜙(x)

and set the parameters of the proposed method as follows. The initial parameter

𝛼0

was set at 0.005 and

𝜎2

was set at the sample variance calculated from Y. The maxi‑

mum numbers of iterations

TEM

and

TBP

were set at 100 and 1000, respectively. We

considered that the messages converged if the absolute value of the average change

in the messages was smaller than

10−4

; the same applied to the parameters

𝛼

and

𝜎2

We set the search interval of the bisection method for computing

𝛼t+1

[0, 2𝛼t]

First, we compared the computation time of the proposed method with that of the

previous methods (belief propagation FFT (BP‑FFT) and Naive). The Naive method

is the naive implementation version of Algorithm 1; the worst computation time is

. BP‑FFT is the method used in the study presented in [14], where only

the message updates in Eqs. (13) and (14) were speeded‑up by using the FFT. Tables1

and 2 show the average computation times over 10 trials for each method where K

noisy images Y were generated by adding an AWGN of

𝜎=15

to the original noise‑

less images. According to the results, the proposed method was faster than the other

methods. It should be noted that the diﬀerence between the three methods is in whether

Algorithm1 is implemented using the FFT. Therefore, the image denoising results of

these method are all the same.

(39)

𝜙(x)=x2,

Fig. 3 Gray scale standard images used in experiments (

L=256

)

Table 1 Average computation

time over 10 trials with

𝜎=15

K=1

, and image size

128 ×128

Fig.3a Fig.3b Fig.3c Fig.3d

Proposed (min) 15.24 15.24 11.56 15.74

BP‑FFT (min) 45.14 45.10 49.79 47.55

Naive (min) 275.17 249.64 230.18 263.57

1 3

The Review of Socionetwork Strategies

Figure4 shows the average computation time versus image size for

K=1

and

K=5

over 10 trials, where the noise level of AWGN was

𝜎=15

. The computation time of

our denoising methods grows approximately linearly with the increase in the image

size (it dose not grow strictly linearly, because we break oﬀ the message and parameter

updates according to the convergence condition).

Figure5 shows the denoising performance of the proposed method versus various

values of K for two levels of AWGN (

𝜎=15

and

𝜎=30

). We evaluated the perfor‑

mance of the method according to the average peak signal‑to‑noise ratio (PSNR) over

10 trials. The PSNR is deﬁned as

(40)

PSNR

=10 log10

255

MSE ,

Table 2 Average computation

time over 10 trials with

𝜎=15

K=5

, and image size

128 ×128

Fig.3a Fig.3b Fig.3c Fig.3d

Proposed (min) 4.86 6.46 4.22 6.05

BP‑FFT (min) 14.02 18.92 16.24 16.18

Naive (min) 100.92 119.43 97.54 103.63

Fig. 4 Average computation time and peak signal‑to‑noise ratio over 10 trials versus various image sizes

(

128 ×128

256 ×256

, and

512 ×512

) for

K=1

and

K=5

. The noise level applied in these experiments

was

𝜎=15

. a Fig.3a. b Fig.3b. c Fig.3c. d Fig.3d

The Review of Socionetwork Strategies

1 3

where MSE is the mean squared error between the original noiseless image and

the inferred denoised image

. Figure 5 conforms that the image denoising results

improve as the value of K is increased. Example of image denoising results for

Fig.3a are shown in Fig.6 for

K=1

and

K=5

, respectively.

Fig. 5 Average peak signal‑to‑noise ratio over 10 trials against K for

𝜎=15

and

𝜎=30

. a Fig. 3a. b

Fig.3b. c Fig.3c. d Fig.3d

Fig. 6 Examples of the image denoising results for Fig.3a. a Example of a noisy image when

𝜎=30

(peak signal‑to‑noise ratio (PSNR):18.61). b Denoised image obtained by the proposed method for

K=1

(PSNR:25.49). c Denoised image obtained by the proposed method for

K=5

(PSNR:30.33)

1 3

The Review of Socionetwork Strategies

7 Concluding Remarks

In this paper, we deﬁned a discrete MRF model for the Bayesian image denoising

problem with multiple noisy images. We proposed a fast denoising algorithm for

inferring a denoised image in

NL log L

)

‑time by using belief propagation

and an EM algorithm based on our MRF model and FFT. We numerically veriﬁed

the proposed denoising method using standard images. The results show that the

proposed algorithm inferred the denoised image faster than previous implementation

methods that use belief propagation.

We believe that the proposed method is the most fastest implementation of an

image denoising algorithm based on a discrete MRF model that uses belief propaga‑

tion and an EM algorithm. However, the method cannot yet be used for real‑time

processing. Therefore, we need to seek a further eﬀective fast approximate method

that preserves the restoration quality for the discrete MRF model. In our experiment,

we adopted the quadratic function as the form of function

𝜙(x)

. However, the pro‑

posed method is not restricted to the quadratic function: we can apply it to other

types of the function

𝜙(x)

, such as the Huber prior [15] and generalized sparse prior

[16]. Moreover, because it can be used in any discrete MRF with

𝜙(xi−xj)

potential

for

ij ∈E

interaction, it is expected that the proposed method is applicable to not

only image denoising but also other inference problems such as sparse modeling

[17, 18]. We intend to develop the method in these directions.

Acknowledgements This work was partially supported by JST CREST Grant no. JPMJCR1402 and JSPS

KAKENHI Grant nos. 18K18120, 18H03303, 18K11459, and 15H03699.

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 Interna‑

tional License (http://creat iveco mmons .org/licen ses/by/4.0/), which permits unrestricted use, distribution,

and reproduction in any medium, provided you give appropriate credit to the original author(s) and the

source, provide a link to the Creative Commons license, and indicate if changes were made.

References

1. Geman, S., & Geman, D. (1984). Stochastic relaxation, Gibbs distributions and the Bayesian resto‑

ration of images. IEEE Transactions on Pattern Analysis and Machine Intelligence, 6, 721–741.

2. Tanaka, K. (2002). Statistical‑mechanical approach to image processing. Journal of Physics A:

Mathematical and General, 35(37), R81–R150.

3. Li, S. Z. (2009). Markov random ﬁeld modeling in image analysis. Berlin: Springer.

4. Blake, A., Kohli, P., & Rother, C. (2011). Markov random ﬁelds for vision and image processing.

Cambridge: MIT Press.

5. Lezoray, O., & Grady, L. (Eds.). (2012). Image processing and analysis with graphs. Boca Raton:

CRC Press.

6. Kataoka, S., Yasuda, M., Furtlehner, C., & Tanaka, K. (2014). Traﬃc data reconstruction based on

Markov random ﬁeld modeling. Inverse Problem, 30(2), 025003.

7. Hara, Y., Suzuki, J., & Kuwahara, M. (2018). Network‑wide traﬃc state estimation using a mixture

Gaussian graphical model and graphical lasso. Transportation Research Part C, 86, 622–638.

8. Kuwatani, T., Nagata, K., Okada, M., & Toriumi, M. (2014). Markov random ﬁeld modeling for

mapping geoﬂuid distributions from seismic velocity structures. Earth, Planets and Space, 66(5),

1–9.

The Review of Socionetwork Strategies

1 3

9. Kuwatani, T., Nagata, K., Okada, M., & Toriumi, M. (2014). Markov‑random‑ﬁeld modeling for

linear seismic tomography. Physical Review E, 90, 042137.

10. Yasuda, M., Watanabe, J., Kataoka, S., & Tanaka, K. (2018). Linear‑time algorithm in Bayesian

image denoising based on Gaussian Markov random ﬁeld. In: IEICE Transactions on Information

and Systems, vol. E101.D, no. 6, pp. 1629‑1639.

11. Pearl, J. (1988). Probabilistic reasoning in intelligent system:networks of plausible inference. Burl‑

ington: Morgan Kaufmann.

12. Dempster, A. P., Laird, N. M., & Rubin, D. B. (1977). Maximun likelihood from incomplete data

via the EM algorithm. Journal of the Royal Statistical Society. Series B (Methodological), 39(1),

1–38.

13. Felzenszwalb, P. F., & Huttenlocher, D. P. (2006). Eﬃcient belief propagation for early vision. Inter-

national Journal of Computer Vision, 70(1), 41–54.

14. Inoue, K. (2009). Kakuritisu denpan hou ni yoru gazou shori algorithm no kairyou ni kansuru ken‑

kyu (Study on an improvement of the image processing using belief propagation), Master Thesis in

Tohoku University (unpublished) (in Japanese).

15. Schulta, R. R., & Stevenson, R. L. (1994). A Bayesian approach to image expansion for improved

deﬁnition. IEEE Transactions on Image Processing, 3(3), 233–242.

16. Tanaka, K., Yasuda, M., & Titterington, D. M. (2012). Bayesian image modeling by means of gen‑

eralized sparse prior and loopy belief propagation. Journal of the Physical Society of Japan, 81(11),

114802.

17. Krzakala, F., Mézard, M., Sausset, F., Sun, Y., & Zdeborová, L. (2012). Probabilistic reconstruction

in compressed sensing: Algorithms, phase diagrams, and threshold achieving matrices. Journal of

Statistical Mechanics: Theory and Experiment, 2012, P08009.

18. Elad, M. (2010). Sparse and redundant representations: From theory to applications in signal and

image processing. Berlin: Springer.

Publisher’s Note Springer Nature remains neutral with regard to jurisdictional claims in published

maps and institutional aﬃliations.

Research Trends and Case Studies of Deep Learning Applications in Geo-electric and Electromagnetic Surveys

Article

Aug 2022

Hierarchical Gaussian Markov Random Field for Image Denoising

Article

Mar 2022

In this study, Bayesian image denoising, in which the prior distribution is assumed to be a Gaussian Markov random field (GMRF), is considered. Recently, an effective algorithm for Bayesian image denoising with a standard GMRF prior has been proposed, which can help implement the overall procedure and optimize its parameters in O(n)-time, where n is the size of the image. A new GMRF-type prior, referred to as a hierarchical GMRF (HGMRF) prior, is proposed, which is obtained by applying a hierarchical Bayesian approach to the standard GMRF prior; in addition, an effective denoising algorithm based on the HGMRF prior is proposed. The proposed HGMRF method can help implement the overall procedure and optimize its parameters in O(n)-time, as well as the previous GMRF method. The restoration quality of the proposed method is found to be significantly higher than that of the previous GMRF method as well as that of a non-local means filter in several cases. Furthermore, numerical evidence implies that the proposed HGMRF prior is more suitable for the image prior than the standard GMRF prior.

TEMDnet: A Novel Deep Denoising Network for Transient Electromagnetic Signal With Signal-to-Image Transformation

Article

Full-text available

Sep 2020

The considerable prospecting depth and accurate subsurface characteristics can be obtained by the transient electromagnetic method (TEM) in geophysics. Nevertheless, the time-domain TEM signal received by the coil is easily disturbed by environmental background noise, artificial noise, and electronic noise of the equipment. Recently, deep neural networks (DNN) have been used to solve the TEM denoising problem and have achieved better performance than traditional methods. However, the existing denoising method with DNN adopts fully connected neural networks, and is therefore not flexible enough to deal with various signal scales. To address these issues, a novel denoising framework with deep convolutional neural networks (CNN) of transforming the TEM signal denoising task into an image denois-ing task (namely, TEMDnet) is proposed in this paper. Specifically , a novel signal-to-image transformation method is developed first to preserve the structural features of TEM signals. Then, a novel deep CNN-based denoiser is proposed to further perform feature learning, in which the residual learning mechanism is adopted to model the noise estimation image for different signal features. Extensive experiments demonstrate that the proposed framework can achieve much better performance compared with other state-of-the-art approaches on both simulated signals and real-world signals from a landfill leachate treatment plant in Chengdu, Sichuan province, China. Models and code are available at https://github.com/tonyckc/TEMDnet demo.

An MR Image Segmentation Method Based on Dictionary Learning Preprocessing and Probability Statistics

Chapter

Jul 2022

Segmentation of brain MR images can help clinicians to extract regions of interest. At present, there are many studies on brain MR image segmentation, especially in deep learning methods. However, deep learning methods requires a very large quantity of datasets. With the popularity of deep learning, dictionary learning has been revived again. As a traditional machine learning segmentation method, dictionary learning requires less sample size, has high segmentation efficiency, and can describe the image well. This paper mainly proposes an MR image segmentation method based on posterior probability. The process of medical image acquisition is accompanied by the generation of noise which is limited by factors such as the equipment environment. We use the powerful classification function of the naive Bayes model to classify the noise data and sample label data in the images, and then provide the classification results to the dictionary learning sparse matrix for sparse expression. We achieve the effect of noise reduction and denoising through mathematical methods to improve the signal-to-noise ratio of medical image data. After getting the denoised image, we adopt region growth combined with the softmax function method to classify the pixels. The results of this paper provide technical support for the subsequent image segmentation, detection and other computer-assisted clinical diagnosis, so as to improve the efficiency of automated clinical diagnosis for clinical application.

Hybrid deep learning model for in-field pest detection on real-time field monitoring

Article

Feb 2022

The growth of important crops in agriculture can be affected and the production is reduced due to various pest attacks. The detection and recognition of these pests is a challenging task because of their identical look in the beginning level of plant growth. To overcome this challenge, deep learning-based real-time video detection models have been introduced for the segmentation and detection of different pests and pathogens. In this paper, a hybrid deep learning model is presented for the segmentation and detection of pests in various plants. The proposed technique is a four-stage model designed on the coordination of different deep learning networks. In the first stage, the image, as well as video frame, acquired images are denoised via the Bayesian image denoising framework. In the second stage, the denoised images are enhanced using LightenNet architecture. In the third stage, the image is semantically segmented with a context-guided residual network (ResNet) model. In the final stage, the segmented images are fed into the convolutional neural network to create a robust system for pest detection. The experiments are carried out on different benchmark datasets for performance assessment. The effectiveness of proposed method is verified in terms of structural similarity index measure (SSIM) and mean absolute error (MAE) and average precision (AP) as 0.99, < 0.2 and 89.67%, respectively. The qualitative performance evaluation of the proposed method indicates that it is apt for real-time monitoring and detection. © 2022, The Author(s) under exclusive licence to Deutsche Phytomedizinische Gesellschaft.

Network-wide traffic state estimation using a mixture Gaussian graphical model and graphical lasso

Article

Full-text available

Jan 2018
TRANSPORT RES C-EMER

This study proposes a model that estimates unobserved highway link speeds by a machine learning technique using historical probe vehicle data. For highway traffic monitoring, probe vehicle data is one of the most promising data source. However, since such data do not always cover an entire study area, we cannot measure traffic speeds on all links in a time-dependent manner; quite a few links are unobserved. To continuously monitor speeds on all links, it is necessary to develop a technique that estimates speeds on unobserved links from historical observed link speeds. For this purpose, we extend the current Gaussian graphical model so as to use two or more multivariate normal distributions to accurately estimate unobserved link speeds. In general, since the number of unknown model parameters (mean parameters and covariance matrices) is enormous and also unobserved links always exist, the EM algorithm and the graphical lasso technique are employed to determine the model parameters. Our proposed model was applied to the Bangkok city center in Thailand as well as to the Fujisawa city in Japan. We confirmed that the model can estimate the unobserved link speeds quite reasonably.

Markov random field modelling for fluid distributions from the seismic velocity structures

Article

Full-text available

Dec 2011
EARTH PLANETS SPACE

We applied the Markov random field model, which is a kind of a Bayesian probabilistic method, to the spatial inversion of the porosity and pore shape in rocks from an observed seismic structure. Gaussian Markov chains were used to incorporate the spatial continuity of the porosity and the aspect ratio of the pore shape. Synthetic inversion tests were able to show the effectiveness and validity of the proposed model by appropriately reducing the statistical noise from the observations. The proposed model was also applied to natural data sets of the seismic velocity structures in the mantle wedge beneath northeastern Japan, under the assumptions that the fluid was melted and the temperature and petrologic structures were uniformly distributed. The result shows a significant difference between the volcanic front and the forearc regions, at a depth of 40 km. Although the parameters and material properties will need to be determined more precisely, the Markov random field model presented here can serve as a basic inversion framework for mapping geofluids.

Traffic data reconstruction based on Markov random field modeling

Article

Full-text available

Jun 2013

We consider the traffic data reconstruction problem. Suppose we have the traffic data of an entire city that are incomplete because some road data are unobserved. The problem is to reconstruct the unobserved parts of the data. In this paper, we propose a new method to reconstruct incomplete traffic data collected from various traffic sensors. Our approach is based on Markov random field modeling of road traffic. The reconstruction is achieved by using mean-field method and a machine learning method. We numerically verify the performance of our method using realistic simulated traffic data for the real road network of Sendai, Japan.

Markov Random Field Modeling in Image Analysis

Book

Full-text available

Jan 2001

Stan Z Li

Edition 1 Since its beginning, image analysis research has been evolving from heuristic design of algorithms to systematic investigation of approaches. Researchers have realized: (1) The solution to a vision problem should be sought based on optimization principles, either explicitly or implicitly, and (2) contextual constraints are ultimately necessary for the understanding of visual information in images. Two questions follow: how to define an optimality criterion under contextual constraints and how to find its optimal solution. Markov random field (MRF), a branch of probability theory, provides a foundation for the characterization of contextual constraints and the derivation of the probability distribution of interacting features. In conjunction with methods from decision and estimation theory, MRF theory provides a systematic approach for deriving optimality criteria such as those based on the maximum a posteriori (MAP) concept. This MAP-MRF framework enables us to systematically develop algorithms for a variety of vision problems using rational principles rather than ad hoc heuristics. For these reasons, there has been increasing interest in modeling computer vision problems using MRF’s in recent years. This book provides a coherent reference to theories, methodologies, and recent developments in solving computer vision problems based on MRF’s, statistics, and optimization. It treats various problems in low- and high-level computational vision in a systematic and unified way within the MAP-MRF framework. The main issues of concern are how to use MRF’s to encode contextual constraints that are indispensable to image understanding; how to derive the objective function, typically the posterior distribution, for the optimal solution to a problem; and how to design computational algorithms for finding the optimal solution. As the first thorough reference on the subject, the book has four essential parts for solving image and vision analysis problems using MRF’s: (1) introduction to fundamental theories, (2) formulations of various image models in the MAP-MRF framework, (3) parameter estimation, and (4) optimization methods.

Linear-Time Algorithm in Bayesian Image Denoising based on Gaussian Markov Random Field

Article

Oct 2017

In this paper, we consider Bayesian image denoising based on a Gaussian Markov random field (GMRF) model, for which we propose an new algorithm. Our method can solve Bayesian image denoising problems, including hyperparameter estimation, in $O(n)$-time, where $n$ is the number of pixels in a given image. From the perspective of the order of the computational time, this is a state-of-the-art algorithm for the present problem setting. Moreover, the results of our numerical experiments we show our method is in fact effective in practice.

Markov-random-field modeling for linear seismic tomography

Article

Oct 2014

We apply the Markov-random-field model to linear seismic tomography and propose a method to estimate the hyperparameters for the smoothness and the magnitude of the noise. Optimal hyperparameters can be determined analytically by minimizing the free energy function, which is defined by marginalizing the evaluation function. In synthetic inversion tests under various settings, the assumed velocity structures are successfully reconstructed, which shows the effectiveness and robustness of the proposed method. The proposed mathematical framework can be applied to inversion problems in various fields in the natural sciences.

Markov random fields for vision and image processing

Article

Jan 2011

Bayesian Image Modeling by Means of a Generalized Sparse Prior and Loopy Belief Propagation

Article

Nov 2012

Bayesian image modeling is presented based on a generalized sparse prior probability distribution. Our prior includes sparsity in each interaction term between every pair of neighbouring pixels in Markov random fields. A new scheme for hyperparameter estimation is based on the conditional maximization of entropy in our generalized sparse prior. In addition, the criterion used for defining the optimal value for sparseness in interactions is that of the maximization of marginal likelihood. Our practical algorithm is based on loopy belief propagation.

Stochastic Relaxation, Gibbs Distributions and the Bayesian Resoration of Images

Article

Jun 1984

We make an analogy between images and statistical mechanics systems. Pixel gray levels and the presence and orientation of edges are viewed as states of atoms or molecules in a lattice-like physical system. The assignment of an energy function in the physical system determines its Gibbs distribution. Because of the Gibbs distribution, Markov random field (MRF) equivalence, this assignment also determines an MRF image model. The energy function is a more convenient and natural mechanism for embodying picture attributes than are the local characteristics of the MRF. For a range of degradation mechanisms, including blurring, nonlinear deformations, and multiplicative or additive noise, the posterior distribution is an MRF with a structure akin to the image model. By the analogy, the posterior distribution defines another (imaginary) physical system. Gradual temperature reduction in the physical system isolates low energy states (``annealing''), or what is the same thing, the most probable states under the Gibbs distribution. The analogous operation under the posterior distribution yields the maximum a posteriori (MAP) estimate of the image given the degraded observations. The result is a highly parallel ``relaxation'' algorithm for MAP estimation. We establish convergence properties of the algorithm and we experiment with some simple pictures, for which good restorations are obtained at low signal-to-noise ratios.

Statistical-mechanical approach to image processing

Article

Sep 2002
J Phys Math Gen

Kazuyuki Tanaka

The basic frameworks and techniques of the Bayesian approach to image restoration are reviewed from the statistical-mechanical point of view. First, a few basic notions in digital image processing are explained to convince the reader that statistical mechanics has a close formal similarity to this problem. Second, the basic formulation of the statistical estimation from the observed degraded image by using the Bayes formula is demonstrated. The relationship between Bayesian statistics and statistical mechanics is also listed. Particularly, it is explained that some correlation inequalities on the Nishimori line of the random spin model also play an important role in Bayesian image restoration. Third, the framework of Bayesian image restoration for binary images by means of the Ising model is reviewed. Some practical algorithms for binary image restoration are given by employing the mean-field and the Bethe approximations. Finally, Bayesian image restoration for a grey-level image using the Gaussian model is reviewed, and the Gaussian model is extended to a more practical probabilistic model by introducing the line state to treat the effects of edges. The line state is also extended to quantized values.

Bayesian Image Denoising with Multiple Noisy Images

Abstract and Figures

Recommended publications

A Bayesian Approach to Clustering Matting Components in Spectral Matting

Degradation data-drive approach for remaining useful life estimation

Bayesian Image Modeling by Means of a Generalized Sparse Prior and Loopy Belief Propagation

Probabilistic Solution of Inverse Problems