Scholarly article on topic 'Image Recovery Algorithm Based on Learned Dictionary'

Image Recovery Algorithm Based on Learned Dictionary Academic research paper on "Mathematics"

CC BY
0
0
Share paper
Academic journal
Mathematical Problems in Engineering
OECD Field of science
Keywords
{""}

Academic research paper on topic "Image Recovery Algorithm Based on Learned Dictionary"

Hindawi Publishing Corporation Mathematical Problems in Engineering Volume 2014, Article ID 964835, 6 pages http://dx.doi.org/10.1155/2014/964835

Research Article

Image Recovery Algorithm Based on Learned Dictionary

Xinghui Zhu and Fang Kui

College of Information Science & Technology, Hunan Agricultural University, Changsha 410128, China Correspondence should be addressed to Fang Kui; fk@hunau.net Received 28 May 2014; Accepted 25 July 2014; Published 12 August 2014 Academic Editor: Binxiang Dai

Copyright © 2014 X. Zhu and F. Kui. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

We proposed a recovery scheme for image deblurring. The scheme is under the framework of sparse representation and it has three main contributions. Firstly, considering the sparse property of natural image, the nonlocal overcompleted dictionaries are learned for image patches in our scheme. And, then, we coded the patches in each nonlocal clustering with the corresponding learned dictionary to recover the whole latent image. In addition, for some practical applications, we also proposed a method to evaluate the blur kernel to make the algorithm usable in blind image recovery. The experimental results demonstrated that the proposed scheme is competitive with some current state-of-the-art methods.

1. Introduction

Image recovery has been a widely studied issue in the past decades, which remains an active area in low-level image processing [1, 2]. Many factors may cause blur, such as the imperfection of devices, atmospheric turbulence (for remote sensing image), and motion between the camera and scene. By assuming the blur model is linear and space invariant, the observation can be expressed as

y = H ® x + n, (1)

where y and x denote the blurry image and the latent image, respectively, H is the blur kernel, and ® and n represent the convolution operation and the additive white Gaussian noise. The mission of deblurring is to obtain the latent image x from the degradation observation y, which is usually solved by the regularization method due to its ill-posed property. The key idea of the regularization is to penalize the prior knowledge of latent image. In the past decades, many kinds of prior knowledge have been studied in literatures [3-5]. In recent years, the sparse regularization based modeling is employed to solve the recovery problems, which is proven to be more effective than the conventional regularization. Jalobeanu et al. proposed to deblur the satellite image with complex wavelet packet, which established a Gaussian model for the sparse coefficients [6]. Cho et al. [7] proposed a regression model by exploiting the sparsity of natural image

to mitigate the ringing artifacts in the deblurred image, which can be seen as a natural generalization work of [8]. Based on the dictionary learning, Foi et al. proposed a shape adaptive DCT for high-quality image deblurring [9]. Recently, incorporating with the sparse representation, the NLM (nonlocal means) method is exploited in many image processing tasks successfully, such that [10,11] all achieve the impressive recovery results based on this model. In this paper, inspired by the sparse representation and NLM techniques, we proposed a novel scheme for image deblurring, which will train many subdictionaries to better present the patches with different features and recover the whole latent image based on sparse model.

The outline of the remainder of the paper is as follows. In Section 2, we review the regularization framework for image deblurring and propose a novel optimization based on sparse representation. In Section 3, we show an iterative method to solve the optimization in Section 2 with the alternating direction method (ADM), and, additionally, a dictionary learning method with NLM is developed to better present the image. The numerical experiments are shown in Section 4 and we concluded the paper in Section 5.

2. Deblurring Framework with Sparse Model

To begin, following the conventional notation in many literatures, we denote the latent image by X e Rn and

degradation image by Y as follows:

Rn; we show the blurry model 3. Iterative Algorithm for Image Deblurring

Y = H ® X + n,

where 8 indicates the convolution operator and H and n are the blurring kernel and additive white Gaussian noise, respectively.

Based on the regularization, we can formulate the optimization to obtain the solution for (2) as follows:

X = arg min {|| Y - H <

>X|£ +

Xp(X)},

where || • ||F denotes the Frobenius norm and the p(^) presents regularization for latent image X, such as the TV norm and smoothness prior.

To solve the optimization in (3), we still need to know the blur kernel. Nevertheless, in practice, it is unknown for blind image deblurring. A popular estimation method is developed in [12]; in their work, the authors proposed to estimate the blur kernel as follows:

H = arg min {|

—1|2 >xll +

In our scheme, we incorporate the model in (3) and (4) into a novel unified variational framework as follows:

(X, H) =

arg min

||Y - H ® X||

, + Ap(X)+W(H)}, (5)

where p(^) and <p(^) denote the regularization penalty term of desired latent sharp image and blur kernel, respectively, which can stabilize the solution. A and ^ are termed regular-ization parameters.

Furthermore, assuming the image patch of size -/n x -/n at location i by x; = .R;X (x; e £n, n < N) and the dictionary by D e RnxM (« ^ M), we can propose a novel framework based on the sparse model in the following form:

3.1. Local Dictionary Learning. In recent years, there has been a growing interest in studying the sparse model and its application to image processing. And a key technique for spare model is to design a dictionary to better fit the above model, which is generated in two main methods. The first one is combining some predefined bases to generate a comprehensive dictionary, such as wavelet and DCT. Nevertheless, by this method, the generated dictionary is not data independent. In other words, the dictionary can be termed "fixed." The other one is to learn a dictionary adaptive to a set of training examples from degradation image itself or some sample images. In our framework, we intend to adopt the second method, which will make the dictionary more adaptive and flexible. The recent method termed KSVD is introduced by Aharon et al. [14]. In their work, the author tended to learn a global dictionary and best presented the patches from the image. However, the training tends to generate a group of atoms that can present each patch in image, which leads to loss of some ability to present some local structure.

Note that the patches with similar structure in image may be presented by the same dictionary ideally. Motivated by the NLM technique, our learning scheme is proposed to train many local dictionaries for similar patches in the same cluster. For low complexity, we can measure the distance between two patches as follows:

Dist (y > y,-) = exP

-y - y 112 h

(X, H)

= arg min

where y and y denote the patch at locations i and j and h is the scale parameter to control the discrimination. And, then, we can take the distance in (7) as the measurement and cluster the patches by K-Mean algorithm.

||Y - H 8 X||p + £(||£jX--Da;||2 + Ai||ai||1) Next, given the train examples set C; = jyU

)'j= 1

sponding to clustering «; we can learn the local dictionary adaptive to this clustering by the following minimization:

where Rt is a matrix to extract the patch from the latent image at location «. The first term in (6) is called fidelity constraint, which guarantees that the latent image is consistent with the degradation observation. The second term constrains the sparsity of patches in image with the reasonable dictionary D. The third term is the /2-norm constraint on blur kernel to preserve the solution stability.

As for the optimization in (6), the recent alternating direction method (ADM) [13] can be employed to obtain the solution, which is the popular method used in multivariables optimization. In addition, the dictionary D is also a valuable issue to be studied, because the more reasonable the dictionary is, the more effective the restoration result is. So, in the next section, we will present an iterative algorithm based on ADM to solve the minimization in (6).

D = arg min

||c, -da|£ +

where A denotes the coding matrix and a.j is its jth column corresponding to patch y^.

3.2. Deblurring Algorithm. In this section, we will propose an iterative algorithm to solve the novel framework in (6). By the ADM method, we can solve the minimization alternately and divide the optimization into three major parts.

Firstly, by fixing all other variables without H, (6) can be reduced to the following form:

H = arg min { h u

SA-DCT Proposed

Figure 1: Recovery results of Cameraman.

The minimization (9) is a strictly convex problem, which leads to a closed-form solution as follows:

H = F-

conj (f(x)of(Y)}

. KX)C.

where F(^) and F-1(^) denote the fast Fourier transform (FFT) and its inverse. conj(^) denotes the complex conjugate operator, the presents the pointwise product, and I is a unit matrix.

Secondly, by fixing the code set {a^^and the estimation

blur kernel H, we can generate the second major part as follows:

X; = arg min

y, - H ® X|£ + ХУ, ||RX - о'«,!

where D* denotes the local learned dictionary for the patches in clustering i and footnote i presents the variable corresponding to clustering«.

Given the sparse coding set and the blur kernel, (11) can be solved with the fast deconvolution method [15] as follows:

X = F-

conj (F (h)) о F (y,) + F (^ y^jô'âj)

IHII2 +

At last, we coded the patches from X* with the local dictionary to update the sparse coding set for the next

iteration by the following minimization:

ос, = arg min {||.RX - D 'a

J a I-1 J

m + Ablli

The comprehensive deblurring scheme is summarized in Algorithm 1.

In Algorithm 1, the coding by (13) can be implemented by some recent sparse coding algorithms, such as "Basis Pursuit"

SA-DCT

Proposed

Figure 2: Recovery results of Parrots.

and "Orthogonal Matching Pursuit". With Algorithm 1, the proposed scheme solves our novel framework with the unified variational formulation, which shows some advantage in practice.

4. Numerical Results and Analysis

We conduct some numerical experiments to demonstrate the performance of our proposed scheme. The compared method includes the BM3D proposed in [16], SA-DCT method in [9], FISTA in [17], and our scheme. The sizes of test images are all 256 x 256 and the pixel value ranges to [0,1]. The Gaussian blur kernel with the standard deviation 1.5 was used in our experiments and the additive white noise with different standard deviation was also adopted. The maximum iteration number T = 10. For low computing complexity, the clustering number К is set to be 5 empirically. The zoomed local visual comparisons are shown in Figures 1 and 2, and we only show the deblurring results of Parrots and Cameraman with о = 1 due to the limited space. Also, the PSNR for test images is reported in Tables 1 and 2 as the objective evaluation.

Table 1: PSNR (dB) results of the recovered image with a = 1.

Method/image BM3D SADCT FISTA Proposed

Lena 31.2615 28.8271 28.8129 30.9176

Cameraman 27.9001 26.2613 25.4112 27.1056

Parrots 31.3120 28.3765 28.5020 30.8230

Peppers 29.6203 28.3270 28.1644 29.1026

From the visual comparison results, we can see that our novel scheme outperforms other methods in overcoming the staircase effect and also show some advance in recovering the texture and shaper edge. Also, the PSNR results also demonstrate that our method is competitive with the compared method and presents robustness with higher noise standard deviation. Though the BM3D obtains the highest PSNR, its visual results still leave some staircase effect in the part of smooth areas. The PSNR of our scheme is only a little lower than BM3D, but the visualization seems better, such as the texture, which means that our method shows synthesis advantage compared to BM3D. In addition, in the experiments, we found that the PSNR decreased with

Input: The patches set jy} = from the degradation image Y(0).

Initialization: A = 0.1, ^ = 1, the coarse estimation latent image X(0) by the method in [11];

For k = 0 : T

Learn the local dictionary {Dj}J=1 by (8) with train set from Y®; Estimate the new kernel H№) by (l0) with X№);

For i = 1: K (K means the maximum clustering number) Compute the latent image X(fc+1) by (12); Compute the coding set {a' 13

via the method in [12]; end

Update the Y№+1) = X№+1); end

Output the comprehensive latent image X.

Algorithm 1

Table 2: PSNR (dB) results of the recovered image with a = 2.

Method/image BM3D SADCT FISTA Proposed

Lena 30.3902 27.6494 26.8268 29.6937

Cameraman 26.9831 25.3297 24.0339 26.2203

Parrots 30.2611 27.1512 26.5932 29.3679

Peppers 28.7621 27.2626 26.3425 28.1023

the increasing iterations, which can guarantee the convergence of our scheme empirically.

5. Conclusions

In this paper, we addressed the image deblurring problem. We proposed a scheme based on the sparse representation. To better present the image, we learned many dictionaries via the patches with nonlocal self-similarity. And, then, we incorporated the kernel estimation and sparse model into a novel unified framework. Additionally, an iterative algorithm was provided to solve the novel framework with the ADM. The experimental results demonstrate that our proposed scheme outperforms some leading methods. For further work, many possible extension researches can be explored, such as extending the framework to video deblurring and considering some other kinds of noise.

Conflict of Interests

The authors declare that there is no conflict of interests regarding the publication of this paper.

Acknowledgment

This project was supported in part by the National Key Technology R&D Program of China (2013BAD15B02 and 2012BAD35B07).

References

[1] F. Li, C. M. Shen, J. S. Fan, and C. L. Shen, "Image restoration combining a total variational filter and a fourth-order filter," Journal of Visual Communication and Image Representation, vol. 18, no. 4, pp. 322-330, 2007.

[2] J. Yang, Y. Zhang, and W. Yin, "An efficient TVL1 algorithm for deblurring multichannel images corrupted by impulsive noise," SIAM Journal on Scientific Computing, vol. 31, no. 4, pp. 28422865, 2009.

[3] A. M. Bruckstein, D. L. Donoho, and M. Elad, "From sparse solutions of systems of equations to sparse modeling of signals and images," SIAM Review, vol. 51, no. 1, pp. 34-81, 2009.

[4] X. Guo, F. Li, and M. K. Ng, "A fast 11-TV algorithm for image restoration," SIAM Journal on Scientific Computing, vol. 31, no. 3, pp. 2322-2341, 2009.

[5] S. Dai, M. Han, W. Xu, Y. Wu, Y. Gong, and A. K. Katsaggelos, "SoftCuts: a soft edge smoothness prior for color image superresolution," IEEE Transactions on Image Processing, vol. 18, no. 5, pp. 969-981,2009.

[6] A. Jalobeanu, L. Blanc-Feraud, and J. Zerubia, "Hyperparameter estimation for satellite image restoration using a MCMC maximum-likelihood method," Pattern Recognition, vol. 35, no. 2, pp. 341-352, 2002.

[7] T. S. Cho, N. Joshi, C. L. Zitnick, S. B. Kang, R. Szeliski, and W. T. Freeman, "A content-aware image prior," in Proceedings of the 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR '10), pp. 169-176, June 2010.

[8] A. Levin, R. Fergus, F. Durand et al., "Deconvolution using natural image priors," ACM Transactions on Graphics, vol. 26, no. 3, 2 pages, 2007.

[9] A. Foi, V. Katkovnik, and K. Egiazarian, "Pointwise shape-adaptive DCT for high-quality denoising and deblocking of grayscale and color images," IEEE Transactions on Image Processing, vol. 16, no. 5, pp. 1395-1411, 2007.

[10] K. Dabov, A. Foi, V. Katkovnik, and K. Egiazarian, "Image denoising by sparse 3-D transform-domain collaborative filtering," IEEE Transactions on Image Processing, vol. 16, no. 8, pp. 2080-2095, 2007.

[11] W. Dong, L. Zhang, and G. Shi, "Centralized sparse representation for image restoration," in Proceedings of the IEEE

International Conference on Computer Vision (ICCV '11), pp. 1259-1266, Barcelona, Spain, November 2011.

[12] S. Cho and S. Lee, "Fast motion deblurring," ACM Transactions on Graphics, vol. 28, no. 5, pp. 145-145, 2009.

[13] E. Esser, "Applications of lagrangian-based alternating direction methods and connections to split Bregman," CAM Rep 09-31, UCLA, 2009.

[14] M. Aharon, M. Elad, and A. Bruckstein, "K-SVD: an algorithm for designing overcomplete dictionaries for sparse representation," IEEE Transactions on Signal Processing, vol. 54, no. 11, pp. 4311-4322, 2006.

[15] Q. Shan, J. Jia, and A. Agarwala, "High-quality motion deblurring from a single image," ACM Transactions on Graphics, vol. 27, no. 3, p. 73, 2008.

[16] K. Dabov, A. Foi, V Katkovnik, and K. Egiazarian, "Image restoration by sparse 3D transform-domain collaborative filtering," in Image Processing: Algorithms and Systems VI, vol. 6812 of Proceedings of SPIE, 2008.

[17] A. Beck and M. Teboulle, "Fast gradient-based algorithms for constrained total variation image denoising and deblurring problems," IEEE Transactions on Image Processing, vol. 18, no. 11, pp. 2419-2434, 2009.

Copyright of Mathematical Problems in Engineering is the property of Hindawi Publishing Corporation and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use.