Search | arXiv e-print repository

Learning a Gaussian Mixture for Sparsity Regularization in Inverse Problems

Authors: Giovanni S. Alberti, Luca Ratti, Matteo Santacesaria, Silvia Sciutto

Abstract: In inverse problems, it is widely recognized that the incorporation of a sparsity prior yields a regularization effect on the solution. This approach is grounded on the a priori assumption that the unknown can be appropriately represented in a basis with a limited number of significant components, while most coefficients are close to zero. This occurrence is frequently observed in real-world scena… ▽ More In inverse problems, it is widely recognized that the incorporation of a sparsity prior yields a regularization effect on the solution. This approach is grounded on the a priori assumption that the unknown can be appropriately represented in a basis with a limited number of significant components, while most coefficients are close to zero. This occurrence is frequently observed in real-world scenarios, such as with piecewise smooth signals. In this study, we propose a probabilistic sparsity prior formulated as a mixture of degenerate Gaussians, capable of modeling sparsity with respect to a generic basis. Under this premise, we design a neural network that can be interpreted as the Bayes estimator for linear inverse problems. Additionally, we put forth both a supervised and an unsupervised training strategy to estimate the parameters of this network. To evaluate the effectiveness of our approach, we conduct a numerical comparison with commonly employed sparsity-promoting regularization techniques, namely LASSO, group LASSO, iterative hard thresholding, and sparse coding/dictionary learning. Notably, our reconstructions consistently exhibit lower mean square error values across all $1$D datasets utilized for the comparisons, even in cases where the datasets significantly deviate from a Gaussian mixture model. △ Less

Submitted 29 January, 2024; originally announced January 2024.

arXiv:2303.15244 [pdf, other]

Manifold Learning by Mixture Models of VAEs for Inverse Problems

Authors: Giovanni S. Alberti, Johannes Hertrich, Matteo Santacesaria, Silvia Sciutto

Abstract: Representing a manifold of very high-dimensional data with generative models has been shown to be computationally efficient in practice. However, this requires that the data manifold admits a global parameterization. In order to represent manifolds of arbitrary topology, we propose to learn a mixture model of variational autoencoders. Here, every encoder-decoder pair represents one chart of a mani… ▽ More Representing a manifold of very high-dimensional data with generative models has been shown to be computationally efficient in practice. However, this requires that the data manifold admits a global parameterization. In order to represent manifolds of arbitrary topology, we propose to learn a mixture model of variational autoencoders. Here, every encoder-decoder pair represents one chart of a manifold. We propose a loss function for maximum likelihood estimation of the model weights and choose an architecture that provides us the analytical expression of the charts and of their inverses. Once the manifold is learned, we use it for solving inverse problems by minimizing a data fidelity term restricted to the learned manifold. To solve the arising minimization problem we propose a Riemannian gradient descent algorithm on the learned manifold. We demonstrate the performance of our method for low-dimensional toy examples as well as for deblurring and electrical impedance tomography on certain image manifolds. △ Less

Submitted 12 June, 2024; v1 submitted 27 March, 2023; originally announced March 2023.

arXiv:2302.03577 [pdf, ps, other]

Compressed sensing for inverse problems and the sample complexity of the sparse Radon transform

Authors: Giovanni S. Alberti, Alessandro Felisi, Matteo Santacesaria, S. Ivan Trapasso

Abstract: Compressed sensing allows for the recovery of sparse signals from few measurements, whose number is proportional to the sparsity of the unknown signal, up to logarithmic factors. The classical theory typically considers either random linear measurements or subsampled isometries and has found many applications, including accelerated magnetic resonance imaging, which is modeled by the subsampled Fou… ▽ More Compressed sensing allows for the recovery of sparse signals from few measurements, whose number is proportional to the sparsity of the unknown signal, up to logarithmic factors. The classical theory typically considers either random linear measurements or subsampled isometries and has found many applications, including accelerated magnetic resonance imaging, which is modeled by the subsampled Fourier transform. In this work, we develop a general theory of infinite-dimensional compressed sensing for abstract inverse problems, possibly ill-posed, involving an arbitrary forward operator. This is achieved by considering a generalized restricted isometry property, and a quasi-diagonalization property of the forward map. As a notable application, for the first time, we obtain rigorous recovery estimates for the sparse Radon transform (i.e., with a finite number of angles $θ_1,\dots,θ_m$), which models computed tomography, in both the parallel-beam and the fan-beam settings. In the case when the unknown signal is $s$-sparse with respect to an orthonormal basis of compactly supported wavelets, we prove stable recovery under the condition \[ m\gtrsim s, \] up to logarithmic factors. △ Less

Submitted 3 May, 2024; v1 submitted 7 February, 2023; originally announced February 2023.

Comments: 57 pages

MSC Class: 42C40; 44A12; 60B20; 94A20

arXiv:2206.05289 [pdf, other]

doi 10.1137/22M1503221

Localized adversarial artifacts for compressed sensing MRI

Authors: Rima Alaifari, Giovanni S. Alberti, Tandri Gauksson

Abstract: As interest in deep neural networks (DNNs) for image reconstruction tasks grows, their reliability has been called into question (Antun et al., 2020; Gottschling et al., 2020). However, recent work has shown that, compared to total variation (TV) minimization, when appropriately regularized, DNNs show similar robustness to adversarial noise in terms of $\ell^2$-reconstruction error (Genzel et al.,… ▽ More As interest in deep neural networks (DNNs) for image reconstruction tasks grows, their reliability has been called into question (Antun et al., 2020; Gottschling et al., 2020). However, recent work has shown that, compared to total variation (TV) minimization, when appropriately regularized, DNNs show similar robustness to adversarial noise in terms of $\ell^2$-reconstruction error (Genzel et al., 2022). We consider a different notion of robustness, using the $\ell^\infty$-norm, and argue that localized reconstruction artifacts are a more relevant defect than the $\ell^2$-error. We create adversarial perturbations to undersampled magnetic resonance imaging measurements (in the frequency domain) which induce severe localized artifacts in the TV-regularized reconstruction. Notably, the same attack method is not as effective against DNN based reconstruction. Finally, we show that this phenomenon is inherent to reconstruction methods for which exact recovery can be guaranteed, as with compressed sensing reconstructions with $\ell^1$- or TV-minimization. △ Less

Submitted 11 January, 2024; v1 submitted 10 June, 2022; originally announced June 2022.

Comments: 14 pages, 7 figures

Journal ref: SIAM Journal on Imaging Sciences, 16(4):SC14-SC26, 2023

arXiv:2205.14627 [pdf, other]

Continuous Generative Neural Networks

Authors: Giovanni S. Alberti, Matteo Santacesaria, Silvia Sciutto

Abstract: In this work, we present and study Continuous Generative Neural Networks (CGNNs), namely, generative models in the continuous setting: the output of a CGNN belongs to an infinite-dimensional function space. The architecture is inspired by DCGAN, with one fully connected layer, several convolutional layers and nonlinear activation functions. In the continuous $L^2$ setting, the dimensions of the sp… ▽ More In this work, we present and study Continuous Generative Neural Networks (CGNNs), namely, generative models in the continuous setting: the output of a CGNN belongs to an infinite-dimensional function space. The architecture is inspired by DCGAN, with one fully connected layer, several convolutional layers and nonlinear activation functions. In the continuous $L^2$ setting, the dimensions of the spaces of each layer are replaced by the scales of a multiresolution analysis of a compactly supported wavelet. We present conditions on the convolutional filters and on the nonlinearity that guarantee that a CGNN is injective. This theory finds applications to inverse problems, and allows for deriving Lipschitz stability estimates for (possibly nonlinear) infinite-dimensional inverse problems with unknowns belonging to the manifold generated by a CGNN. Several numerical simulations, including signal deblurring, illustrate and validate this approach. △ Less

Submitted 20 April, 2023; v1 submitted 29 May, 2022; originally announced May 2022.

Comments: 40 pages, 8 figures

arXiv:2106.06513 [pdf, other]

Learning the optimal Tikhonov regularizer for inverse problems

Authors: Giovanni S. Alberti, Ernesto De Vito, Matti Lassas, Luca Ratti, Matteo Santacesaria

Abstract: In this work, we consider the linear inverse problem $y=Ax+ε$, where $A\colon X\to Y$ is a known linear operator between the separable Hilbert spaces $X$ and $Y$, $x$ is a random variable in $X$ and $ε$ is a zero-mean random process in $Y$. This setting covers several inverse problems in imaging including denoising, deblurring, and X-ray tomography. Within the classical framework of regularization… ▽ More In this work, we consider the linear inverse problem $y=Ax+ε$, where $A\colon X\to Y$ is a known linear operator between the separable Hilbert spaces $X$ and $Y$, $x$ is a random variable in $X$ and $ε$ is a zero-mean random process in $Y$. This setting covers several inverse problems in imaging including denoising, deblurring, and X-ray tomography. Within the classical framework of regularization, we focus on the case where the regularization functional is not given a priori but learned from data. Our first result is a characterization of the optimal generalized Tikhonov regularizer, with respect to the mean squared error. We find that it is completely independent of the forward operator $A$ and depends only on the mean and covariance of $x$. Then, we consider the problem of learning the regularizer from a finite training set in two different frameworks: one supervised, based on samples of both $x$ and $y$, and one unsupervised, based only on samples of $x$. In both cases, we prove generalization bounds, under some weak assumptions on the distribution of $x$ and $ε$, including the case of sub-Gaussian variables. Our bounds hold in infinite-dimensional spaces, thereby showing that finer and finer discretizations do not make this learning problem harder. The results are validated through numerical simulations. △ Less

Submitted 22 November, 2021; v1 submitted 11 June, 2021; originally announced June 2021.

Journal ref: Advances in Neural Information Processing Systems 34 (2021)

arXiv:1804.07729 [pdf, other]

ADef: an Iterative Algorithm to Construct Adversarial Deformations

Authors: Rima Alaifari, Giovanni S. Alberti, Tandri Gauksson

Abstract: While deep neural networks have proven to be a powerful tool for many recognition and classification tasks, their stability properties are still not well understood. In the past, image classifiers have been shown to be vulnerable to so-called adversarial attacks, which are created by additively perturbing the correctly classified image. In this paper, we propose the ADef algorithm to construct a d… ▽ More While deep neural networks have proven to be a powerful tool for many recognition and classification tasks, their stability properties are still not well understood. In the past, image classifiers have been shown to be vulnerable to so-called adversarial attacks, which are created by additively perturbing the correctly classified image. In this paper, we propose the ADef algorithm to construct a different kind of adversarial attack created by iteratively applying small deformations to the image, found through a gradient descent step. We demonstrate our results on MNIST with convolutional neural networks and on ImageNet with Inception-v3 and ResNet-101. △ Less

Submitted 11 January, 2019; v1 submitted 20 April, 2018; originally announced April 2018.

Comments: ICLR 2019 conference paper. 25 pages, 20 figures

arXiv:1803.03251 [pdf, other]

Dynamic Spike Super-resolution and Applications to Ultrafast Ultrasound Imaging

Authors: Giovanni S. Alberti, Habib Ammari, Francisco Romero, Timothée Wintz

Abstract: We consider the dynamical super-resolution problem consisting in the recovery of positions and velocities of moving particles from low-frequency static measurements taken over multiple time steps. The standard approach to this issue is a two-step process: first, at each time step some static reconstruction method is applied to locate the positions of the particles with super-resolution and, second… ▽ More We consider the dynamical super-resolution problem consisting in the recovery of positions and velocities of moving particles from low-frequency static measurements taken over multiple time steps. The standard approach to this issue is a two-step process: first, at each time step some static reconstruction method is applied to locate the positions of the particles with super-resolution and, second, some tracking technique is applied to obtain the velocities. In this paper we propose a fully dynamical method based on a phase-space lifting of the positions and the velocities of the particles, which are simultaneously reconstructed with super-resolution. We provide a rigorous mathematical analysis of the recovery problem, both for the noiseless case and in presence of noise (in the discrete setting). Several numerical simulations illustrate and validate our method, which shows some advantage over existing techniques. We then discuss the application of this approach to the dynamical super-resolution problem in ultrafast ultrasound imaging: blood vessels' locations and blood flow velocities are recovered with super-resolution. △ Less

Submitted 30 November, 2018; v1 submitted 8 March, 2018; originally announced March 2018.

Comments: 31 pages, 14 figures

MSC Class: 65Z05; 42A05; 42A15; 94A08; 94A20; 65J22

arXiv:1710.11093 [pdf, ps, other]

doi 10.1016/j.acha.2019.08.002

Infinite dimensional compressed sensing from anisotropic measurements and applications to inverse problems in PDE

Authors: Giovanni S. Alberti, Matteo Santacesaria

Abstract: We consider a compressed sensing problem in which both the measurement and the sparsifying systems are assumed to be frames (not necessarily tight) of the underlying Hilbert space of signals, which may be finite or infinite dimensional. The main result gives explicit bounds on the number of measurements in order to achieve stable recovery, which depends on the mutual coherence of the two systems.… ▽ More We consider a compressed sensing problem in which both the measurement and the sparsifying systems are assumed to be frames (not necessarily tight) of the underlying Hilbert space of signals, which may be finite or infinite dimensional. The main result gives explicit bounds on the number of measurements in order to achieve stable recovery, which depends on the mutual coherence of the two systems. As a simple corollary, we prove the efficiency of nonuniform sampling strategies in cases when the two systems are not incoherent, but only asymptotically incoherent, as with the recovery of wavelet coefficients from Fourier samples. This general framework finds applications to inverse problems in partial differential equations, where the standard assumptions of compressed sensing are often not satisfied. Several examples are discussed, with a special focus on electrical impedance tomography. △ Less

Submitted 25 May, 2019; v1 submitted 30 October, 2017; originally announced October 2017.

Comments: 42 pages

MSC Class: 94A20; 94A08; 42C40; 35R30

Journal ref: Applied and Computational Harmonic Analysis, 50 (2021), pp. 105-146

Showing 1–9 of 9 results for author: Alberti, G S