-
Learning a Gaussian Mixture for Sparsity Regularization in Inverse Problems
Authors:
Giovanni S. Alberti,
Luca Ratti,
Matteo Santacesaria,
Silvia Sciutto
Abstract:
In inverse problems, it is widely recognized that the incorporation of a sparsity prior yields a regularization effect on the solution. This approach is grounded on the a priori assumption that the unknown can be appropriately represented in a basis with a limited number of significant components, while most coefficients are close to zero. This occurrence is frequently observed in real-world scena…
▽ More
In inverse problems, it is widely recognized that the incorporation of a sparsity prior yields a regularization effect on the solution. This approach is grounded on the a priori assumption that the unknown can be appropriately represented in a basis with a limited number of significant components, while most coefficients are close to zero. This occurrence is frequently observed in real-world scenarios, such as with piecewise smooth signals. In this study, we propose a probabilistic sparsity prior formulated as a mixture of degenerate Gaussians, capable of modeling sparsity with respect to a generic basis. Under this premise, we design a neural network that can be interpreted as the Bayes estimator for linear inverse problems. Additionally, we put forth both a supervised and an unsupervised training strategy to estimate the parameters of this network. To evaluate the effectiveness of our approach, we conduct a numerical comparison with commonly employed sparsity-promoting regularization techniques, namely LASSO, group LASSO, iterative hard thresholding, and sparse coding/dictionary learning. Notably, our reconstructions consistently exhibit lower mean square error values across all $1$D datasets utilized for the comparisons, even in cases where the datasets significantly deviate from a Gaussian mixture model.
△ Less
Submitted 29 January, 2024;
originally announced January 2024.
-
Manifold Learning by Mixture Models of VAEs for Inverse Problems
Authors:
Giovanni S. Alberti,
Johannes Hertrich,
Matteo Santacesaria,
Silvia Sciutto
Abstract:
Representing a manifold of very high-dimensional data with generative models has been shown to be computationally efficient in practice. However, this requires that the data manifold admits a global parameterization. In order to represent manifolds of arbitrary topology, we propose to learn a mixture model of variational autoencoders. Here, every encoder-decoder pair represents one chart of a mani…
▽ More
Representing a manifold of very high-dimensional data with generative models has been shown to be computationally efficient in practice. However, this requires that the data manifold admits a global parameterization. In order to represent manifolds of arbitrary topology, we propose to learn a mixture model of variational autoencoders. Here, every encoder-decoder pair represents one chart of a manifold. We propose a loss function for maximum likelihood estimation of the model weights and choose an architecture that provides us the analytical expression of the charts and of their inverses. Once the manifold is learned, we use it for solving inverse problems by minimizing a data fidelity term restricted to the learned manifold. To solve the arising minimization problem we propose a Riemannian gradient descent algorithm on the learned manifold. We demonstrate the performance of our method for low-dimensional toy examples as well as for deblurring and electrical impedance tomography on certain image manifolds.
△ Less
Submitted 12 June, 2024; v1 submitted 27 March, 2023;
originally announced March 2023.
-
Compressed sensing for inverse problems and the sample complexity of the sparse Radon transform
Authors:
Giovanni S. Alberti,
Alessandro Felisi,
Matteo Santacesaria,
S. Ivan Trapasso
Abstract:
Compressed sensing allows for the recovery of sparse signals from few measurements, whose number is proportional to the sparsity of the unknown signal, up to logarithmic factors. The classical theory typically considers either random linear measurements or subsampled isometries and has found many applications, including accelerated magnetic resonance imaging, which is modeled by the subsampled Fou…
▽ More
Compressed sensing allows for the recovery of sparse signals from few measurements, whose number is proportional to the sparsity of the unknown signal, up to logarithmic factors. The classical theory typically considers either random linear measurements or subsampled isometries and has found many applications, including accelerated magnetic resonance imaging, which is modeled by the subsampled Fourier transform. In this work, we develop a general theory of infinite-dimensional compressed sensing for abstract inverse problems, possibly ill-posed, involving an arbitrary forward operator. This is achieved by considering a generalized restricted isometry property, and a quasi-diagonalization property of the forward map.
As a notable application, for the first time, we obtain rigorous recovery estimates for the sparse Radon transform (i.e., with a finite number of angles $θ_1,\dots,θ_m$), which models computed tomography, in both the parallel-beam and the fan-beam settings. In the case when the unknown signal is $s$-sparse with respect to an orthonormal basis of compactly supported wavelets, we prove stable recovery under the condition \[ m\gtrsim s, \] up to logarithmic factors.
△ Less
Submitted 3 May, 2024; v1 submitted 7 February, 2023;
originally announced February 2023.
-
Localized adversarial artifacts for compressed sensing MRI
Authors:
Rima Alaifari,
Giovanni S. Alberti,
Tandri Gauksson
Abstract:
As interest in deep neural networks (DNNs) for image reconstruction tasks grows, their reliability has been called into question (Antun et al., 2020; Gottschling et al., 2020). However, recent work has shown that, compared to total variation (TV) minimization, when appropriately regularized, DNNs show similar robustness to adversarial noise in terms of $\ell^2$-reconstruction error (Genzel et al.,…
▽ More
As interest in deep neural networks (DNNs) for image reconstruction tasks grows, their reliability has been called into question (Antun et al., 2020; Gottschling et al., 2020). However, recent work has shown that, compared to total variation (TV) minimization, when appropriately regularized, DNNs show similar robustness to adversarial noise in terms of $\ell^2$-reconstruction error (Genzel et al., 2022). We consider a different notion of robustness, using the $\ell^\infty$-norm, and argue that localized reconstruction artifacts are a more relevant defect than the $\ell^2$-error. We create adversarial perturbations to undersampled magnetic resonance imaging measurements (in the frequency domain) which induce severe localized artifacts in the TV-regularized reconstruction. Notably, the same attack method is not as effective against DNN based reconstruction. Finally, we show that this phenomenon is inherent to reconstruction methods for which exact recovery can be guaranteed, as with compressed sensing reconstructions with $\ell^1$- or TV-minimization.
△ Less
Submitted 11 January, 2024; v1 submitted 10 June, 2022;
originally announced June 2022.
-
Continuous Generative Neural Networks
Authors:
Giovanni S. Alberti,
Matteo Santacesaria,
Silvia Sciutto
Abstract:
In this work, we present and study Continuous Generative Neural Networks (CGNNs), namely, generative models in the continuous setting: the output of a CGNN belongs to an infinite-dimensional function space. The architecture is inspired by DCGAN, with one fully connected layer, several convolutional layers and nonlinear activation functions. In the continuous $L^2$ setting, the dimensions of the sp…
▽ More
In this work, we present and study Continuous Generative Neural Networks (CGNNs), namely, generative models in the continuous setting: the output of a CGNN belongs to an infinite-dimensional function space. The architecture is inspired by DCGAN, with one fully connected layer, several convolutional layers and nonlinear activation functions. In the continuous $L^2$ setting, the dimensions of the spaces of each layer are replaced by the scales of a multiresolution analysis of a compactly supported wavelet. We present conditions on the convolutional filters and on the nonlinearity that guarantee that a CGNN is injective. This theory finds applications to inverse problems, and allows for deriving Lipschitz stability estimates for (possibly nonlinear) infinite-dimensional inverse problems with unknowns belonging to the manifold generated by a CGNN. Several numerical simulations, including signal deblurring, illustrate and validate this approach.
△ Less
Submitted 20 April, 2023; v1 submitted 29 May, 2022;
originally announced May 2022.
-
Learning the optimal Tikhonov regularizer for inverse problems
Authors:
Giovanni S. Alberti,
Ernesto De Vito,
Matti Lassas,
Luca Ratti,
Matteo Santacesaria
Abstract:
In this work, we consider the linear inverse problem $y=Ax+ε$, where $A\colon X\to Y$ is a known linear operator between the separable Hilbert spaces $X$ and $Y$, $x$ is a random variable in $X$ and $ε$ is a zero-mean random process in $Y$. This setting covers several inverse problems in imaging including denoising, deblurring, and X-ray tomography. Within the classical framework of regularization…
▽ More
In this work, we consider the linear inverse problem $y=Ax+ε$, where $A\colon X\to Y$ is a known linear operator between the separable Hilbert spaces $X$ and $Y$, $x$ is a random variable in $X$ and $ε$ is a zero-mean random process in $Y$. This setting covers several inverse problems in imaging including denoising, deblurring, and X-ray tomography. Within the classical framework of regularization, we focus on the case where the regularization functional is not given a priori but learned from data. Our first result is a characterization of the optimal generalized Tikhonov regularizer, with respect to the mean squared error. We find that it is completely independent of the forward operator $A$ and depends only on the mean and covariance of $x$. Then, we consider the problem of learning the regularizer from a finite training set in two different frameworks: one supervised, based on samples of both $x$ and $y$, and one unsupervised, based only on samples of $x$. In both cases, we prove generalization bounds, under some weak assumptions on the distribution of $x$ and $ε$, including the case of sub-Gaussian variables. Our bounds hold in infinite-dimensional spaces, thereby showing that finer and finer discretizations do not make this learning problem harder. The results are validated through numerical simulations.
△ Less
Submitted 22 November, 2021; v1 submitted 11 June, 2021;
originally announced June 2021.
-
ADef: an Iterative Algorithm to Construct Adversarial Deformations
Authors:
Rima Alaifari,
Giovanni S. Alberti,
Tandri Gauksson
Abstract:
While deep neural networks have proven to be a powerful tool for many recognition and classification tasks, their stability properties are still not well understood. In the past, image classifiers have been shown to be vulnerable to so-called adversarial attacks, which are created by additively perturbing the correctly classified image. In this paper, we propose the ADef algorithm to construct a d…
▽ More
While deep neural networks have proven to be a powerful tool for many recognition and classification tasks, their stability properties are still not well understood. In the past, image classifiers have been shown to be vulnerable to so-called adversarial attacks, which are created by additively perturbing the correctly classified image. In this paper, we propose the ADef algorithm to construct a different kind of adversarial attack created by iteratively applying small deformations to the image, found through a gradient descent step. We demonstrate our results on MNIST with convolutional neural networks and on ImageNet with Inception-v3 and ResNet-101.
△ Less
Submitted 11 January, 2019; v1 submitted 20 April, 2018;
originally announced April 2018.
-
Dynamic Spike Super-resolution and Applications to Ultrafast Ultrasound Imaging
Authors:
Giovanni S. Alberti,
Habib Ammari,
Francisco Romero,
Timothée Wintz
Abstract:
We consider the dynamical super-resolution problem consisting in the recovery of positions and velocities of moving particles from low-frequency static measurements taken over multiple time steps. The standard approach to this issue is a two-step process: first, at each time step some static reconstruction method is applied to locate the positions of the particles with super-resolution and, second…
▽ More
We consider the dynamical super-resolution problem consisting in the recovery of positions and velocities of moving particles from low-frequency static measurements taken over multiple time steps. The standard approach to this issue is a two-step process: first, at each time step some static reconstruction method is applied to locate the positions of the particles with super-resolution and, second, some tracking technique is applied to obtain the velocities. In this paper we propose a fully dynamical method based on a phase-space lifting of the positions and the velocities of the particles, which are simultaneously reconstructed with super-resolution. We provide a rigorous mathematical analysis of the recovery problem, both for the noiseless case and in presence of noise (in the discrete setting). Several numerical simulations illustrate and validate our method, which shows some advantage over existing techniques. We then discuss the application of this approach to the dynamical super-resolution problem in ultrafast ultrasound imaging: blood vessels' locations and blood flow velocities are recovered with super-resolution.
△ Less
Submitted 30 November, 2018; v1 submitted 8 March, 2018;
originally announced March 2018.
-
Infinite dimensional compressed sensing from anisotropic measurements and applications to inverse problems in PDE
Authors:
Giovanni S. Alberti,
Matteo Santacesaria
Abstract:
We consider a compressed sensing problem in which both the measurement and the sparsifying systems are assumed to be frames (not necessarily tight) of the underlying Hilbert space of signals, which may be finite or infinite dimensional. The main result gives explicit bounds on the number of measurements in order to achieve stable recovery, which depends on the mutual coherence of the two systems.…
▽ More
We consider a compressed sensing problem in which both the measurement and the sparsifying systems are assumed to be frames (not necessarily tight) of the underlying Hilbert space of signals, which may be finite or infinite dimensional. The main result gives explicit bounds on the number of measurements in order to achieve stable recovery, which depends on the mutual coherence of the two systems. As a simple corollary, we prove the efficiency of nonuniform sampling strategies in cases when the two systems are not incoherent, but only asymptotically incoherent, as with the recovery of wavelet coefficients from Fourier samples. This general framework finds applications to inverse problems in partial differential equations, where the standard assumptions of compressed sensing are often not satisfied. Several examples are discussed, with a special focus on electrical impedance tomography.
△ Less
Submitted 25 May, 2019; v1 submitted 30 October, 2017;
originally announced October 2017.