-
Redatuming physical systems using symmetric autoencoders
Authors:
Pawan Bharadwaj,
Matthew Li,
Laurent Demanet
Abstract:
This paper considers physical systems described by hidden states and indirectly observed through repeated measurements corrupted by unmodeled nuisance parameters. A network-based representation learns to disentangle the coherent information (relative to the state) from the incoherent nuisance information (relative to the sensing). Instead of physical models, the representation uses symmetry and st…
▽ More
This paper considers physical systems described by hidden states and indirectly observed through repeated measurements corrupted by unmodeled nuisance parameters. A network-based representation learns to disentangle the coherent information (relative to the state) from the incoherent nuisance information (relative to the sensing). Instead of physical models, the representation uses symmetry and stochastic regularization to inform an autoencoder architecture called SymAE. It enables redatuming, i.e., creating virtual data instances where the nuisances are uniformized across measurements.
△ Less
Submitted 5 February, 2022; v1 submitted 5 August, 2021;
originally announced August 2021.
-
Accurate and Robust Deep Learning Framework for Solving Wave-Based Inverse Problems in the Super-Resolution Regime
Authors:
Matthew Li,
Laurent Demanet,
Leonardo Zepeda-Núñez
Abstract:
We propose an end-to-end deep learning framework that comprehensively solves the inverse wave scattering problem across all length scales. Our framework consists of the newly introduced wide-band butterfly network coupled with a simple training procedure that dynamically injects noise during training. While our trained network provides competitive results in classical imaging regimes, most notably…
▽ More
We propose an end-to-end deep learning framework that comprehensively solves the inverse wave scattering problem across all length scales. Our framework consists of the newly introduced wide-band butterfly network coupled with a simple training procedure that dynamically injects noise during training. While our trained network provides competitive results in classical imaging regimes, most notably it also succeeds in the super-resolution regime where other comparable methods fail. This encompasses both (i) reconstruction of scatterers with sub-wavelength geometric features, and (ii) accurate imaging when two or more scatterers are separated by less than the classical diffraction limit. We demonstrate these properties are retained even in the presence of strong noise and extend to scatterers not previously seen in the training set. In addition, our network is straightforward to train requiring no restarts and has an online runtime that is an order of magnitude faster than optimization-based algorithms. We perform experiments with a variety of wave scattering mediums and we demonstrate that our proposed framework outperforms both classical inversion and competing network architectures that specialize in oscillatory wave scattering data.
△ Less
Submitted 2 June, 2021;
originally announced June 2021.
-
Wide-band butterfly network: stable and efficient inversion via multi-frequency neural networks
Authors:
Matthew Li,
Laurent Demanet,
Leonardo Zepeda-Núñez
Abstract:
We introduce an end-to-end deep learning architecture called the wide-band butterfly network (WideBNet) for approximating the inverse scattering map from wide-band scattering data. This architecture incorporates tools from computational harmonic analysis, such as the butterfly factorization, and traditional multi-scale methods, such as the Cooley-Tukey FFT algorithm, to drastically reduce the numb…
▽ More
We introduce an end-to-end deep learning architecture called the wide-band butterfly network (WideBNet) for approximating the inverse scattering map from wide-band scattering data. This architecture incorporates tools from computational harmonic analysis, such as the butterfly factorization, and traditional multi-scale methods, such as the Cooley-Tukey FFT algorithm, to drastically reduce the number of trainable parameters to match the inherent complexity of the problem. As a result WideBNet is efficient: it requires fewer training points than off-the-shelf architectures, and has stable training dynamics, thus it can rely on standard weight initialization strategies. The architecture automatically adapts to the dimensions of the data with only a few hyper-parameters that the user must specify. WideBNet is able to produce images that are competitive with optimization-based approaches, but at a fraction of the cost, and we also demonstrate numerically that it learns to super-resolve scatterers in the full aperture scattering setup.
△ Less
Submitted 28 October, 2021; v1 submitted 24 November, 2020;
originally announced November 2020.
-
Conditioning of partial nonuniform Fourier matrices with clustered nodes
Authors:
Dmitry Batenkov,
Laurent Demanet,
Gil Goldman,
Yosef Yomdin
Abstract:
We prove sharp lower bounds for the smallest singular value of a partial Fourier matrix with arbitrary "off the grid" nodes (equivalently, a rectangular Vandermonde matrix with the nodes on the unit circle), in the case when some of the nodes are separated by less than the inverse bandwidth. The bound is polynomial in the reciprocal of the so-called "super-resolution factor", while the exponent is…
▽ More
We prove sharp lower bounds for the smallest singular value of a partial Fourier matrix with arbitrary "off the grid" nodes (equivalently, a rectangular Vandermonde matrix with the nodes on the unit circle), in the case when some of the nodes are separated by less than the inverse bandwidth. The bound is polynomial in the reciprocal of the so-called "super-resolution factor", while the exponent is controlled by the maximal number of nodes which are clustered together. As a corollary, we obtain sharp minimax bounds for the problem of sparse super-resolution on a grid under the partial clustering assumptions.
△ Less
Submitted 19 June, 2019; v1 submitted 3 September, 2018;
originally announced September 2018.
-
Leveraging Diversity and Sparsity in Blind Deconvolution
Authors:
Ali Ahmed,
Laurent Demanet
Abstract:
This paper considers recovering $L$-dimensional vectors $\boldsymbol{w}$, and $\boldsymbol{x}_1,\boldsymbol{x}_2, \ldots, \boldsymbol{x}_N$ from their circular convolutions $\boldsymbol{y}_n = \boldsymbol{w}*\boldsymbol{x}_n, \ n = 1,2,3, \ldots, N$. The vector $\boldsymbol{w}$ is assumed to be $S$-sparse in a known basis that is spread out in the Fourier domain, and each input $\boldsymbol{x}_n$…
▽ More
This paper considers recovering $L$-dimensional vectors $\boldsymbol{w}$, and $\boldsymbol{x}_1,\boldsymbol{x}_2, \ldots, \boldsymbol{x}_N$ from their circular convolutions $\boldsymbol{y}_n = \boldsymbol{w}*\boldsymbol{x}_n, \ n = 1,2,3, \ldots, N$. The vector $\boldsymbol{w}$ is assumed to be $S$-sparse in a known basis that is spread out in the Fourier domain, and each input $\boldsymbol{x}_n$ is a member of a known $K$-dimensional random subspace.
We prove that whenever $K + S\log^2S \lesssim L /\log^4(LN)$, the problem can be solved effectively by using only the nuclear-norm minimization as the convex relaxation, as long as the inputs are sufficiently diverse and obey $N \gtrsim \log^2(LN)$. By "diverse inputs", we mean that the $\boldsymbol{x}_n$'s belong to different, generic subspaces. To our knowledge, this is the first theoretical result on blind deconvolution where the subspace to which $\boldsymbol{w}$ belongs is not fixed, but needs to be determined.
We discuss the result in the context of multipath channel estimation in wireless communications. Both the fading coefficients, and the delays in the channel impulse response $\boldsymbol{w}$ are unknown. The encoder codes the $K$-dimensional message vectors randomly and then transmits coded messages $\boldsymbol{x}_n$'s over a fixed channel one after the other. The decoder then discovers all of the messages and the channel response when the number of samples taken for each received message are roughly greater than $(K+S\log^2S)\log^4(LN)$, and the number of messages is roughly at least $\log^2(LN)$.
△ Less
Submitted 15 December, 2017; v1 submitted 19 October, 2016;
originally announced October 2016.
-
Stable extrapolation of analytic functions
Authors:
Laurent Demanet,
Alex Townsend
Abstract:
This paper examines the problem of extrapolation of an analytic function for $x > 1$ given perturbed samples from an equally spaced grid on $[-1,1]$. Mathematical folklore states that extrapolation is in general hopelessly ill-conditioned, but we show that a more precise statement carries an interesting nuance. For a function $f$ on $[-1,1]$ that is analytic in a Bernstein ellipse with parameter…
▽ More
This paper examines the problem of extrapolation of an analytic function for $x > 1$ given perturbed samples from an equally spaced grid on $[-1,1]$. Mathematical folklore states that extrapolation is in general hopelessly ill-conditioned, but we show that a more precise statement carries an interesting nuance. For a function $f$ on $[-1,1]$ that is analytic in a Bernstein ellipse with parameter $ρ> 1$, and for a uniform perturbation level $ε$ on the function samples, we construct an asymptotically best extrapolant $e(x)$ as a least squares polynomial approximant of degree $M^*$ given explicitly. We show that the extrapolant $e(x)$ converges to $f(x)$ pointwise in the interval $I_ρ\in[1,(ρ+ρ^{-1})/2)$ as $ε\to 0$, at a rate given by a $x$-dependent fractional power of $ε$. More precisely, for each $x \in I_ρ$ we have
\[
|f(x) - e(x)| = \mathcal{O}\left( ε^{-\log r(x) / \logρ} \right), \qquad\qquad r(x) = \frac{x+\sqrt{x^2-1}}ρ,
\] up to log factors, provided that the oversampling conditioning is satisfied. That is,
\[
M^* \leq \frac{1}{2} \sqrt{N},
\] which is known to be needed from approximation theory. In short, extrapolation enjoys a weak form of stability, up to a fraction of the characteristic smoothness length. The number of function samples, $N+1$, does not bear on the size of the extrapolation error provided that it obeys the oversampling condition. We also show that one cannot construct an asymptotically more accurate extrapolant from $N+1$ equally spaced samples than $e(x)$, using any other linear or nonlinear procedure. The proofs involve original statements on the stability of polynomial approximation in the Chebyshev basis from equally spaced samples and these are expected to be of independent interest.
△ Less
Submitted 31 May, 2016;
originally announced May 2016.
-
The recoverability limit for superresolution via sparsity
Authors:
Laurent Demanet,
Nam Nguyen
Abstract:
We consider the problem of robustly recovering a $k$-sparse coefficient vector from the Fourier series that it generates, restricted to the interval $[- Ω, Ω]$. The difficulty of this problem is linked to the superresolution factor SRF, equal to the ratio of the Rayleigh length (inverse of $Ω$) by the spacing of the grid supporting the sparse vector. In the presence of additive deterministic noise…
▽ More
We consider the problem of robustly recovering a $k$-sparse coefficient vector from the Fourier series that it generates, restricted to the interval $[- Ω, Ω]$. The difficulty of this problem is linked to the superresolution factor SRF, equal to the ratio of the Rayleigh length (inverse of $Ω$) by the spacing of the grid supporting the sparse vector. In the presence of additive deterministic noise of norm $σ$, we show upper and lower bounds on the minimax error rate that both scale like $(SRF)^{2k-1} σ$, providing a partial answer to a question posed by Donoho in 1992. The scaling arises from comparing the noise level to a restricted isometry constant at sparsity $2k$, or equivalently from comparing $2k$ to the so-called $σ$-spark of the Fourier system. The proof involves new bounds on the singular values of restricted Fourier matrices, obtained in part from old techniques in complex analysis.
△ Less
Submitted 4 February, 2015;
originally announced February 2015.
-
Convex recovery from interferometric measurements
Authors:
Laurent Demanet,
Vincent Jugnon
Abstract:
This note formulates a deterministic recovery result for vectors $x$ from quadratic measurements of the form $(Ax)_i \overline{(Ax)_j}$ for some left-invertible $A$. Recovery is exact, or stable in the noisy case, when the couples $(i,j)$ are chosen as edges of a well-connected graph. One possible way of obtaining the solution is as a feasible point of a simple semidefinite program. Furthermore, w…
▽ More
This note formulates a deterministic recovery result for vectors $x$ from quadratic measurements of the form $(Ax)_i \overline{(Ax)_j}$ for some left-invertible $A$. Recovery is exact, or stable in the noisy case, when the couples $(i,j)$ are chosen as edges of a well-connected graph. One possible way of obtaining the solution is as a feasible point of a simple semidefinite program. Furthermore, we show how the proportionality constant in the error estimate depends on the spectral gap of a data-weighted graph Laplacian. Such quadratic measurements have found applications in phase retrieval, angular synchronization, and more recently interferometric waveform inversion.
△ Less
Submitted 15 January, 2018; v1 submitted 25 July, 2013;
originally announced July 2013.
-
Super-resolution via superset selection and pruning
Authors:
Laurent Demanet,
Deanna Needell,
Nam Nguyen
Abstract:
We present a pursuit-like algorithm that we call the "superset method" for recovery of sparse vectors from consecutive Fourier measurements in the super-resolution regime. The algorithm has a subspace identification step that hinges on the translation invariance of the Fourier transform, followed by a removal step to estimate the solution's support. The superset method is always successful in the…
▽ More
We present a pursuit-like algorithm that we call the "superset method" for recovery of sparse vectors from consecutive Fourier measurements in the super-resolution regime. The algorithm has a subspace identification step that hinges on the translation invariance of the Fourier transform, followed by a removal step to estimate the solution's support. The superset method is always successful in the noiseless regime (unlike L1-minimization) and generalizes to higher dimensions (unlike the matrix pencil method). Relative robustness to noise is demonstrated numerically.
△ Less
Submitted 10 June, 2013; v1 submitted 25 February, 2013;
originally announced February 2013.
-
Eventual linear convergence of the Douglas Rachford iteration for basis pursuit
Authors:
Laurent Demanet,
Xiangxiong Zhang
Abstract:
We provide a simple analysis of the Douglas-Rachford splitting algorithm in the context of $\ell^1$ minimization with linear constraints, and quantify the asymptotic linear convergence rate in terms of principal angles between relevant vector spaces. In the compressed sensing setting, we show how to bound this rate in terms of the restricted isometry constant. More general iterative schemes obtain…
▽ More
We provide a simple analysis of the Douglas-Rachford splitting algorithm in the context of $\ell^1$ minimization with linear constraints, and quantify the asymptotic linear convergence rate in terms of principal angles between relevant vector spaces. In the compressed sensing setting, we show how to bound this rate in terms of the restricted isometry constant. More general iterative schemes obtained by $\ell^2$-regularization and over-relaxation including the dual split Bregman method are also treated, which answers the question how to choose the relaxation and soft-thresholding parameters to accelerate the asymptotic convergence rate. We make no attempt at characterizing the transient regime preceding the onset of linear convergence.
△ Less
Submitted 29 May, 2013; v1 submitted 3 January, 2013;
originally announced January 2013.