-
An optimal transport analogue of the Rudin Osher Fatemi model and its corresponding multiscale theory
Authors:
Tristan Milne,
Adrian Nachman
Abstract:
We develop a theory for image restoration with a learned regularizer that is analogous to that of Meyer's characterization of solutions of the classical variational method of Rudin-Osher-Fatemi (ROF). The learned regularizer we use is a Kantorovich potential for an optimal transport problem of map** a distribution of noisy images onto clean ones, as first proposed by Lunz, Öktem and Schönlieb. W…
▽ More
We develop a theory for image restoration with a learned regularizer that is analogous to that of Meyer's characterization of solutions of the classical variational method of Rudin-Osher-Fatemi (ROF). The learned regularizer we use is a Kantorovich potential for an optimal transport problem of map** a distribution of noisy images onto clean ones, as first proposed by Lunz, Öktem and Schönlieb. We show that the effect of their restoration method on the distribution of the images is an explicit Euler discretization of a gradient flow on probability space, while our variational problem, dubbed Wasserstein ROF (WROF), is the corresponding implicit discretization. We obtain our geometric characterisation of the solution in this setting by first proving a more general convex analysis theorem for variational problems with solutions characterised by projections. We then use optimal transport arguments to obtain our WROF theorem from this general result, as well as a decomposition of a transport map into large scale "features" and small scale "details", where scale refers to the magnitude of the transport distance. Further, we leverage our theory to analyze two algorithms which iterate WROF. We refer to these as iterative regularization and multiscale transport. For the former we prove convergence to the clean data. For the latter we produce successive approximations to the target distribution that match it up to finer and finer scales. These algorithms are in complete analogy to well-known effective methods based on ROF for iterative denoising, respectively hierarchical image decomposition. We also obtain an analogue of the Tadmor Nezzar Vese energy identity which decomposes the Wasserstein 2 distance between two measures into a sum of non-negative terms that correspond to transport costs at different scales.
△ Less
Submitted 30 April, 2023;
originally announced May 2023.
-
A new method for determining Wasserstein 1 optimal transport maps from Kantorovich potentials, with deep learning applications
Authors:
Tristan Milne,
Étienne Bilocq,
Adrian Nachman
Abstract:
Wasserstein 1 optimal transport maps provide a natural correspondence between points from two probability distributions, $μ$ and $ν$, which is useful in many applications. Available algorithms for computing these maps do not appear to scale well to high dimensions. In deep learning applications, efficient algorithms have been developed for approximating solutions of the dual problem, known as Kant…
▽ More
Wasserstein 1 optimal transport maps provide a natural correspondence between points from two probability distributions, $μ$ and $ν$, which is useful in many applications. Available algorithms for computing these maps do not appear to scale well to high dimensions. In deep learning applications, efficient algorithms have been developed for approximating solutions of the dual problem, known as Kantorovich potentials, using neural networks (e.g. [Gulrajani et al., 2017]). Importantly, such algorithms work well in high dimensions. In this paper we present an approach towards computing Wasserstein 1 optimal transport maps that relies only on Kantorovich potentials. In general, a Wasserstein 1 optimal transport map is not unique and is not computable from a potential alone. Our main result is to prove that if $μ$ has a density and $ν$ is supported on a submanifold of codimension at least 2, an optimal transport map is unique and can be written explicitly in terms of a potential. These assumptions are natural in many image processing contexts and other applications. When the Kantorovich potential is only known approximately, our result motivates an iterative procedure wherein data is moved in optimal directions and with the correct average displacement. Since this provides an approach for transforming one distribution to another, it can be used as a multipurpose algorithm for various transport problems; we demonstrate through several proof of concept experiments that this algorithm successfully performs various imaging tasks, such as denoising, generation, translation and deblurring, which normally require specialized techniques.
△ Less
Submitted 1 November, 2022;
originally announced November 2022.
-
Trust the Critics: Generatorless and Multipurpose WGANs with Initial Convergence Guarantees
Authors:
Tristan Milne,
Étienne Bilocq,
Adrian Nachman
Abstract:
Inspired by ideas from optimal transport theory we present Trust the Critics (TTC), a new algorithm for generative modelling. This algorithm eliminates the trainable generator from a Wasserstein GAN; instead, it iteratively modifies the source data using gradient descent on a sequence of trained critic networks. This is motivated in part by the misalignment which we observed between the optimal tr…
▽ More
Inspired by ideas from optimal transport theory we present Trust the Critics (TTC), a new algorithm for generative modelling. This algorithm eliminates the trainable generator from a Wasserstein GAN; instead, it iteratively modifies the source data using gradient descent on a sequence of trained critic networks. This is motivated in part by the misalignment which we observed between the optimal transport directions provided by the gradients of the critic and the directions in which data points actually move when parametrized by a trainable generator. Previous work has arrived at similar ideas from different viewpoints, but our basis in optimal transport theory motivates the choice of an adaptive step size which greatly accelerates convergence compared to a constant step size. Using this step size rule, we prove an initial geometric convergence rate in the case of source distributions with densities. These convergence rates cease to apply only when a non-negligible set of generated data is essentially indistinguishable from real data. Resolving the misalignment issue improves performance, which we demonstrate in experiments that show that given a fixed number of training epochs, TTC produces higher quality images than a comparable WGAN, albeit at increased memory requirements. In addition, TTC provides an iterative formula for the transformed density, which traditional WGANs do not. Finally, TTC can be applied to map any source distribution onto any target; we demonstrate through experiments that TTC can obtain competitive performance in image generation, translation, and denoising without dedicated algorithms.
△ Less
Submitted 29 November, 2021;
originally announced November 2021.
-
Wasserstein GANs with Gradient Penalty Compute Congested Transport
Authors:
Tristan Milne,
Adrian Nachman
Abstract:
Wasserstein GANs with Gradient Penalty (WGAN-GP) are a very popular method for training generative models to produce high quality synthetic data. While WGAN-GP were initially developed to calculate the Wasserstein 1 distance between generated and real data, recent works (e.g. [23]) have provided empirical evidence that this does not occur, and have argued that WGAN-GP perform well not in spite of…
▽ More
Wasserstein GANs with Gradient Penalty (WGAN-GP) are a very popular method for training generative models to produce high quality synthetic data. While WGAN-GP were initially developed to calculate the Wasserstein 1 distance between generated and real data, recent works (e.g. [23]) have provided empirical evidence that this does not occur, and have argued that WGAN-GP perform well not in spite of this issue, but because of it. In this paper we show for the first time that WGAN-GP compute the minimum of a different optimal transport problem, the so-called congested transport [7]. Congested transport determines the cost of moving one distribution to another under a transport model that penalizes congestion. For WGAN-GP, we find that the congestion penalty has a spatially varying component determined by the sampling strategy used in [12] which acts like a local speed limit, making congestion cost less in some regions than others. This aspect of the congested transport problem is new, in that the congestion penalty turns out to be unbounded and depends on the distributions to be transported, and so we provide the necessary mathematical proofs for this setting. One facet of our discovery is a formula connecting the gradient of solutions to the optimization problem in WGAN-GP to the time averaged momentum of the optimal mass flow. This is in contrast to the gradient of Kantorovich potentials for the Wasserstein 1 distance, which is just the normalized direction of flow. Based on this and other considerations, we speculate on how our results explain the observed performance of WGAN-GP. Beyond applications to GANs, our theorems also point to the possibility of approximately solving large scale congested transport problems using neural network techniques.
△ Less
Submitted 30 June, 2022; v1 submitted 1 September, 2021;
originally announced September 2021.
-
A Multiscale Theory for Image Registration and Nonlinear Inverse Problems
Authors:
Klas Modin,
Adrian Nachman,
Luca Rondi
Abstract:
In an influential paper, Tadmor, Nezzar and Vese (Multiscale Model. Simul. (2004)) introduced a hierarchical decomposition of an image as a sum of constituents of different scales. Here we construct analogous hierarchical expansions for diffeomorphisms, in the context of image registration, with the sum replaced by composition of maps. We treat this as a special case of a general framework for mul…
▽ More
In an influential paper, Tadmor, Nezzar and Vese (Multiscale Model. Simul. (2004)) introduced a hierarchical decomposition of an image as a sum of constituents of different scales. Here we construct analogous hierarchical expansions for diffeomorphisms, in the context of image registration, with the sum replaced by composition of maps. We treat this as a special case of a general framework for multiscale decompositions, applicable to a wide range of imaging and nonlinear inverse problems. As a paradigmatic example of the latter, we consider the Calderón inverse conductivity problem. We prove that we can simultaneously perform a numerical reconstruction and a multiscale decomposition of the unknown conductivity, driven by the inverse problem itself. We provide novel convergence proofs which work in the general abstract settings, yet are sharp enough to settle an open problem on the hierarchical decompostion of Tadmor, Nezzar and Vese for arbitrary functions in $L^2$. We also give counterexamples that show the optimality of our general results.
△ Less
Submitted 19 March, 2020; v1 submitted 5 March, 2018;
originally announced March 2018.
-
Determining a Riemannian Metric from Minimal Areas
Authors:
Spyros Alexakis,
Tracey Balehowsky,
Adrian Nachman
Abstract:
We prove that if $(M,g)$ is a topological 3-ball with a $C^4$-smooth Riemannian metric $g$, and mean-convex boundary $\partial M$ then knowledge of least areas circumscribed by simple closed curves $γ\subset \partial M$ uniquely determines the metric $g$, under some additional geometric assumptions. These are that $g$ is either a) $C^3$-close to Euclidean or b) satisfies much weaker geometric cond…
▽ More
We prove that if $(M,g)$ is a topological 3-ball with a $C^4$-smooth Riemannian metric $g$, and mean-convex boundary $\partial M$ then knowledge of least areas circumscribed by simple closed curves $γ\subset \partial M$ uniquely determines the metric $g$, under some additional geometric assumptions. These are that $g$ is either a) $C^3$-close to Euclidean or b) satisfies much weaker geometric conditions which hold when the manifold is to a sufficient degree either thin, or straight. %sufficiently thin.
In fact, the least area data that we require is for a much more restricted class of curves $γ\subset \partial M$. We also prove a corresponding local result: assuming only that $(M,g)$ has strictly mean convex boundary at a point $p\in\partial M$, we prove that knowledge of the least areas circumscribed by any simple closed curve $γ$ in a neighbourhood $U\subset \partial M$ of $p$ uniquely determines the metric near $p$. Additionally, we sketch the proof of a global result with no thin/straight or curvature condition, but assuming the metric admits minimal foliations "from all directions".
The proofs rely on finding the metric along a continuous sweep-out of $M$ by area-minimizing surfaces; they bring together ideas from the 2D-Calderón inverse problem, minimal surface theory, and the careful analysis of a system of pseudo-differential equations.
△ Less
Submitted 6 February, 2018; v1 submitted 26 November, 2017;
originally announced November 2017.
-
A Nonlinear Plancherel Theorem with Applications to Global Well-Posedness for the Defocusing Davey-Stewartson Equation and to the Inverse Boundary Value Problem of Calderón
Authors:
Adrian I. Nachman,
Idan Regev,
Daniel I. Tataru
Abstract:
We prove a Plancherel theorem for a nonlinear Fourier transform in two dimensions arising in the Inverse Scattering method for the defocusing Davey-Stewartson II equation. We then use it to prove global well-posedness and scattering in $L^2$ for defocusing DSII. This Plancherel theorem also implies global uniqueness in the inverse boundary value problem of Calderón in dimension $2$, for conductivi…
▽ More
We prove a Plancherel theorem for a nonlinear Fourier transform in two dimensions arising in the Inverse Scattering method for the defocusing Davey-Stewartson II equation. We then use it to prove global well-posedness and scattering in $L^2$ for defocusing DSII. This Plancherel theorem also implies global uniqueness in the inverse boundary value problem of Calderón in dimension $2$, for conductivities $σ>0$ with $\log σ\in \dot H^1$. The proof of the nonlinear Plancherel theorem includes new estimates on classical fractional integrals, as well as a new result on $L^2$-boundedness of pseudo-differential operators with non-smooth symbols, valid in all dimensions.
△ Less
Submitted 18 September, 2019; v1 submitted 15 August, 2017;
originally announced August 2017.
-
A weighted minimum gradient problem with complete electrode model boundary conditions for conductivity imaging
Authors:
Adrian Nachman,
Alexandru Tamasan,
Johann Veras
Abstract:
We consider the inverse problem of recovering an isotropic electrical conductivity from interior knowledge of the magnitude of one current density field generated by applying current on a set of electrodes. The required interior data can be obtained by means of MRI measurements. On the boundary we only require knowledge of the electrodes, their impedances, and the corresponding average input curre…
▽ More
We consider the inverse problem of recovering an isotropic electrical conductivity from interior knowledge of the magnitude of one current density field generated by applying current on a set of electrodes. The required interior data can be obtained by means of MRI measurements. On the boundary we only require knowledge of the electrodes, their impedances, and the corresponding average input currents. From the mathematical point of view, this practical question leads us to consider a new weighted minimum gradient problem for functions satisfying the boundary conditions coming from the Complete Electrode Model of Somersalo, Cheney and Isaacson. This variational problem has non-unique solutions. The surprising discovery is that the physical data is still sufficient to determine the geometry of the level sets of the minimizers. In particular, we obtain an interesting phase retrieval result: knowledge of the input current at the boundary allows determination of the full current vector field from its magnitude. We characterize the non-uniqueness in the variational problem. We also show that additional measurements of the voltage potential along one curve joining the electrodes yield unique determination of the conductivity. A nonlinear algorithm is proposed and implemented to illustrate the theoretical results.
△ Less
Submitted 6 May, 2016; v1 submitted 16 February, 2015;
originally announced February 2015.
-
Uniqueness of minimizers of weighted least gradient problems arising in conductivity imaging
Authors:
Amir Moradifam,
Adrian Nachman,
Alexandru Tamasan
Abstract:
We prove uniqueness for minimizers of the weighted least gradient problem \[\inf \left\lbrace \int_Ω a|Du|: \ \ u\in BV(Ω), \ \ u|_{\partial Ω}=f \right\rbrace.\] The weight function $a$ is assumed to be continuous and it is allowed to vanish in certain subsets of $Ω$. Existence is assumed a priori. Our approach is motivated by the hybrid inverse problem of imaging electric conductivity from inter…
▽ More
We prove uniqueness for minimizers of the weighted least gradient problem \[\inf \left\lbrace \int_Ω a|Du|: \ \ u\in BV(Ω), \ \ u|_{\partial Ω}=f \right\rbrace.\] The weight function $a$ is assumed to be continuous and it is allowed to vanish in certain subsets of $Ω$. Existence is assumed a priori. Our approach is motivated by the hybrid inverse problem of imaging electric conductivity from interior knowledge (obtainable by MRI) of the magnitude of one current density vector field.
△ Less
Submitted 23 April, 2014;
originally announced April 2014.
-
Existence and uniqueness of minimizers of general least gradient problems
Authors:
Robert L. Jerrard,
Amir Moradifam,
Adrian I. Nachman
Abstract:
Motivated by problems arising in conductivity imaging, we prove existence, uniqueness, and comparison theorems - under certain sharp conditions - for minimizers of the general least gradient problem \[\inf_{u\in BV_f(Ω)} \int_Ω\varphi(x,Du),\] where $f:\partial Ω\to \R$ is continuous, \[ BV_f(Ω):=\{v\in BV(Ω): \ \ \forall x\in \partial Ω, \ \ \lim_{r\to 0} \ \esssup_{y\in Ω, |x-y|<r} |f(x) - v(y)|…
▽ More
Motivated by problems arising in conductivity imaging, we prove existence, uniqueness, and comparison theorems - under certain sharp conditions - for minimizers of the general least gradient problem \[\inf_{u\in BV_f(Ω)} \int_Ω\varphi(x,Du),\] where $f:\partial Ω\to \R$ is continuous, \[ BV_f(Ω):=\{v\in BV(Ω): \ \ \forall x\in \partial Ω, \ \ \lim_{r\to 0} \ \esssup_{y\in Ω, |x-y|<r} |f(x) - v(y)| = 0 \ \} %BV_f(Ω)=\{u\in BV(Ω): {0.1cm} u|_{\partial Ω}=f {0.1cm} \hbox{and} {0.1cm} {0.1cm} u {0.1cm} \hbox{is continuous at} {0.1cm} \partial Ω\}. \] and $\varphi(x,ξ)$ is a function that, among other properties, is convex and homogeneous of degree 1 with respect to the $ξ$ variable. In particular we prove that if $a\in C^{1,1}(Ω)$ is bounded away from zero, then minimizers of the weighted least gradient problem $\inf_{u \in BV_f}\int_Ω a|Du|$ are unique in $BV_f(Ω)$. We construct counterexamples to show that the regularity assumption $a\in C^{1,1}$ is sharp, in the sense that it can not be replaced by $a\in C^{1,α}(Ω)$ with any $α<1$.
△ Less
Submitted 2 May, 2013;
originally announced May 2013.
-
Current Density Impedance Imaging of an Anisotropic Conductivity in a Known Conformal Class
Authors:
Nicholas Hoell,
Amir Moradifam,
Adrian Nachman
Abstract:
We present a procedure for recovering the conformal factor of an anisotropic conductivity matrix in a known conformal class in a domain in Euclidean space of dimension greater than or equal to 2. The method requires one internal measurement, together with a priori knowledge of the conformal class (local orientation) of the conductivity matrix. This problem arises in the coupled-physics medical ima…
▽ More
We present a procedure for recovering the conformal factor of an anisotropic conductivity matrix in a known conformal class in a domain in Euclidean space of dimension greater than or equal to 2. The method requires one internal measurement, together with a priori knowledge of the conformal class (local orientation) of the conductivity matrix. This problem arises in the coupled-physics medical imaging modality of Current Density Impedance Imaging (CDII) and the assumptions on the data are suitable for measurements determinable from cross-property based couplings of the two imaging modalities CDII and Diffusion Tensor Imaging (DTI). We show that the corresponding electric potential is the unique solution of a constrained minimization problem with respect to a weighted total variation functional defined in terms of the physical data. Further, we show that the associated equipotential surfaces are area minimizing with respect to a Riemannian metric obtained from the data. The results are also extended to allow the presence of perfectly conducting and/or insulating inclusions.
△ Less
Submitted 27 February, 2013;
originally announced February 2013.
-
Conductivity imaging from one interior measurement in the presence of perfectly conducting and insulating inclusions
Authors:
Amir Moradifam,
Adrian Nachman,
Alexandru Tamasan
Abstract:
We consider the problem of recovering an isotropic conductivity outside some perfectly conducting or insulating inclusions from the interior measurement of the magnitude of one current density field $|J|$. We prove that the conductivity outside the inclusions, and the shape and position of the perfectly conducting and insulating inclusions are uniquely determined (except in an exceptional case) by…
▽ More
We consider the problem of recovering an isotropic conductivity outside some perfectly conducting or insulating inclusions from the interior measurement of the magnitude of one current density field $|J|$. We prove that the conductivity outside the inclusions, and the shape and position of the perfectly conducting and insulating inclusions are uniquely determined (except in an exceptional case) by the magnitude of the current generated by imposing a given boundary voltage. We have found an extension of the notion of admissibility to the case of possible presence of perfectly conducting and insulating inclusions. This also makes it possible to extend the results on uniqueness of the minimizers of the least gradient problem $F(u)=\int_Ωa|\nabla u|$ with $u|_{\partial Ω}=f$ to cases where $u$ has flat regions (is constant on open sets).
△ Less
Submitted 8 December, 2011;
originally announced December 2011.
-
A convergent algorithm for the hybrid problem of reconstructing conductivity from minimal interior data
Authors:
Amir Moradifam,
Adrian Nachman,
Alexandre Timonov
Abstract:
We consider the hybrid problem of reconstructing the isotropic electric conductivity of a body $Ω$ from interior Current Density Imaging data obtainable using MRI measurements. We only require knowledge of the magnitude $|J|$ of one current generated by a given voltage $f$ on the boundary $\partialΩ$. As previously shown, the corresponding voltage potential u in $Ω$ is a minimizer of the weighted…
▽ More
We consider the hybrid problem of reconstructing the isotropic electric conductivity of a body $Ω$ from interior Current Density Imaging data obtainable using MRI measurements. We only require knowledge of the magnitude $|J|$ of one current generated by a given voltage $f$ on the boundary $\partialΩ$. As previously shown, the corresponding voltage potential u in $Ω$ is a minimizer of the weighted least gradient problem
\[u=\hbox{argmin} \{\int_Ωa(x)|\nabla u|: u \in H^{1}(Ω), \ \ u|_{\partial Ω}=f\},\] with $a(x)= |J(x)|$. In this paper we present an alternating split Bregman algorithm for treating such least gradient problems, for $a\in L^2(Ω)$ non-negative and $f\in H^{1/2}(\partial Ω)$. We give a detailed convergence proof by focusing to a large extent on the dual problem. This leads naturally to the alternating split Bregman algorithm. The dual problem also turns out to yield a novel method to recover the full vector field $J$ from knowledge of its magnitude, and of the voltage $f$ on the boundary. We then present several numerical experiments that illustrate the convergence behavior of the proposed algorithm.
△ Less
Submitted 8 December, 2011;
originally announced December 2011.
-
Convergence of the alternating split Bregman algorithm in infinite-dimensional Hilbert spaces
Authors:
Amir Moradifam,
Adrian Nachman
Abstract:
We prove results on weak convergence for the alternating split Bregman algorithm in infinite dimensional Hilbert spaces. We also show convergence of an approximate split Bregman algorithm, where errors are allowed at each step of the computation. To be able to treat the infinite dimensional case, our proofs focus mostly on the dual problem. We rely on Svaiter's theorem on weak convergence of the D…
▽ More
We prove results on weak convergence for the alternating split Bregman algorithm in infinite dimensional Hilbert spaces. We also show convergence of an approximate split Bregman algorithm, where errors are allowed at each step of the computation. To be able to treat the infinite dimensional case, our proofs focus mostly on the dual problem. We rely on Svaiter's theorem on weak convergence of the Douglas-Rachford splitting algorithm and on the relation between the alternating split Bregman and Douglas-Rachford splitting algorithms discovered by Setzer. Our motivation for this study is to provide a convergent algorithm for weighted least gradient problems arising in the hybrid method of imaging electric conductivity from interior knowledge (obtainable by MRI) of the magnitude of one current.
△ Less
Submitted 8 December, 2011;
originally announced December 2011.
-
Reconstruction in the Calderon Problem with Partial Data
Authors:
Adrian Nachman,
Brian Street
Abstract:
We consider the problem of recovering the coefficient σ(x) of the elliptic equation \grad \cdot(σ\grad u)=0 in a body from measurements of the Cauchy data on possibly very small subsets of its surface. We give a constructive proof of a uniqueness result by Kenig, Sjöstrand, and Uhlmann. We construct a uniquely specified family of solutions such that their traces on the boundary can be calculated…
▽ More
We consider the problem of recovering the coefficient σ(x) of the elliptic equation \grad \cdot(σ\grad u)=0 in a body from measurements of the Cauchy data on possibly very small subsets of its surface. We give a constructive proof of a uniqueness result by Kenig, Sjöstrand, and Uhlmann. We construct a uniquely specified family of solutions such that their traces on the boundary can be calculated by solving an integral equation which involves only the given partial Cauchy data. The construction entails a new family of Green's functions for the Laplacian, and corresponding single layer potentials, which may be of independent interest.
△ Less
Submitted 27 August, 2009; v1 submitted 30 April, 2009;
originally announced April 2009.