-
Logic-Based Discrete-Steepest Descent: A Solution Method for Process Synthesis Generalized Disjunctive Programs
Authors:
Daniel Ovalle,
David A. Liñán,
Albert Lee,
Jorge M. Gómez,
Luis Ricardez-Sandoval,
Ignacio E. Grossmann,
David E. Bernal Neira
Abstract:
The optimization of chemical processes is challenging due to the nonlinearities arising from process physics and discrete design decisions. In particular, optimal synthesis and design of chemical processes can be posed as a Generalized Disjunctive Programming (GDP) superstructure problem. Various solution methods are available to address these problems, such as reformulating them as Mixed-Integer…
▽ More
The optimization of chemical processes is challenging due to the nonlinearities arising from process physics and discrete design decisions. In particular, optimal synthesis and design of chemical processes can be posed as a Generalized Disjunctive Programming (GDP) superstructure problem. Various solution methods are available to address these problems, such as reformulating them as Mixed-Integer Nonlinear Programming (MINLP) problems; nevertheless, algorithms explicitly designed to solve the GDP problem and potentially leverage its structure remain scarce. This paper presents the Logic-based Discrete-Steepest Descent Algorithm (LD-SDA) as a solution method for GDP problems involving ordered Boolean variables. The LD-SDA reformulates these ordered Boolean variables into integer decisions called external variables. The LD-SDA solves the reformulated GDP problem using a two-level decomposition approach where the upper-level subproblem determines external variable configurations. Subsequently, the remaining continuous and discrete variables are solved as a subproblem only involving those constraints relevant to the given external variable arrangement, effectively taking advantage of the structure of the GDP problem. The advantages of LD-SDA are illustrated through a batch processing case study, a reactor superstructure, a distillation column, and a catalytic distillation column, and its open-source implementation is available online. The results show convergence efficiency and solution quality improvements compared to conventional GDP and MINLP solvers.
△ Less
Submitted 8 May, 2024;
originally announced May 2024.
-
Locally Regular and Efficient Tests in Non-Regular Semiparametric Models
Authors:
Adam Lee
Abstract:
This paper considers hypothesis testing in semiparametric models which may be non-regular. I show that C($α$) style tests are locally regular under mild conditions, including in cases where locally regular estimators do not exist, such as models which are (semi-parametrically) weakly identified. I characterise the appropriate limit experiment in which to study local (asymptotic) optimality of test…
▽ More
This paper considers hypothesis testing in semiparametric models which may be non-regular. I show that C($α$) style tests are locally regular under mild conditions, including in cases where locally regular estimators do not exist, such as models which are (semi-parametrically) weakly identified. I characterise the appropriate limit experiment in which to study local (asymptotic) optimality of tests in the non-regular case, permitting the generalisation of classical power bounds to this case. I give conditions under which these power bounds are attained by the proposed C($α$) style tests. The application of the theory to a single index model and an instrumental variables model is worked out in detail.
△ Less
Submitted 9 March, 2024;
originally announced March 2024.
-
Rigid-flexible values for symplectic embeddings of four-dimensional ellipsoids into almost-cubes
Authors:
Andrew Lee,
Cory H. Colbert
Abstract:
We consider the embedding function $c_b(a)$ describing the problem of symplectically embedding an ellipsoid $E(1,a)$ into the smallest possible scaling by $λ>1$ of the polydisc $P(1,b)$. In particular, we calculate rigid-flexible values, i.e. the minimum $a$ such that for $E(1,a')$ with $a'>a$, the embedding problem is determined only by volume. For $1<b<2$ we find that these values vary piecewise…
▽ More
We consider the embedding function $c_b(a)$ describing the problem of symplectically embedding an ellipsoid $E(1,a)$ into the smallest possible scaling by $λ>1$ of the polydisc $P(1,b)$. In particular, we calculate rigid-flexible values, i.e. the minimum $a$ such that for $E(1,a')$ with $a'>a$, the embedding problem is determined only by volume. For $1<b<2$ we find that these values vary piecewise smoothly outside a discrete set of discontinuities at $b\in\left(\frac{n+1}{n}\right)^2$.
△ Less
Submitted 25 February, 2024;
originally announced February 2024.
-
Designing Problems for Improved Instruction and Learning -- Linear Algebra
Authors:
Ryan H. Allaire,
Margaret Reynolds,
Andrew C. Lee
Abstract:
One of the grand challenges of Mathematics instruction is to provide students with problems that are both accessible and have a reasonably elegant solution. Instructors commonly resort to resources like course textbooks, online-learning platforms, or other automated problem-generating software to select problems for exams and assignments. However, reliance on such tools may result in limited contr…
▽ More
One of the grand challenges of Mathematics instruction is to provide students with problems that are both accessible and have a reasonably elegant solution. Instructors commonly resort to resources like course textbooks, online-learning platforms, or other automated problem-generating software to select problems for exams and assignments. However, reliance on such tools may result in limited control over problem parameters, potentially yielding intricate solutions that impede students' understanding. This article centers on Linear Algebra, wherein we devise algorithms for reverse engineering matrices of integers with integer outcomes through operations such as the inverse, LU decomposition, and QR decomposition. The focus is on empowering instructors to manipulate matrix properties deliberately, ensuring the creation of problems that enrich instruction and foster student confidence. The intellectual endeavor of reverse engineering such problems, grounded in both theory and matrix properties, proves mutually beneficial for both students and instructors alike.
△ Less
Submitted 2 February, 2024;
originally announced February 2024.
-
Mixing time of the conditional backward sampling particle filter
Authors:
Joona Karjalainen,
Anthony Lee,
Sumeetpal S. Singh,
Matti Vihola
Abstract:
The conditional backward sampling particle filter (CBPF) is a powerful Markov chain Monte Carlo sampler for general state space hidden Markov model smoothing. It was proposed as an improvement over the conditional particle filter, which is known to have an $O(T^2)$ computational time complexity under a general `strong' mixing assumption, where $T$ is the time horizon. We provide the first proof th…
▽ More
The conditional backward sampling particle filter (CBPF) is a powerful Markov chain Monte Carlo sampler for general state space hidden Markov model smoothing. It was proposed as an improvement over the conditional particle filter, which is known to have an $O(T^2)$ computational time complexity under a general `strong' mixing assumption, where $T$ is the time horizon. We provide the first proof that the CBPF admits an $O(T \log T)$ time complexity under strong mixing, complementing strong empirical evidence of the superiority of the CBPF in practice. In particular, the CBPF's mixing time is upper bounded by $O(\log T)$, for any sufficiently large number of particles $N$ that depends only on the mixing assumptions and not $T$. We show that an $O(\log T)$ mixing time is optimal. The proof involves the analysis of a novel coupling of two CBPFs, which involves a maximal coupling of two particle systems at each time instant. The coupling is implementable, and thus can also be used to construct unbiased, finite variance, estimates of functionals which have arbitrary dependence on the latent state's path, with a total expected cost of $O(T \log T)$. We also investigate other couplings, and we show some of these alternatives have improved empirical behaviour.
△ Less
Submitted 22 February, 2024; v1 submitted 29 December, 2023;
originally announced December 2023.
-
Weak Poincaré Inequalities for Markov chains: theory and applications
Authors:
Christophe Andrieu,
Anthony Lee,
Sam Power,
Andi Q. Wang
Abstract:
We investigate the application of Weak Poincaré Inequalities (WPI) to Markov chains to study their rates of convergence and to derive complexity bounds. At a theoretical level we investigate the necessity of the existence of WPIs to ensure \mathrm{L}^{2}-convergence, in particular by establishing equivalence with the Resolvent Uniform Positivity-Improving (RUPI) condition and providing a counterex…
▽ More
We investigate the application of Weak Poincaré Inequalities (WPI) to Markov chains to study their rates of convergence and to derive complexity bounds. At a theoretical level we investigate the necessity of the existence of WPIs to ensure \mathrm{L}^{2}-convergence, in particular by establishing equivalence with the Resolvent Uniform Positivity-Improving (RUPI) condition and providing a counterexample. From a more practical perspective, we extend the celebrated Cheeger's inequalities to the subgeometric setting, and further apply these techniques to study random-walk Metropolis algorithms for heavy-tailed target distributions and to obtain lower bounds on pseudo-marginal algorithms.
△ Less
Submitted 18 December, 2023;
originally announced December 2023.
-
$O(k)$-Equivariant Dimensionality Reduction on Stiefel Manifolds
Authors:
Andrew Lee,
Harlin Lee,
Jose A. Perea,
Nikolas Schonsheck,
Madeleine Weinstein
Abstract:
Many real-world datasets live on high-dimensional Stiefel and Grassmannian manifolds, $V_k(\mathbb{R}^N)$ and $Gr(k, \mathbb{R}^N)$ respectively, and benefit from projection onto lower-dimensional Stiefel (respectively, Grassmannian) manifolds. In this work, we propose an algorithm called Principal Stiefel Coordinates (PSC) to reduce data dimensionality from $ V_k(\mathbb{R}^N)$ to…
▽ More
Many real-world datasets live on high-dimensional Stiefel and Grassmannian manifolds, $V_k(\mathbb{R}^N)$ and $Gr(k, \mathbb{R}^N)$ respectively, and benefit from projection onto lower-dimensional Stiefel (respectively, Grassmannian) manifolds. In this work, we propose an algorithm called Principal Stiefel Coordinates (PSC) to reduce data dimensionality from $ V_k(\mathbb{R}^N)$ to $V_k(\mathbb{R}^n)$ in an $O(k)$-equivariant manner ($k \leq n \ll N$). We begin by observing that each element $α\in V_n(\mathbb{R}^N)$ defines an isometric embedding of $V_k(\mathbb{R}^n)$ into $V_k(\mathbb{R}^N)$. Next, we optimize for such an embedding map that minimizes data fit error by warm-starting with the output of principal component analysis (PCA) and applying gradient descent. Then, we define a continuous and $O(k)$-equivariant map $π_α$ that acts as a ``closest point operator'' to project the data onto the image of $V_k(\mathbb{R}^n)$ in $V_k(\mathbb{R}^N)$ under the embedding determined by $α$, while minimizing distortion. Because this dimensionality reduction is $O(k)$-equivariant, these results extend to Grassmannian manifolds as well. Lastly, we show that the PCA output globally minimizes projection error in a noiseless setting, but that our algorithm achieves a meaningfully different and improved outcome when the data does not lie exactly on the image of a linearly embedded lower-dimensional Stiefel manifold as above. Multiple numerical experiments using synthetic and real-world data are performed.
△ Less
Submitted 19 September, 2023;
originally announced September 2023.
-
On the Forgetting of Particle Filters
Authors:
Joona Karjalainen,
Anthony Lee,
Sumeetpal S. Singh,
Matti Vihola
Abstract:
We study the forgetting properties of the particle filter when its state - the collection of particles - is regarded as a Markov chain. Under a strong mixing assumption on the particle filter's underlying Feynman-Kac model, we find that the particle filter is exponentially mixing, and forgets its initial state in $O(\log N )$ `time', where $N$ is the number of particles and time refers to the numb…
▽ More
We study the forgetting properties of the particle filter when its state - the collection of particles - is regarded as a Markov chain. Under a strong mixing assumption on the particle filter's underlying Feynman-Kac model, we find that the particle filter is exponentially mixing, and forgets its initial state in $O(\log N )$ `time', where $N$ is the number of particles and time refers to the number of particle filter algorithm steps, each comprising a selection (or resampling) and mutation (or prediction) operation. We present an example which suggests that this rate is optimal. In contrast to our result, available results to-date are extremely conservative, suggesting $O(α^N)$ time steps are needed, for some $α>1$, for the particle filter to forget its initialisation. We also study the conditional particle filter (CPF) and extend our forgetting result to this context. We establish a similar conclusion, namely, CPF is exponentially mixing and forgets its initial state in $O(\log N )$ time. To support this analysis, we establish new time-uniform $L^p$ error estimates for CPF, which can be of independent interest.
△ Less
Submitted 15 September, 2023;
originally announced September 2023.
-
On Selecting Distance Metrics in $n$-Dimensional Normed Vector Spaces of Cells: A Novel Criterion and Similarity Measure Towards Efficient and Accurate Omics Analysis
Authors:
Okezue Bell,
Arthur Lee,
Elizabeth Engle
Abstract:
Single-cell omics enable the profiles of cells, which contain large numbers of biological features, to be quantified. Cluster analysis, a dimensionality reduction process, is used to reduce the dimensions of the data to make it computationally tractable. In these analyses, cells are represented as vectors in $n$-Dimensional space, where each dimension corresponds to a certain cell feature. The dis…
▽ More
Single-cell omics enable the profiles of cells, which contain large numbers of biological features, to be quantified. Cluster analysis, a dimensionality reduction process, is used to reduce the dimensions of the data to make it computationally tractable. In these analyses, cells are represented as vectors in $n$-Dimensional space, where each dimension corresponds to a certain cell feature. The distance between cells is used as a surrogate measure of similarity, providing insight into the cell's state, function, and genetic mechanisms. However, as cell profiles are clustered in 3D or higher-dimensional space, it remains unknown which distance metric provides the most accurate spatiotemporal representation of similarity, limiting the interpretability of the data. I propose and prove a generalized proposition and set of corollaries that serve as a criterion to determine which of the standard distance measures is most accurate for conveying cell profile heterogeneity. Each distance method is evaluated via statistical, geometric, and topological proofs, which are formalized into a set of criteria. In this paper, I present the putative, first-ever method to elect the most accurate and precise distance metrics with any profiling modality, which are determined to be the Wasserstein distance and cosine similarity metrics, respectively, in general cases. I also identify special cases in which the criterion may select non-standard metrics. Combining the metric properties selected by the criterion, I develop a novel, custom, optimal distance metric that demonstrates superior computational efficiency, peak annotation, motif identification, and footprinting for transcription factor binding sites when compared with leading methods.
△ Less
Submitted 5 June, 2024; v1 submitted 12 June, 2023;
originally announced June 2023.
-
Equality in the spacetime positive mass theorem II
Authors:
Lan-Hsuan Huang,
Dan A. Lee
Abstract:
We provide a new proof of the equality case of the spacetime positive mass theorem, which states that if a complete asymptotically flat initial data set $(M, g, k)$ satisfying the dominant energy condition has null ADM energy-momentum (that is, $|E|=|P|$), then $(M,g)$ must isometrically embed into Minkowski space with $k$ as its second fundamental form. Previous proofs either used spinor methods…
▽ More
We provide a new proof of the equality case of the spacetime positive mass theorem, which states that if a complete asymptotically flat initial data set $(M, g, k)$ satisfying the dominant energy condition has null ADM energy-momentum (that is, $|E|=|P|$), then $(M,g)$ must isometrically embed into Minkowski space with $k$ as its second fundamental form. Previous proofs either used spinor methods [Wit 81, BC96, CM06], relied on the Jang equation [HL20, Eic13], or assumed three spatial dimensions [HZ22]. In contrast, our new proof only requires knowing that $E\ge|P|$ for all complete initial data sets near $(g,k)$ on $M$ satisfying the dominant energy condition.
△ Less
Submitted 20 February, 2023; v1 submitted 12 February, 2023;
originally announced February 2023.
-
Reversible random number generation for adjoint Monte Carlo simulation of the heat equation
Authors:
Emil Løvbak,
Frédéric Blondeel,
Adam Lee,
Lander Vanroye,
Andreas Van Barel,
Giovanni Samaey
Abstract:
In PDE-constrained optimization, one aims to find design parameters that minimize some objective, subject to the satisfaction of a partial differential equation. A major challenges is computing gradients of the objective to the design parameters, as applying the chain rule requires computing the Jacobian of the design parameters to the PDE's state. The adjoint method avoids this Jacobian by comput…
▽ More
In PDE-constrained optimization, one aims to find design parameters that minimize some objective, subject to the satisfaction of a partial differential equation. A major challenges is computing gradients of the objective to the design parameters, as applying the chain rule requires computing the Jacobian of the design parameters to the PDE's state. The adjoint method avoids this Jacobian by computing partial derivatives of a Lagrangian. Evaluating these derivatives requires the solution of a second PDE with the adjoint differential operator to the constraint, resulting in a backwards-in-time simulation.
Particle-based Monte Carlo solvers are often used to compute the solution to high-dimensional PDEs. However, such solvers have the drawback of introducing noise to the computed results, thus requiring stochastic optimization methods. To guarantee convergence in this setting, both the constraint and adjoint Monte Carlo simulations should simulate the same particle trajectories. For large simulations, storing full paths from the constraint equation for re-use in the adjoint equation becomes infeasible due to memory limitations. In this paper, we provide a reversible extension to the family of permuted congruential pseudorandom number generators (PCG). We then use such a generator to recompute these time-reversed paths for the heat equation, avoiding these memory issues.
△ Less
Submitted 24 December, 2023; v1 submitted 6 February, 2023;
originally announced February 2023.
-
Explicit convergence bounds for Metropolis Markov chains: isoperimetry, spectral gaps and profiles
Authors:
Christophe Andrieu,
Anthony Lee,
Sam Power,
Andi Q. Wang
Abstract:
We derive the first explicit bounds for the spectral gap of a random walk Metropolis algorithm on $R^d$ for any value of the proposal variance, which when scaled appropriately recovers the correct $d^{-1}$ dependence on dimension for suitably regular invariant distributions. We also obtain explicit bounds on the ${\rm L}^2$-mixing time for a broad class of models. In obtaining these results, we re…
▽ More
We derive the first explicit bounds for the spectral gap of a random walk Metropolis algorithm on $R^d$ for any value of the proposal variance, which when scaled appropriately recovers the correct $d^{-1}$ dependence on dimension for suitably regular invariant distributions. We also obtain explicit bounds on the ${\rm L}^2$-mixing time for a broad class of models. In obtaining these results, we refine the use of isoperimetric profile inequalities to obtain conductance profile bounds, which also enable the derivation of explicit bounds in a much broader class of models. We also obtain similar results for the preconditioned Crank--Nicolson Markov chain, obtaining dimension-independent bounds under suitable assumptions.
△ Less
Submitted 31 October, 2023; v1 submitted 16 November, 2022;
originally announced November 2022.
-
Noncompact Fill-Ins of Bartnik Data
Authors:
Dan A. Lee,
Martin Lesourd,
Ryan Unger
Abstract:
We generalize Y. Shi and L.-F.\ Tam's \cite{ShiTam} nonnegativity result for the Brown-York mass, by considering nonnegative scalar curvature (NNSC) fill-ins that need only be complete rather than compact. Moreover, the NNSC fill-ins need not even be complete as long the incompleteness is ``shielded'' by a region with positive scalar curvature and occurs occurs sufficiently far away.
We accompli…
▽ More
We generalize Y. Shi and L.-F.\ Tam's \cite{ShiTam} nonnegativity result for the Brown-York mass, by considering nonnegative scalar curvature (NNSC) fill-ins that need only be complete rather than compact. Moreover, the NNSC fill-ins need not even be complete as long the incompleteness is ``shielded'' by a region with positive scalar curvature and occurs occurs sufficiently far away.
We accomplish this by generalizing P.~Miao's~\cite{Miao02} positive mass theorem with corners to asymptotically flat manifolds that may have other complete ends, or possibly incomplete ends that are appropriately shielded. We can similarly extend other results on the compact NNSC fill-in problem to allow for complete (or shielded) NNSC fill-ins. In particular, we prove the following generalization of a theorem of Miao~\cite{Miao20}: Given any metric $γ$ on a closed manifold $Σ^{n-1}$, there exists a constant $λ$ such that for any complete (or shielded) NNSC fill-in $(Ω^n, g)$ of $(Σ^{n-1},γ)$, we have $\min_ΣH \le λ$, where $H$ is the mean curvature of $Σ$ with respect to $g$.
△ Less
Submitted 11 November, 2022;
originally announced November 2022.
-
Generalizing the German Tank Problem
Authors:
Anthony Lee,
Steven J. Miller
Abstract:
The German Tank Problem dates back to World War II when the Allies used a statistical approach to estimate the number of enemy tanks produced or on the field from observed serial numbers after battles. Assuming that the tanks are labeled consecutively starting from 1, if we observe $k$ tanks from a total of $N$ tanks with the maximum observed tank being $m$, then the best estimate for $N$ is…
▽ More
The German Tank Problem dates back to World War II when the Allies used a statistical approach to estimate the number of enemy tanks produced or on the field from observed serial numbers after battles. Assuming that the tanks are labeled consecutively starting from 1, if we observe $k$ tanks from a total of $N$ tanks with the maximum observed tank being $m$, then the best estimate for $N$ is $m(1 + 1/k) - 1$. We explore many generalizations. We looked at the discrete and continuous one dimensional case. We explored different estimators such as the $L$\textsuperscript{th} largest tank, and applied motivation from portfolio theory and studied a weighted average; however, the original formula was the best. We generalized the problem in two dimensions, with pairs instead of points, studying the discrete and continuous square and circle variants. There were complications from curvature issues and that not every number is representable as a sum of two squares. We often concentrated on the large $N$ limit. For the discrete and continuous square, we tested various statistics, finding the largest observed component did best; the scaling factor for both cases is $(2k+1)/2k$. The discrete case was especially involved because we had to use approximation formulas that gave us the number of lattice points inside the circle. Interestingly, the scaling factors were different for the cases. Lastly, we generalized the problem into $L$ dimensional squares and circles. The discrete and continuous square proved similar to the two dimensional square problem. However, for the $L$\textsuperscript{th} dimensional circle, we had to use formulas for the volume of the $L$-ball, and had to approximate the number of lattice points inside it. The formulas for the discrete circle were particularly interesting, as there was no $L$ dependence in the formula.
△ Less
Submitted 27 October, 2022;
originally announced October 2022.
-
Connectedness in Friends-and-Strangers Graphs of Spiders and Complements
Authors:
Alan Lee
Abstract:
Let $X$ and $Y$ be two graphs with vertex set $[n]$. Their friends-and-strangers graph $\mathsf{FS}(X,Y)$ is a graph with vertices corresponding to elements of the group $S_n$, and two permutations $σ$ and $σ'$ are adjacent if they are separated by a transposition $\{a,b\}$ such that $a$ and $b$ are adjacent in $X$ and $σ(a)$ and $σ(b)$ are adjacent in $Y$. Specific friends-and-strangers graphs su…
▽ More
Let $X$ and $Y$ be two graphs with vertex set $[n]$. Their friends-and-strangers graph $\mathsf{FS}(X,Y)$ is a graph with vertices corresponding to elements of the group $S_n$, and two permutations $σ$ and $σ'$ are adjacent if they are separated by a transposition $\{a,b\}$ such that $a$ and $b$ are adjacent in $X$ and $σ(a)$ and $σ(b)$ are adjacent in $Y$. Specific friends-and-strangers graphs such as $\mathsf{FS}(\mathsf{Path}_n,Y)$ and $\mathsf{FS}(\mathsf{Cycle}_n,Y)$ have been researched, and their connected components have been enumerated using various equivalence relations such as double-flip equivalence. A spider graph is a collection of path graphs that are all connected to a single center point. In this paper, we delve deeper into the question of when $\mathsf{FS}(X,Y)$ is connected when $X$ is a spider and $Y$ is the complement of a spider or a tadpole.
△ Less
Submitted 22 November, 2022; v1 submitted 5 October, 2022;
originally announced October 2022.
-
Connectedness and Cycle Spaces of Friends-and-Strangers Graphs
Authors:
Colin Defant,
David Dong,
Alan Lee,
Michelle Wei
Abstract:
If $X=(V(X),E(X))$ and $Y=(V(Y),E(Y))$ are $n$-vertex graphs, then their friends-and-strangers graph $\mathsf{FS}(X,Y)$ is the graph whose vertices are the bijections from $V(X)$ to $V(Y)$ in which two bijections $σ$ and $σ'$ are adjacent if and only if there is an edge $\{a,b\}\in E(X)$ such that $\{σ(a),σ(b)\}\in E(Y)$ and $σ'=σ\circ (a\,\,b)$, where $(a\,\,b)$ is the permutation of $V(X)$ that…
▽ More
If $X=(V(X),E(X))$ and $Y=(V(Y),E(Y))$ are $n$-vertex graphs, then their friends-and-strangers graph $\mathsf{FS}(X,Y)$ is the graph whose vertices are the bijections from $V(X)$ to $V(Y)$ in which two bijections $σ$ and $σ'$ are adjacent if and only if there is an edge $\{a,b\}\in E(X)$ such that $\{σ(a),σ(b)\}\in E(Y)$ and $σ'=σ\circ (a\,\,b)$, where $(a\,\,b)$ is the permutation of $V(X)$ that swaps $a$ and $b$. We prove general theorems that provide necessary and/or sufficient conditions for $\mathsf{FS}(X,Y)$ to be connected. As a corollary, we obtain a complete characterization of the graphs $Y$ such that $\mathsf{FS}(\mathsf{Dand}_{k,n},Y)$ is connected, where $\mathsf{Dand}_{k,n}$ is a dandelion graph; this substantially generalizes a theorem of the first author and Kravitz in the case $k=3$. For specific choices of $Y$, we characterize the spider graphs $X$ such that $\mathsf{FS}(X,Y)$ is connected. In a different vein, we study the cycle spaces of friends-and-strangers graphs. Naatz proved that if $X$ is a path graph, then the cycle space of $\mathsf{FS}(X,Y)$ is spanned by $4$-cycles and $6$-cycles; we show that the same statement holds when $X$ is a cycle and $Y$ has domination number at least $3$. When $X$ is a cycle and $Y$ has domination number at least $2$, our proof sheds light on how walks in $\mathsf{FS}(X,Y)$ behave under certain Coxeter moves.
△ Less
Submitted 4 September, 2022;
originally announced September 2022.
-
Sharp bounds on the price of bandit feedback for several models of mistake-bounded online learning
Authors:
Raymond Feng,
Jesse Geneson,
Andrew Lee,
Espen Slettnes
Abstract:
We determine sharp bounds on the price of bandit feedback for several variants of the mistake-bound model. The first part of the paper presents bounds on the $r$-input weak reinforcement model and the $r$-input delayed, ambiguous reinforcement model. In both models, the adversary gives $r$ inputs in each round and only indicates a correct answer if all $r$ guesses are correct. The only difference…
▽ More
We determine sharp bounds on the price of bandit feedback for several variants of the mistake-bound model. The first part of the paper presents bounds on the $r$-input weak reinforcement model and the $r$-input delayed, ambiguous reinforcement model. In both models, the adversary gives $r$ inputs in each round and only indicates a correct answer if all $r$ guesses are correct. The only difference between the two models is that in the delayed, ambiguous model, the learner must answer each input before receiving the next input of the round, while the learner receives all $r$ inputs at once in the weak reinforcement model. In the second part of the paper, we introduce models for online learning with permutation patterns, in which a learner attempts to learn a permutation from a set of permutations by guessing statistics related to sub-permutations. For these permutation models, we prove sharp bounds on the price of bandit feedback.
△ Less
Submitted 3 September, 2022;
originally announced September 2022.
-
Poincaré inequalities for Markov chains: a meeting with Cheeger, Lyapunov and Metropolis
Authors:
Christophe Andrieu,
Anthony Lee,
Sam Power,
Andi Q. Wang
Abstract:
We develop a theory of weak Poincaré inequalities to characterize convergence rates of ergodic Markov chains. Motivated by the application of Markov chains in the context of algorithms, we develop a relevant set of tools which enable the practical study of convergence rates in the setting of Markov chain Monte Carlo methods, but also well beyond.
We develop a theory of weak Poincaré inequalities to characterize convergence rates of ergodic Markov chains. Motivated by the application of Markov chains in the context of algorithms, we develop a relevant set of tools which enable the practical study of convergence rates in the setting of Markov chain Monte Carlo methods, but also well beyond.
△ Less
Submitted 10 August, 2022;
originally announced August 2022.
-
Hypergraph Fuss-Catalan Numbers
Authors:
Parth Chavan,
Andrew Lee,
Karthik Seetharaman
Abstract:
The Catalan numbers $C_n$ are an extremely well-studied sequence of numbers that appear as the answer to many combinatorial problems. Two generalizations of these numbers that have been studied are the Fuss-Catalan numbers and the Hypergraph Catalan numbers. In this paper, we study the combination of these, the Hypergraph Fuss-Catalan numbers. We provide some combinatorial interpretations of these…
▽ More
The Catalan numbers $C_n$ are an extremely well-studied sequence of numbers that appear as the answer to many combinatorial problems. Two generalizations of these numbers that have been studied are the Fuss-Catalan numbers and the Hypergraph Catalan numbers. In this paper, we study the combination of these, the Hypergraph Fuss-Catalan numbers. We provide some combinatorial interpretations of these numbers, as well as describe their generating function.
△ Less
Submitted 2 February, 2022;
originally announced February 2022.
-
Density and positive mass theorems for incomplete manifolds
Authors:
Dan A. Lee,
Martin Lesourd,
Ryan Unger
Abstract:
For manifolds with a distinguished asymptotically flat end, we prove a density theorem which produces harmonic asymptotics on the distinguished end, while allowing for points of incompleteness (or negative scalar curvature) away from this end. We use this to improve the "quantitative" version of the positive mass theorem (in dimensions $3\leq n\leq 7$), obtained by the last two named authors with…
▽ More
For manifolds with a distinguished asymptotically flat end, we prove a density theorem which produces harmonic asymptotics on the distinguished end, while allowing for points of incompleteness (or negative scalar curvature) away from this end. We use this to improve the "quantitative" version of the positive mass theorem (in dimensions $3\leq n\leq 7$), obtained by the last two named authors with S.-T. Yau [LUY21], where stronger decay was assumed on the distinguished end. We also give an alternative proof of this theorem based on a relationship between MOTS and $μ$-bubbles and our recent work on the spacetime positive mass theorem with boundary [LLU21].
△ Less
Submitted 11 November, 2022; v1 submitted 4 January, 2022;
originally announced January 2022.
-
Density and positive mass theorems for initial data sets with boundary
Authors:
Dan A. Lee,
Martin Lesourd,
Ryan Unger
Abstract:
We prove a harmonic asymptotics density theorem for asymptotically flat initial data sets with compact boundary that satisfy the dominant energy condition. We use this to settle the spacetime positive mass theorem, with rigidity, for initial data sets with apparent horizon boundary in dimensions less than $8$ without a spin assumption.
We prove a harmonic asymptotics density theorem for asymptotically flat initial data sets with compact boundary that satisfy the dominant energy condition. We use this to settle the spacetime positive mass theorem, with rigidity, for initial data sets with apparent horizon boundary in dimensions less than $8$ without a spin assumption.
△ Less
Submitted 11 November, 2022; v1 submitted 22 December, 2021;
originally announced December 2021.
-
A note on the positive mass theorem with boundary
Authors:
Gregory J. Galloway,
Dan A. Lee
Abstract:
In this short note we explain how one can use established results to prove various versions of the positive mass theorem for initial data sets with boundary, in dimensions less than 8.
In this short note we explain how one can use established results to prove various versions of the positive mass theorem for initial data sets with boundary, in dimensions less than 8.
△ Less
Submitted 7 January, 2022; v1 submitted 24 May, 2021;
originally announced May 2021.
-
Intrinsic flat convergence of points and applications to stability of the positive mass theorem
Authors:
Lan-Hsuan Huang,
Dan A. Lee,
Raquel Perales
Abstract:
We prove results on intrinsic flat convergence of points---a concept first explored by Sormani in \cite{Sormani-AA}. In particular, we discuss compatibility with Gromov-Hausdorff convergence of points---a concept first described by Gromov in \cite{Gromov-poly}.
We apply these results to the problem of stability of the positive mass theorem in mathematical relativity. Specifically, we revisit the…
▽ More
We prove results on intrinsic flat convergence of points---a concept first explored by Sormani in \cite{Sormani-AA}. In particular, we discuss compatibility with Gromov-Hausdorff convergence of points---a concept first described by Gromov in \cite{Gromov-poly}.
We apply these results to the problem of stability of the positive mass theorem in mathematical relativity. Specifically, we revisit the article \cite{HLS} on intrinsic flat stability for the case of graphical hypersurfaces of Euclidean space: We are able to fill in some details in the proofs of Theorems 1.4 and Lemma~5.1 of \cite{HLS} and strengthen some statements. Moreover, in light of an acknowledged error in the proof of Theorem~1.3 of \cite{HLS}, we provide an alternative proof that extends recent work of \cite{AP20}.
△ Less
Submitted 2 November, 2020; v1 submitted 15 October, 2020;
originally announced October 2020.
-
Firefighting on the Hexagonal Grid and on Infinite Trees
Authors:
Alexander Dean,
Sean English,
Tongyun Huang,
Robert A. Krueger,
Andy Lee,
Mose Mizrahi,
Casey Wheaton-Werle
Abstract:
The firefighter problem with $k$ firefighters on an infinite graph $G$ is an iterative graph process, defined as follows: Suppose a fire breaks out at a given vertex $v\in V(G)$ on Turn 1. On each subsequent even turn, $k$ firefighters protect $k$ vertices that are not on fire, and on each subsequent odd turn, any vertex that is on fire spreads the fire to all adjacent unprotected vertices. The fi…
▽ More
The firefighter problem with $k$ firefighters on an infinite graph $G$ is an iterative graph process, defined as follows: Suppose a fire breaks out at a given vertex $v\in V(G)$ on Turn 1. On each subsequent even turn, $k$ firefighters protect $k$ vertices that are not on fire, and on each subsequent odd turn, any vertex that is on fire spreads the fire to all adjacent unprotected vertices. The firefighters' goal is to eventually stop the spread of the fire. If there exists a strategy for $k$ firefighters to eventually stop the spread of the fire, then we say $G$ is $k$-containable.
We consider the firefighter problem on the hexagonal grid, which is the graph whose vertices and edges are exactly the vertices and edges of a regular hexagonal tiling of the plane. It is not known if the hexagonal grid is $1$-containable. In arXiv:1305.7076 [math.CO], it was shown that if the firefighters have one firefighter per turn and one extra firefighter on two turns, the firefighters can contain the fire. We improve on this result by showing that even with only one extra firefighter on one turn, the firefighters can still contain the fire.
In addition, we explore $k$-containability for birth sequence trees, which are infinite rooted trees that have the property that every vertex at the same level has the same degree. A birth sequence forest is an infinite forest, each component of which is a birth sequence tree. For birth sequence trees and forests, the fire always starts at the root of each tree. We provide a pseudopolynomial time algorithm to decide if all the vertices at a fixed level can be protected or not.
△ Less
Submitted 6 June, 2021; v1 submitted 10 October, 2020;
originally announced October 2020.
-
Bartnik mass minimizing initial data sets and improvability of the dominant energy scalar
Authors:
Lan-Hsuan Huang,
Dan A. Lee
Abstract:
We introduce the concept of improvability of the dominant energy scalar, and we derive strong consequences of non-improvability. In particular, we prove that a non-improvable initial data set without local symmetries must sit inside a null perfect fluid spacetime carrying a global Killing vector field. We also show that the dominant energy scalar is always almost improvable in a precise sense. Usi…
▽ More
We introduce the concept of improvability of the dominant energy scalar, and we derive strong consequences of non-improvability. In particular, we prove that a non-improvable initial data set without local symmetries must sit inside a null perfect fluid spacetime carrying a global Killing vector field. We also show that the dominant energy scalar is always almost improvable in a precise sense. Using these main results, we provide a characterization of Bartnik mass minimizing initial data sets which makes substantial progress toward Bartnik's stationary conjecture.
Along the way we observe that in dimensions greater than eight there exist pp-wave counterexamples (without the optimal decay rate for asymptotically flatness) to the equality case of the spacetime positive mass theorem. As a consequence, there exist counterexamples to Bartnik's stationary and strict positivity conjectures in those dimensions.
△ Less
Submitted 1 March, 2022; v1 submitted 1 July, 2020;
originally announced July 2020.
-
MAP Clustering under the Gaussian Mixture Model via Mixed Integer Nonlinear Optimization
Authors:
Patrick Flaherty,
Pitchaya Wiratchotisatian,
Ji Ah Lee,
Zhou Tang,
Andrew C. Trapp
Abstract:
We present a global optimization approach for solving the maximum a-posteriori (MAP) clustering problem under the Gaussian mixture model.Our approach can accommodate side constraints and it preserves the combinatorial structure of the MAP clustering problem by formulating it asa mixed-integer nonlinear optimization problem (MINLP). We approximate the MINLP through a mixed-integer quadratic program…
▽ More
We present a global optimization approach for solving the maximum a-posteriori (MAP) clustering problem under the Gaussian mixture model.Our approach can accommodate side constraints and it preserves the combinatorial structure of the MAP clustering problem by formulating it asa mixed-integer nonlinear optimization problem (MINLP). We approximate the MINLP through a mixed-integer quadratic program (MIQP) transformation that improves computational aspects while guaranteeing $ε$-global optimality. An important benefit of our approach is the explicit quantification of the degree of suboptimality, via the optimality gap, en route to finding the globally optimal MAP clustering. Numerical experiments comparing our method to other approaches show that our method finds a better solution than standard clustering methods. Finally, we cluster a real breast cancer gene expression data set incorporating intrinsic subtype information; the induced constraints substantially improve the computational performance and produce more coherent and bio-logically meaningful clusters.
△ Less
Submitted 16 March, 2020; v1 submitted 8 November, 2019;
originally announced November 2019.
-
A regularization approach for solving Poisson's equation with singular charge sources and diffuse interfaces
Authors:
Siwen Wang,
Arum Lee,
Emil Alexov,
Shan Zhao
Abstract:
Singular charge sources in terms of Dirac delta functions present a well-known numerical challenge for solving Poisson's equation. For a sharp interface between inhomogeneous media, singular charges could be analytically treated by fundamental solutions or regularization methods. However, no analytical treatment is known in the literature in case of a diffuse interface of complex shape. This lette…
▽ More
Singular charge sources in terms of Dirac delta functions present a well-known numerical challenge for solving Poisson's equation. For a sharp interface between inhomogeneous media, singular charges could be analytically treated by fundamental solutions or regularization methods. However, no analytical treatment is known in the literature in case of a diffuse interface of complex shape. This letter reports the first such regularization method that represents the Coulomb potential component analytically by Green's functions to account for singular charges. The other component, i.e., the reaction field potential, then satisfies a regularized Poisson equation with a smooth source and the original elliptic operator. The regularized equation can then be simply solved by any numerical method. For a spherical domain with diffuse interface, the proposed regularization method is numerically validated and compared with a semi-analytical quasi-harmonic method.
△ Less
Submitted 1 October, 2019;
originally announced October 2019.
-
Lower semicontinuity of ADM mass under intrinsic flat convergence
Authors:
Jeffrey L. Jauregui,
Dan A. Lee
Abstract:
A natural question in mathematical general relativity is how the ADM mass behaves as a functional on the space of asymptotically flat 3-manifolds of nonnegative scalar curvature. In previous results, lower semicontinuity has been established by the first-named author for pointed $C^2$ convergence, and more generally by both authors for pointed $C^0$ convergence (all in the Cheeger--Gromov sense).…
▽ More
A natural question in mathematical general relativity is how the ADM mass behaves as a functional on the space of asymptotically flat 3-manifolds of nonnegative scalar curvature. In previous results, lower semicontinuity has been established by the first-named author for pointed $C^2$ convergence, and more generally by both authors for pointed $C^0$ convergence (all in the Cheeger--Gromov sense). In this paper, we show this behavior persists for the much weaker notion of pointed Sormani--Wenger intrinsic flat ($\mathcal{F}$) volume convergence, under natural hypotheses. We consider smooth manifolds converging to asymptotically flat local integral current spaces (a new definition), using Huisken's isoperimetric mass as a replacement for the ADM mass. Along the way we prove results of independent interest about convergence of subregions of $\mathcal{F}$-converging sequences of integral current spaces.
△ Less
Submitted 3 March, 2019;
originally announced March 2019.
-
On the polygon determined by the short diagonals of a convex polygon
Authors:
Jacqueline Cho,
Dan Ismailescu,
Yiwon Kim,
Andrew Woojong Lee
Abstract:
Let $K$ be a convex pentagon in the plane and let $K_1$ be the pentagon bounded by the diagonals of $K$. It has been conjectured that the maximum of the ratio between the areas of $K_1$ and $K$ is reached when $K$ is an affine regular pentagon. In this paper we prove this conjecture. We also show that for polygons with at least six vertices the trivial answers are the best possible.
Let $K$ be a convex pentagon in the plane and let $K_1$ be the pentagon bounded by the diagonals of $K$. It has been conjectured that the maximum of the ratio between the areas of $K_1$ and $K$ is reached when $K$ is an affine regular pentagon. In this paper we prove this conjecture. We also show that for polygons with at least six vertices the trivial answers are the best possible.
△ Less
Submitted 18 December, 2018;
originally announced December 2018.
-
The rigid-flexible value for symplectic embeddings of four-dimensional ellipsoids into polydiscs
Authors:
Alvin **,
Andrew S. Lee
Abstract:
In previous work of Cristofaro-Gardiner, Frenkel, and Schlenk, the embedding function $c_b(a)$ describing the problem of symplectically embedding an ellipsoid $E(1,a)$ into the smallest scaling of the polydisc $P(1,b)$ was determined for all integers $b \geq 2.$ As in McDuff's work, Cristofaro-Gardiner, Frenkel, and Schlenk found staircases associated with the embedding function. More recently, Us…
▽ More
In previous work of Cristofaro-Gardiner, Frenkel, and Schlenk, the embedding function $c_b(a)$ describing the problem of symplectically embedding an ellipsoid $E(1,a)$ into the smallest scaling of the polydisc $P(1,b)$ was determined for all integers $b \geq 2.$ As in McDuff's work, Cristofaro-Gardiner, Frenkel, and Schlenk found staircases associated with the embedding function. More recently, Usher's work shows that the appearance of infinite staircases as $b$ varies is hard to predict. The intricate structure found there suggests that determining the entirety of the graph of $c_b(a)$ for all $b$ is intractable.
In contrast with this result, we show that for every polydisc $P(1,b)$ with $b>2$, there is an explicit formula for the minimum $a$ such that the embedding problem is determined only by volume. That is, when the ellipsoid is sufficiently stretched, there is a symplectic embedding of $E(1,a)$ fully filling an appropriately scaled polydisc $P(λ,λb)$. We call this value of $a$ the rigid-flexible value at $b$, or the RF-value. This formula is piecewise smooth in $b$.
To further describe the function $RF(b)$, we investigate its behavior as $b\to 1$. Frenkel and Müller showed that the $RF$-value at $b=1$ is $7 \frac{1}{32}$ and Cristofaro-Gardiner, Frenkel, and Schlenk showed that the $RF$-value at $b=2$ is $8 \frac{1}{36}.$ By exhibiting a sequence of obstructive classes for $b = \frac{n+1}{n}$ at $a=8$, in combination with the Frenkel-Müller result, we show that $RF$ is discontinuous at $b=1$.
△ Less
Submitted 8 April, 2019; v1 submitted 8 November, 2018;
originally announced November 2018.
-
Coupled conditional backward sampling particle filter
Authors:
Anthony Lee,
Sumeetpal S. Singh,
Matti Vihola
Abstract:
The conditional particle filter (CPF) is a promising algorithm for general hidden Markov model smoothing. Empirical evidence suggests that the variant of CPF with backward sampling (CBPF) performs well even with long time series. Previous theoretical results have not been able to demonstrate the improvement brought by backward sampling, whereas we provide rates showing that CBPF can remain effecti…
▽ More
The conditional particle filter (CPF) is a promising algorithm for general hidden Markov model smoothing. Empirical evidence suggests that the variant of CPF with backward sampling (CBPF) performs well even with long time series. Previous theoretical results have not been able to demonstrate the improvement brought by backward sampling, whereas we provide rates showing that CBPF can remain effective with a fixed number of particles independent of the time horizon. Our result is based on analysis of a new coupling of two CBPFs, the coupled conditional backward sampling particle filter (CCBPF). We show that CCBPF has good stability properties in the sense that with fixed number of particles, the coupling time in terms of iterations increases only linearly with respect to the time horizon under a general (strong mixing) condition. The CCBPF is useful not only as a theoretical tool, but also as a practical method that allows for unbiased estimation of smoothing expectations, following the recent developments by Jacob et al. (to appear). Unbiased estimation has many advantages, such as enabling the construction of asymptotically exact confidence intervals and straightforward parallelisation.
△ Less
Submitted 28 August, 2019; v1 submitted 15 June, 2018;
originally announced June 2018.
-
Double Kodaira fibrations with small signature
Authors:
Ju A Lee,
Michael Lönne,
Sönke Rollenske
Abstract:
Kodaira fibrations are surfaces of general type with a non-isotrivial fibration, which are differentiable fibre bundles. They are known to have positive signature divisible by $4$. Examples are known only with signature 16 and more. We review approaches to construct examples of low signature which admit two independent fibrations. Special attention is paid to ramified covers of product of curves w…
▽ More
Kodaira fibrations are surfaces of general type with a non-isotrivial fibration, which are differentiable fibre bundles. They are known to have positive signature divisible by $4$. Examples are known only with signature 16 and more. We review approaches to construct examples of low signature which admit two independent fibrations. Special attention is paid to ramified covers of product of curves which we analyse by studying the monodromy action for bundles of punctured curves.
As a by-product we obtain a classification of all fix-point-free automorphisms on curves of genus at most $9$.
△ Less
Submitted 6 November, 2017;
originally announced November 2017.
-
Equality in the Spacetime Positive Mass Theorem
Authors:
Lan-Hsuan Huang,
Dan A. Lee
Abstract:
We affirm the rigidity conjecture of the spacetime positive mass theorem in dimensions less than eight. Namely, if an asymptotically flat initial data set satisfies the dominant energy condition and has $E=|P|$, then $E=|P|=0$, where $(E, P)$ is the ADM energy-momentum vector. The dimensional restriction can be removed if we assume the positive mass inequality holds. Previously the result was only…
▽ More
We affirm the rigidity conjecture of the spacetime positive mass theorem in dimensions less than eight. Namely, if an asymptotically flat initial data set satisfies the dominant energy condition and has $E=|P|$, then $E=|P|=0$, where $(E, P)$ is the ADM energy-momentum vector. The dimensional restriction can be removed if we assume the positive mass inequality holds. Previously the result was only known for spin manifolds.
△ Less
Submitted 26 November, 2019; v1 submitted 12 June, 2017;
originally announced June 2017.
-
Variance bounding of delayed-acceptance kernels
Authors:
Chris Sherlock,
Anthony Lee
Abstract:
A delayed-acceptance version of a Metropolis--Hastings algorithm can be useful for Bayesian inference when it is computationally expensive to calculate the true posterior, but a computationally cheap approximation is available; the delayed-acceptance kernel targets the same posterior as its associated "parent" Metropolis-Hastings kernel. Although the asymptotic variance of the ergodic average of a…
▽ More
A delayed-acceptance version of a Metropolis--Hastings algorithm can be useful for Bayesian inference when it is computationally expensive to calculate the true posterior, but a computationally cheap approximation is available; the delayed-acceptance kernel targets the same posterior as its associated "parent" Metropolis-Hastings kernel. Although the asymptotic variance of the ergodic average of any functional of the chain cannot be less than that obtained using its parent, the average computational time per iteration can be much smaller and so for a given computational budget the delayed-acceptance kernel can be more efficient.
When the asymptotic variance of the ergodic averages of all $L^2$ functionals of the chain is finite, the kernel is said to be variance bounding. It has recently been noted that a delayed-acceptance kernel need not be variance bounding even when its parent is. We provide sufficient conditions for inheritance: for non-local algorithms, such as the independence sampler, the discrepancy between the log density of the approximation and that of the truth should be bounded; for local algorithms, two alternative sets of conditions are provided.
As a by-product of our initial, general result we also supply sufficient conditions on any pair of proposals such that, for any shared target distribution, if a Metropolis-Hastings kernel using one of the proposals is variance bounding then so is the Metropolis-Hastings kernel using the other proposal.
△ Less
Submitted 11 November, 2021; v1 submitted 7 June, 2017;
originally announced June 2017.
-
Pseudo-marginal Metropolis--Hastings using averages of unbiased estimators
Authors:
Chris Sherlock,
Alexandre Thiery,
Anthony Lee
Abstract:
We consider a pseudo-marginal Metropolis--Hastings kernel $P_m$ that is constructed using an average of $m$ exchangeable random variables, as well as an analogous kernel $P_s$ that averages $s<m$ of these same random variables. Using an embedding technique to facilitate comparisons, we show that the asymptotic variances of ergodic averages associated with $P_m$ are lower bounded in terms of those…
▽ More
We consider a pseudo-marginal Metropolis--Hastings kernel $P_m$ that is constructed using an average of $m$ exchangeable random variables, as well as an analogous kernel $P_s$ that averages $s<m$ of these same random variables. Using an embedding technique to facilitate comparisons, we show that the asymptotic variances of ergodic averages associated with $P_m$ are lower bounded in terms of those associated with $P_s$. We show that the bound provided is tight and disprove a conjecture that when the random variables to be averaged are independent, the asymptotic variance under $P_m$ is never less than $s/m$ times the variance under $P_s$. The conjecture does, however, hold when considering continuous-time Markov chains. These results imply that if the computational cost of the algorithm is proportional to $m$, it is often better to set $m=1$. We provide intuition as to why these findings differ so markedly from recent results for pseudo-marginal kernels employing particle filter approximations. Our results are exemplified through two simulation studies; in the first the computational cost is effectively proportional to $m$ and in the second there is a considerable start-up cost at each iteration.
△ Less
Submitted 31 October, 2016;
originally announced October 2016.
-
Proceedings of the third "international Traveling Workshop on Interactions between Sparse models and Technology" (iTWIST'16)
Authors:
V. Abrol,
O. Absil,
P. -A. Absil,
S. Anthoine,
P. Antoine,
T. Arildsen,
N. Bertin,
F. Bleichrodt,
J. Bobin,
A. Bol,
A. Bonnefoy,
F. Caltagirone,
V. Cambareri,
C. Chenot,
V. Crnojević,
M. Daňková,
K. Degraux,
J. Eisert,
J. M. Fadili,
M. Gabrié,
N. Gac,
D. Giacobello,
A. Gonzalez,
C. A. Gomez Gonzalez,
A. González
, et al. (36 additional authors not shown)
Abstract:
The third edition of the "international - Traveling Workshop on Interactions between Sparse models and Technology" (iTWIST) took place in Aalborg, the 4th largest city in Denmark situated beautifully in the northern part of the country, from the 24th to 26th of August 2016. The workshop venue was at the Aalborg University campus. One implicit objective of this biennial workshop is to foster collab…
▽ More
The third edition of the "international - Traveling Workshop on Interactions between Sparse models and Technology" (iTWIST) took place in Aalborg, the 4th largest city in Denmark situated beautifully in the northern part of the country, from the 24th to 26th of August 2016. The workshop venue was at the Aalborg University campus. One implicit objective of this biennial workshop is to foster collaboration between international scientific teams by disseminating ideas through both specific oral/poster presentations and free discussions. For this third edition, iTWIST'16 gathered about 50 international participants and features 8 invited talks, 12 oral presentations, and 12 posters on the following themes, all related to the theory, application and generalization of the "sparsity paradigm": Sparsity-driven data sensing and processing (e.g., optics, computer vision, genomics, biomedical, digital communication, channel estimation, astronomy); Application of sparse models in non-convex/non-linear inverse problems (e.g., phase retrieval, blind deconvolution, self calibration); Approximate probabilistic inference for sparse problems; Sparse machine learning and inference; "Blind" inverse problems and dictionary learning; Optimization for sparse modelling; Information theory, geometry and randomness; Sparsity? What's next? (Discrete-valued signals; Union of low-dimensional spaces, Cosparsity, mixed/group norm, model-based, low-complexity models, ...); Matrix/manifold sensing/processing (graph, low-rank approximation, ...); Complexity/accuracy tradeoffs in numerical methods/optimization; Electronic/optical compressive sensors (hardware).
△ Less
Submitted 14 September, 2016;
originally announced September 2016.
-
Blind Deconvolution of PET Images using Anatomical Priors
Authors:
Stéphanie Guérit,
Adriana González,
Anne Bol,
John A. Lee,
Laurent Jacques
Abstract:
Images from positron emission tomography (PET) provide metabolic information about the human body. They present, however, a spatial resolution that is limited by physical and instrumental factors often modeled by a blurring function. Since this function is typically unknown, blind deconvolution (BD) techniques are needed in order to produce a useful restored PET image. In this work, we propose a g…
▽ More
Images from positron emission tomography (PET) provide metabolic information about the human body. They present, however, a spatial resolution that is limited by physical and instrumental factors often modeled by a blurring function. Since this function is typically unknown, blind deconvolution (BD) techniques are needed in order to produce a useful restored PET image. In this work, we propose a general BD technique that restores a low resolution blurry image using information from data acquired with a high resolution modality (e.g., CT-based delineation of regions with uniform activity in PET images). The proposed BD method is validated on synthetic and actual phantoms.
△ Less
Submitted 5 August, 2016;
originally announced August 2016.
-
Lower semicontinuity of mass under $C^0$ convergence and Huisken's isoperimetric mass
Authors:
Jeffrey L. Jauregui,
Dan A. Lee
Abstract:
Given a sequence of asymptotically flat 3-manifolds of nonnegative scalar curvature with outermost minimal boundary, converging in the pointed $C^0$ Cheeger--Gromov sense to an asymptotically flat limit space, we show that the total mass of the limit is bounded above by the liminf of the total masses of the sequence. In other words, total mass is lower semicontinuous under such convergence. In ord…
▽ More
Given a sequence of asymptotically flat 3-manifolds of nonnegative scalar curvature with outermost minimal boundary, converging in the pointed $C^0$ Cheeger--Gromov sense to an asymptotically flat limit space, we show that the total mass of the limit is bounded above by the liminf of the total masses of the sequence. In other words, total mass is lower semicontinuous under such convergence. In order to prove this, we use Huisken's isoperimetric mass concept, together with a modified weak mean curvature flow argument. We include a brief discussion of Huisken's work before explaining our extension of that work. The results are all specific to three dimensions.
△ Less
Submitted 1 February, 2016;
originally announced February 2016.
-
Surface bundles over surfaces with a fixed signature
Authors:
Ju A Lee
Abstract:
The signature of a surface bundle over a surface is known to be divisible by 4. It is also known that the signature vanishes if the fiber genus is less than or equal to 2 or the base genus is less than or equal to 1. In this article, we construct new smooth 4-manifolds with signature 4 which are surface bundles over surfaces with small fiber and base genera. From these we derive improved upper bou…
▽ More
The signature of a surface bundle over a surface is known to be divisible by 4. It is also known that the signature vanishes if the fiber genus is less than or equal to 2 or the base genus is less than or equal to 1. In this article, we construct new smooth 4-manifolds with signature 4 which are surface bundles over surfaces with small fiber and base genera. From these we derive improved upper bounds for the minimal genus of surfaces representing the second homology classes of a map** class group.
△ Less
Submitted 5 June, 2016; v1 submitted 20 November, 2015;
originally announced November 2015.
-
Sharp Interface Limits of the Cahn-Hilliard Equation with Degenerate Mobility
Authors:
Alpha Albert Lee,
Andreas Münch,
Endre Süli
Abstract:
In this work, the sharp interface limit of the degenerate Cahn-Hilliard equation (in two space dimensions) with a polynomial double well free energy and a quadratic mobility is derived via a matched asymptotic analysis involving exponentially large and small terms and multiple inner layers. In contrast to some results found in the literature, our analysis reveals that the interface motion is drive…
▽ More
In this work, the sharp interface limit of the degenerate Cahn-Hilliard equation (in two space dimensions) with a polynomial double well free energy and a quadratic mobility is derived via a matched asymptotic analysis involving exponentially large and small terms and multiple inner layers. In contrast to some results found in the literature, our analysis reveals that the interface motion is driven by a combination of surface diffusion flux proportional to the surface Laplacian of the interface curvature and an additional contribution from nonlinear, porous-medium type bulk diffusion, For higher degenerate mobilities, bulk diffusion is subdominant. The sharp interface models are corroborated by comparing relaxation rates of perturbations to a radially symmetric stationary state with those obtained by the phase field model.
△ Less
Submitted 9 July, 2015;
originally announced July 2015.
-
Post-Reconstruction Deconvolution of PET Images by Total Generalized Variation Regularization
Authors:
Stéphanie Guérit,
Laurent Jacques,
Benoît Macq,
John A. Lee
Abstract:
Improving the quality of positron emission tomography (PET) images, affected by low resolution and high level of noise, is a challenging task in nuclear medicine and radiotherapy. This work proposes a restoration method, achieved after tomographic reconstruction of the images and targeting clinical situations where raw data are often not accessible. Based on inverse problem methods, our contributi…
▽ More
Improving the quality of positron emission tomography (PET) images, affected by low resolution and high level of noise, is a challenging task in nuclear medicine and radiotherapy. This work proposes a restoration method, achieved after tomographic reconstruction of the images and targeting clinical situations where raw data are often not accessible. Based on inverse problem methods, our contribution introduces the recently developed total generalized variation (TGV) norm to regularize PET image deconvolution. Moreover, we stabilize this procedure with additional image constraints such as positivity and photometry invariance. A criterion for updating and adjusting automatically the regularization parameter in case of Poisson noise is also presented. Experiments are conducted on both synthetic data and real patient images.
△ Less
Submitted 16 June, 2015;
originally announced June 2015.
-
Degenerate Mobilities in Phase Field Models are Insufficient to Capture Surface Diffusion
Authors:
Alpha A Lee,
Andreas Münch,
Endre Süli
Abstract:
Phase field models frequently provide insight to phase transitions, and are robust numerical tools to solve free boundary problems corresponding to the motion of interfaces. A body of prior literature suggests that interface motion via surface diffusion is the long-time, sharp interface limit of microscopic phase field models such as the Cahn-Hilliard equation with a degenerate mobility function.…
▽ More
Phase field models frequently provide insight to phase transitions, and are robust numerical tools to solve free boundary problems corresponding to the motion of interfaces. A body of prior literature suggests that interface motion via surface diffusion is the long-time, sharp interface limit of microscopic phase field models such as the Cahn-Hilliard equation with a degenerate mobility function. Contrary to this conventional wisdom, we show that the long-time behaviour of degenerate Cahn-Hilliard equation with a polynomial free energy undergoes coarsening, reflecting the presence of bulk diffusion, rather than pure surface diffusion. This reveals an important limitation of phase field models that are frequently used to model surface diffusion.
△ Less
Submitted 23 May, 2015;
originally announced May 2015.
-
Perfect sampling for nonhomogeneous Markov chains and hidden Markov models
Authors:
Nick Whiteley,
Anthony Lee
Abstract:
We obtain a perfect sampling characterization of weak ergodicity for backward products of finite stochastic matrices, and equivalently, simultaneous tail triviality of the corresponding nonhomogeneous Markov chains. Applying these ideas to hidden Markov models, we show how to sample exactly from the finite-dimensional conditional distributions of the signal process given infinitely many observatio…
▽ More
We obtain a perfect sampling characterization of weak ergodicity for backward products of finite stochastic matrices, and equivalently, simultaneous tail triviality of the corresponding nonhomogeneous Markov chains. Applying these ideas to hidden Markov models, we show how to sample exactly from the finite-dimensional conditional distributions of the signal process given infinitely many observations, using an algorithm which requires only an almost surely finite number of observations to actually be accessed. A notion of "successful" coupling is introduced and its occurrence is characterized in terms of conditional ergodicity properties of the hidden Markov model and related to the stability of nonlinear filters.
△ Less
Submitted 6 January, 2016; v1 submitted 16 October, 2014;
originally announced October 2014.
-
The positive mass theorem for manifolds with distributional curvature
Authors:
Dan A. Lee,
Philippe G. LeFloch
Abstract:
We formulate and prove a positive mass theorem for n-dimensional spin manifolds whose metrics have only the Sobolev regularity $C^0 \cap W^{1,n}$. At this level of regularity, the curvature of the metric is defined in the distributional sense only, and we propose here a (generalized) notion of ADM mass for such a metric. Our main theorem establishes that if the manifold is asymptotically flat and…
▽ More
We formulate and prove a positive mass theorem for n-dimensional spin manifolds whose metrics have only the Sobolev regularity $C^0 \cap W^{1,n}$. At this level of regularity, the curvature of the metric is defined in the distributional sense only, and we propose here a (generalized) notion of ADM mass for such a metric. Our main theorem establishes that if the manifold is asymptotically flat and has non-negative scalar curvature distribution, then its (generalized) ADM mass is well-defined and non-negative, and vanishes only if the manifold is isometric to Euclidian space. Prior applications of Witten's spinor method by Lee and Parker and by Bartnik required the much stronger regularity $W^{2,2}$. Our proof is a generalization of Witten's arguments, in which we must treat the Dirac operator and its associated Lichnerowicz-Weitzenbock identity in the distributional sense and cope with certain averages of first-order derivatives of the metric over annuli that approach infinity. Finally, we observe that our arguments are not specific to scalar curvature and also allow us to establish a universal positive mass theorem.
△ Less
Submitted 19 August, 2014;
originally announced August 2014.
-
Intrinsic flat stability of the positive mass theorem for graphical hypersurfaces of Euclidean space
Authors:
Lan-Hsuan Huang,
Dan A. Lee,
Christina Sormani
Abstract:
The rigidity of the Positive Mass Theorem states that the only complete asymptotically flat manifold of nonnegative scalar curvature and zero mass is Euclidean space. We study the stability of this statement for spaces that can be realized as graphical hypersurfaces in Euclidean space. We prove (under certain technical hypotheses) that if a sequence of complete asymptotically flat graphs of nonneg…
▽ More
The rigidity of the Positive Mass Theorem states that the only complete asymptotically flat manifold of nonnegative scalar curvature and zero mass is Euclidean space. We study the stability of this statement for spaces that can be realized as graphical hypersurfaces in Euclidean space. We prove (under certain technical hypotheses) that if a sequence of complete asymptotically flat graphs of nonnegative scalar curvature has mass approaching zero, then the sequence must converge to Euclidean space in the pointed intrinsic flat sense. The appendix includes a new Gromov-Hausdorff and intrinsic flat compactness theorem for sequences of metric spaces with uniform Lipschitz bounds on their metrics.
△ Less
Submitted 25 May, 2015; v1 submitted 19 August, 2014;
originally announced August 2014.
-
Stability of the positive mass theorem for graphical hypersurfaces of Euclidean space
Authors:
Lan-Hsuan Huang,
Dan A. Lee
Abstract:
The rigidity of the positive mass theorem states that the only complete asymptotically flat manifold of nonnegative scalar curvature and zero mass is Euclidean space. We prove a corresponding stability theorem for spaces that can be realized as graphical hypersurfaces in $\mathbb{R}^{n+1}$. Specifically, for an asymptotically flat graphical hypersurface $M^n\subset \mathbb{R}^{n+1}$ of nonnegative…
▽ More
The rigidity of the positive mass theorem states that the only complete asymptotically flat manifold of nonnegative scalar curvature and zero mass is Euclidean space. We prove a corresponding stability theorem for spaces that can be realized as graphical hypersurfaces in $\mathbb{R}^{n+1}$. Specifically, for an asymptotically flat graphical hypersurface $M^n\subset \mathbb{R}^{n+1}$ of nonnegative scalar curvature (satisfying certain technical conditions), there is a horizontal hyperplane $Π\subset \mathbb{R}^{n+1}$ such that the flat distance between $M$ and $Π$ in any ball of radius $ρ$ can be bounded purely in terms of $n$, $ρ$, and the mass of $M$. In particular, this means that if the masses of a sequence of such graphs approach zero, then the sequence weakly converges (in the sense of currents, after a suitable vertical normalization) to a flat plane in $\mathbb{R}^{n+1}$. This result generalizes some of the earlier findings of the second author and C. Sormani and provides some evidence for a conjecture stated there.
△ Less
Submitted 18 September, 2014; v1 submitted 3 May, 2014;
originally announced May 2014.
-
Uniform Ergodicity of the Iterated Conditional SMC and Geometric Ergodicity of Particle Gibbs samplers
Authors:
Christophe Andrieu,
Anthony Lee,
Matti Vihola
Abstract:
We establish quantitative bounds for rates of convergence and asymptotic variances for iterated conditional sequential Monte Carlo (i-cSMC) Markov chains and associated particle Gibbs samplers. Our main findings are that the essential boundedness of potential functions associated with the i-cSMC algorithm provide necessary and sufficient conditions for the uniform ergodicity of the i-cSMC Markov c…
▽ More
We establish quantitative bounds for rates of convergence and asymptotic variances for iterated conditional sequential Monte Carlo (i-cSMC) Markov chains and associated particle Gibbs samplers. Our main findings are that the essential boundedness of potential functions associated with the i-cSMC algorithm provide necessary and sufficient conditions for the uniform ergodicity of the i-cSMC Markov chain, as well as quantitative bounds on its (uniformly geometric) rate of convergence. Furthermore, we show that the i-cSMC Markov chain cannot even be geometrically ergodic if this essential boundedness does not hold in many applications of interest. Our sufficiency and quantitative bounds rely on a novel non-asymptotic analysis of the expectation of a standard normalizing constant estimate with respect to a "doubly conditional" SMC algorithm. In addition, our results for i-cSMC imply that the rate of convergence can be improved arbitrarily by increasing N, the number of particles in the algorithm, and that in the presence of mixing assumptions, the rate of convergence can be kept constant by increasing N linearly with the time horizon. We translate the sufficiency of the boundedness condition for i-cSMC into sufficient conditions for the particle Gibbs Markov chain to be geometrically ergodic and quantitative bounds on its geometric rate of convergence, which imply convergence of properties of the particle Gibbs Markov chain to those of its corresponding Gibbs sampler. These results complement recently discovered, and related, conditions for the particle marginal Metropolis-Hastings (PMMH) Markov chain.
△ Less
Submitted 14 April, 2015; v1 submitted 22 December, 2013;
originally announced December 2013.
-
The Penrose inequality for asymptotically locally hyperbolic spaces with nonpositive mass
Authors:
Dan A. Lee,
André Neves
Abstract:
In the asymptotically locally hyperbolic setting it is possible to have metrics with scalar curvature at least -6 and negative mass when the genus of the conformal boundary at infinity is positive. Using inverse mean curvature flow, we prove a Penrose inequality for these negative mass metrics. The motivation comes from a previous result of P. Chruściel and W. Simon, which states that the Penrose…
▽ More
In the asymptotically locally hyperbolic setting it is possible to have metrics with scalar curvature at least -6 and negative mass when the genus of the conformal boundary at infinity is positive. Using inverse mean curvature flow, we prove a Penrose inequality for these negative mass metrics. The motivation comes from a previous result of P. Chruściel and W. Simon, which states that the Penrose inequality we prove implies a static uniqueness theorem for negative mass Kottler metrics.
△ Less
Submitted 10 October, 2013;
originally announced October 2013.
-
Feynman-Kac particle integration with geometric interacting jumps
Authors:
Pierre Del Moral,
Pierre E. Jacob,
Anthony Lee,
Lawrence Murray,
Gareth W. Peters
Abstract:
This article is concerned with the design and analysis of discrete time Feynman-Kac particle integration models with geometric interacting jump processes. We analyze two general types of model, corresponding to whether the reference process is in continuous or discrete time. For the former, we consider discrete generation particle models defined by arbitrarily fine time mesh approximations of the…
▽ More
This article is concerned with the design and analysis of discrete time Feynman-Kac particle integration models with geometric interacting jump processes. We analyze two general types of model, corresponding to whether the reference process is in continuous or discrete time. For the former, we consider discrete generation particle models defined by arbitrarily fine time mesh approximations of the Feynman-Kac models with continuous time path integrals. For the latter, we assume that the discrete process is observed at integer times and we design new approximation models with geometric interacting jumps in terms of a sequence of intermediate time steps between the integers. In both situations, we provide non asymptotic bias and variance theorems w.r.t. the time step and the size of the system, yielding what appear to be the first results of this type for this class of Feynman-Kac particle integration models. We also discuss uniform convergence estimates w.r.t. the time horizon. Our approach is based on an original semigroup analysis with first order decompositions of the fluctuation errors.
△ Less
Submitted 30 November, 2012;
originally announced November 2012.
-
Solving multivariate functional equations
Authors:
Michael Chon,
Christopher R. H. Hanusa,
Amy Lee
Abstract:
This paper presents a new method to solve functional equations of multivariate generating functions, such as $$F(r,s)=e(r,s)+xf(r,s)F(1,1)+xg(r,s)F(qr,1)+xh(r,s)F(qr,qs),$$ giving a formula for $F(r,s)$ in terms of a sum over finite sequences. We use this method to show how one would calculate the coefficients of the generating function for parallelogram polyominoes, which is impractical using oth…
▽ More
This paper presents a new method to solve functional equations of multivariate generating functions, such as $$F(r,s)=e(r,s)+xf(r,s)F(1,1)+xg(r,s)F(qr,1)+xh(r,s)F(qr,qs),$$ giving a formula for $F(r,s)$ in terms of a sum over finite sequences. We use this method to show how one would calculate the coefficients of the generating function for parallelogram polyominoes, which is impractical using other methods. We also apply this method to answer a question from fully commutative affine permutations.
△ Less
Submitted 3 December, 2013; v1 submitted 28 June, 2012;
originally announced June 2012.