Search | arXiv e-print repository

A Geometric Embedding Approach to Multiple Games and Multiple Populations

Authors: Bastian Boll, Jonas Cassel, Peter Albers, Stefania Petra, Christoph Schnörr

Abstract: This paper studies a meta-simplex concept and geometric embedding framework for multi-population replicator dynamics. Central results are two embedding theorems which constitute a formal reduction of multi-population replicator dynamics to single-population ones. In conjunction with a robust mathematical formalism, this provides a toolset for analyzing complex multi-population models. Our framewor… ▽ More This paper studies a meta-simplex concept and geometric embedding framework for multi-population replicator dynamics. Central results are two embedding theorems which constitute a formal reduction of multi-population replicator dynamics to single-population ones. In conjunction with a robust mathematical formalism, this provides a toolset for analyzing complex multi-population models. Our framework provides a unifying perspective on different population dynamics in the literature which in particular enables to establish a formal link between multi-population and multi-game dynamics. △ Less

Submitted 11 January, 2024; originally announced January 2024.

arXiv:2401.05196 [pdf, other]

Accelerated Bregmann divergence optimization with SMART: an information geometry point of view

Authors: Maren Raus, Yara Elshiaty, Stefania Petra

Abstract: We investigate the problem of minimizing Kullback-Leibler divergence between a linear model $Ax$ and a positive vector $b$ in different convex domains (positive orthant, $n$-dimensional box, probability simplex). Our focus is on the SMART method that employs efficient multiplicative updates. We explore the exponentiated gradient method, which can be viewed as a Bregman proximal gradient method and… ▽ More We investigate the problem of minimizing Kullback-Leibler divergence between a linear model $Ax$ and a positive vector $b$ in different convex domains (positive orthant, $n$-dimensional box, probability simplex). Our focus is on the SMART method that employs efficient multiplicative updates. We explore the exponentiated gradient method, which can be viewed as a Bregman proximal gradient method and as a Riemannian gradient descent on the parameter manifold of a corresponding distribution of the exponential family. This dual interpretation enables us to establish connections and achieve accelerated SMART iterates while smoothly incorporating constraints. The performance of the proposed acceleration schemes is demonstrated by large-scale numerical examples. △ Less

Submitted 10 January, 2024; originally announced January 2024.

Comments: 37 pages, 11 figures, 3 tables, 4 algorithms. Submitted to Journal of Applied and Numerical Optimization for the Special Issue Dedicated to Prof. Yair Censor

arXiv:2207.04934 [pdf, other]

Multilevel Geometric Optimization for Regularised Constrained Linear Inverse Problems

Authors: Sebastian Müller, Stefania Petra, Matthias Zisler

Abstract: We present a geometric multilevel optimization approach that smoothly incorporates box constraints. Given a box constrained optimization problem, we consider a hierarchy of models with varying discretization levels. Finer models are accurate but expensive to compute, while coarser models are less accurate but cheaper to compute. When working at the fine level, multilevel optimisation computes the… ▽ More We present a geometric multilevel optimization approach that smoothly incorporates box constraints. Given a box constrained optimization problem, we consider a hierarchy of models with varying discretization levels. Finer models are accurate but expensive to compute, while coarser models are less accurate but cheaper to compute. When working at the fine level, multilevel optimisation computes the search direction based on a coarser model which speeds up updates at the fine level. Moreover, exploiting geometry induced by the hierarchy the feasibility of the updates is preserved. In particular, our approach extends classical components of multigrid methods like restriction and prolongation to the Riemannian structure of our constraints. △ Less

Submitted 22 April, 2024; v1 submitted 11 July, 2022; originally announced July 2022.

Comments: 25 pages, 6 figures

MSC Class: 65K10; 49J40; 49M37; 68U10; 74P20; 90C06

Journal ref: Pure and Applied Functional Analysis, Vol. 8 (2023), No. 3, pp. 855-880

arXiv:2201.11162 [pdf, other]

Self-Certifying Classification by Linearized Deep Assignment

Authors: Bastian Boll, Alexander Zeilmann, Stefania Petra, Christoph Schnörr

Abstract: We propose a novel class of deep stochastic predictors for classifying metric data on graphs within the PAC-Bayes risk certification paradigm. Classifiers are realized as linearly parametrized deep assignment flows with random initial conditions. Building on the recent PAC-Bayes literature and data-dependent priors, this approach enables (i) to use risk bounds as training objectives for learning p… ▽ More We propose a novel class of deep stochastic predictors for classifying metric data on graphs within the PAC-Bayes risk certification paradigm. Classifiers are realized as linearly parametrized deep assignment flows with random initial conditions. Building on the recent PAC-Bayes literature and data-dependent priors, this approach enables (i) to use risk bounds as training objectives for learning posterior distributions on the hypothesis space and (ii) to compute tight out-of-sample risk certificates of randomized classifiers more efficiently than related work. Comparison with empirical test set errors illustrates the performance and practicality of this self-certifying classification method. △ Less

Submitted 18 February, 2022; v1 submitted 26 January, 2022; originally announced January 2022.

arXiv:2108.02571 [pdf, other]

Learning Linearized Assignment Flows for Image Labeling

Authors: Alexander Zeilmann, Stefania Petra, Christoph Schnörr

Abstract: We introduce a novel algorithm for estimating optimal parameters of linearized assignment flows for image labeling. An exact formula is derived for the parameter gradient of any loss function that is constrained by the linear system of ODEs determining the linearized assignment flow. We show how to efficiently evaluate this formula using a Krylov subspace and a low-rank approximation. This enables… ▽ More We introduce a novel algorithm for estimating optimal parameters of linearized assignment flows for image labeling. An exact formula is derived for the parameter gradient of any loss function that is constrained by the linear system of ODEs determining the linearized assignment flow. We show how to efficiently evaluate this formula using a Krylov subspace and a low-rank approximation. This enables us to perform parameter learning by Riemannian gradient descent in the parameter space, without the need to backpropagate errors or to solve an adjoint equation. Experiments demonstrate that our method performs as good as highly-tuned machine learning software using automatic differentiation. Unlike methods employing automatic differentiation, our approach yields a low-dimensional representation of internal parameters and their dynamics which helps to understand how assignment flows and more generally neural networks work and perform. △ Less

Submitted 4 April, 2022; v1 submitted 2 August, 2021; originally announced August 2021.

MSC Class: 34C40; 62H35; 68U10; 68T05; 91A22

arXiv:2009.05814 [pdf, other]

Multi-Channel Potts-Based Reconstruction for Multi-Spectral Computed Tomography

Authors: Lukas Kiefer, Stefania Petra, Martin Storath, Andreas Weinmann

Abstract: We consider reconstructing multi-channel images from measurements performed by photon-counting and energy-discriminating detectors in the setting of multi-spectral X-ray computed tomography (CT). Our aim is to exploit the strong structural correlation that is known to exist between the channels of multi-spectral CT images. To that end, we adopt the multi-channel Potts prior to jointly reconstruct… ▽ More We consider reconstructing multi-channel images from measurements performed by photon-counting and energy-discriminating detectors in the setting of multi-spectral X-ray computed tomography (CT). Our aim is to exploit the strong structural correlation that is known to exist between the channels of multi-spectral CT images. To that end, we adopt the multi-channel Potts prior to jointly reconstruct all channels. This prior produces piecewise constant solutions with strongly correlated channels. In particular, edges are enforced to have the same spatial position across channels which is a benefit over TV-based methods. We consider the Potts prior in two frameworks: (a) in the context of a variational Potts model, and (b) in a Potts-superiorization approach that perturbs the iterates of a basic iterative least squares solver. We identify an alternating direction method of multipliers (ADMM) approach as well as a Potts-superiorized conjugate gradient method as particularly suitable. In numerical experiments, we compare the Potts prior based approaches to existing TV-type approaches on realistically simulated multi-spectral CT data and obtain improved reconstruction for compound solid bodies. △ Less

Submitted 10 March, 2021; v1 submitted 12 September, 2020; originally announced September 2020.

Comments: 37 pages, 12 figures

MSC Class: 94A08; 68U10; 65D18; 90C26; 90C39

arXiv:1911.05498 [pdf, other]

Superiorization vs. Accelerated Convex Optimization: The Superiorized/Regularized Least-Squares Case

Authors: Yair Censor, Stefania Petra, Christoph Schnörr

Abstract: We conduct a study and comparison of superiorization and optimization approaches for the reconstruction problem of superiorized/regularized least-squares solutions of underdetermined linear equations with nonnegativity variable bounds. Regarding superiorization, the state of the art is examined for this problem class, and a novel approach is proposed that employs proximal map**s and is structura… ▽ More We conduct a study and comparison of superiorization and optimization approaches for the reconstruction problem of superiorized/regularized least-squares solutions of underdetermined linear equations with nonnegativity variable bounds. Regarding superiorization, the state of the art is examined for this problem class, and a novel approach is proposed that employs proximal map**s and is structurally similar to the established forward-backward optimization approach. Regarding convex optimization, accelerated forward-backward splitting with inexact proximal maps is worked out and applied to both the natural splitting least-squares term/regularizer and to the reverse splitting regularizer/least-squares term. Our numerical findings suggest that superiorization can approach the solution of the optimization problem and leads to comparable results at significantly lower costs, after appropriate parameter tuning. On the other hand, applying accelerated forward-backward optimization to the reverse splitting slightly outperforms superiorization, which suggests that convex optimization can approach superiorization too, using a suitable problem splitting. △ Less

Submitted 1 April, 2020; v1 submitted 13 November, 2019; originally announced November 2019.

MSC Class: 65F22; 65F10; 90C25; 90C06 ACM Class: G.1.6; G.1.3; G.4

arXiv:1910.09976 [pdf, other]

Learning Adaptive Regularization for Image Labeling Using Geometric Assignment

Authors: Ruben Hühnerbein, Fabrizio Savarino, Stefania Petra, Christoph Schnörr

Abstract: We study the inverse problem of model parameter learning for pixelwise image labeling, using the linear assignment flow and training data with ground truth. This is accomplished by a Riemannian gradient flow on the manifold of parameters that determine the regularization properties of the assignment flow. Using the symplectic partitioned Runge--Kutta method for numerical integration, it is shown t… ▽ More We study the inverse problem of model parameter learning for pixelwise image labeling, using the linear assignment flow and training data with ground truth. This is accomplished by a Riemannian gradient flow on the manifold of parameters that determine the regularization properties of the assignment flow. Using the symplectic partitioned Runge--Kutta method for numerical integration, it is shown that deriving the sensitivity conditions of the parameter learning problem and its discretization commute. A convenient property of our approach is that learning is based on exact inference. Carefully designed experiments demonstrate the performance of our approach, the expressiveness of the mathematical model as well as its limitations, from the viewpoint of statistical learning and optimal control. △ Less

Submitted 25 June, 2020; v1 submitted 22 October, 2019; originally announced October 2019.

MSC Class: 62H35; 68U10; 68T05; 90C31; 62M45; 91A22

arXiv:1904.10863 [pdf, other]

doi 10.1007/s10851-019-00935-7

Unsupervised Assignment Flow: Label Learning on Feature Manifolds by Spatially Regularized Geometric Assignment

Authors: Artjom Zern, Matthias Zisler, Stefania Petra, Christoph Schnörr

Abstract: This paper introduces the unsupervised assignment flow that couples the assignment flow for supervised image labeling with Riemannian gradient flows for label evolution on feature manifolds. The latter component of the approach encompasses extensions of state-of-the-art clustering approaches to manifold-valued data. Coupling label evolution with the spatially regularized assignment flow induces a… ▽ More This paper introduces the unsupervised assignment flow that couples the assignment flow for supervised image labeling with Riemannian gradient flows for label evolution on feature manifolds. The latter component of the approach encompasses extensions of state-of-the-art clustering approaches to manifold-valued data. Coupling label evolution with the spatially regularized assignment flow induces a sparsifying effect that enables to learn compact label dictionaries in an unsupervised manner. Our approach alleviates the requirement for supervised labeling to have proper labels at hand, because an initial set of labels can evolve and adapt to better values while being assigned to given data. The separation between feature and assignment manifolds enables the flexible application which is demonstrated for three scenarios with manifold-valued features. Experiments demonstrate a beneficial effect in both directions: adaptivity of labels improves image labeling, and steering label evolution by spatially regularized assignments leads to proper labels, because the assignment flow for supervised labeling is exactly used without any approximation for label learning. △ Less

Submitted 16 December, 2019; v1 submitted 24 April, 2019; originally announced April 2019.

Comments: 34 pages, 13 figures, published in Journal of Mathematical Imaging and Vision (JMIV)

arXiv:1812.10471 [pdf, other]

Performance Bounds For Co-/Sparse Box Constrained Signal Recovery

Authors: Jan Kuske, Stefania Petra

Abstract: The recovery of structured signals from a few linear measurements is a central point in both compressed sensing (CS) and discrete tomography. In CS the signal structure is described by means of a low complexity model e.g. co-/sparsity. The CS theory shows that any signal/image can be undersampled at a rate dependent on its intrinsic complexity. Moreover, in such undersampling regimes, the signal c… ▽ More The recovery of structured signals from a few linear measurements is a central point in both compressed sensing (CS) and discrete tomography. In CS the signal structure is described by means of a low complexity model e.g. co-/sparsity. The CS theory shows that any signal/image can be undersampled at a rate dependent on its intrinsic complexity. Moreover, in such undersampling regimes, the signal can be recovered by sparsity promoting convex regularization like $\ell_1$- or total variation (TV-) minimization. Precise relations between many low complexity measures and the sufficient number of random measurements are known for many sparsity promoting norms. However, a precise estimate of the undersampling rate for the TV seminorm is still lacking. We address this issue by: a) providing dual certificates testing uniqueness of a given cosparse signal with bounded signal values, b) approximating the undersampling rates via the statistical dimension of the TV descent cone and c) showing empirically that the provided rates also hold for tomographic measurements. △ Less

Submitted 23 December, 2018; originally announced December 2018.

arXiv:1810.06970 [pdf, other]

doi 10.1088/1361-6420/ab2772

Geometric Numerical Integration of the Assignment Flow

Authors: Alexander Zeilmann, Fabrizio Savarino, Stefania Petra, Christoph Schnörr

Abstract: The assignment flow is a smooth dynamical system that evolves on an elementary statistical manifold and performs contextual data labeling on a graph. We derive and introduce the linear assignment flow that evolves nonlinearly on the manifold, but is governed by a linear ODE on the tangent space. Various numerical schemes adapted to the mathematical structure of these two models are designed and st… ▽ More The assignment flow is a smooth dynamical system that evolves on an elementary statistical manifold and performs contextual data labeling on a graph. We derive and introduce the linear assignment flow that evolves nonlinearly on the manifold, but is governed by a linear ODE on the tangent space. Various numerical schemes adapted to the mathematical structure of these two models are designed and studied, for the geometric numerical integration of both flows: embedded Runge-Kutta-Munthe-Kaas schemes for the nonlinear flow, adaptive Runge-Kutta schemes and exponential integrators for the linear flow. All algorithms are parameter free, except for setting a tolerance value that specifies adaptive step size selection by monitoring the local integration error, or fixing the dimension of the Krylov subspace approximation. These algorithms provide a basis for applying the assignment flow to machine learning scenarios beyond supervised labeling, including unsupervised labeling and learning from controlled assignment flows. △ Less

Submitted 5 October, 2018; originally announced October 2018.

MSC Class: 62H35; 62M40; 65K10; 68U10

arXiv:1703.03769 [pdf, other]

doi 10.1007/978-3-319-58771-4_19

A Novel Convex Relaxation for Non-Binary Discrete Tomography

Authors: Jan Kuske, Paul Swoboda, Stefania Petra

Abstract: We present a novel convex relaxation and a corresponding inference algorithm for the non-binary discrete tomography problem, that is, reconstructing discrete-valued images from few linear measurements. In contrast to state of the art approaches that split the problem into a continuous reconstruction problem for the linear measurement constraints and a discrete labeling problem to enforce discrete-… ▽ More We present a novel convex relaxation and a corresponding inference algorithm for the non-binary discrete tomography problem, that is, reconstructing discrete-valued images from few linear measurements. In contrast to state of the art approaches that split the problem into a continuous reconstruction problem for the linear measurement constraints and a discrete labeling problem to enforce discrete-valued reconstructions, we propose a joint formulation that addresses both problems simultaneously, resulting in a tighter convex relaxation. For this purpose a constrained graphical model is set up and evaluated using a novel relaxation optimized by dual decomposition. We evaluate our approach experimentally and show superior solutions both mathematically (tighter relaxation) and experimentally in comparison to previously proposed relaxations. △ Less

Submitted 10 March, 2017; originally announced March 2017.

arXiv:1603.05285 [pdf, other]

doi 10.1007/s10851-016-0702-4

Image Labeling by Assignment

Authors: Freddie Åström, Stefania Petra, Bernhard Schmitzer, Christoph Schnörr

Abstract: We introduce a novel geometric approach to the image labeling problem. Abstracting from specific labeling applications, a general objective function is defined on a manifold of stochastic matrices, whose elements assign prior data that are given in any metric space, to observed image measurements. The corresponding Riemannian gradient flow entails a set of replicator equations, one for each data p… ▽ More We introduce a novel geometric approach to the image labeling problem. Abstracting from specific labeling applications, a general objective function is defined on a manifold of stochastic matrices, whose elements assign prior data that are given in any metric space, to observed image measurements. The corresponding Riemannian gradient flow entails a set of replicator equations, one for each data point, that are spatially coupled by geometric averaging on the manifold. Starting from uniform assignments at the barycenter as natural initialization, the flow terminates at some global maximum, each of which corresponds to an image labeling that uniquely assigns the prior data. Our geometric variational approach constitutes a smooth non-convex inner approximation of the general image labeling problem, implemented with sparse interior-point numerics in terms of parallel multiplicative updates that converge efficiently. △ Less

Submitted 16 March, 2016; originally announced March 2016.

MSC Class: 62H35; 65K05; 68U10; 62M40

arXiv:1504.00231 [pdf, ps, other]

Single Projection Kaczmarz Extended Algorithms

Authors: Stefania Petra, Constantin Popa

Abstract: To find the least squares solution of a very large and inconsistent system of equations, one can employ the extended Kaczmarz algorithm. This method simultaneously removes the error term, such that a consistent system is asymptotically obtained, and applies Kaczmarz iterations for the current approximation of this system. For random corrections of the right hand side and Kaczmarz updates selected… ▽ More To find the least squares solution of a very large and inconsistent system of equations, one can employ the extended Kaczmarz algorithm. This method simultaneously removes the error term, such that a consistent system is asymptotically obtained, and applies Kaczmarz iterations for the current approximation of this system. For random corrections of the right hand side and Kaczmarz updates selected at random, convergence to the least squares solution has been shown. We consider the deterministic control strategies, and show convergence to a least squares solution when row and column updates are chosen according to the almost-cyclic or maximal-residual choice. △ Less

Submitted 1 April, 2015; originally announced April 2015.

Comments: 14 pages

MSC Class: 65F10; 65F20; 90C06; 90C25 ACM Class: G.1.3; G.1.6

arXiv:1311.0423 [pdf, other]

Phase Transitions and Cosparse Tomographic Recovery of Compound Solid Bodies from Few Projections

Authors: Andreea Deniţiu, Stefania Petra, Claudius Schnörr, Christoph Schnörr

Abstract: We study unique recovery of cosparse signals from limited-angle tomographic measurements of two- and three-dimensional domains. Admissible signals belong to the union of subspaces defined by all cosupports of maximal cardinality $\ell$ with respect to the discrete gradient operator. We relate $\ell$ both to the number of measurements and to a nullspace condition with respect to the measurement mat… ▽ More We study unique recovery of cosparse signals from limited-angle tomographic measurements of two- and three-dimensional domains. Admissible signals belong to the union of subspaces defined by all cosupports of maximal cardinality $\ell$ with respect to the discrete gradient operator. We relate $\ell$ both to the number of measurements and to a nullspace condition with respect to the measurement matrix, so as to achieve unique recovery by linear programming. These results are supported by comprehensive numerical experiments that show a high correlation of performance in practice and theoretical predictions. Despite poor properties of the measurement matrix from the viewpoint of compressed sensing, the class of uniquely recoverable signals basically seems large enough to cover practical applications, like contactless quality inspection of compound solid bodies composed of few materials. △ Less

Submitted 2 November, 2013; originally announced November 2013.

MSC Class: 65F22; 68U10

arXiv:1209.4316 [pdf, other]

Critical Parameter Values and Reconstruction Properties of Discrete Tomography: Application to Experimental Fluid Dynamics

Authors: Stefania Petra, Christoph Schnörr, Andreas Schröder

Abstract: We analyze representative ill-posed scenarios of tomographic PIV with a focus on conditions for unique volume reconstruction. Based on sparse random seedings of a region of interest with small particles, the corresponding systems of linear projection equations are probabilistically analyzed in order to determine (i) the ability of unique reconstruction in terms of the imaging geometry and the crit… ▽ More We analyze representative ill-posed scenarios of tomographic PIV with a focus on conditions for unique volume reconstruction. Based on sparse random seedings of a region of interest with small particles, the corresponding systems of linear projection equations are probabilistically analyzed in order to determine (i) the ability of unique reconstruction in terms of the imaging geometry and the critical sparsity parameter, and (ii) sharpness of the transition to non-unique reconstruction with ghost particles when choosing the sparsity parameter improperly. The sparsity parameter directly relates to the seeding density used for PIV in experimental fluids dynamics that is chosen empirically to date. Our results provide a basic mathematical characterization of the PIV volume reconstruction problem that is an essential prerequisite for any algorithm used to actually compute the reconstruction. Moreover, we connect the sparse volume function reconstruction problem from few tomographic projections to major developments in compressed sensing. △ Less

Submitted 19 September, 2012; originally announced September 2012.

Comments: 22 pages, submitted to Fundamenta Informaticae. arXiv admin note: text overlap with arXiv:1208.5894

MSC Class: 65F22; 68U10

arXiv:1208.5894 [pdf, other]

Average Case Recovery Analysis of Tomographic Compressive Sensing

Authors: Stefania Petra, Christoph Schnörr

Abstract: The reconstruction of three-dimensional sparse volume functions from few tomographic projections constitutes a challenging problem in image reconstruction and turns out to be a particular instance problem of compressive sensing. The tomographic measurement matrix encodes the incidence relation of the imaging process, and therefore is not subject to design up to small perturbations of non-zero entr… ▽ More The reconstruction of three-dimensional sparse volume functions from few tomographic projections constitutes a challenging problem in image reconstruction and turns out to be a particular instance problem of compressive sensing. The tomographic measurement matrix encodes the incidence relation of the imaging process, and therefore is not subject to design up to small perturbations of non-zero entries. We present an average case analysis of the recovery properties and a corresponding tail bound to establish weak thresholds, in excellent agreement with numerical experiments. Our result improve the state-of-the-art of tomographic imaging in experimental fluid dynamics by a factor of three. △ Less

Submitted 30 August, 2012; v1 submitted 29 August, 2012; originally announced August 2012.

MSC Class: 65F22; 68U10

Showing 1–17 of 17 results for author: Petra, S