Skip to main content

Showing 1–18 of 18 results for author: Weissmann, S

Searching in archive math. Search in all archives.
.
  1. arXiv:2405.13592  [pdf, other

    cs.LG math.OC

    Almost sure convergence rates of stochastic gradient methods under gradient domination

    Authors: Simon Weissmann, Sara Klein, Waïss Azizian, Leif Döring

    Abstract: Stochastic gradient methods are among the most important algorithms in training machine learning problems. While classical assumptions such as strong convexity allow a simple analysis they are rarely satisfied in applications. In recent years, global and local gradient domination properties have shown to be a more realistic replacement of strong convexity. They were proved to hold in diverse setti… ▽ More

    Submitted 27 May, 2024; v1 submitted 22 May, 2024; originally announced May 2024.

  2. arXiv:2402.01320  [pdf, ps, other

    math.NA stat.ME

    On the mean-field limit for Stein variational gradient descent: stability and multilevel approximation

    Authors: Simon Weissmann, Jakob Zech

    Abstract: In this paper we propose and analyze a novel multilevel version of Stein variational gradient descent (SVGD). SVGD is a recent particle based variational inference method. For Bayesian inverse problems with computationally expensive likelihood evaluations, the method can become prohibitive as it requires to evolve a discrete dynamical system over many time steps, each of which requires likelihood… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

  3. arXiv:2401.11948  [pdf, ps, other

    math.NA stat.ME

    The Ensemble Kalman Filter for Dynamic Inverse Problems

    Authors: Simon Weissmann, Neil K. Chada, Xin T. Tong

    Abstract: In inverse problems, the goal is to estimate unknown model parameters from noisy observational data. Traditionally, inverse problems are solved under the assumption of a fixed forward operator describing the observation model. In this article, we consider the extension of this approach to situations where we have a dynamic forward model, motivated by applications in scientific computation and engi… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

  4. arXiv:2312.13889  [pdf, ps, other

    stat.CO math.NA

    Metropolis-adjusted interacting particle sampling

    Authors: Björn Sprungk, Simon Weissmann, Jakob Zech

    Abstract: In recent years, various interacting particle samplers have been developed to sample from complex target distributions, such as those found in Bayesian inverse problems. These samplers are motivated by the mean-field limit perspective and implemented as ensembles of particles that move in the product state space according to coupled stochastic differential equations. The ensemble approximation and… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

  5. arXiv:2312.13804  [pdf, ps, other

    math.NA

    On the ensemble Kalman inversion under inequality constraints

    Authors: Matei Hanu, Simon Weissmann

    Abstract: The ensemble Kalman inversion (EKI), a recently introduced optimisation method for solving inverse problems, is widely employed for the efficient and derivative-free estimation of unknown parameters. Specifically in cases involving ill-posed inverse problems and high-dimensional parameter spaces, the scheme has shown promising success. However, in its general form, the EKI does not take constraint… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

    MSC Class: 65N21; 37C10; 90C56; 65M32

  6. arXiv:2310.02671  [pdf, other

    math.OC cs.LG stat.ML

    Beyond Stationarity: Convergence Analysis of Stochastic Softmax Policy Gradient Methods

    Authors: Sara Klein, Simon Weissmann, Leif Döring

    Abstract: Markov Decision Processes (MDPs) are a formal framework for modeling and solving sequential decision-making problems. In finite-time horizons such problems are relevant for instance for optimal stop** or specific supply chain problems, but also in the training of large language models. In contrast to infinite horizon MDPs optimal policies are not stationary, policies must be learned for every si… ▽ More

    Submitted 6 May, 2024; v1 submitted 4 October, 2023; originally announced October 2023.

    Comments: 54 pages, 2 figures, ICLR 2024

  7. arXiv:2208.05392  [pdf, ps, other

    math.NA

    Adaptive multilevel subset simulation with selective refinement

    Authors: Daniel Elfverson, Robert Scheichl, Simon Weissmann, F. Alejandro DiazDelaO

    Abstract: In this work we propose an adaptive multilevel version of subset simulation to estimate the probability of rare events for complex physical systems. Given a sequence of nested failure domains of increasing size, the rare event probability is expressed as a product of conditional probabilities. The proposed new estimator uses different model resolutions and varying numbers of samples across the hie… ▽ More

    Submitted 12 December, 2023; v1 submitted 10 August, 2022; originally announced August 2022.

    MSC Class: 65N30; 65C05; 65C40; 35R60

  8. arXiv:2204.13732  [pdf, other

    math.OC math.NA

    Multilevel Optimization for Inverse Problems

    Authors: Simon Weissmann, Ashia Wilson, Jakob Zech

    Abstract: Inverse problems occur in a variety of parameter identification tasks in engineering. Such problems are challenging in practice, as they require repeated evaluation of computationally expensive forward models. We introduce a unifying framework of multilevel optimization that can be applied to a wide range of optimization-based solvers. Our framework provably reduces the computational cost associat… ▽ More

    Submitted 28 April, 2022; originally announced April 2022.

    MSC Class: 65N21; 65N75; 65K10

  9. Gradient flow structure and convergence analysis of the ensemble Kalman inversion for nonlinear forward models

    Authors: Simon Weissmann

    Abstract: The ensemble Kalman inversion (EKI) is a particle based method which has been introduced as the application of the ensemble Kalman filter to inverse problems. In practice it has been widely used as derivative-free optimization method in order to estimate unknown parameters from noisy measurement data. For linear forward models the EKI can be viewed as gradient flow preconditioned by a certain samp… ▽ More

    Submitted 2 September, 2022; v1 submitted 31 March, 2022; originally announced March 2022.

    MSC Class: 37C10; 65N21; 65N75; 93D05

  10. arXiv:2112.11126  [pdf, other

    math.OC math.NA

    One-shot Learning of Surrogates in PDE-constrained Optimization Under Uncertainty

    Authors: Philipp A. Guth, Claudia Schillings, Simon Weissmann

    Abstract: We propose a general framework for machine learning based optimization under uncertainty. Our approach replaces the complex forward model by a surrogate, which is learned simultaneously in a one-shot sense when solving the optimal control problem. Our approach relies on a reformulation of the problem as a penalized empirical risk minimization problem for which we provide a consistency analysis in… ▽ More

    Submitted 22 December, 2023; v1 submitted 21 December, 2021; originally announced December 2021.

    MSC Class: 35Q93; 35R60; 60H35; 49M25

  11. Adaptive Tikhonov strategies for stochastic ensemble Kalman inversion

    Authors: Simon Weissmann, Neil K. Chada, Claudia Schillings, Xin T. Tong

    Abstract: Ensemble Kalman inversion (EKI) is a derivative-free optimizer aimed at solving inverse problems, taking motivation from the celebrated ensemble Kalman filter. The purpose of this article is to consider the introduction of adaptive Tikhonov strategies for EKI. This work builds upon Tikhonov EKI (TEKI) which was proposed for a fixed regularization constant. By adaptively learning the regularization… ▽ More

    Submitted 18 October, 2021; originally announced October 2021.

    MSC Class: 65M32; 60G35; 65C35; 70F17

  12. arXiv:2107.14508  [pdf, other

    math.NA

    Continuous time limit of the stochastic ensemble Kalman inversion: Strong convergence analysis

    Authors: Dirk Blömker, Claudia Schillings, Philipp Wacker, Simon Weissmann

    Abstract: The Ensemble Kalman inversion (EKI) method is a method for the estimation of unknown parameters in the context of (Bayesian) inverse problems. The method approximates the underlying measure by an ensemble of particles and iteratively applies the ensemble Kalman update to evolve (the approximation of the) prior into the posterior measure. For the convergence analysis of the EKI it is common pract… ▽ More

    Submitted 30 July, 2021; originally announced July 2021.

    MSC Class: 65N21; 62F15; 65N75; 65C30; 90C56

  13. arXiv:2007.02677  [pdf, ps, other

    math.ST math.NA math.OC stat.ML

    Consistency analysis of bilevel data-driven learning in inverse problems

    Authors: Neil K. Chada, Claudia Schillings, Xin T. Tong, Simon Weissmann

    Abstract: One fundamental problem when solving inverse problems is how to find regularization parameters. This article considers solving this problem using data-driven bilevel optimization, i.e. we consider the adaptive learning of the regularization parameter from data by means of optimization. This approach can be interpreted as solving an empirical risk minimization problem, and we analyze its performanc… ▽ More

    Submitted 7 January, 2021; v1 submitted 6 July, 2020; originally announced July 2020.

    MSC Class: 35R30; 90C15; 62F12; 65K10

  14. arXiv:2005.02039  [pdf, ps, other

    math.NA

    Ensemble Kalman filter for neural network based one-shot inversion

    Authors: Philipp A. Guth, Claudia Schillings, Simon Weissmann

    Abstract: We study the use of novel techniques arising in machine learning for inverse problems. Our approach replaces the complex forward model by a neural network, which is trained simultaneously in a one-shot sense when estimating the unknown parameters from data, i.e. the neural network is trained only for the unknown parameter. By establishing a link to the Bayesian approach to inverse problems, an alg… ▽ More

    Submitted 14 September, 2020; v1 submitted 5 May, 2020; originally announced May 2020.

    MSC Class: 65N21; 62F15; 65N75; 90C56

  15. arXiv:1911.10832  [pdf, other

    math.NA

    Fokker-Planck particle systems for Bayesian inference: Computational approaches

    Authors: Sebastian Reich, Simon Weissmann

    Abstract: Bayesian inference can be embedded into an appropriately defined dynamics in the space of probability measures. In this paper, we take Brownian motion and its associated Fokker--Planck equation as a starting point for such embeddings and explore several interacting particle approximations. More specifically, we consider both deterministic and stochastic interacting particle systems and combine the… ▽ More

    Submitted 8 February, 2021; v1 submitted 25 November, 2019; originally announced November 2019.

  16. arXiv:1908.00696  [pdf, other

    math.NA math.OC

    On the Incorporation of Box-Constraints for Ensemble Kalman Inversion

    Authors: Neil K. Chada, Claudia Schillings, Simon Weissmann

    Abstract: The Bayesian approach to inverse problems is widely used in practice to infer unknown parameters from noisy observations. In this framework, the ensemble Kalman inversion has been successfully applied for the quantification of uncertainties in various areas of applications. In recent years, a complete analysis of the method has been developed for linear inverse problems adopting an optimization vi… ▽ More

    Submitted 14 October, 2019; v1 submitted 2 August, 2019; originally announced August 2019.

    MSC Class: 37C10; 49M15; 65M32; 65N20

  17. Well Posedness and Convergence Analysis of the Ensemble Kalman Inversion

    Authors: Dirk Blömker, Claudia Schillings, Philipp Wacker, Simon Weissmann

    Abstract: The ensemble Kalman inversion is widely used in practice to estimate unknown parameters from noisy measurement data. Its low computational costs, straightforward implementation, and non-intrusive nature makes the method appealing in various areas of application. We present a complete analysis of the ensemble Kalman inversion with perturbed observations for a fixed ensemble size when applied to lin… ▽ More

    Submitted 26 February, 2019; v1 submitted 19 October, 2018; originally announced October 2018.

  18. arXiv:0708.0979  [pdf, ps, other

    nlin.SI math.DS physics.flu-dyn

    A new doubly discrete analogue of smoke ring flow and the real time simulation of fluid flow

    Authors: Ulrich Pinkall, Boris Springborn, Steffen Weissmann

    Abstract: Modelling incompressible ideal fluids as a finite collection of vortex filaments is important in physics (super-fluidity, models for the onset of turbulence) as well as for numerical algorithms used in computer graphics for the real time simulation of smoke. Here we introduce a time-discrete evolution equation for arbitrary closed polygons in 3-space that is a discretisation of the localised ind… ▽ More

    Submitted 7 August, 2007; originally announced August 2007.

    Comments: 15 pages, 3 figures

    Journal ref: J. Phys. A: Math. Theor. 40 (2007) 12563-12576