Skip to main content

Showing 1–14 of 14 results for author: Stillfjord, T

.
  1. arXiv:2406.16649  [pdf, other

    math.OC

    Almost sure convergence of stochastic Hamiltonian descent methods

    Authors: Måns Williamson, Tony Stillfjord

    Abstract: Gradient normalization and soft clip** are two popular techniques for tackling instability issues and improving convergence of stochastic gradient descent (SGD) with momentum. In this article, we study these types of methods through the lens of dissipative Hamiltonian systems. Gradient normalization and certain types of soft clip** algorithms can be seen as (stochastic) implicit-explicit Euler… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  2. arXiv:2406.16640  [pdf, other

    math.OC

    Analysis of a Class of Stochastic Component-Wise Soft-Clip** Schemes

    Authors: Måns Williamson, Monika Eisenmann, Tony Stillfjord

    Abstract: Choosing the optimization algorithm that performs best on a given machine learning problem is often delicate, and there is no guarantee that current state-of-the-art algorithms will perform well across all tasks. Consequently, the more reliable methods that one has at hand, the larger the likelihood of a good end result. To this end, we introduce and analyze a large class of stochastic so-called s… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  3. arXiv:2402.13656  [pdf, other

    math.NA

    Numerical methods for closed-loop systems with non-autonomous data

    Authors: B. Baran, P. Benner, J. Saak, T. Stillfjord

    Abstract: By computing a feedback control via the linear quadratic regulator (LQR) approach and simulating a non-linear non-autonomous closed-loop system using this feedback, we combine two numerically challenging tasks. For the first task, the computation of the feedback control, we use the non-autonomous generalized differential Riccati equation (DRE), whose solution determines the time-varying feedback g… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    MSC Class: 65F45; 93A15; 93B52; 93C10

  4. arXiv:2310.13462  [pdf, other

    math.NA math.OC

    Computing the matrix exponential and the Cholesky factor of a related finite horizon Gramian

    Authors: Tony Stillfjord, Filip Tronarp

    Abstract: In this article, an efficient numerical method for computing finite-horizon controllability Gramians in Cholesky-factored form is proposed. The method is applicable to general dense matrices of moderate size and produces a Cholesky factor of the Gramian without computing the full product. In contrast to other methods applicable to this task, the proposed method is a generalization of the scaling-a… ▽ More

    Submitted 30 April, 2024; v1 submitted 20 October, 2023; originally announced October 2023.

  5. arXiv:2210.05375  [pdf, other

    math.NA

    A randomized operator splitting scheme inspired by stochastic optimization methods

    Authors: Monika Eisenmann, Tony Stillfjord

    Abstract: In this paper, we combine the operator splitting methodology for abstract evolution equations with that of stochastic methods for large-scale optimization problems. The combination results in a randomized splitting scheme, which in a given time step does not necessarily use all the parts of the split operator. This is in contrast to deterministic splitting schemes which always use every part at le… ▽ More

    Submitted 11 October, 2022; originally announced October 2022.

    MSC Class: 65M12 (Primary) 65C99; 90C15; 65M55 (Secondary)

  6. arXiv:2201.12782  [pdf, other

    math.OC math.NA

    SRKCD: a stabilized Runge-Kutta method for stochastic optimization

    Authors: Tony Stillfjord, Måns Williamson

    Abstract: We introduce a family of stochastic optimization methods based on the Runge-Kutta-Chebyshev (RKC) schemes. The RKC methods are explicit methods originally designed for solving stiff ordinary differential equations by ensuring that their stability regions are of maximal size.In the optimization context, this allows for larger step sizes (learning rates) and better robustness compared to e.g. the po… ▽ More

    Submitted 30 January, 2022; originally announced January 2022.

    MSC Class: 90C15; 65K05; 65L20

  7. arXiv:2106.09286  [pdf, other

    math.OC math.NA

    Sub-linear convergence of a tamed stochastic gradient descent method in Hilbert space

    Authors: Monika Eisenmann, Tony Stillfjord

    Abstract: In this paper, we introduce the tamed stochastic gradient descent method (TSGD) for optimization problems. Inspired by the tamed Euler scheme, which is a commonly used method within the context of stochastic differential equations, TSGD is an explicit scheme that exhibits stability properties similar to those of implicit schemes. As its computational cost is essentially equivalent to that of the w… ▽ More

    Submitted 17 June, 2021; originally announced June 2021.

    MSC Class: 46N10; 65K10; 90C15

  8. arXiv:2010.12348  [pdf, other

    math.OC math.NA

    Sub-linear convergence of a stochastic proximal iteration method in Hilbert space

    Authors: Monika Eisenmann, Tony Stillfjord, Måns Williamson

    Abstract: We consider a stochastic version of the proximal point algorithm for optimization problems posed on a Hilbert space. A typical application of this is supervised learning. While the method is not new, it has not been extensively analyzed in this form. Indeed, most related results are confined to the finite-dimensional setting, where error bounds could depend on the dimension of the space. On the ot… ▽ More

    Submitted 27 September, 2021; v1 submitted 23 October, 2020; originally announced October 2020.

    Comments: Adjusted the setting, corrected minor typos and made several arguments and motivations more clear

    MSC Class: 46N10; 65K10; 90C15

  9. arXiv:2006.05370  [pdf, ps, other

    math.NA math.OC

    A linear implicit Euler method for the finite element discretization of a controlled stochastic heat equation

    Authors: Peter Benner, Tony Stillfjord, Christoph Trautwein

    Abstract: We consider a numerical approximation of a linear quadratic control problem constrained by the stochastic heat equation with non-homogeneous Neumann boundary conditions. This involves a combination of distributed and boundary control, as well as both distributed and boundary noise. We apply the finite element method for the spatial discretization and the linear implicit Euler method for the tempor… ▽ More

    Submitted 9 June, 2020; originally announced June 2020.

    MSC Class: 65M12 (Primary) 65C30; 49N10 (Secondary)

  10. arXiv:1805.08990  [pdf, ps, other

    math.NA

    GPU acceleration of splitting schemes applied to differential matrix equations

    Authors: Hermann Mena, Lena-Maria Pfurtscheller, Tony Stillfjord

    Abstract: We consider differential Lyapunov and Riccati equations, and generalized versions thereof. Such equations arise in many different areas and are especially important within the field of optimal control. In order to approximate their solution, one may use several different kinds of numerical methods. Of these, splitting schemes are often a very competitive choice. In this article, we investigate the… ▽ More

    Submitted 22 October, 2018; v1 submitted 23 May, 2018; originally announced May 2018.

    Comments: 21 pages, 17 figures

    MSC Class: 65F30; 65Y05; 65F60

  11. arXiv:1804.02197  [pdf, ps, other

    math.OC math.NA

    Singular value decay of operator-valued differential Lyapunov and Riccati equations

    Authors: Tony Stillfjord

    Abstract: We consider operator-valued differential Lyapunov and Riccati equations, where the operators $B$ and $C$ may be relatively unbounded with respect to $A$ (in the standard notation). In this setting, we prove that the singular values of the solutions decay fast under certain conditions. In fact, the decay is exponential in the negative square root if $A$ generates an analytic semigroup and the range… ▽ More

    Submitted 25 June, 2018; v1 submitted 6 April, 2018; originally announced April 2018.

    Comments: Corrected some misconceptions, which lead to more general results (e.g. exponential stability is no longer required). Also fixed some off-by-one errors, improved the presentation, and added/extended several remarks on possible generalizations. Now 22 pages, 8 figures

    MSC Class: 47A62; 47A11; 49N10

  12. Multiscale differential Riccati equations for linear quadratic regulator problems

    Authors: Axel Målqvist, Anna Persson, Tony Stillfjord

    Abstract: We consider approximations to the solutions of differential Riccati equations in the context of linear quadratic regulator problems, where the state equation is governed by a multiscale operator. Similarly to elliptic and parabolic problems, standard finite element discretizations perform poorly in this setting unless the grid resolves the fine-scale features of the problem. This results in unfeas… ▽ More

    Submitted 18 June, 2018; v1 submitted 14 June, 2017; originally announced June 2017.

    Comments: Accepted for publication in SIAM J. Sci. Comput. This version differs from the previous one only by the addition of Remark 7.2 and minor changes in formatting. 21 pages, 12 figures

    MSC Class: 49N10; 65N12; 65N30; 93C20

    Journal ref: SIAM J. Sci. Comput. 40(4) (2018), pp. A2406--A2426

  13. Adaptive high-order splitting schemes for large-scale differential Riccati equations

    Authors: Tony Stillfjord

    Abstract: We consider high-order splitting schemes for large-scale differential Riccati equations. Such equations arise in many different areas and are especially important within the field of optimal control. In the large-scale case, it is critical to employ structural properties of the matrix-valued solution, or the computational cost and storage requirements become infeasible. Our main contribution is th… ▽ More

    Submitted 26 June, 2017; v1 submitted 2 December, 2016; originally announced December 2016.

    Comments: 23 pages, 7 figures

    MSC Class: 15A24; 49N10; 65L05; 93A15

    Journal ref: Numer. Algor. 78(4) (2018), pp. 1129--1151

  14. Finite element convergence analysis for the thermoviscoelastic Joule heating problem

    Authors: Axel Målqvist, Tony Stillfjord

    Abstract: We consider a system of equations that model the temperature, electric potential and deformation of a thermoviscoelastic body. A typical application is a thermistor; an electrical component that can be used e.g. as a surge protector, temperature sensor or for very precise positioning. We introduce a full discretization based on standard finite elements in space and a semi-implicit Euler-type metho… ▽ More

    Submitted 20 February, 2017; v1 submitted 1 June, 2016; originally announced June 2016.

    Comments: 20 pages, 6 figures, 2 tables

    MSC Class: 65M12; 65M60; 74D05; 74H15

    Journal ref: BIT 57(3) (2017), pp.787-810