Skip to main content

Showing 1–26 of 26 results for author: Šiška, D

.
  1. arXiv:2405.20250  [pdf, ps, other

    math.OC cs.LG math.PR

    Entropy annealing for policy mirror descent in continuous time and space

    Authors: Deven Sethi, David Šiška, Yufei Zhang

    Abstract: Entropy regularization has been extensively used in policy optimization algorithms to regularize the optimization landscape and accelerate convergence; however, it comes at the cost of introducing an additional regularization bias. This work quantifies the impact of entropy regularization on the convergence of policy gradient methods for stochastic exit time control problems. We analyze a continuo… ▽ More

    Submitted 6 June, 2024; v1 submitted 30 May, 2024; originally announced May 2024.

    MSC Class: Primary 93E20; Secondary 49M29; 68Q25; 60H30; 35J61

  2. arXiv:2401.01198  [pdf, ps, other

    math.OC math.NA math.PR

    Mirror Descent for Stochastic Control Problems with Measure-valued Controls

    Authors: Bekzhan Kerimkulov, David Šiška, Łukasz Szpruch, Yufei Zhang

    Abstract: This paper studies the convergence of the mirror descent algorithm for finite horizon stochastic control problems with measure-valued control processes. The control objective involves a convex regularisation function, denoted as $h$, with regularisation strength determined by the weight $τ\ge 0$. The setting covers regularised relaxed control problems. Under suitable conditions, we establish the r… ▽ More

    Submitted 2 January, 2024; originally announced January 2024.

    MSC Class: 93E20; 49M05; 68Q25; 60H30

  3. arXiv:2310.02951  [pdf, ps, other

    math.OC cs.LG math.PR

    A Fisher-Rao gradient flow for entropy-regularised Markov decision processes in Polish spaces

    Authors: Bekzhan Kerimkulov, James-Michael Leahy, David Siska, Lukasz Szpruch, Yufei Zhang

    Abstract: We study the global convergence of a Fisher-Rao policy gradient flow for infinite-horizon entropy-regularised Markov decision processes with Polish state and action space. The flow is a continuous-time analogue of a policy mirror descent method. We establish the global well-posedness of the gradient flow and demonstrate its exponential convergence to the optimal policy. Moreover, we prove the flow… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

    MSC Class: 90C40; 93E20; 90C26; 60B05; 90C53

  4. arXiv:2302.04345  [pdf, other

    q-fin.MF q-fin.TR

    Inefficiency of CFMs: hedging perspective and agent-based simulations

    Authors: Samuel Cohen, Marc Sabaté Vidales, David Šiška, Łukasz Szpruch

    Abstract: We investigate whether the fee income from trades on the CFM is sufficient for the liquidity providers to hedge away the exposure to market risk. We first analyse this problem through the lens of continuous-time financial mathematics and derive an upper bound for not-arbitrage fee income that would make CFM efficient and liquidity provision fair. We then evaluate our findings by performing multi-a… ▽ More

    Submitted 8 February, 2023; originally announced February 2023.

  5. arXiv:2212.05784  [pdf, ps, other

    math.OC math.PR

    The Modified MSA, a Gradient Flow and Convergence

    Authors: Deven Sethi, David Šiška

    Abstract: The modified Method of Successive Approximations (MSA) is an iterative scheme for approximating solutions to stochastic control problems in continuous time based on Pontryagin Optimality Principle which, starting with an initial open loop control, solves the forward equation, the backward adjoint equation and then performs a static minimization step. We observe that this is an implicit Euler schem… ▽ More

    Submitted 7 October, 2023; v1 submitted 12 December, 2022; originally announced December 2022.

    MSC Class: 93E20; 60H30; 37N40; 65K99

  6. arXiv:2207.12871  [pdf, other

    math.PR math.AP math.FA

    Decaying derivative estimates for functions of solutions to non-autonomous SDEs

    Authors: Maria Lefter, David Šiška, Łukasz Szpruch

    Abstract: We produce uniform and decaying bounds in time for derivatives of the solution to the backwards Kolmogorov equation associated to a stochastic processes governed by a time dependent dynamics. These hold under assumptions over the integrability properties in finite time of the derivatives of the transition density associated to the process, together with the assumption of remaining close over all… ▽ More

    Submitted 26 July, 2022; originally announced July 2022.

  7. arXiv:2201.07296  [pdf, ps, other

    math.OC cs.AI cs.LG math.PR stat.ML

    Convergence of Policy Gradient for Entropy Regularized MDPs with Neural Network Approximation in the Mean-Field Regime

    Authors: Bekzhan Kerimkulov, James-Michael Leahy, David Šiška, Lukasz Szpruch

    Abstract: We study the global convergence of policy gradient for infinite-horizon, continuous state and action space, and entropy-regularized Markov decision processes (MDPs). We consider a softmax policy with (one-hidden layer) neural network approximation in a mean-field regime. Additional entropic regularization in the associated mean-field probability measure is added, and the corresponding gradient flo… ▽ More

    Submitted 16 June, 2022; v1 submitted 18 January, 2022; originally announced January 2022.

  8. arXiv:2011.10630  [pdf, other

    q-fin.CP

    Solving path dependent PDEs with LSTM networks and path signatures

    Authors: Marc Sabate-Vidales, David Šiška, Lukasz Szpruch

    Abstract: Using a combination of recurrent neural networks and signature methods from the rough paths theory we design efficient algorithms for solving parametric families of path dependent partial differential equations (PPDEs) that arise in pricing and hedging of path-dependent derivatives or from use of non-Markovian model, such as rough volatility models in Jacquier and Oumgari, 2019. The solutions of P… ▽ More

    Submitted 20 November, 2020; originally announced November 2020.

  9. arXiv:2007.05209  [pdf, ps, other

    math.OC math.PR

    A modified MSA for stochastic control problems

    Authors: Bekzhan Kerimkulov, David Šiška, Łukasz Szpruch

    Abstract: The classical Method of Successive Approximations (MSA) is an iterative method for solving stochastic control problems and is derived from Pontryagin's optimality principle. It is known that the MSA may fail to converge. Using careful estimates for the backward stochastic differential equation (BSDE) this paper suggests a modification to the MSA algorithm. This modified MSA is shown to converge fo… ▽ More

    Submitted 17 November, 2020; v1 submitted 10 July, 2020; originally announced July 2020.

  10. arXiv:2007.04154  [pdf, other

    q-fin.MF cs.LG stat.ML

    Robust pricing and hedging via neural SDEs

    Authors: Patryk Gierjatowicz, Marc Sabate-Vidales, David Šiška, Lukasz Szpruch, Žan Žurič

    Abstract: Mathematical modelling is ubiquitous in the financial industry and drives key decision processes. Any given model provides only a crude approximation to reality and the risk of using an inadequate model is hard to detect and quantify. By contrast, modern data science techniques are opening the door to more robust and data-driven model selection mechanisms. However, most machine learning models are… ▽ More

    Submitted 8 July, 2020; originally announced July 2020.

    MSC Class: 65C30; 60H35; 60H30

  11. arXiv:2006.05956  [pdf, ps, other

    math.OC cs.LG math.PR

    Gradient Flows for Regularized Stochastic Control Problems

    Authors: David Šiška, Łukasz Szpruch

    Abstract: This paper studies stochastic control problems with the action space taken to be probability measures, with the objective penalised by the relative entropy. We identify suitable metric space on which we construct a gradient flow for the measure-valued control process, in the set of admissible controls, along which the cost functional is guaranteed to decrease. It is shown that any invariant measur… ▽ More

    Submitted 25 January, 2024; v1 submitted 10 June, 2020; originally announced June 2020.

    MSC Class: 93E20; 60H30; 37L40

  12. arXiv:1912.05475  [pdf, ps, other

    math.PR math.OC stat.ML

    Mean-Field Neural ODEs via Relaxed Optimal Control

    Authors: Jean-François Jabir, David Šiška, Łukasz Szpruch

    Abstract: We develop a framework for the analysis of deep neural networks and neural ODE models that are trained with stochastic gradient algorithms. We do that by identifying the connections between control theory, deep learning and theory of statistical sampling. We derive Pontryagin's optimality principle and study the corresponding gradient flow in the form of Mean-Field Langevin dynamics (MFLD) for sol… ▽ More

    Submitted 16 March, 2021; v1 submitted 11 December, 2019; originally announced December 2019.

  13. arXiv:1911.09647  [pdf, ps, other

    math.NA cs.LG math.PR stat.ML

    Uniform error estimates for artificial neural network approximations for heat equations

    Authors: Lukas Gonon, Philipp Grohs, Arnulf Jentzen, David Kofler, David Šiška

    Abstract: Recently, artificial neural networks (ANNs) in conjunction with stochastic gradient descent optimization methods have been employed to approximately compute solutions of possibly rather high-dimensional partial differential equations (PDEs). Very recently, there have also been a number of rigorous mathematical results in the scientific literature which examine the approximation capabilities of suc… ▽ More

    Submitted 15 June, 2020; v1 submitted 20 November, 2019; originally announced November 2019.

    MSC Class: 65C99; 65M99; 60H30

    Journal ref: IMA J. Numer. Anal. (2021), 1-64

  14. arXiv:1908.00955  [pdf, ps, other

    math.PR

    Weak Existence and Uniqueness for McKean-Vlasov SDEs with Common Noise

    Authors: William R. P. Hammersley, David Šiška, Łukasz Szpruch

    Abstract: This paper concerns the McKean-Vlasov stochastic differential equation (SDE) with common noise. An appropriate definition of a weak solution to such an equation is developed. The importance of the notion of compatibility in this definition is highlighted by a demonstration of its rôle in connecting weak solutions to McKean-Vlasov SDEs with common noise and solutions to corresponding stochastic par… ▽ More

    Submitted 26 June, 2020; v1 submitted 2 August, 2019; originally announced August 2019.

  15. arXiv:1905.07769  [pdf, ps, other

    math.PR math.OC stat.ML

    Mean-Field Langevin Dynamics and Energy Landscape of Neural Networks

    Authors: Kaitong Hu, Zhenjie Ren, David Siska, Lukasz Szpruch

    Abstract: Our work is motivated by a desire to study the theoretical underpinning for the convergence of stochastic gradient type algorithms widely used for non-convex learning tasks such as training of neural networks. The key insight, already observed in the works of Mei, Montanari and Nguyen (2018), Chizat and Bach (2018) as well as Rotskoff and Vanden-Eijnden (2018), is that a certain class of the finit… ▽ More

    Submitted 13 December, 2020; v1 submitted 19 May, 2019; originally announced May 2019.

    Comments: 31 pages

    MSC Class: 60H30; 37M25

  16. arXiv:1812.07846  [pdf, other

    math.OC math.PR

    Exponential Convergence and stability of Howards's Policy Improvement Algorithm for Controlled Diffusions

    Authors: B. Kerimkulov, D. Šiška, Ł. Szpruch

    Abstract: Optimal control problems are inherently hard to solve as the optimization must be performed simultaneously with updating the underlying system. Starting from an initial guess, Howard's policy improvement algorithm separates the step of updating the trajectory of the dynamical system from the optimization and iterations of this should converge to the optimal control. In the discrete space-time sett… ▽ More

    Submitted 22 May, 2020; v1 submitted 19 December, 2018; originally announced December 2018.

    Comments: Identical to the published version except minor typographical details

    MSC Class: 93E20; 60H30; 65N12; 49L20

    Journal ref: SIAM J. Control Optim., 58(3), 1314-1340, 2020

  17. arXiv:1810.05094  [pdf, other

    q-fin.CP cs.LG math.NA

    Unbiased deep solvers for linear parametric PDEs

    Authors: Marc Sabate Vidales, David Siska, Lukasz Szpruch

    Abstract: We develop several deep learning algorithms for approximating families of parametric PDE solutions. The proposed algorithms approximate solutions together with their gradients, which in the context of mathematical finance means that the derivative prices and hedging strategies are computed simulatenously. Having approximated the gradient of the solution one can combine it with a Monte-Carlo simula… ▽ More

    Submitted 17 January, 2022; v1 submitted 11 October, 2018; originally announced October 2018.

    MSC Class: 65M75; 60H30; 91G60

  18. arXiv:1802.03974  [pdf, ps, other

    math.PR

    McKean-Vlasov SDEs under Measure Dependent Lyapunov Conditions

    Authors: William Hammersley, David Šiška, Lukasz Szpruch

    Abstract: We prove the existence of weak solutions to McKean-Vlasov SDEs defined on a domain $D \subseteq \mathbb{R}^d$ with continuous and unbounded coefficients that satisfy Lyapunov type conditions, where the Lyapunov function may depend on measure. We propose a new type of {\em integrated} Lyapunov condition, where the inequality is only required to hold when integrated against the measure on which the… ▽ More

    Submitted 30 September, 2020; v1 submitted 12 February, 2018; originally announced February 2018.

  19. $L^p$-estimates and regularity for SPDEs with monotone semilinearity

    Authors: Neelima, David Šiška

    Abstract: Semilinear stochastic partial differential equations on bounded domains $\mathscr{D}$ are considered. The semilinear term may have arbitrary polynomial growth as long as it is continuous and monotone except perhaps near the origin. Typical examples are the stochastic Allen--Cahn and Ginzburg--Landau equations. The first main result of this article are $L^p$-estimates for such equations. The $L^p$-… ▽ More

    Submitted 24 September, 2019; v1 submitted 29 May, 2017; originally announced May 2017.

    MSC Class: 60H15; 35R60

    Journal ref: Stoch PDE: Anal Comp (2019)

  20. Coercivity condition for higher order moments for nonlinear SPDEs and existence of solution under local monotonicity

    Authors: Neelima, David Šiška

    Abstract: Higher order moment estimates for solutions to nonlinear SPDEs governed by locally-monotone operators are obtained under appropriate coercivity condition. These are then used to extend known existence and uniqueness results for nonlinear SPDEs under local monotonicity conditions to allow derivatives in the operator acting on the solution under the stochastic integral.

    Submitted 9 August, 2019; v1 submitted 18 October, 2016; originally announced October 2016.

    Comments: 32 pages

    MSC Class: 60H15; 65M60; 47J35

    Journal ref: Stochastics 2019

  21. Itô Formula for Processes Taking Values in Intersection of Finitely Many Banach Spaces

    Authors: István Gyöngy, David Šiška

    Abstract: Motivated by applications to SPDEs we extend the Itô formula for the square of the norm of a semimartingale $y(t)$ from Gyöngy and Krylov (Stochastics 6(3):153-173, 1982) to the case \begin{equation*} \sum_{i=1}^m \int_{(0,t]} v_i^{\ast}(s)\,dA(s) + h(t)=:y(t)\in V \quad \text{$dA\times \mathbb{P}$-a.e.}, \end{equation*} where $A$ is an increasing right-continuous adapted process, $v_i^{\ast}$ is… ▽ More

    Submitted 20 March, 2017; v1 submitted 5 September, 2016; originally announced September 2016.

    Comments: Updated to the version published in Stochastics and Partial Differential Equations: Analysis and Computations

    MSC Class: 60H15

    Journal ref: PDE: Anal Comp (2017). doi:10.1007/s40072-017-0093-6

  22. Nonlinear stochastic evolution equations of second order with dam**

    Authors: Etienne Emmrich, David Šiška

    Abstract: Convergence of a full discretization of a second order stochastic evolution equation with nonlinear dam** is shown and thus existence of a solution is established. The discretization scheme combines an implicit time step** scheme with an internal approximation. Uniqueness is proved as well.

    Submitted 11 October, 2016; v1 submitted 31 December, 2015; originally announced December 2015.

    Comments: This is the version of the article accepted for publication. The final publication is available at http://link.springer.com

    MSC Class: 60H15; 47J35; 60H35; 65M12

    Journal ref: Stoch PDE: Anal Comp (2016)

  23. arXiv:1407.7107  [pdf, ps, other

    math.PR math.NA

    Convergence of tamed Euler schemes for a class of stochastic evolution equations

    Authors: István Gyöngy, Sotirios Sabanis, David Šiška

    Abstract: We prove stability and convergence of a full discretization for a class of stochastic evolution equations with super-linearly growing operators appearing in the drift term. This is done using the recently developed tamed Euler method, which uses a fully explicit time step**, coupled with a Galerkin scheme for the spatial discretization.

    Submitted 13 August, 2015; v1 submitted 26 July, 2014; originally announced July 2014.

    MSC Class: 60H15; 65M12

  24. arXiv:1109.4032  [pdf, ps, other

    q-fin.CP eess.SY math.NA math.OC math.PR q-fin.PR

    Error estimates for finite difference approximations of American put option price

    Authors: David Šiška

    Abstract: Finite difference approximations to multi-asset American put option price are considered. The assets are modelled as a multi-dimensional diffusion process with variable drift and volatility. Approximation error of order one quarter with respect to the time discretisation parameter and one half with respect to the space discretisation parameter is proved by reformulating the corresponding optimal s… ▽ More

    Submitted 30 September, 2011; v1 submitted 19 September, 2011; originally announced September 2011.

    MSC Class: 65M06; 65M12; 60G40; 35R35; 91G80; 91G60

  25. arXiv:0705.2302  [pdf, ps, other

    math.PR math.OC

    On randomized stop**

    Authors: Istvan Gyongy, David Siska

    Abstract: A general result on the method of randomized stop** is proved. It is applied to optimal stop** of controlled diffusion processes with unbounded coefficients to reduce it to an optimal control problem without stop**. This is motivated by recent results of Krylov on numerical solutions to the Bellman equation.

    Submitted 15 May, 2008; v1 submitted 16 May, 2007; originally announced May 2007.

    Comments: Published in at http://dx.doi.org/10.3150/07-BEJ108 the Bernoulli (http://isi.cbs.nl/bernoulli/) by the International Statistical Institute/Bernoulli Society (http://isi.cbs.nl/BS/bshome.htm)

    Report number: IMS-BEJ-BEJ108

    Journal ref: Bernoulli 2008, Vol. 14, No. 2, 352-361

  26. On finite-difference approximations for normalized Bellman equations

    Authors: István Gyöngy, David Šiška

    Abstract: A class of stochastic optimal control problems involving optimal stop** is considered. Methods of Krylov are adapted to investigate the numerical solutions of the corresponding normalized Bellman equations and to estimate the rate of convergence of finite difference approximations for the optimal reward functions.

    Submitted 17 December, 2014; v1 submitted 27 October, 2006; originally announced October 2006.

    Comments: 36 pages, ArXiv version updated to the version accepted in Appl. Math. Optim

    MSC Class: 65M15; 35J60; 93E20

    Journal ref: Appl. Math. Optim., 60 (2009), no. 3, 297-339