Skip to main content

Showing 1–22 of 22 results for author: Szpruch, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.15834  [pdf, ps, other

    math.OC cs.LG math.PR

    A Fisher-Rao gradient flow for entropic mean-field min-max games

    Authors: Razvan-Andrei Lascu, Mateusz B. Majka, Łukasz Szpruch

    Abstract: Gradient flows play a substantial role in addressing many machine learning problems. We examine the convergence in continuous-time of a \textit{Fisher-Rao} (Mean-Field Birth-Death) gradient flow in the context of solving convex-concave min-max games with entropy regularization. We propose appropriate Lyapunov functions to demonstrate convergence with explicit rates to the unique mixed Nash equilib… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: 18 pages. arXiv admin note: substantial text overlap with arXiv:2306.03033

  2. arXiv:2405.03624  [pdf, ps, other

    cs.LG math.OC q-fin.ST stat.ML

    $ε$-Policy Gradient for Online Pricing

    Authors: Lukasz Szpruch, Tanut Treetanthiploet, Yufei Zhang

    Abstract: Combining model-based and model-free reinforcement learning approaches, this paper proposes and analyzes an $ε$-policy gradient algorithm for the online pricing learning task. The algorithm extends $ε$-greedy algorithm by replacing greedy exploitation with gradient descent step and facilitates learning via model inference. We optimize the regret of the proposed algorithm by quantifying the explora… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

    MSC Class: 62J12; 68Q32; 65Y20

  3. arXiv:2402.08106  [pdf, ps, other

    math.OC cs.LG math.PR

    Mirror Descent-Ascent for mean-field min-max problems

    Authors: Razvan-Andrei Lascu, Mateusz B. Majka, Łukasz Szpruch

    Abstract: We study two variants of the mirror descent-ascent algorithm for solving min-max problems on the space of measures: simultaneous and sequential. We work under assumptions of convexity-concavity and relative smoothness of the payoff function with respect to a suitable Bregman divergence, defined on the space of measures via flat derivatives. We show that the convergence rates to mixed Nash equilibr… ▽ More

    Submitted 28 May, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

    Comments: 33 pages; updated introduction

  4. arXiv:2402.07025  [pdf, other

    stat.ML cs.IT cs.LG

    Generalization Error of Graph Neural Networks in the Mean-field Regime

    Authors: Gholamali Aminian, Yixuan He, Gesine Reinert, Łukasz Szpruch, Samuel N. Cohen

    Abstract: This work provides a theoretical framework for assessing the generalization error of graph neural networks in the over-parameterized regime, where the number of parameters surpasses the quantity of data points. We explore two widely utilized types of graph neural networks: graph convolutional neural networks and message passing graph neural networks. Prior to this study, existing bounds on the gen… ▽ More

    Submitted 1 July, 2024; v1 submitted 10 February, 2024; originally announced February 2024.

    Comments: Accepted in ICML 2024

  5. arXiv:2310.02951  [pdf, ps, other

    math.OC cs.LG math.PR

    A Fisher-Rao gradient flow for entropy-regularised Markov decision processes in Polish spaces

    Authors: Bekzhan Kerimkulov, James-Michael Leahy, David Siska, Lukasz Szpruch, Yufei Zhang

    Abstract: We study the global convergence of a Fisher-Rao policy gradient flow for infinite-horizon entropy-regularised Markov decision processes with Polish state and action space. The flow is a continuous-time analogue of a policy mirror descent method. We establish the global well-posedness of the gradient flow and demonstrate its exponential convergence to the optimal policy. Moreover, we prove the flow… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

    MSC Class: 90C40; 93E20; 90C26; 60B05; 90C53

  6. arXiv:2308.16538  [pdf

    cs.AI

    The AI Revolution: Opportunities and Challenges for the Finance Sector

    Authors: Carsten Maple, Lukasz Szpruch, Gregory Epiphaniou, Kalina Staykova, Simran Singh, William Penwarden, Yisi Wen, Zijian Wang, Jagdish Hariharan, Pavle Avramovic

    Abstract: This report examines Artificial Intelligence (AI) in the financial sector, outlining its potential to revolutionise the industry and identify its challenges. It underscores the criticality of a well-rounded understanding of AI, its capabilities, and its implications to effectively leverage its potential while mitigating associated risks. The potential of AI potential extends from augmenting existi… ▽ More

    Submitted 31 August, 2023; originally announced August 2023.

  7. arXiv:2308.06935  [pdf, other

    q-fin.PR cs.LG q-fin.ST

    Insurance pricing on price comparison websites via reinforcement learning

    Authors: Tanut Treetanthiploet, Yufei Zhang, Lukasz Szpruch, Isaac Bowers-Barnard, Henrietta Ridley, James Hickey, Chris Pearce

    Abstract: The emergence of price comparison websites (PCWs) has presented insurers with unique challenges in formulating effective pricing strategies. Operating on PCWs requires insurers to strike a delicate balance between competitive premiums and profitability, amidst obstacles such as low historical conversion rates, limited visibility of competitors' actions, and a dynamic market environment. In additio… ▽ More

    Submitted 14 August, 2023; originally announced August 2023.

  8. arXiv:2306.11623  [pdf, ps, other

    stat.ML cs.LG math.ST

    Mean-field Analysis of Generalization Errors

    Authors: Gholamali Aminian, Samuel N. Cohen, Łukasz Szpruch

    Abstract: We propose a novel framework for exploring weak and $L_2$ generalization errors of algorithms through the lens of differential calculus on the space of probability measures. Specifically, we consider the KL-regularized empirical risk minimization problem and establish generic conditions under which the generalization error convergence rate, when training on a sample of size $n$, is… ▽ More

    Submitted 20 June, 2023; originally announced June 2023.

    Comments: 49 pages

    MSC Class: 62B10; 60F99; 49N80; 46N30

  9. arXiv:2211.11540  [pdf, other

    cs.CR

    A Framework for Auditable Synthetic Data Generation

    Authors: Florimond Houssiau, Samuel N. Cohen, Lukasz Szpruch, Owen Daniel, Michaela G. Lawrence, Robin Mitra, Henry Wilde, Callum Mole

    Abstract: Synthetic data has gained significant momentum thanks to sophisticated machine learning tools that enable the synthesis of high-dimensional datasets. However, many generation techniques do not give the data controller control over what statistical patterns are captured, leading to concerns over privacy protection. While synthetic records are not linked to a particular real-world individual, they c… ▽ More

    Submitted 21 November, 2022; originally announced November 2022.

  10. arXiv:2211.06550  [pdf, other

    cs.CR cs.AI cs.LG

    TAPAS: a Toolbox for Adversarial Privacy Auditing of Synthetic Data

    Authors: Florimond Houssiau, James Jordon, Samuel N. Cohen, Owen Daniel, Andrew Elliott, James Geddes, Callum Mole, Camila Rangel-Smith, Lukasz Szpruch

    Abstract: Personal data collected at scale promises to improve decision-making and accelerate innovation. However, sharing and using such data raises serious privacy concerns. A promising solution is to produce synthetic data, artificial records to share instead of real data. Since synthetic records are not linked to real persons, this intuitively prevents classical re-identification attacks. However, this… ▽ More

    Submitted 11 November, 2022; originally announced November 2022.

    Comments: Published at the SyntheticData4ML Neurips workshop

  11. arXiv:2208.04466  [pdf, ps, other

    cs.LG math.OC math.PR stat.ML

    Optimal scheduling of entropy regulariser for continuous-time linear-quadratic reinforcement learning

    Authors: Lukasz Szpruch, Tanut Treetanthiploet, Yufei Zhang

    Abstract: This work uses the entropy-regularised relaxed stochastic control perspective as a principled framework for designing reinforcement learning (RL) algorithms. Herein agent interacts with the environment by generating noisy controls distributed according to the optimal relaxed policy. The noisy policies on the one hand, explore the space and hence facilitate learning but, on the other hand, introduc… ▽ More

    Submitted 14 September, 2023; v1 submitted 8 August, 2022; originally announced August 2022.

    Comments: Accepted by SIAM Journal on Control and Optimization

    MSC Class: 62L05; 49N10; 93E35; 94A17

  12. arXiv:2205.03257  [pdf, other

    cs.LG

    Synthetic Data -- what, why and how?

    Authors: James Jordon, Lukasz Szpruch, Florimond Houssiau, Mirko Bottarelli, Giovanni Cherubin, Carsten Maple, Samuel N. Cohen, Adrian Weller

    Abstract: This explainer document aims to provide an overview of the current state of the rapidly expanding work on synthetic data technologies, with a particular focus on privacy. The article is intended for a non-technical audience, though some formal definitions have been given to provide clarity to specialists. This article is intended to enable the reader to quickly become familiar with the notion of s… ▽ More

    Submitted 6 May, 2022; originally announced May 2022.

    Comments: Commissioned by the Royal Society. 57 pages 2 figures

  13. arXiv:2201.07296  [pdf, ps, other

    math.OC cs.AI cs.LG math.PR stat.ML

    Convergence of Policy Gradient for Entropy Regularized MDPs with Neural Network Approximation in the Mean-Field Regime

    Authors: Bekzhan Kerimkulov, James-Michael Leahy, David Šiška, Lukasz Szpruch

    Abstract: We study the global convergence of policy gradient for infinite-horizon, continuous state and action space, and entropy-regularized Markov decision processes (MDPs). We consider a softmax policy with (one-hidden layer) neural network approximation in a mean-field regime. Additional entropic regularization in the associated mean-field probability measure is added, and the corresponding gradient flo… ▽ More

    Submitted 16 June, 2022; v1 submitted 18 January, 2022; originally announced January 2022.

  14. arXiv:2112.10264  [pdf, ps, other

    cs.LG math.OC math.PR stat.ML

    Exploration-exploitation trade-off for continuous-time episodic reinforcement learning with linear-convex models

    Authors: Lukasz Szpruch, Tanut Treetanthiploet, Yufei Zhang

    Abstract: We develop a probabilistic framework for analysing model-based reinforcement learning in the episodic setting. We then apply it to study finite-time horizon stochastic control problems with linear dynamics but unknown coefficients and convex, but possibly irregular, objective function. Using probabilistic representations, we study regularity of the associated cost functions and establish precise e… ▽ More

    Submitted 19 December, 2021; originally announced December 2021.

    MSC Class: 93E35; 62G35; 93E11; 68Q32

  15. arXiv:2111.01207  [pdf, other

    cs.LG

    Sig-Wasserstein GANs for Time Series Generation

    Authors: Hao Ni, Lukasz Szpruch, Marc Sabate-Vidales, Baoren Xiao, Magnus Wiese, Shujian Liao

    Abstract: Synthetic data is an emerging technology that can significantly accelerate the development and deployment of AI machine learning pipelines. In this work, we develop high-fidelity time-series generators, the SigWGAN, by combining continuous-time stochastic models with the newly proposed signature $W_1$ metric. The former are the Logsig-RNN models based on the stochastic differential equations, wher… ▽ More

    Submitted 1 November, 2021; originally announced November 2021.

    Comments: This paper is accepted by the 2nd ACM International Conference on AI in Finance 2021

    MSC Class: 60L10 ACM Class: I.6; G.3

  16. arXiv:2106.03498  [pdf, other

    cs.LG math.OC

    Identifiability in inverse reinforcement learning

    Authors: Haoyang Cao, Samuel N. Cohen, Lukasz Szpruch

    Abstract: Inverse reinforcement learning attempts to reconstruct the reward function in a Markov decision problem, using observations of agent actions. As already observed in Russell [1998] the problem is ill-posed, and the reward function is not identifiable, even under the presence of perfect information about optimal behavior. We provide a resolution to this non-identifiability for problems with entropy… ▽ More

    Submitted 8 November, 2021; v1 submitted 7 June, 2021; originally announced June 2021.

    MSC Class: 2020: 49N45; 93B30; 93E12; 93B15; 49N10; 90C40; 60J10; 62M05

  17. arXiv:2007.04154  [pdf, other

    q-fin.MF cs.LG stat.ML

    Robust pricing and hedging via neural SDEs

    Authors: Patryk Gierjatowicz, Marc Sabate-Vidales, David Šiška, Lukasz Szpruch, Žan Žurič

    Abstract: Mathematical modelling is ubiquitous in the financial industry and drives key decision processes. Any given model provides only a crude approximation to reality and the risk of using an inadequate model is hard to detect and quantify. By contrast, modern data science techniques are opening the door to more robust and data-driven model selection mechanisms. However, most machine learning models are… ▽ More

    Submitted 8 July, 2020; originally announced July 2020.

    MSC Class: 65C30; 60H35; 60H30

  18. arXiv:2006.06102  [pdf, other

    stat.ML cs.LG math.PR math.ST stat.CO

    Multi-index Antithetic Stochastic Gradient Algorithm

    Authors: Mateusz B. Majka, Marc Sabate-Vidales, Łukasz Szpruch

    Abstract: Stochastic Gradient Algorithms (SGAs) are ubiquitous in computational statistics, machine learning and optimisation. Recent years have brought an influx of interest in SGAs, and the non-asymptotic analysis of their bias is by now well-developed. However, relatively little is known about the optimal choice of the random approximation (e.g mini-batching) of the gradient in SGAs as this relies on the… ▽ More

    Submitted 30 September, 2021; v1 submitted 10 June, 2020; originally announced June 2020.

    Comments: 51 pages, 8 figures. Revised version: an improved introduction, a completely new numerical section including experiments in non-convex settings, a new appendix discussing the dependence of the variance of SGLD on the mini-batch size

  19. arXiv:2006.05956  [pdf, ps, other

    math.OC cs.LG math.PR

    Gradient Flows for Regularized Stochastic Control Problems

    Authors: David Šiška, Łukasz Szpruch

    Abstract: This paper studies stochastic control problems with the action space taken to be probability measures, with the objective penalised by the relative entropy. We identify suitable metric space on which we construct a gradient flow for the measure-valued control process, in the set of admissible controls, along which the cost functional is guaranteed to decrease. It is shown that any invariant measur… ▽ More

    Submitted 25 January, 2024; v1 submitted 10 June, 2020; originally announced June 2020.

    MSC Class: 93E20; 60H30; 37L40

  20. arXiv:2006.05421  [pdf, other

    cs.LG stat.ML

    Conditional Sig-Wasserstein GANs for Time Series Generation

    Authors: Shujian Liao, Hao Ni, Lukasz Szpruch, Magnus Wiese, Marc Sabate-Vidales, Baoren Xiao

    Abstract: Generative adversarial networks (GANs) have been extremely successful in generating samples, from seemingly high dimensional probability measures. However, these methods struggle to capture the temporal dependence of joint probability distributions induced by time-series data. Furthermore, long time-series data streams hugely increase the dimension of the target space, which may render generative… ▽ More

    Submitted 11 October, 2023; v1 submitted 9 June, 2020; originally announced June 2020.

    Comments: This paper has been accepted for Mathematical Finance Special Issue on Machine Learning in Finance

  21. arXiv:1912.00894  [pdf, other

    stat.ML cs.LG math.AP math.ST

    On the geometry of Stein variational gradient descent

    Authors: A. Duncan, N. Nuesken, L. Szpruch

    Abstract: Bayesian inference problems require sampling or approximating high-dimensional probability distributions. The focus of this paper is on the recently introduced Stein variational gradient descent methodology, a class of algorithms that rely on iterated steepest descent steps with respect to a reproducing kernel Hilbert space norm. This construction leads to interacting particle systems, the mean-fi… ▽ More

    Submitted 12 February, 2023; v1 submitted 2 December, 2019; originally announced December 2019.

    Comments: 40 pages, 4 figures

  22. arXiv:1810.05094  [pdf, other

    q-fin.CP cs.LG math.NA

    Unbiased deep solvers for linear parametric PDEs

    Authors: Marc Sabate Vidales, David Siska, Lukasz Szpruch

    Abstract: We develop several deep learning algorithms for approximating families of parametric PDE solutions. The proposed algorithms approximate solutions together with their gradients, which in the context of mathematical finance means that the derivative prices and hedging strategies are computed simulatenously. Having approximated the gradient of the solution one can combine it with a Monte-Carlo simula… ▽ More

    Submitted 17 January, 2022; v1 submitted 11 October, 2018; originally announced October 2018.

    MSC Class: 65M75; 60H30; 91G60