Search | arXiv e-print repository

Optimal nonparametric estimation of the expected shortfall risk

Abstract: We address the problem of estimating the expected shortfall risk of a financial loss using a finite number of i.i.d. data. It is well known that the classical plug-in estimator suffers from poor statistical performance when faced with (heavy-tailed) distributions that are commonly used in financial contexts. Further, it lacks robustness, as the modification of even a single data point can cause a… ▽ More We address the problem of estimating the expected shortfall risk of a financial loss using a finite number of i.i.d. data. It is well known that the classical plug-in estimator suffers from poor statistical performance when faced with (heavy-tailed) distributions that are commonly used in financial contexts. Further, it lacks robustness, as the modification of even a single data point can cause a significant distortion. We propose a novel procedure for the estimation of the expected shortfall and prove that it recovers the best possible statistical properties (dictated by the central limit theorem) under minimal assumptions and for all finite numbers of data. Further, this estimator is adversarially robust: even if a (small) proportion of the data is maliciously modified, the procedure continuous to optimally estimate the true expected shortfall risk. We demonstrate that our estimator outperforms the classical plug-in estimator through a variety of numerical experiments across a range of standard loss distributions. △ Less

Submitted 1 May, 2024; originally announced May 2024.

arXiv:2311.04041 [pdf, other]

Hilbert's projective metric for functions of bounded growth and exponential convergence of Sinkhorn's algorithm

Authors: Stephan Eckstein

Abstract: Motivated by the entropic optimal transport problem in unbounded settings, we study versions of Hilbert's projective metric for spaces of integrable functions of bounded growth. These versions of Hilbert's metric originate from cones which are relaxations of the cone of all non-negative functions, in the sense that they include all functions having non-negative integral values when multiplied with… ▽ More Motivated by the entropic optimal transport problem in unbounded settings, we study versions of Hilbert's projective metric for spaces of integrable functions of bounded growth. These versions of Hilbert's metric originate from cones which are relaxations of the cone of all non-negative functions, in the sense that they include all functions having non-negative integral values when multiplied with certain test functions. We show that kernel integral operators are contractions with respect to suitable specifications of such metrics even for kernels which are not bounded away from zero, provided that the decay to zero of the kernel is controlled. As an application to entropic optimal transport, we show exponential convergence of Sinkhorn's algorithm in settings where the marginal distributions have sufficiently light tails compared to the growth of the cost function. △ Less

Submitted 17 January, 2024; v1 submitted 7 November, 2023; originally announced November 2023.

Comments: Changes in this version: Added Section 3.1 illustrating the construction of the cones used and adjusted Section 5 on Sinkhorn's algorithm

arXiv:2303.14085 [pdf, other]

Optimal transport and Wasserstein distances for causal models

Authors: Patrick Cheridito, Stephan Eckstein

Abstract: In this paper, we introduce a variant of optimal transport adapted to the causal structure given by an underlying directed graph $G$. Different graph structures lead to different specifications of the optimal transport problem. For instance, a fully connected graph yields standard optimal transport, a linear graph structure corresponds to causal optimal transport between the distributions of two d… ▽ More In this paper, we introduce a variant of optimal transport adapted to the causal structure given by an underlying directed graph $G$. Different graph structures lead to different specifications of the optimal transport problem. For instance, a fully connected graph yields standard optimal transport, a linear graph structure corresponds to causal optimal transport between the distributions of two discrete-time stochastic processes, and an empty graph leads to a notion of optimal transport related to CO-OT, Gromov-Wasserstein distances and factored OT. We derive different characterizations of $G$-causal transport plans and introduce Wasserstein distances between causal models that respect the underlying graph structure. We show that average treatment effects are continuous with respect to $G$-causal Wasserstein distances and small perturbations of structural causal models lead to small deviations in $G$-causal Wasserstein distance. We also introduce an interpolation between causal models based on $G$-causal Wasserstein distance and compare it to standard Wasserstein interpolation. △ Less

Submitted 4 July, 2024; v1 submitted 24 March, 2023; originally announced March 2023.

arXiv:2212.00367 [pdf, other]

Stability and Sample Complexity of Divergence Regularized Optimal Transport

Authors: Erhan Bayraktar, Stephan Eckstein, Xin Zhang

Abstract: We study stability and sample complexity properties of divergence regularized optimal transport (DOT). First, we obtain quantitative stability results for optimizers of DOT measured in Wasserstein distance, which are applicable to a wide class of divergences and simultaneously improve known results for entropic optimal transport. Second, we study the case of sample complexity, where the DOT proble… ▽ More We study stability and sample complexity properties of divergence regularized optimal transport (DOT). First, we obtain quantitative stability results for optimizers of DOT measured in Wasserstein distance, which are applicable to a wide class of divergences and simultaneously improve known results for entropic optimal transport. Second, we study the case of sample complexity, where the DOT problem is approximated using empirical measures of the marginals. We show that divergence regularization can improve the corresponding convergence rate compared to unregularized optimal transport. To this end, we prove upper bounds which exploit both the regularity of cost function and divergence functional, as well as the intrinsic dimension of the marginals. Along the way, we establish regularity properties of dual optimizers of DOT, as well as general limit theorems for empirical measures with suitable classes of test functions. △ Less

Submitted 16 January, 2024; v1 submitted 1 December, 2022; originally announced December 2022.

MSC Class: 90C25; 49N05

arXiv:2208.14391 [pdf, ps, other]

Convergence Rates for Regularized Optimal Transport via Quantization

Authors: Stephan Eckstein, Marcel Nutz

Abstract: We study the convergence of divergence-regularized optimal transport as the regularization parameter vanishes. Sharp rates for general divergences including relative entropy or $L^{p}$ regularization, general transport costs and multi-marginal problems are obtained. A novel methodology using quantization and martingale couplings is suitable for non-compact marginals and achieves, in particular, th… ▽ More We study the convergence of divergence-regularized optimal transport as the regularization parameter vanishes. Sharp rates for general divergences including relative entropy or $L^{p}$ regularization, general transport costs and multi-marginal problems are obtained. A novel methodology using quantization and martingale couplings is suitable for non-compact marginals and achieves, in particular, the sharp leading-order term of entropically regularized 2-Wasserstein distance for all marginals with finite $(2+δ)$-moment. △ Less

Submitted 21 June, 2023; v1 submitted 30 August, 2022; originally announced August 2022.

Comments: Forthcoming in 'Mathematics of Operations Research'

arXiv:2203.09347 [pdf, other]

Dimensionality Reduction and Wasserstein Stability for Kernel Regression

Authors: Stephan Eckstein, Armin Iske, Mathias Trabs

Abstract: In a high-dimensional regression framework, we study consequences of the naive two-step procedure where first the dimension of the input variables is reduced and second, the reduced input variables are used to predict the output variable with kernel regression. In order to analyze the resulting regression errors, a novel stability result for kernel regression with respect to the Wasserstein distan… ▽ More In a high-dimensional regression framework, we study consequences of the naive two-step procedure where first the dimension of the input variables is reduced and second, the reduced input variables are used to predict the output variable with kernel regression. In order to analyze the resulting regression errors, a novel stability result for kernel regression with respect to the Wasserstein distance is derived. This allows us to bound errors that occur when perturbed input data is used to fit the regression function. We apply the general stability result to principal component analysis (PCA). Exploiting known estimates from the literature on both principal component analysis and kernel regression, we deduce convergence rates for the two-step procedure. The latter turns out to be particularly useful in a semi-supervised setting. △ Less

Submitted 27 November, 2023; v1 submitted 17 March, 2022; originally announced March 2022.

Comments: Forthcoming in JMLR

arXiv:2203.05005 [pdf, other]

Computational methods for adapted optimal transport

Authors: Stephan Eckstein, Gudmund Pammer

Abstract: Adapted optimal transport (AOT) problems are optimal transport problems for distributions of a time series where couplings are constrained to have a temporal causal structure. In this paper, we develop computational tools for solving AOT problems numerically. First, we show that AOT problems are stable with respect to perturbations in the marginals and thus arbitrary AOT problems can be approximat… ▽ More Adapted optimal transport (AOT) problems are optimal transport problems for distributions of a time series where couplings are constrained to have a temporal causal structure. In this paper, we develop computational tools for solving AOT problems numerically. First, we show that AOT problems are stable with respect to perturbations in the marginals and thus arbitrary AOT problems can be approximated by sequences of linear programs. We further study entropic methods to solve AOT problems. We show that any entropically regularized AOT problem converges to the corresponding unregularized problem if the regularization parameter goes to zero. The proof is based on a novel method - even in the non-adapted case - to easily obtain smooth approximations of a given coupling with fixed marginals. Finally, we show tractability of the adapted version of Sinkhorn's algorithm. We give explicit solutions for the occurring projections and prove that the procedure converges to the optimizer of the entropic AOT problem. △ Less

Submitted 25 April, 2023; v1 submitted 9 March, 2022; originally announced March 2022.

Comments: Forthcoming in Annals of Applied Probability

arXiv:2110.06798 [pdf, ps, other]

Quantitative Stability of Regularized Optimal Transport and Convergence of Sinkhorn's Algorithm

Authors: Stephan Eckstein, Marcel Nutz

Abstract: We study the stability of entropically regularized optimal transport with respect to the marginals. Lipschitz continuity of the value and Hölder continuity of the optimal coupling in $p$-Wasserstein distance are obtained under general conditions including quadratic costs and unbounded marginals. The results for the value extend to regularization by an arbitrary divergence. As an application, we sh… ▽ More We study the stability of entropically regularized optimal transport with respect to the marginals. Lipschitz continuity of the value and Hölder continuity of the optimal coupling in $p$-Wasserstein distance are obtained under general conditions including quadratic costs and unbounded marginals. The results for the value extend to regularization by an arbitrary divergence. As an application, we show convergence of Sinkhorn's algorithm in Wasserstein sense, including for quadratic cost. Two techniques are presented: The first compares an optimal coupling with its so-called shadow, a coupling induced on other marginals by an explicit construction. The second transforms one set of marginals by a change of coordinates and thus reduces the comparison of differing marginals to the comparison of differing cost functions under the same marginals. △ Less

Submitted 5 July, 2022; v1 submitted 13 October, 2021; originally announced October 2021.

Comments: Forthcoming in 'SIAM Journal on Mathematical Analysis'

MSC Class: 90C25; 49N05

arXiv:2010.11502 [pdf, other]

MinMax Methods for Optimal Transport and Beyond: Regularization, Approximation and Numerics

Authors: Luca De Gennaro Aquino, Stephan Eckstein

Abstract: We study MinMax solution methods for a general class of optimization problems related to (and including) optimal transport. Theoretically, the focus is on fitting a large class of problems into a single MinMax framework and generalizing regularization techniques known from classical optimal transport. We show that regularization techniques justify the utilization of neural networks to solve such p… ▽ More We study MinMax solution methods for a general class of optimization problems related to (and including) optimal transport. Theoretically, the focus is on fitting a large class of problems into a single MinMax framework and generalizing regularization techniques known from classical optimal transport. We show that regularization techniques justify the utilization of neural networks to solve such problems by proving approximation theorems and illustrating fundamental issues if no regularization is used. We further study the relation to the literature on generative adversarial nets, and analyze which algorithmic techniques used therein are particularly suitable to the class of problems studied in this paper. Several numerical experiments showcase the generality of the setting and highlight which theoretical insights are most beneficial in practice. △ Less

Submitted 22 October, 2020; originally announced October 2020.

Comments: NeurIPS 2020

arXiv:2009.13881 [pdf, ps, other]

Lipschitz neural networks are dense in the set of all Lipschitz functions

Authors: Stephan Eckstein

Abstract: This note shows that, for a fixed Lipschitz constant $L > 0$, one layer neural networks that are $L$-Lipschitz are dense in the set of all $L$-Lipschitz functions with respect to the uniform norm on bounded sets. This note shows that, for a fixed Lipschitz constant $L > 0$, one layer neural networks that are $L$-Lipschitz are dense in the set of all $L$-Lipschitz functions with respect to the uniform norm on bounded sets. △ Less

Submitted 29 September, 2020; originally announced September 2020.

Comments: 7 pages

arXiv:2007.08815 [pdf, ps, other]

Limits of random walks with distributionally robust transition probabilities

Authors: Daniel Bartl, Stephan Eckstein, Michael Kupper

Abstract: We consider a nonlinear random walk which, in each time step, is free to choose its own transition probability within a neighborhood (w.r.t. Wasserstein distance) of the transition probability of a fixed Lévy process. In analogy to the classical framework we show that, when passing from discrete to continuous time via a scaling limit, this nonlinear random walk gives rise to a nonlinear semigroup.… ▽ More We consider a nonlinear random walk which, in each time step, is free to choose its own transition probability within a neighborhood (w.r.t. Wasserstein distance) of the transition probability of a fixed Lévy process. In analogy to the classical framework we show that, when passing from discrete to continuous time via a scaling limit, this nonlinear random walk gives rise to a nonlinear semigroup. We explicitly compute the generator of this semigroup and corresponding PDE as a perturbation of the generator of the initial Lévy process. △ Less

Submitted 27 April, 2021; v1 submitted 17 July, 2020; originally announced July 2020.

Comments: 14 pages, forthcoming in ECP

arXiv:1909.03870 [pdf, other]

Robust pricing and hedging of options on multiple assets and its numerics

Authors: Stephan Eckstein, Gaoyue Guo, Tongseok Lim, Jan Obloj

Abstract: We consider robust pricing and hedging for options written on multiple assets given market option prices for the individual assets. The resulting problem is called the multi-marginal martingale optimal transport problem. We propose two numerical methods to solve such problems: using discretisation and linear programming applied to the primal side and using penalisation and deep neural networks opt… ▽ More We consider robust pricing and hedging for options written on multiple assets given market option prices for the individual assets. The resulting problem is called the multi-marginal martingale optimal transport problem. We propose two numerical methods to solve such problems: using discretisation and linear programming applied to the primal side and using penalisation and deep neural networks optimisation applied to the dual side. We prove convergence for our methods and compare their numerical performance. We show how adding further information about call option prices at additional maturities can be incorporated and narrows down the no-arbitrage pricing bounds. Finally, we obtain structural results for the case of the payoff given by a weighted sum of covariances between the assets. △ Less

Submitted 7 October, 2020; v1 submitted 9 September, 2019; originally announced September 2019.

Comments: Forthcoming in SIAM Journal on Financial Mathematics

arXiv:1908.10242 [pdf, ps, other]

doi 10.1080/14697688.2020.1787493

Martingale transport with homogeneous stock movements

Authors: Stephan Eckstein, Michael Kupper

Abstract: We study a variant of the martingale optimal transport problem in a multi-period setting to derive robust price bounds of a financial derivative. On top of marginal and martingale constraints, we introduce a time-homogeneity assumption, which restricts the variability of the forward-looking transitions of the martingale across time. We provide a dual formulation in terms of superhedging and discus… ▽ More We study a variant of the martingale optimal transport problem in a multi-period setting to derive robust price bounds of a financial derivative. On top of marginal and martingale constraints, we introduce a time-homogeneity assumption, which restricts the variability of the forward-looking transitions of the martingale across time. We provide a dual formulation in terms of superhedging and discuss relaxations of the time-homogeneity assumption by adding market frictions. In financial terms, the introduced time-homogeneity corresponds to a time-consistency condition for call prices, given the state of the stock. The time homogeneity assumption leads to improved price bounds as market data from many time points can be incorporated effectively. The approach is illustrated with two numerical examples. △ Less

Submitted 6 May, 2021; v1 submitted 27 August, 2019; originally announced August 2019.

Comments: 20 pages, 2 figures

Journal ref: Quantitative Finance (2021), 21:2, 271-280

arXiv:1811.00304 [pdf, other]

Robust risk aggregation with neural networks

Authors: Stephan Eckstein, Michael Kupper, Mathias Pohl

Abstract: We consider settings in which the distribution of a multivariate random variable is partly ambiguous. We assume the ambiguity lies on the level of the dependence structure, and that the marginal distributions are known. Furthermore, a current best guess for the distribution, called reference measure, is available. We work with the set of distributions that are both close to the given reference mea… ▽ More We consider settings in which the distribution of a multivariate random variable is partly ambiguous. We assume the ambiguity lies on the level of the dependence structure, and that the marginal distributions are known. Furthermore, a current best guess for the distribution, called reference measure, is available. We work with the set of distributions that are both close to the given reference measure in a transportation distance (e.g. the Wasserstein distance), and additionally have the correct marginal structure. The goal is to find upper and lower bounds for integrals of interest with respect to distributions in this set. The described problem appears naturally in the context of risk aggregation. When aggregating different risks, the marginal distributions of these risks are known and the task is to quantify their joint effect on a given system. This is typically done by applying a meaningful risk measure to the sum of the individual risks. For this purpose, the stochastic interdependencies between the risks need to be specified. In practice the models of this dependence structure are however subject to relatively high model ambiguity. The contribution of this paper is twofold: Firstly, we derive a dual representation of the considered problem and prove that strong duality holds. Secondly, we propose a generally applicable and computationally feasible method, which relies on neural networks, in order to numerically solve the derived dual problem. The latter method is tested on a number of toy examples, before it is finally applied to perform robust risk aggregation in a real world instance. △ Less

Submitted 26 May, 2020; v1 submitted 1 November, 2018; originally announced November 2018.

Comments: Revised version. Accepted for publication in "Mathematical Finance"

arXiv:1802.08539 [pdf, other]

Computation of optimal transport and related hedging problems via penalization and neural networks

Authors: Stephan Eckstein, Michael Kupper

Abstract: This paper presents a widely applicable approach to solving (multi-marginal, martingale) optimal transport and related problems via neural networks. The core idea is to penalize the optimization problem in its dual formulation and reduce it to a finite dimensional one which corresponds to optimizing a neural network with smooth objective function. We present numerical examples from optimal transpo… ▽ More This paper presents a widely applicable approach to solving (multi-marginal, martingale) optimal transport and related problems via neural networks. The core idea is to penalize the optimization problem in its dual formulation and reduce it to a finite dimensional one which corresponds to optimizing a neural network with smooth objective function. We present numerical examples from optimal transport, martingale optimal transport, portfolio optimization under uncertainty and generative adversarial networks that showcase the generality and effectiveness of the approach. △ Less

Submitted 25 January, 2019; v1 submitted 23 February, 2018; originally announced February 2018.

arXiv:1709.02278 [pdf, other]

doi 10.1017/apr.2019.6

Extended Laplace Principle for Empirical Measures of a Markov Chain

Authors: Stephan Eckstein

Abstract: We consider discrete time Markov chains with Polish state space. The large deviations principle for empirical measures of a Markov chain can equivalently be stated in Laplace principle form, which builds on the convex dual pair of relative entropy (or Kullback-Leibler divergence) and cumulant generating functional $f\mapsto \ln \int \exp(f)$. Following the approach by Lacker in the i.i.d. case, we… ▽ More We consider discrete time Markov chains with Polish state space. The large deviations principle for empirical measures of a Markov chain can equivalently be stated in Laplace principle form, which builds on the convex dual pair of relative entropy (or Kullback-Leibler divergence) and cumulant generating functional $f\mapsto \ln \int \exp(f)$. Following the approach by Lacker in the i.i.d. case, we generalize the Laplace principle to a greater class of convex dual pairs. We present in depth one application arising from this extension, which includes large deviations results and a weak law of large numbers for certain robust Markov chains - similar to Markov set chains - where we model robustness via the first Wasserstein distance. The setting and proof of the extended Laplace principle are based on the weak convergence approach to large deviations by Dupuis and Ellis. △ Less

Submitted 7 September, 2017; originally announced September 2017.

MSC Class: 60F10; 60J05

Journal ref: Adv. Appl. Probab. 51 (2019) 136-167

arXiv:1709.00641 [pdf, ps, other]

Marginal and dependence uncertainty: bounds, optimal transport, and sharpness

Authors: Daniel Bartl, Michael Kupper, Thibaut Lux, Antonis Papapantoleon, Stephan Eckstein

Abstract: Motivated by applications in model-free finance and quantitative risk management, we consider Fréchet classes of multivariate distribution functions where additional information on the joint distribution is assumed, while uncertainty in the marginals is also possible. We derive optimal transport duality results for these Fréchet classes that extend previous results in the related literature. These… ▽ More Motivated by applications in model-free finance and quantitative risk management, we consider Fréchet classes of multivariate distribution functions where additional information on the joint distribution is assumed, while uncertainty in the marginals is also possible. We derive optimal transport duality results for these Fréchet classes that extend previous results in the related literature. These proofs are based on representation results for increasing convex functionals and the explicit computation of the conjugates. We show that the dual transport problem admits an explicit solution for the function $f=1_B$, where $B$ is a rectangular subset of $\mathbb R^d$, and provide an intuitive geometric interpretation of this result. The improved Fréchet--Hoeffding bounds provide ad-hoc upper bounds for these Fréchet classes. We show that the improved Fréchet--Hoeffding bounds are pointwise sharp for these classes in the presence of uncertainty in the marginals, while a counterexample yields that they are not pointwise sharp in the absence of uncertainty in the marginals, even in dimension 2. The latter result sheds new light on the improved Fréchet--Hoeffding bounds, since Tankov [30] has showed that, under certain conditions, these bounds are sharp in dimension 2. △ Less

Submitted 17 August, 2018; v1 submitted 2 September, 2017; originally announced September 2017.

Comments: 24 pages, 4 figures. Revised version with new title

MSC Class: 60E15; 49N15; 28A35

arXiv:1610.06320 [pdf, other]

On the full space--time discretization of the generalized Stokes equations: The Dirichlet case

Authors: S. Eckstein, M. Ruzicka

Abstract: In this work we treat the space-time discretization of the generalized Stokes equations in the case of Dirichlet boundary conditions. We prove error estimates in the case $p\in[\frac{2d}{d+2},\infty)$ that are independent of the degeneracy parameter $δ\in[0,δ_0]$. For $p\leq 2$, our convergence rate is optimal. In this work we treat the space-time discretization of the generalized Stokes equations in the case of Dirichlet boundary conditions. We prove error estimates in the case $p\in[\frac{2d}{d+2},\infty)$ that are independent of the degeneracy parameter $δ\in[0,δ_0]$. For $p\leq 2$, our convergence rate is optimal. △ Less

Submitted 20 October, 2016; originally announced October 2016.

MSC Class: 65M15; 65M60; 76A05; 35Q35

Showing 1–18 of 18 results for author: Eckstein, S