Skip to main content

Showing 1–11 of 11 results for author: Vadori, N

.
  1. arXiv:2306.05366  [pdf, other

    cs.GT cs.LG

    Ordinal Potential-based Player Rating

    Authors: Nelson Vadori, Rahul Savani

    Abstract: It was recently observed that Elo ratings fail at preserving transitive relations among strategies and therefore cannot correctly extract the transitive component of a game. We provide a characterization of transitive games as a weak variant of ordinal potential games and show that Elo ratings actually do preserve transitivity when computed in the right space, using suitable invertible map**s. L… ▽ More

    Submitted 6 March, 2024; v1 submitted 8 June, 2023; originally announced June 2023.

  2. arXiv:2210.07184  [pdf, other

    cs.MA cs.AI cs.GT q-fin.CP

    Towards Multi-Agent Reinforcement Learning driven Over-The-Counter Market Simulations

    Authors: Nelson Vadori, Leo Ardon, Sumitra Ganesh, Thomas Spooner, Selim Amrouni, Jared Vann, Mengda Xu, Zeyu Zheng, Tucker Balch, Manuela Veloso

    Abstract: We study a game between liquidity provider and liquidity taker agents interacting in an over-the-counter market, for which the typical example is foreign exchange. We show how a suitable design of parameterized families of reward functions coupled with shared policy learning constitutes an efficient solution to this problem. By playing against each other, our deep-reinforcement-learning-driven age… ▽ More

    Submitted 1 August, 2023; v1 submitted 13 October, 2022; originally announced October 2022.

  3. arXiv:2203.06865  [pdf, other

    q-fin.CP cs.AI cs.LG q-fin.MF

    Calibration of Derivative Pricing Models: a Multi-Agent Reinforcement Learning Perspective

    Authors: Nelson Vadori

    Abstract: One of the most fundamental questions in quantitative finance is the existence of continuous-time diffusion models that fit market prices of a given set of options. Traditionally, one employs a mix of intuition, theoretical and empirical analysis to find models that achieve exact or approximate fits. Our contribution is to show how a suitable game theoretical formulation of this problem can help s… ▽ More

    Submitted 6 October, 2023; v1 submitted 14 March, 2022; originally announced March 2022.

  4. arXiv:2110.06829  [pdf, other

    cs.MA cs.AI cs.LG q-fin.TR

    Towards a fully RL-based Market Simulator

    Authors: Leo Ardon, Nelson Vadori, Thomas Spooner, Mengda Xu, Jared Vann, Sumitra Ganesh

    Abstract: We present a new financial framework where two families of RL-based agents representing the Liquidity Providers and Liquidity Takers learn simultaneously to satisfy their objective. Thanks to a parametrized reward formulation and the use of Deep RL, each group learns a shared policy able to generalize and interpolate over a wide range of behaviors. This is a step towards a fully RL-based market si… ▽ More

    Submitted 8 November, 2021; v1 submitted 13 October, 2021; originally announced October 2021.

    Journal ref: ACM International Conference on AI in Finance, 2021

  5. arXiv:2106.02615  [pdf, other

    cs.GT cs.LG

    Consensus Multiplicative Weights Update: Learning to Learn using Projector-based Game Signatures

    Authors: Nelson Vadori, Rahul Savani, Thomas Spooner, Sumitra Ganesh

    Abstract: Cheung and Piliouras (2020) recently showed that two variants of the Multiplicative Weights Update method - OMWU and MWU - display opposite convergence properties depending on whether the game is zero-sum or cooperative. Inspired by this work and the recent literature on learning to optimize for single functions, we introduce a new framework for learning last-iterate convergence to Nash Equilibria… ▽ More

    Submitted 11 June, 2022; v1 submitted 4 June, 2021; originally announced June 2021.

    Comments: ICML 2022, the 39th International Conference on Machine Learning

  6. arXiv:2102.10362  [pdf, other

    cs.LG cs.AI cs.MA stat.ML

    Factored Policy Gradients: Leveraging Structure for Efficient Learning in MOMDPs

    Authors: Thomas Spooner, Nelson Vadori, Sumitra Ganesh

    Abstract: Policy gradient methods can solve complex tasks but often fail when the dimensionality of the action-space or objective multiplicity grow very large. This occurs, in part, because the variance on score-based gradient estimators scales quadratically. In this paper, we address this problem through a factor baseline which exploits independence structure encoded in a novel action-target influence netw… ▽ More

    Submitted 23 November, 2021; v1 submitted 20 February, 2021; originally announced February 2021.

    Comments: NeurIPS 2021; 19 pages, 19 figures, 1 table

  7. arXiv:2006.13085  [pdf, other

    cs.MA cs.LG

    Calibration of Shared Equilibria in General Sum Partially Observable Markov Games

    Authors: Nelson Vadori, Sumitra Ganesh, Prashant Reddy, Manuela Veloso

    Abstract: Training multi-agent systems (MAS) to achieve realistic equilibria gives us a useful tool to understand and model real-world systems. We consider a general sum partially observable Markov game where agents of different types share a single policy network, conditioned on agent-specific information. This paper aims at i) formally understanding equilibria reached by such agents, and ii) matching emer… ▽ More

    Submitted 23 October, 2020; v1 submitted 23 June, 2020; originally announced June 2020.

    Comments: NeurIPS 2020, Thirty-fourth Conference on Neural Information Processing Systems

  8. arXiv:2006.12686  [pdf, other

    cs.LG q-fin.RM stat.ML

    Risk-Sensitive Reinforcement Learning: a Martingale Approach to Reward Uncertainty

    Authors: Nelson Vadori, Sumitra Ganesh, Prashant Reddy, Manuela Veloso

    Abstract: We introduce a novel framework to account for sensitivity to rewards uncertainty in sequential decision-making problems. While risk-sensitive formulations for Markov decision processes studied so far focus on the distribution of the cumulative reward as a whole, we aim at learning policies sensitive to the uncertain/stochastic nature of the rewards, which has the advantage of being conceptually mo… ▽ More

    Submitted 15 September, 2020; v1 submitted 22 June, 2020; originally announced June 2020.

    Comments: Published at ICAIF 2020: ACM International Conference on AI in Finance

  9. arXiv:1911.05892  [pdf, other

    q-fin.TR cs.LG cs.MA

    Reinforcement Learning for Market Making in a Multi-agent Dealer Market

    Authors: Sumitra Ganesh, Nelson Vadori, Mengda Xu, Hua Zheng, Prashant Reddy, Manuela Veloso

    Abstract: Market makers play an important role in providing liquidity to markets by continuously quoting prices at which they are willing to buy and sell, and managing inventory risk. In this paper, we build a multi-agent simulation of a dealer market and demonstrate that it can be used to understand the behavior of a reinforcement learning (RL) based market maker agent. We use the simulator to train an RL-… ▽ More

    Submitted 13 November, 2019; originally announced November 2019.

  10. arXiv:1601.01710  [pdf, other

    q-fin.MF q-fin.TR

    A Semi-Markovian Modeling of Limit Order Markets

    Authors: Anatoliy Swishchuk, Nelson Vadori

    Abstract: R. Cont and A. de Larrard (SIAM J. Finan. Math, 2013) introduced a tractable stochastic model for the dynamics of a limit order book, computing various quantities of interest such as the probability of a price increase or the diffusion limit of the price process. As suggested by empirical observations, we extend their framework to 1) arbitrary distributions for book events inter-arrival times (pos… ▽ More

    Submitted 7 January, 2016; originally announced January 2016.

    Comments: 27 pages, 1 figure, 13 tables

    MSC Class: 60K15; 60K20; 90B22; 91B24; 91B70

  11. arXiv:1304.4169  [pdf, ps, other

    math.PR

    Law of Large Numbers for Semi-Markov inhomogeneous Random Evolutions on Banach spaces

    Authors: Nelson Vadori, Anatoliy Swishchuk

    Abstract: Using backward propagators, we construct inhomogeneous Random Evolutions on Banach spaces driven by (uniformly ergodic) Semi-Markov processes. After studying some of their properties (measurability, continuity, integral representation), we establish a Law of Large Numbers for such inhomogeneous Random Evolutions, and more precisely their weak convergence - in the Skorohod space $D$ - to an inhomog… ▽ More

    Submitted 27 May, 2013; v1 submitted 15 April, 2013; originally announced April 2013.

    Comments: v2: typos removed. Remark 4.14 corrected. Intro slightly changed v3: typos removed. Intro slightly changed