Skip to main content

Showing 1–10 of 10 results for author: Delage, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.04670  [pdf, other

    cs.LG

    End-to-end Conditional Robust Optimization

    Authors: Abhilash Chenreddy, Erick Delage

    Abstract: The field of Contextual Optimization (CO) integrates machine learning and optimization to solve decision making problems under uncertainty. Recently, a risk sensitive variant of CO, known as Conditional Robust Optimization (CRO), combines uncertainty quantification with robust optimization in order to promote safety and reliability in high stake applications. Exploiting modern differentiable optim… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

  2. arXiv:2306.10374  [pdf, ps, other

    math.OC cs.LG

    A Survey of Contextual Optimization Methods for Decision Making under Uncertainty

    Authors: Utsav Sadana, Abhilash Chenreddy, Erick Delage, Alexandre Forel, Emma Fre**ger, Thibaut Vidal

    Abstract: Recently there has been a surge of interest in operations research (OR) and the machine learning (ML) community in combining prediction algorithms and optimization techniques to solve decision-making problems in the face of uncertainty. This gave rise to the field of contextual optimization, under which data-driven procedures are developed to prescribe actions to the decision-maker that make the b… ▽ More

    Submitted 2 February, 2024; v1 submitted 17 June, 2023; originally announced June 2023.

  3. arXiv:2306.05937  [pdf, other

    math.OC cs.LG stat.ME

    Robust Data-driven Prescriptiveness Optimization

    Authors: Mehran Poursoltani, Erick Delage, Angelos Georghiou

    Abstract: The abundance of data has led to the emergence of a variety of optimization techniques that attempt to leverage available side information to provide more anticipative decisions. The wide range of methods and contexts of application have motivated the design of a universal unitless measure of performance known as the coefficient of prescriptiveness. This coefficient was designed to quantify both t… ▽ More

    Submitted 3 June, 2024; v1 submitted 9 June, 2023; originally announced June 2023.

  4. arXiv:2304.12477  [pdf, other

    math.OC cs.AI

    On Dynamic Programming Decompositions of Static Risk Measures in Markov Decision Processes

    Authors: Jia Lin Hau, Erick Delage, Mohammad Ghavamzadeh, Marek Petrik

    Abstract: Optimizing static risk-averse objectives in Markov decision processes is difficult because they do not admit standard dynamic programming equations common in Reinforcement Learning (RL) algorithms. Dynamic programming decompositions that augment the state space with discrete risk levels have recently gained popularity in the RL community. Prior work has shown that these decompositions are optimal… ▽ More

    Submitted 23 April, 2024; v1 submitted 24 April, 2023; originally announced April 2023.

    Journal ref: Advances in Neural Information Processing Systems (Neurips), 2023

  5. arXiv:2210.15837  [pdf, other

    cs.LG cs.GT cs.IR math.OC

    Risk-Aware Bid Optimization for Online Display Advertisement

    Authors: Rui Fan, Erick Delage

    Abstract: This research focuses on the bid optimization problem in the real-time bidding setting for online display advertisements, where an advertiser, or the advertiser's agent, has access to the features of the website visitor and the type of ad slots, to decide the optimal bid prices given a predetermined total advertisement budget. We propose a risk-aware data-driven bid optimization model that maximiz… ▽ More

    Submitted 27 October, 2022; originally announced October 2022.

    Comments: Accepted for CIKM '22

  6. arXiv:2109.07005  [pdf, other

    q-fin.PM cs.LG

    WaveCorr: Correlation-savvy Deep Reinforcement Learning for Portfolio Management

    Authors: Saeed Marzban, Erick Delage, Jonathan Yumeng Li, Jeremie Desgagne-Bouchard, Carl Dussault

    Abstract: The problem of portfolio management represents an important and challenging class of dynamic decision making problems, where rebalancing decisions need to be made over time with the consideration of many factors such as investors preferences, trading environments, and market conditions. In this paper, we present a new portfolio policy network architecture for deep reinforcement learning (DRL)that… ▽ More

    Submitted 28 September, 2021; v1 submitted 14 September, 2021; originally announced September 2021.

  7. arXiv:2109.04001  [pdf, other

    q-fin.PR cs.LG

    Deep Reinforcement Learning for Equal Risk Pricing and Hedging under Dynamic Expectile Risk Measures

    Authors: Saeed Marzban, Erick Delage, Jonathan Yumeng Li

    Abstract: Recently equal risk pricing, a framework for fair derivative pricing, was extended to consider dynamic risk measures. However, all current implementations either employ a static risk measure that violates time consistency, or are based on traditional dynamic programming solution schemes that are impracticable in problems with a large number of underlying assets (due to the curse of dimensionality)… ▽ More

    Submitted 8 September, 2021; originally announced September 2021.

  8. arXiv:2105.08877  [pdf, ps, other

    cs.AI

    Deep Reinforcement Learning for Optimal Stop** with Application in Financial Engineering

    Authors: Abderrahim Fathan, Erick Delage

    Abstract: Optimal stop** is the problem of deciding the right time at which to take a particular action in a stochastic system, in order to maximize an expected reward. It has many applications in areas such as finance, healthcare, and statistics. In this paper, we employ deep Reinforcement Learning (RL) to learn optimal stop** policies in two financial engineering applications: namely option pricing, a… ▽ More

    Submitted 18 May, 2021; originally announced May 2021.

  9. arXiv:2010.05373  [pdf, other

    stat.ML cs.LG math.ST

    Distributionally Robust Local Non-parametric Conditional Estimation

    Authors: Viet Anh Nguyen, Fan Zhang, Jose Blanchet, Erick Delage, Yinyu Ye

    Abstract: Conditional estimation given specific covariate values (i.e., local conditional estimation or functional estimation) is ubiquitously useful with applications in engineering, social and natural sciences. Existing data-driven non-parametric estimators mostly focus on structured homogeneous data (e.g., weakly independent and stationary data), thus they are sensitive to adversarial noise and may perfo… ▽ More

    Submitted 11 October, 2020; originally announced October 2020.

  10. arXiv:2003.07915  [pdf, ps, other

    cs.GT math.OC

    The value of randomized strategies in distributionally robust risk averse network interdiction games

    Authors: Utsav Sadana, Erick Delage

    Abstract: Conditional Value at Risk (CVaR) is widely used to account for the preferences of a risk-averse agent in the extreme loss scenarios. To study the effectiveness of randomization in interdiction games with an interdictor that is both risk and ambiguity averse, we introduce a distributionally robust network interdiction game where the interdictor randomizes over the feasible interdiction plans in ord… ▽ More

    Submitted 17 March, 2020; originally announced March 2020.