Skip to main content

Showing 1–1 of 1 results for author: Thoma, V

Searching in archive stat. Search in all archives.
.
  1. arXiv:2406.01575  [pdf, other

    math.OC cs.AI cs.LG stat.ML

    Stochastic Bilevel Optimization with Lower-Level Contextual Markov Decision Processes

    Authors: Vinzenz Thoma, Barna Pasztor, Andreas Krause, Giorgia Ramponi, Yifan Hu

    Abstract: In various applications, the optimal policy in a strategic decision-making problem depends both on the environmental configuration and exogenous events. For these settings, we introduce Bilevel Optimization with Contextual Markov Decision Processes (BO-CMDP), a stochastic bilevel decision-making model, where the lower level consists of solving a contextual Markov Decision Process (CMDP). BO-CMDP c… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: 54 pages, 18 Figures