Skip to main content

Showing 1–4 of 4 results for author: Kamoutsi, A

Searching in archive math. Search in all archives.
.
  1. arXiv:2405.15509  [pdf, other

    math.OC cs.LG

    Randomized algorithms and PAC bounds for inverse reinforcement learning in continuous spaces

    Authors: Angeliki Kamoutsi, Peter Schmitt-Förster, Tobias Sutter, Volkan Cevher, John Lygeros

    Abstract: This work studies discrete-time discounted Markov decision processes with continuous state and action spaces and addresses the inverse problem of inferring a cost function from observed optimal behavior. We first consider the case in which we have access to the entire expert policy and characterize the set of solutions to the inverse problem by using occupation measures, linear duality, and comple… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: 29 pages, 4 figures

  2. arXiv:2201.00039  [pdf, ps, other

    cs.LG math.OC

    Stochastic convex optimization for provably efficient apprenticeship learning

    Authors: Angeliki Kamoutsi, Goran Banjac, John Lygeros

    Abstract: We consider large-scale Markov decision processes (MDPs) with an unknown cost function and employ stochastic convex optimization tools to address the problem of imitation learning, which consists of learning a policy from a finite set of expert demonstrations. We adopt the apprenticeship learning formalism, which carries the assumption that the true cost function can be represented as a linear c… ▽ More

    Submitted 31 December, 2021; originally announced January 2022.

    Comments: arXiv admin note: text overlap with arXiv:2112.14004

    Journal ref: Optimization Foundations for Reinforcement Learning Workshop at NeurIPS 2019, Vancouver, Canada

  3. arXiv:2112.14004  [pdf, ps, other

    cs.LG cs.AI math.OC

    Efficient Performance Bounds for Primal-Dual Reinforcement Learning from Demonstrations

    Authors: Angeliki Kamoutsi, Goran Banjac, John Lygeros

    Abstract: We consider large-scale Markov decision processes with an unknown cost function and address the problem of learning a policy from a finite set of expert demonstrations. We assume that the learner is not allowed to interact with the expert and has no access to reinforcement signal of any kind. Existing inverse reinforcement learning methods come with strong theoretical guarantees, but are computati… ▽ More

    Submitted 28 December, 2021; originally announced December 2021.

    Journal ref: International Conference of Machine Learning (ICML) 2021

  4. On Infinite Linear Programming and the Moment Approach to Deterministic Infinite Horizon Discounted Optimal Control Problems

    Authors: Angeliki Kamoutsi, Tobias Sutter, Peyman Mohajerin Esfahani, John Lygeros

    Abstract: We revisit the linear programming approach to deterministic, continuous time, infinite horizon discounted optimal control problems. In the first part, we relax the original problem to an infinite-dimensional linear program over a measure space and prove equivalence of the two formulations under mild assumptions, significantly weaker than those found in the literature until now. The proof is based… ▽ More

    Submitted 7 June, 2017; v1 submitted 27 March, 2017; originally announced March 2017.

    Comments: 7 pages, 1 figure

    MSC Class: 49L20; 49M20; 90C22; 90C48