Skip to main content

Showing 1–6 of 6 results for author: Shiarlis, K

.
  1. arXiv:2309.14003  [pdf, other

    cs.LG cs.RO

    Hierarchical Imitation Learning for Stochastic Environments

    Authors: Maximilian Igl, Punit Shah, Paul Mougin, Sirish Srinivasan, Tarun Gupta, Brandyn White, Kyriacos Shiarlis, Shimon Whiteson

    Abstract: Many applications of imitation learning require the agent to generate the full distribution of behaviour observed in the training data. For example, to evaluate the safety of autonomous vehicles in simulation, accurate and diverse behaviour models of other road users are paramount. Existing methods that improve this distributional realism typically rely on hierarchical policies. These condition th… ▽ More

    Submitted 25 September, 2023; originally announced September 2023.

    Comments: Published at IROS'23

  2. arXiv:2205.03195  [pdf, other

    cs.LG cs.RO

    Symphony: Learning Realistic and Diverse Agents for Autonomous Driving Simulation

    Authors: Maximilian Igl, Daewoo Kim, Alex Kuefler, Paul Mougin, Punit Shah, Kyriacos Shiarlis, Dragomir Anguelov, Mark Palatucci, Brandyn White, Shimon Whiteson

    Abstract: Simulation is a crucial tool for accelerating the development of autonomous vehicles. Making simulation realistic requires models of the human road users who interact with such cars. Such models can be obtained by applying learning from demonstration (LfD) to trajectories observed by cars already on the road. However, existing LfD methods are typically insufficient, yielding policies that frequent… ▽ More

    Submitted 6 May, 2022; originally announced May 2022.

    Comments: Accepted to ICRA-2022

  3. arXiv:1910.08348  [pdf, other

    cs.LG stat.ML

    VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning

    Authors: Luisa Zintgraf, Kyriacos Shiarlis, Maximilian Igl, Sebastian Schulze, Yarin Gal, Katja Hofmann, Shimon Whiteson

    Abstract: Trading off exploration and exploitation in an unknown environment is key to maximising expected return during learning. A Bayes-optimal policy, which does so optimally, conditions its actions not only on the environment state but on the agent's uncertainty about the environment. Computing a Bayes-optimal policy is however intractable for all but the smallest tasks. In this paper, we introduce var… ▽ More

    Submitted 27 February, 2020; v1 submitted 18 October, 2019; originally announced October 2019.

    Comments: Published at ICLR 2020

  4. arXiv:1811.03516  [pdf, other

    cs.LG stat.ML

    Learning from Demonstration in the Wild

    Authors: Feryal Behbahani, Kyriacos Shiarlis, Xi Chen, Vitaly Kurin, Sudhanshu Kasewa, Ciprian Stirbu, João Gomes, Supratik Paul, Frans A. Oliehoek, João Messias, Shimon Whiteson

    Abstract: Learning from demonstration (LfD) is useful in settings where hand-coding behaviour or a reward function is impractical. It has succeeded in a wide range of problems but typically relies on manually generated demonstrations or specially deployed sensors and has not generally been able to leverage the copious demonstrations available in the wild: those that capture behaviours that were occurring an… ▽ More

    Submitted 25 March, 2019; v1 submitted 8 November, 2018; originally announced November 2018.

    Comments: Accepted to the IEEE International Conference on Robotics and Automation (ICRA) 2019; extended version with appendix

  5. arXiv:1810.03642  [pdf, other

    cs.LG stat.ML

    Fast Context Adaptation via Meta-Learning

    Authors: Luisa M Zintgraf, Kyriacos Shiarlis, Vitaly Kurin, Katja Hofmann, Shimon Whiteson

    Abstract: We propose CAVIA for meta-learning, a simple extension to MAML that is less prone to meta-overfitting, easier to parallelise, and more interpretable. CAVIA partitions the model parameters into two parts: context parameters that serve as additional input to the model and are adapted on individual tasks, and shared parameters that are meta-trained and shared across tasks. At test time, only the cont… ▽ More

    Submitted 10 June, 2019; v1 submitted 8 October, 2018; originally announced October 2018.

    Comments: Published at the International Conference on Machine Learning (ICML) 2019

  6. arXiv:1803.01840  [pdf, other

    cs.LG stat.ML

    TACO: Learning Task Decomposition via Temporal Alignment for Control

    Authors: Kyriacos Shiarlis, Markus Wulfmeier, Sasha Salter, Shimon Whiteson, Ingmar Posner

    Abstract: Many advanced Learning from Demonstration (LfD) methods consider the decomposition of complex, real-world tasks into simpler sub-tasks. By reusing the corresponding sub-policies within and between tasks, they provide training data for each policy from different high-level tasks and compose them to perform novel ones. Existing approaches to modular LfD focus either on learning a single high-level t… ▽ More

    Submitted 10 August, 2018; v1 submitted 2 March, 2018; originally announced March 2018.

    Comments: 12 Pages. Published at ICML 2018