Skip to main content

Showing 1–6 of 6 results for author: Vuorio, R

Searching in archive stat. Search in all archives.
.
  1. arXiv:2211.02667  [pdf, other

    cs.LG stat.ML

    Deconfounded Imitation Learning

    Authors: Risto Vuorio, Johann Brehmer, Hanno Ackermann, Daniel Dijkman, Taco Cohen, Pim de Haan

    Abstract: Standard imitation learning can fail when the expert demonstrators have different sensory inputs than the imitating agent. This is because partial observability gives rise to hidden confounders in the causal graph. We break down the space of confounded imitation learning problems and identify three settings with different data requirements in which the correct imitation policy can be identified. W… ▽ More

    Submitted 4 November, 2022; originally announced November 2022.

  2. arXiv:2112.00478  [pdf, other

    cs.LG cs.AI stat.ML

    On the Practical Consistency of Meta-Reinforcement Learning Algorithms

    Authors: Zheng Xiong, Luisa Zintgraf, Jacob Beck, Risto Vuorio, Shimon Whiteson

    Abstract: Consistency is the theoretical property of a meta learning algorithm that ensures that, under certain assumptions, it can adapt to any task at test time. An open question is whether and how theoretical consistency translates into practice, in comparison to inconsistent algorithms. In this paper, we empirically investigate this question on a set of representative meta-RL algorithms. We find that th… ▽ More

    Submitted 1 December, 2021; originally announced December 2021.

  3. arXiv:1911.11260  [pdf, other

    cs.LG cs.AI stat.ML

    Deep Reinforcement Learning for Multi-Driver Vehicle Dispatching and Repositioning Problem

    Authors: John Holler, Risto Vuorio, Zhiwei Qin, Xiaocheng Tang, Yan Jiao, Tiancheng **, Satinder Singh, Chenxi Wang, Jie** Ye

    Abstract: Order dispatching and driver repositioning (also known as fleet management) in the face of spatially and temporally varying supply and demand are central to a ride-sharing platform marketplace. Hand-crafting heuristic solutions that account for the dynamics in these resource allocation problems is difficult, and may be better handled by an end-to-end machine learning method. Previous works have ex… ▽ More

    Submitted 25 November, 2019; originally announced November 2019.

    Comments: ICDM 2019 Short Paper

  4. arXiv:1910.13616  [pdf, other

    cs.LG cs.AI stat.ML

    Multimodal Model-Agnostic Meta-Learning via Task-Aware Modulation

    Authors: Risto Vuorio, Shao-Hua Sun, Hexiang Hu, Joseph J. Lim

    Abstract: Model-agnostic meta-learners aim to acquire meta-learned parameters from similar tasks to adapt to novel tasks from the same distribution with few gradient updates. With the flexibility in the choice of models, those frameworks demonstrate appealing performance on a variety of domains such as few-shot image classification and reinforcement learning. However, one important limitation of such framew… ▽ More

    Submitted 29 October, 2019; originally announced October 2019.

  5. arXiv:1812.07172  [pdf, other

    cs.LG cs.AI stat.ML

    Toward Multimodal Model-Agnostic Meta-Learning

    Authors: Risto Vuorio, Shao-Hua Sun, Hexiang Hu, Joseph J. Lim

    Abstract: Gradient-based meta-learners such as MAML are able to learn a meta-prior from similar tasks to adapt to novel tasks from the same distribution with few gradient updates. One important limitation of such frameworks is that they seek a common initialization shared across the entire task distribution, substantially limiting the diversity of the task distributions that they are able to learn from. In… ▽ More

    Submitted 18 December, 2018; originally announced December 2018.

  6. arXiv:1806.06928  [pdf, other

    cs.LG cs.AI cs.NE stat.ML

    Meta Continual Learning

    Authors: Risto Vuorio, Dong-Yeon Cho, Daejoong Kim, Jiwon Kim

    Abstract: Using neural networks in practical settings would benefit from the ability of the networks to learn new tasks throughout their lifetimes without forgetting the previous tasks. This ability is limited in the current deep neural networks by a problem called catastrophic forgetting, where training on new tasks tends to severely degrade performance on previous tasks. One way to lessen the impact of th… ▽ More

    Submitted 11 June, 2018; originally announced June 2018.