Skip to main content

Showing 1–4 of 4 results for author: van der Heiden, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2203.03355  [pdf, other

    cs.AI cs.LG cs.MA

    Reliably Re-Acting to Partner's Actions with the Social Intrinsic Motivation of Transfer Empowerment

    Authors: Tessa van der Heiden, Herke van Hoof, Efstratios Gavves, Christoph Salge

    Abstract: We consider multi-agent reinforcement learning (MARL) for cooperative communication and coordination tasks. MARL agents can be brittle because they can overfit their training partners' policies. This overfitting can produce agents that adopt policies that act under the expectation that other agents will act in a certain way rather than react to their actions. Our objective is to bias the learning… ▽ More

    Submitted 7 March, 2022; originally announced March 2022.

    Comments: arXiv admin note: text overlap with arXiv:2012.08255

  2. arXiv:2012.08255  [pdf, other

    cs.MA

    Robust Multi-Agent Reinforcement Learning with Social Empowerment for Coordination and Communication

    Authors: T. van der Heiden, C. Salge, E. Gavves, H. van Hoof

    Abstract: We consider the problem of robust multi-agent reinforcement learning (MARL) for cooperative communication and coordination tasks. MARL agents, mainly those trained in a centralized way, can be brittle because they can adopt policies that act under the expectation that other agents will act a certain way rather than react to their actions. Our objective is to bias the learning process towards findi… ▽ More

    Submitted 15 December, 2020; originally announced December 2020.

  3. arXiv:2003.08158  [pdf, other

    cs.MA cs.AI cs.LG

    Social Navigation with Human Empowerment driven Deep Reinforcement Learning

    Authors: Tessa van der Heiden, Florian Mirus, Herke van Hoof

    Abstract: Mobile robot navigation has seen extensive research in the last decades. The aspect of collaboration with robots and humans sharing workspaces will become increasingly important in the future. Therefore, the next generation of mobile robots needs to be socially-compliant to be accepted by their human collaborators. However, a formal definition of compliance is not straightforward. On the other han… ▽ More

    Submitted 5 August, 2020; v1 submitted 18 March, 2020; originally announced March 2020.

  4. arXiv:1910.06673  [pdf, other

    cs.LG cs.CV stat.ML

    SafeCritic: Collision-Aware Trajectory Prediction

    Authors: Tessa van der Heiden, Naveen Shankar Nagaraja, Christian Weiss, Efstratios Gavves

    Abstract: Navigating complex urban environments safely is a key to realize fully autonomous systems. Predicting future locations of vulnerable road users, such as pedestrians and cyclists, thus, has received a lot of attention in the recent years. While previous works have addressed modeling interactions with the static (obstacles) and dynamic (humans) environment agents, we address an important gap in traj… ▽ More

    Submitted 15 October, 2019; originally announced October 2019.

    Comments: To Appear as workshop paper for the British Machine Vision Conference (BMVC) 2019