Skip to main content

Showing 1–6 of 6 results for author: Seurin, M

.
  1. arXiv:2105.09992  [pdf, other

    cs.LG

    Don't Do What Doesn't Matter: Intrinsic Motivation with Action Usefulness

    Authors: Mathieu Seurin, Florian Strub, Philippe Preux, Olivier Pietquin

    Abstract: Sparse rewards are double-edged training signals in reinforcement learning: easy to design but hard to optimize. Intrinsic motivation guidances have thus been developed toward alleviating the resulting exploration problem. They usually incentivize agents to look for new states through novelty signals. Yet, such methods encourage exhaustive exploration of the state space rather than focusing on the… ▽ More

    Submitted 31 May, 2021; v1 submitted 20 May, 2021; originally announced May 2021.

    Comments: Accepted at Internationnal Joint Conference on Artificial Intelligence (IJCAI'21) and Self-Supervision for Reinforcement Learning Workshop (SSL-RL @ICLR'21)

  2. arXiv:2008.03127  [pdf, other

    eess.AS cs.LG cs.SD

    A Machine of Few Words -- Interactive Speaker Recognition with Reinforcement Learning

    Authors: Mathieu Seurin, Florian Strub, Philippe Preux, Olivier Pietquin

    Abstract: Speaker recognition is a well known and studied task in the speech processing domain. It has many applications, either for security or speaker adaptation of personal devices. In this paper, we present a new paradigm for automatic speaker recognition that we call Interactive Speaker Recognition (ISR). In this paradigm, the recognition system aims to incrementally build a representation of the speak… ▽ More

    Submitted 7 August, 2020; originally announced August 2020.

  3. arXiv:1910.09451  [pdf, other

    cs.LG cs.CL stat.ML

    HIGhER : Improving instruction following with Hindsight Generation for Experience Replay

    Authors: Geoffrey Cideron, Mathieu Seurin, Florian Strub, Olivier Pietquin

    Abstract: Language creates a compact representation of the world and allows the description of unlimited situations and objectives through compositionality. While these characterizations may foster instructing, conditioning or structuring interactive agent behavior, it remains an open-problem to correctly relate language understanding and reinforcement learning in even simple instruction following scenarios… ▽ More

    Submitted 10 December, 2020; v1 submitted 21 October, 2019; originally announced October 2019.

    Comments: Accepted at ADPRL'20

  4. arXiv:1910.02078  [pdf, other

    cs.LG stat.ML

    I'm sorry Dave, I'm afraid I can't do that, Deep Q-learning from forbidden action

    Authors: Mathieu Seurin, Philippe Preux, Olivier Pietquin

    Abstract: The use of Reinforcement Learning (RL) is still restricted to simulation or to enhance human-operated systems through recommendations. Real-world environments (e.g. industrial robots or power grids) are generally designed with safety constraints in mind implemented in the shape of valid actions masks or contingency controllers. For example, the range of motion and the angles of the motors of a rob… ▽ More

    Submitted 13 August, 2020; v1 submitted 4 October, 2019; originally announced October 2019.

    Comments: Accepted at Internationnal Joint Conference on Neural Networks (IJCNN'2020)

  5. arXiv:1808.04446  [pdf, other

    cs.CV cs.CL cs.LG stat.ML

    Visual Reasoning with Multi-hop Feature Modulation

    Authors: Florian Strub, Mathieu Seurin, Ethan Perez, Harm de Vries, Jérémie Mary, Philippe Preux, Aaron Courville, Olivier Pietquin

    Abstract: Recent breakthroughs in computer vision and natural language processing have spurred interest in challenging multi-modal tasks such as visual question-answering and visual dialogue. For such tasks, one successful approach is to condition image-based convolutional network computation on language via Feature-wise Linear Modulation (FiLM) layers, i.e., per-channel scaling and shifting. We propose to… ▽ More

    Submitted 12 October, 2018; v1 submitted 3 August, 2018; originally announced August 2018.

    Comments: In Proc of ECCV 2018

  6. arXiv:1709.05185  [pdf, other

    cs.AI cs.CV cs.RO

    Unsupervised state representation learning with robotic priors: a robustness benchmark

    Authors: Timothée Lesort, Mathieu Seurin, Xinrui Li, Natalia Díaz-Rodríguez, David Filliat

    Abstract: Our understanding of the world depends highly on our capacity to produce intuitive and simplified representations which can be easily used to solve problems. We reproduce this simplification process using a neural network to build a low dimensional state representation of the world from images acquired by a robot. As in Jonschkowski et al. 2015, we learn in an unsupervised way using prior knowledg… ▽ More

    Submitted 15 September, 2017; originally announced September 2017.

    Comments: ICRA 2018 submission