Skip to main content

Showing 1–18 of 18 results for author: Altché, F

.
  1. arXiv:2211.10515  [pdf, other

    stat.ML cs.LG

    Curiosity in Hindsight: Intrinsic Exploration in Stochastic Environments

    Authors: Daniel Jarrett, Corentin Tallec, Florent Altché, Thomas Mesnard, Rémi Munos, Michal Valko

    Abstract: Consider the problem of exploration in sparse-reward or reward-free environments, such as in Montezuma's Revenge. In the curiosity-driven paradigm, the agent is rewarded for how much each realized outcome differs from their predicted outcome. But using predictive error as intrinsic motivation is fragile in stochastic environments, as the agent may become trapped by high-entropy areas of the state-… ▽ More

    Submitted 14 July, 2023; v1 submitted 18 November, 2022; originally announced November 2022.

    Journal ref: In Proc. 40th International Conference on Machine Learning (ICML 2023)

  2. arXiv:2211.04236  [pdf, other

    cs.CL cs.LG

    Self-conditioned Embedding Diffusion for Text Generation

    Authors: Robin Strudel, Corentin Tallec, Florent Altché, Yilun Du, Yaroslav Ganin, Arthur Mensch, Will Grathwohl, Nikolay Savinov, Sander Dieleman, Laurent Sifre, Rémi Leblond

    Abstract: Can continuous diffusion models bring the same performance breakthrough on natural language they did for image generation? To circumvent the discrete nature of text data, we can simply project tokens in a continuous space of embeddings, as is standard in language modeling. We propose Self-conditioned Embedding Diffusion, a continuous diffusion mechanism that operates on token embeddings and allows… ▽ More

    Submitted 8 November, 2022; originally announced November 2022.

    Comments: 15 pages

  3. arXiv:2206.08332  [pdf, other

    cs.LG cs.AI stat.ML

    BYOL-Explore: Exploration by Bootstrapped Prediction

    Authors: Zhaohan Daniel Guo, Shantanu Thakoor, Miruna Pîslar, Bernardo Avila Pires, Florent Altché, Corentin Tallec, Alaa Saade, Daniele Calandriello, Jean-Bastien Grill, Yunhao Tang, Michal Valko, Rémi Munos, Mohammad Gheshlaghi Azar, Bilal Piot

    Abstract: We present BYOL-Explore, a conceptually simple yet general approach for curiosity-driven exploration in visually-complex environments. BYOL-Explore learns a world representation, the world dynamics, and an exploration policy all-together by optimizing a single prediction loss in the latent space with no additional auxiliary objective. We show that BYOL-Explore is effective in DM-HARD-8, a challeng… ▽ More

    Submitted 16 June, 2022; originally announced June 2022.

  4. arXiv:2103.16559  [pdf, other

    cs.CV

    Broaden Your Views for Self-Supervised Video Learning

    Authors: Adrià Recasens, Pauline Luc, Jean-Baptiste Alayrac, Luyu Wang, Ross Hemsley, Florian Strub, Corentin Tallec, Mateusz Malinowski, Viorica Patraucean, Florent Altché, Michal Valko, Jean-Bastien Grill, Aäron van den Oord, Andrew Zisserman

    Abstract: Most successful self-supervised learning methods are trained to align the representations of two independent views from the data. State-of-the-art methods in video are inspired by image techniques, where these two views are similarly extracted by crop** and augmenting the resulting crop. However, these methods miss a crucial element in the video domain: time. We introduce BraVe, a self-supervise… ▽ More

    Submitted 19 October, 2021; v1 submitted 30 March, 2021; originally announced March 2021.

    Comments: This paper is an extended version of our ICCV-21 paper. It includes more results as well as a minor architectural variation which improves results

  5. arXiv:2010.10241  [pdf, ps, other

    stat.ML cs.CV cs.LG

    BYOL works even without batch statistics

    Authors: Pierre H. Richemond, Jean-Bastien Grill, Florent Altché, Corentin Tallec, Florian Strub, Andrew Brock, Samuel Smith, Soham De, Razvan Pascanu, Bilal Piot, Michal Valko

    Abstract: Bootstrap Your Own Latent (BYOL) is a self-supervised learning approach for image representation. From an augmented view of an image, BYOL trains an online network to predict a target network representation of a different augmented view of the same image. Unlike contrastive methods, BYOL does not explicitly use a repulsion term built from negative pairs in its training objective. Yet, it avoids co… ▽ More

    Submitted 20 October, 2020; originally announced October 2020.

  6. arXiv:2007.12509  [pdf, other

    cs.LG stat.ML

    Monte-Carlo Tree Search as Regularized Policy Optimization

    Authors: Jean-Bastien Grill, Florent Altché, Yunhao Tang, Thomas Hubert, Michal Valko, Ioannis Antonoglou, Rémi Munos

    Abstract: The combination of Monte-Carlo tree search (MCTS) with deep reinforcement learning has led to significant advances in artificial intelligence. However, AlphaZero, the current state-of-the-art MCTS algorithm, still relies on handcrafted heuristics that are only partially understood. In this paper, we show that AlphaZero's search heuristics, along with other common ones such as UCT, are an approxima… ▽ More

    Submitted 24 July, 2020; originally announced July 2020.

    Comments: Accepted to International Conference on Machine Learning (ICML), 2020

  7. arXiv:2006.07733  [pdf, other

    cs.LG cs.CV stat.ML

    Bootstrap your own latent: A new approach to self-supervised Learning

    Authors: Jean-Bastien Grill, Florian Strub, Florent Altché, Corentin Tallec, Pierre H. Richemond, Elena Buchatskaya, Carl Doersch, Bernardo Avila Pires, Zhaohan Daniel Guo, Mohammad Gheshlaghi Azar, Bilal Piot, Koray Kavukcuoglu, Rémi Munos, Michal Valko

    Abstract: We introduce Bootstrap Your Own Latent (BYOL), a new approach to self-supervised image representation learning. BYOL relies on two neural networks, referred to as online and target networks, that interact and learn from each other. From an augmented view of an image, we train the online network to predict the target network representation of the same image under a different augmented view. At the… ▽ More

    Submitted 10 September, 2020; v1 submitted 13 June, 2020; originally announced June 2020.

  8. arXiv:2004.14646  [pdf, other

    cs.LG cs.AI

    Bootstrap Latent-Predictive Representations for Multitask Reinforcement Learning

    Authors: Daniel Guo, Bernardo Avila Pires, Bilal Piot, Jean-bastien Grill, Florent Altché, Rémi Munos, Mohammad Gheshlaghi Azar

    Abstract: Learning a good representation is an essential component for deep reinforcement learning (RL). Representation learning is especially important in multitask and partially observable settings where building a representation of the unknown environment is crucial to solve the tasks. Here we introduce Prediction of Bootstrap Latents (PBL), a simple and flexible self-supervised representation learning a… ▽ More

    Submitted 30 April, 2020; originally announced April 2020.

  9. arXiv:1902.07685  [pdf, other

    cs.AI stat.AP stat.ML

    World Discovery Models

    Authors: Mohammad Gheshlaghi Azar, Bilal Piot, Bernardo Avila Pires, Jean-Bastien Grill, Florent Altché, Rémi Munos

    Abstract: As humans we are driven by a strong desire for seeking novelty in our world. Also upon observing a novel pattern we are capable of refining our understanding of the world based on the new information---humans can discover their world. The outstanding ability of the human mind for discovery has led to many breakthroughs in science, art and technology. Here we investigate the possibility of building… ▽ More

    Submitted 1 March, 2019; v1 submitted 20 February, 2019; originally announced February 2019.

  10. arXiv:1810.09365  [pdf, other

    cs.LG cs.RO eess.SY stat.ML

    Coupled Longitudinal and Lateral Control of a Vehicle using Deep Learning

    Authors: Guillaume Devineau, Philip Polack, Florent Altché, Fabien Moutarde

    Abstract: This paper explores the capability of deep neural networks to capture key characteristics of vehicle dynamics, and their ability to perform coupled longitudinal and lateral control of a vehicle. To this extent, two different artificial neural networks are trained to compute vehicle controls corresponding to a reference trajectory, using a dataset based on high-fidelity simulations of vehicle dynam… ▽ More

    Submitted 22 October, 2018; originally announced October 2018.

    Comments: Published in the IEEE 2018 International Conference on Intelligent Transportation Systems (ITSC 2018)

  11. arXiv:1804.08290  [pdf, other

    math.OC

    Guaranteeing Consistency in a Motion Planning and Control Architecture Using a Kinematic Bicycle Model

    Authors: Philip Polack, Florent Altché, Brigitte d'Andréa-Novel, Arnaud de La Fortelle

    Abstract: This paper proposes to combine a 10Hz motion planner based on a kinematic bicycle Model Predictive Control (MPC) and a 100Hz closed-loop Proportional-Integral-Derivative (PID) controller to cope with normal driving situations. Its novelty consists in ensuring the feasibility of the computed trajectory by the motion planner through a limitation of the steering angle depending on the speed. This ens… ▽ More

    Submitted 23 April, 2018; originally announced April 2018.

    Comments: 7 pages, 9 figures, American Control Conference 2018, Milwaukee, WI (accepted)

  12. arXiv:1801.07962  [pdf, other

    cs.RO cs.LG

    An LSTM Network for Highway Trajectory Prediction

    Authors: Florent Altché, Arnaud de La Fortelle

    Abstract: In order to drive safely and efficiently on public roads, autonomous vehicles will have to understand the intentions of surrounding vehicles, and adapt their own behavior accordingly. If experienced human drivers are generally good at inferring other vehicles' motion up to a few seconds in the future, most current Advanced Driving Assistance Systems (ADAS) are unable to perform such medium-term fo… ▽ More

    Submitted 24 January, 2018; originally announced January 2018.

    Comments: Presented at IEEE ITSC 2017

  13. arXiv:1801.07961  [pdf, other

    eess.SY

    Partitioning of the Free Space-Time for On-Road Navigation of Autonomous Ground Vehicles

    Authors: Florent Altché, Arnaud de La Fortelle

    Abstract: In this article, we consider the problem of trajectory planning and control for on-road driving of an autonomous ground vehicle (AGV) in presence of static or moving obstacles. We propose a systematic approach to partition the collision-free portion of the space-time into convex sub-regions that can be interpreted in terms of relative positions with respect to a set of fixed or mobile obstacles. W… ▽ More

    Submitted 24 January, 2018; originally announced January 2018.

    Comments: Presented at IEEE CDC2017

  14. arXiv:1706.08046  [pdf, other

    cs.MA cs.RO

    An Algorithm for Supervised Driving of Cooperative Semi-Autonomous Vehicles (Extended)

    Authors: Florent Altche, Xiangjun Qian, Arnaud de La Fortelle

    Abstract: Before reaching full autonomy, vehicles will gradually be equipped with more and more advanced driver assistance systems (ADAS), effectively rendering them semi-autonomous. However, current ADAS technologies seem unable to handle complex traffic situations, notably when dealing with vehicles arriving from the sides, either at intersections or when merging on highways. The high rate of accidents in… ▽ More

    Submitted 25 June, 2017; originally announced June 2017.

  15. arXiv:1704.01003  [pdf, other

    eess.SY

    High-Speed Trajectory Planning for Autonomous Vehicles Using a Simple Dynamic Model

    Authors: Florent Altché, Philip Polack, Arnaud de la Fortelle

    Abstract: To improve safety and energy efficiency, autonomous vehicles are expected to drive smoothly in most situations, while maintaining their velocity below a predetermined speed limit. However, some scenarios such as low road adherence or inadequate speed limit may require vehicles to automatically adapt their velocity without external input, while nearing the limits of their dynamic capacities. Many o… ▽ More

    Submitted 4 April, 2017; originally announced April 2017.

  16. arXiv:1703.01225  [pdf, other

    eess.SY

    A Simple Dynamic Model for Aggressive, Near-Limits Trajectory Planning

    Authors: Florent Altché, Philip Polack, Arnaud de La Fortelle

    Abstract: In normal on-road situations, autonomous vehicles will be expected to have smooth trajectories with relatively little demand on the vehicle dynamics to ensure passenger comfort and driving safety. However, the occurrence of unexpected events may require vehicles to perform aggressive maneuvers, near the limits of their dynamic capacities. In order to ensure the occupant's safety in these situation… ▽ More

    Submitted 23 June, 2017; v1 submitted 3 March, 2017; originally announced March 2017.

    Comments: Published in the IEEE IV 2017 conference

  17. arXiv:1605.00026  [pdf, other

    cs.RO

    A Distributed Model Predictive Control Framework for Road-Following Formation Control of Car-like Vehicles (Extended Version)

    Authors: Xiangjun Qian, Florent Altché, Arnaud de La Fortelle, Fabien Moutarde

    Abstract: This work presents a novel framework for the formation control of multiple autonomous ground vehicles in an on-road environment. Unique challenges of this problem lie in 1) the design of collision avoidance strategies with obstacles and with other vehicles in a highly structured environment, 2) dynamic reconfiguration of the formation to handle different task specifications. In this paper, we desi… ▽ More

    Submitted 29 April, 2016; originally announced May 2016.

    Comments: Extended version of the conference paper submission on ICARCV'16

  18. Time-optimal Coordination of Mobile Robots along Specified Paths

    Authors: Florent Altché, Xiangjun Qian, Arnaud de La Fortelle

    Abstract: In this paper, we address the problem of time-optimal coordination of mobile robots under kinodynamic constraints along specified paths. We propose a novel approach based on time discretization that leads to a mixed-integer linear programming (MILP) formulation. This problem can be solved using general-purpose MILP solvers in a reasonable time, resulting in a resolution-optimal solution. Moreover,… ▽ More

    Submitted 5 April, 2017; v1 submitted 15 March, 2016; originally announced March 2016.

    Comments: Published in 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)