Skip to main content

Showing 1–16 of 16 results for author: Gazeau, M

.
  1. arXiv:2312.09187  [pdf, other

    cs.LG

    Vision-Language Models as a Source of Rewards

    Authors: Kate Baumli, Satinder Baveja, Feryal Behbahani, Harris Chan, Gheorghe Comanici, Sebastian Flennerhag, Maxime Gazeau, Kristian Holsheimer, Dan Horgan, Michael Laskin, Clare Lyle, Hussain Masoom, Kay McKinney, Volodymyr Mnih, Alexander Neitz, Fabio Pardo, Jack Parker-Holder, John Quan, Tim Rocktäschel, Himanshu Sahni, Tom Schaul, Yannick Schroecker, Stephen Spencer, Richie Steigerwald, Luyu Wang , et al. (1 additional authors not shown)

    Abstract: Building generalist agents that can accomplish many goals in rich open-ended environments is one of the research frontiers for reinforcement learning. A key limiting factor for building generalist agents with RL has been the need for a large number of reward functions for achieving different goals. We investigate the feasibility of using off-the-shelf vision-language models, or VLMs, as sources of… ▽ More

    Submitted 21 February, 2024; v1 submitted 14 December, 2023; originally announced December 2023.

    Comments: 10 pages, 5 figures

  2. arXiv:2210.14215  [pdf, other

    cs.LG cs.AI

    In-context Reinforcement Learning with Algorithm Distillation

    Authors: Michael Laskin, Luyu Wang, Junhyuk Oh, Emilio Parisotto, Stephen Spencer, Richie Steigerwald, DJ Strouse, Steven Hansen, Angelos Filos, Ethan Brooks, Maxime Gazeau, Himanshu Sahni, Satinder Singh, Volodymyr Mnih

    Abstract: We propose Algorithm Distillation (AD), a method for distilling reinforcement learning (RL) algorithms into neural networks by modeling their training histories with a causal sequence model. Algorithm Distillation treats learning to reinforcement learn as an across-episode sequential prediction problem. A dataset of learning histories is generated by a source RL algorithm, and then a causal transf… ▽ More

    Submitted 25 October, 2022; originally announced October 2022.

  3. arXiv:2206.04798  [pdf, other

    cs.AI cs.LG

    A*Net: A Scalable Path-based Reasoning Approach for Knowledge Graphs

    Authors: Zhaocheng Zhu, Xinyu Yuan, Mikhail Galkin, Sophie Xhonneux, Ming Zhang, Maxime Gazeau, Jian Tang

    Abstract: Reasoning on large-scale knowledge graphs has been long dominated by embedding methods. While path-based methods possess the inductive capacity that embeddings lack, their scalability is limited by the exponential number of paths. Here we present A*Net, a scalable path-based method for knowledge graph reasoning. Inspired by the A* algorithm for shortest path problems, our A*Net learns a priority f… ▽ More

    Submitted 8 November, 2023; v1 submitted 6 June, 2022; originally announced June 2022.

    Comments: NeurIPS 2023

  4. arXiv:2102.06229  [pdf, other

    stat.ML cs.LG

    Higher Order Generalization Error for First Order Discretization of Langevin Diffusion

    Authors: Mufan Bill Li, Maxime Gazeau

    Abstract: We propose a novel approach to analyze generalization error for discretizations of Langevin diffusion, such as the stochastic gradient Langevin dynamics (SGLD). For an $ε$ tolerance of expected generalization error, it is known that a first order discretization can reach this target if we run $Ω(ε^{-1} \log (ε^{-1}) )$ iterations with $Ω(ε^{-1})$ samples. In this article, we show that with additio… ▽ More

    Submitted 11 February, 2021; originally announced February 2021.

  5. arXiv:1902.08234  [pdf, other

    cs.LG stat.ML

    An Empirical Study of Large-Batch Stochastic Gradient Descent with Structured Covariance Noise

    Authors: Yeming Wen, Kevin Luk, Maxime Gazeau, Guodong Zhang, Harris Chan, Jimmy Ba

    Abstract: The choice of batch-size in a stochastic optimization algorithm plays a substantial role for both optimization and generalization. Increasing the batch-size used typically improves optimization but degrades generalization. To address the problem of improving generalization while maintaining optimal convergence in large-batch training, we propose to add covariance noise to the gradients. We demonst… ▽ More

    Submitted 28 February, 2020; v1 submitted 21 February, 2019; originally announced February 2019.

    Journal ref: The 23rd International Conference on Artificial Intelligence and Statistics, 2020

  6. arXiv:1810.13108  [pdf, other

    cs.LG math.CA math.DS math.OC stat.ML

    A general system of differential equations to model first order adaptive algorithms

    Authors: André Belotto da Silva, Maxime Gazeau

    Abstract: First order optimization algorithms play a major role in large scale machine learning. A new class of methods, called adaptive algorithms, were recently introduced to adjust iteratively the learning rate for each coordinate. Despite great practical success in deep learning, their behavior and performance on more general loss functions are not well understood. In this paper, we derive a non-autonom… ▽ More

    Submitted 30 September, 2019; v1 submitted 31 October, 2018; originally announced October 2018.

  7. arXiv:1807.02150  [pdf, other

    cs.IR cs.LG stat.ML

    Scalable Recommender Systems through Recursive Evidence Chains

    Authors: Elias Tragas, Calvin Luo, Maxime Gazeau, Kevin Luk, David Duvenaud

    Abstract: Recommender systems can be formulated as a matrix completion problem, predicting ratings from user and item parameter vectors. Optimizing these parameters by subsampling data becomes difficult as the number of users and items grows. We develop a novel approach to generate all latent variables on demand from the ratings matrix itself and a fixed pool of parameters. We estimate missing ratings using… ▽ More

    Submitted 5 July, 2018; originally announced July 2018.

  8. arXiv:1710.11260  [pdf, other

    stat.ML

    Implicit Manifold Learning on Generative Adversarial Networks

    Authors: Kry Yik Chau Lui, Yanshuai Cao, Maxime Gazeau, Kelvin Shuangjian Zhang

    Abstract: This paper raises an implicit manifold learning perspective in Generative Adversarial Networks (GANs), by studying how the support of the learned distribution, modelled as a submanifold $\mathcal{M}_θ$, perfectly match with $\mathcal{M}_{r}$, the support of the real data distribution. We show that optimizing Jensen-Shannon divergence forces $\mathcal{M}_θ$ to perfectly match with… ▽ More

    Submitted 30 October, 2017; originally announced October 2017.

    Journal ref: ICML 2017 Workshop on Implicit Models

  9. VUV-absorption cross section of carbon dioxide from 150 to 800 K and applications to warm exoplanetary atmospheres

    Authors: Olivia Venot, Yves Bénilan, Nicolas Fray, Marie-Claire Gazeau, Franck Lefèvre, Et-touhami Es-sebbar, Eric Hébrard, Martin Schwell, Chiheb Bahrini, Franck Montmessin, Maxence Lefèvre, Ingo P. Waldmann

    Abstract: Most exoplanets detected so far have atmospheric T significantly higher than 300K. Often close to their star, they receive an intense UV photons flux that triggers important photodissociation processes. The T dependency of VUV absorption cross sections are poorly known, leading to an undefined uncertainty in atmospheric models. Similarly, data measured at low T similar to that of the high atmosphe… ▽ More

    Submitted 25 September, 2017; originally announced September 2017.

    Comments: 14 pages, 17 figures, accepted for pulication in A&A

    Journal ref: A&A 609, A34 (2018)

  10. arXiv:1706.07417  [pdf, other

    math.AP math-ph math.SP

    Bloch theory and spectral gaps for linearized water waves

    Authors: Walter Craig, Maxime Gazeau, Christophe Lacave, Catherine Sulem

    Abstract: The system of equations for water waves, when linearized about equilibrium of a fluid body with a varying bottom boundary, is described by a spectral problem for the Dirichlet -- Neumann operator of the unperturbed free surface. This spectral problem is fundamental in questions of stability, as well as to the perturbation theory of evolution of the free surface in such settings. In addition, the D… ▽ More

    Submitted 27 February, 2018; v1 submitted 22 June, 2017; originally announced June 2017.

  11. Characterization of aromaticity in analogues of titan's atmospheric aerosols with two-step laser desorption ionization mass spectrometry

    Authors: Ahmed Mahjoub, Martin Schwell, Nathalie Carrasco, Yves Benilan, Guy Cernogora, Cyril Szopa, Marie-Claire Gazeau

    Abstract: The role of polycyclic aromatic hydrocarbons (PAH) and Nitrogen containing PAH (PANH) as intermediates of aerosol production in the atmosphere of Titan has been a subject of controversy for a long time. An analysis of the atmospheric emission band observed by the Visible and Infrared Map** Spectrometer (VIMS) at 3.28 micrometer suggests the presence of neutral polycyclic aromatic species in the… ▽ More

    Submitted 19 May, 2016; originally announced May 2016.

  12. VUV-absorption cross section of CO2 at high temperatures and impact on exoplanet atmospheres

    Authors: Olivia Venot, Nicolas Fray, Yves Bénilan, Marie-Claire Gazeau, Eric Hébrard, Gwenaelle Larcher, Martin Schwell, Michel Dobrijevic, Franck Selsis

    Abstract: Ultraviolet (UV) absorption cross sections are an essential ingredient of photochemical atmosphere models. Exoplanet searches have unveiled a large population of short-period objects with hot atmospheres, very different from what we find in our solar system. Transiting exoplanets whose atmospheres can now be studied by transit spectroscopy receive extremely strong UV fluxes and have typical temper… ▽ More

    Submitted 26 February, 2015; originally announced February 2015.

    Comments: 8 pages, 3 figures, BIO Web of Conferences, Vol. 2, EPOV 2012 : From Planets to Life - Colloquium of the CNRS Interdisciplinary Initiative Planetary Environments and Origins of Life (2014)

  13. Analysis and simulation of rare events for SPDE

    Authors: Charles-Edouard Bréhier, Maxime Gazeau, Ludovic Goudenège, Mathias Rousset

    Abstract: In this work, we consider the numerical estimation of the probability for a stochastic process to hit a set B before reaching another set A. This event is assumed to be rare. We consider reactive trajectories of the stochastic Allen-Cahn partial differential evolution equation (with double well potential) in dimension 1. Reactive trajectories are defined as the probability distribution of the traj… ▽ More

    Submitted 7 January, 2014; originally announced January 2014.

    Journal ref: ESAIM: Proceedings and Surveys. January 2015, Vol. 48, p. 364-384

  14. arXiv:1308.1576  [pdf, ps, other

    math.NA math.AP

    Strong order of convergence of a semidiscrete scheme for the stochastic Manakov equation

    Authors: Maxime Gazeau

    Abstract: It is well accepted by physicists that the Manakov PMD equation is a good model to describe the evolution of nonlinear electric fields in optical fibers with randomly varying birefringence. In the regime of the diffusion approximation theory, an effective asymptotic dynamics has recently been obtained to describe this evolution. This equation is called the stochastic Manakov equation. In this arti… ▽ More

    Submitted 8 August, 2013; v1 submitted 7 August, 2013; originally announced August 2013.

  15. High-temperature measurements of VUV-absorption cross sections of CO2 and their application to exoplanets

    Authors: Olivia Venot, Nicolas Fray, Yves Bénilan, Marie-Claire Gazeau, Eric Hébrard, Gwenaelle Larcher, Martin Schwell, Michel Dobrijevic, Franck Selsis

    Abstract: UV absorption cross sections are an essential ingredient of photochemical atmosphere models. Exoplanet searches have unveiled a large population of short-period objects with hot atmospheres, very different from what we find in our solar system. Transiting exoplanets whose atmospheres can now be studied by transit spectroscopy receive extremely strong UV fluxes and have typical temperatures ranging… ▽ More

    Submitted 12 February, 2013; v1 submitted 11 February, 2013; originally announced February 2013.

    Comments: 9 pages, 12 figures

  16. arXiv:1105.4048  [pdf, ps, other

    math.AP math.PR

    A diffusion approximation theorem for a nonlinear PDE with application to random birefringent optical fibers

    Authors: A. de Bouard, M. Gazeau

    Abstract: In this article we propose a generalization of the theory of diffusion approximation for random ODE to a nonlinear system of random Schrödinger equations. This system arises in the study of pulse propagation in randomly birefringent optical fibers. We first show existence and uniqueness of solutions for the random PDE and the limiting equation. We follow the work of Garnier and Marty [Wave Motion… ▽ More

    Submitted 13 December, 2012; v1 submitted 20 May, 2011; originally announced May 2011.

    Comments: Published in at http://dx.doi.org/10.1214/11-AAP839 the Annals of Applied Probability (http://www.imstat.org/aap/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AAP-AAP839

    Journal ref: Annals of Applied Probability 2012, Vol. 22, No. 6, 2460-2504