Skip to main content

Showing 1–6 of 6 results for author: Jaques, N

Searching in archive stat. Search in all archives.
.
  1. arXiv:2010.00581  [pdf, other

    cs.LG cs.AI cs.MA stat.ML

    Emergent Social Learning via Multi-agent Reinforcement Learning

    Authors: Kamal Ndousse, Douglas Eck, Sergey Levine, Natasha Jaques

    Abstract: Social learning is a key component of human and animal intelligence. By taking cues from the behavior of experts in their environment, social learners can acquire sophisticated behavior and rapidly adapt to new circumstances. This paper investigates whether independent reinforcement learning (RL) agents in a multi-agent environment can learn to use social learning to improve their performance. We… ▽ More

    Submitted 22 June, 2021; v1 submitted 1 October, 2020; originally announced October 2020.

    Comments: 14 pages, 19 figures. To be published in ICML 2021

  2. arXiv:1909.07547  [pdf, other

    cs.LG cs.AI stat.ML

    Hierarchical Reinforcement Learning for Open-Domain Dialog

    Authors: Abdelrhman Saleh, Natasha Jaques, Asma Ghandeharioun, Judy Hanwen Shen, Rosalind Picard

    Abstract: Open-domain dialog generation is a challenging problem; maximum likelihood training can lead to repetitive outputs, models have difficulty tracking long-term conversational goals, and training on standard movie or online datasets may lead to the generation of inappropriate, biased, or offensive text. Reinforcement Learning (RL) is a powerful framework that could potentially address these issues, f… ▽ More

    Submitted 31 December, 2019; v1 submitted 16 September, 2019; originally announced September 2019.

  3. arXiv:1907.00456  [pdf, other

    cs.LG cs.AI stat.ML

    Way Off-Policy Batch Deep Reinforcement Learning of Implicit Human Preferences in Dialog

    Authors: Natasha Jaques, Asma Ghandeharioun, Judy Hanwen Shen, Craig Ferguson, Agata Lapedriza, Noah Jones, Shixiang Gu, Rosalind Picard

    Abstract: Most deep reinforcement learning (RL) systems are not able to learn effectively from off-policy data, especially if they cannot explore online in the environment. These are critical shortcomings for applying RL to real-world problems where collecting data is expensive, and models must be tested offline before being deployed to interact with the environment -- e.g. systems that learn from human int… ▽ More

    Submitted 8 July, 2019; v1 submitted 30 June, 2019; originally announced July 2019.

  4. arXiv:1906.09308  [pdf, other

    cs.CL cs.AI cs.LG stat.ML

    Approximating Interactive Human Evaluation with Self-Play for Open-Domain Dialog Systems

    Authors: Asma Ghandeharioun, Judy Hanwen Shen, Natasha Jaques, Craig Ferguson, Noah Jones, Agata Lapedriza, Rosalind Picard

    Abstract: Building an open-domain conversational agent is a challenging problem. Current evaluation methods, mostly post-hoc judgments of static conversation, do not capture conversation quality in a realistic interactive context. In this paper, we investigate interactive human evaluation and provide evidence for its necessity; we then introduce a novel, model-agnostic, and dataset-agnostic method to approx… ▽ More

    Submitted 3 November, 2019; v1 submitted 21 June, 2019; originally announced June 2019.

    Comments: 33rd Conference on Neural Information Processing Systems (NeurIPS 2019), Vancouver, Canada

  5. arXiv:1906.05433  [pdf, other

    cs.CY cs.AI cs.LG stat.ML

    Tackling Climate Change with Machine Learning

    Authors: David Rolnick, Priya L. Donti, Lynn H. Kaack, Kelly Kochanski, Alexandre Lacoste, Kris Sankaran, Andrew Slavin Ross, Nikola Milojevic-Dupont, Natasha Jaques, Anna Waldman-Brown, Alexandra Luccioni, Tegan Maharaj, Evan D. Sherwin, S. Karthik Mukkavilli, Konrad P. Kording, Carla Gomes, Andrew Y. Ng, Demis Hassabis, John C. Platt, Felix Creutzig, Jennifer Chayes, Yoshua Bengio

    Abstract: Climate change is one of the greatest challenges facing humanity, and we, as machine learning experts, may wonder how we can help. Here we describe how machine learning can be a powerful tool in reducing greenhouse gas emissions and hel** society adapt to a changing climate. From smart grids to disaster management, we identify high impact problems where existing gaps can be filled by machine lea… ▽ More

    Submitted 5 November, 2019; v1 submitted 10 June, 2019; originally announced June 2019.

    Comments: For additional resources, please visit the website that accompanies this paper: https://www.climatechange.ai/

  6. arXiv:1810.08647  [pdf, other

    cs.LG cs.AI cs.MA stat.ML

    Social Influence as Intrinsic Motivation for Multi-Agent Deep Reinforcement Learning

    Authors: Natasha Jaques, Angeliki Lazaridou, Edward Hughes, Caglar Gulcehre, Pedro A. Ortega, DJ Strouse, Joel Z. Leibo, Nando de Freitas

    Abstract: We propose a unified mechanism for achieving coordination and communication in Multi-Agent Reinforcement Learning (MARL), through rewarding agents for having causal influence over other agents' actions. Causal influence is assessed using counterfactual reasoning. At each timestep, an agent simulates alternate actions that it could have taken, and computes their effect on the behavior of other agen… ▽ More

    Submitted 18 June, 2019; v1 submitted 19 October, 2018; originally announced October 2018.