Skip to main content

Showing 1–50 of 57 results for author: Clune, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.15568  [pdf, other

    cs.AI

    OMNI-EPIC: Open-endedness via Models of human Notions of Interestingness with Environments Programmed in Code

    Authors: Maxence Faldor, Jenny Zhang, Antoine Cully, Jeff Clune

    Abstract: Open-ended and AI-generating algorithms aim to continuously generate and solve increasingly complex tasks indefinitely, offering a promising path toward more general intelligence. To accomplish this grand vision, learning must occur within a vast array of potential tasks. Existing approaches to automatically generating environments are constrained within manually predefined, often narrow distribut… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  2. arXiv:2405.15143  [pdf, other

    cs.LG cs.AI cs.CL

    Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models

    Authors: Cong Lu, Shengran Hu, Jeff Clune

    Abstract: Go-Explore is a powerful family of algorithms designed to solve hard-exploration problems, built on the principle of archiving discovered states, and iteratively returning to and exploring from the most promising states. This approach has led to superhuman performance across a wide variety of challenging problems including Atari games and robotic control, but requires manually designing heuristics… ▽ More

    Submitted 30 May, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

  3. arXiv:2404.10179  [pdf, other

    cs.RO cs.AI cs.HC cs.LG

    Scaling Instructable Agents Across Many Simulated Worlds

    Authors: SIMA Team, Maria Abi Raad, Arun Ahuja, Catarina Barros, Frederic Besse, Andrew Bolt, Adrian Bolton, Bethanie Brownfield, Gavin Buttimore, Max Cant, Sarah Chakera, Stephanie C. Y. Chan, Jeff Clune, Adrian Collister, Vikki Copeman, Alex Cullum, Ishita Dasgupta, Dario de Cesare, Julia Di Trapani, Yani Donchev, Emma Dunleavy, Martin Engelcke, Ryan Faulkner, Frankie Garcia, Charles Gbadamosi , et al. (68 additional authors not shown)

    Abstract: Building embodied AI systems that can follow arbitrary language instructions in any 3D environment is a key challenge for creating general AI. Accomplishing this goal requires learning to ground language in perception and embodied actions, in order to accomplish complex tasks. The Scalable, Instructable, Multiworld Agent (SIMA) project tackles this by training agents to follow free-form instructio… ▽ More

    Submitted 17 April, 2024; v1 submitted 13 March, 2024; originally announced April 2024.

  4. arXiv:2402.15391  [pdf, other

    cs.LG cs.AI cs.CV

    Genie: Generative Interactive Environments

    Authors: Jake Bruce, Michael Dennis, Ashley Edwards, Jack Parker-Holder, Yuge Shi, Edward Hughes, Matthew Lai, Aditi Mavalankar, Richie Steigerwald, Chris Apps, Yusuf Aytar, Sarah Bechtle, Feryal Behbahani, Stephanie Chan, Nicolas Heess, Lucy Gonzalez, Simon Osindero, Sherjil Ozair, Scott Reed, **gwei Zhang, Konrad Zolna, Jeff Clune, Nando de Freitas, Satinder Singh, Tim Rocktäschel

    Abstract: We introduce Genie, the first generative interactive environment trained in an unsupervised manner from unlabelled Internet videos. The model can be prompted to generate an endless variety of action-controllable virtual worlds described through text, synthetic images, photographs, and even sketches. At 11B parameters, Genie can be considered a foundation world model. It is comprised of a spatiotem… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

    Comments: https://sites.google.com/corp/view/genie-2024/

  5. arXiv:2310.17688  [pdf, other

    cs.CY cs.AI cs.CL cs.LG

    Managing extreme AI risks amid rapid progress

    Authors: Yoshua Bengio, Geoffrey Hinton, Andrew Yao, Dawn Song, Pieter Abbeel, Trevor Darrell, Yuval Noah Harari, Ya-Qin Zhang, Lan Xue, Shai Shalev-Shwartz, Gillian Hadfield, Jeff Clune, Tegan Maharaj, Frank Hutter, Atılım Güneş Baydin, Sheila McIlraith, Qiqi Gao, Ashwin Acharya, David Krueger, Anca Dragan, Philip Torr, Stuart Russell, Daniel Kahneman, Jan Brauner, Sören Mindermann

    Abstract: Artificial Intelligence (AI) is progressing rapidly, and companies are shifting their focus to develo** generalist AI systems that can autonomously act and pursue goals. Increases in capabilities and autonomy may soon massively amplify AI's impact, with risks that include large-scale social harms, malicious uses, and an irreversible loss of human control over autonomous AI systems. Although rese… ▽ More

    Submitted 22 May, 2024; v1 submitted 26 October, 2023; originally announced October 2023.

    Comments: Published in Science: https://www.science.org/doi/10.1126/science.adn0117

  6. arXiv:2310.13032  [pdf, other

    cs.CL cs.AI cs.LG cs.NE

    Quality-Diversity through AI Feedback

    Authors: Herbie Bradley, Andrew Dai, Hannah Teufel, Jenny Zhang, Koen Oostermeijer, Marco Bellagente, Jeff Clune, Kenneth Stanley, Grégory Schott, Joel Lehman

    Abstract: In many text-generation problems, users may prefer not only a single response, but a diverse range of high-quality outputs from which to choose. Quality-diversity (QD) search algorithms aim at such outcomes, by continually improving and diversifying a population of candidates. However, the applicability of QD to qualitative domains, like creative writing, has been limited by the difficulty of algo… ▽ More

    Submitted 7 December, 2023; v1 submitted 19 October, 2023; originally announced October 2023.

    Comments: minor additions to supplementary results

  7. arXiv:2310.12103  [pdf, other

    cs.AI cs.NE

    Quality Diversity through Human Feedback: Towards Open-Ended Diversity-Driven Optimization

    Authors: Li Ding, Jenny Zhang, Jeff Clune, Lee Spector, Joel Lehman

    Abstract: Reinforcement Learning from Human Feedback (RLHF) has shown potential in qualitative tasks where easily defined performance measures are lacking. However, there are drawbacks when RLHF is commonly used to optimize for average human preferences, especially in generative tasks that demand diverse model responses. Meanwhile, Quality Diversity (QD) algorithms excel at identifying diverse and high-qual… ▽ More

    Submitted 4 June, 2024; v1 submitted 18 October, 2023; originally announced October 2023.

    Comments: ICML 2024

  8. arXiv:2310.07996  [pdf, other

    cs.LG cs.CV cs.NE

    Reset It and Forget It: Relearning Last-Layer Weights Improves Continual and Transfer Learning

    Authors: Lapo Frati, Neil Traft, Jeff Clune, Nick Cheney

    Abstract: This work identifies a simple pre-training mechanism that leads to representations exhibiting better continual and transfer learning. This mechanism -- the repeated resetting of weights in the last layer, which we nickname "zap**" -- was originally designed for a meta-continual-learning procedure, yet we show it is surprisingly applicable in many settings beyond both meta-learning and continual… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

  9. arXiv:2307.02276  [pdf, other

    cs.LG cs.AI

    First-Explore, then Exploit: Meta-Learning Intelligent Exploration

    Authors: Ben Norman, Jeff Clune

    Abstract: Standard reinforcement learning (RL) agents never intelligently explore like a human (i.e. by taking into account complex domain priors and previous explorations). Even the most basic intelligent exploration strategies such as exhaustive search are only inefficiently or poorly approximated by approaches such as novelty search or intrinsic motivation, let alone more complicated strategies like lear… ▽ More

    Submitted 5 July, 2023; originally announced July 2023.

  10. arXiv:2306.01711  [pdf, other

    cs.AI cs.LG

    OMNI: Open-endedness via Models of human Notions of Interestingness

    Authors: Jenny Zhang, Joel Lehman, Kenneth Stanley, Jeff Clune

    Abstract: Open-ended algorithms aim to learn new, interesting behaviors forever. That requires a vast environment search space, but there are thus infinitely many possible tasks. Even after filtering for tasks the current agent can learn (i.e., learning progress), countless learnable yet uninteresting tasks remain (e.g., minor variations of previously learned tasks). An Achilles Heel of open-endedness resea… ▽ More

    Submitted 14 February, 2024; v1 submitted 2 June, 2023; originally announced June 2023.

    Comments: 47 pages, 33 figures

  11. arXiv:2306.00323  [pdf, other

    cs.AI cs.LG

    Thought Cloning: Learning to Think while Acting by Imitating Human Thinking

    Authors: Shengran Hu, Jeff Clune

    Abstract: Language is often considered a key aspect of human thinking, providing us with exceptional abilities to generalize, explore, plan, replan, and adapt to new situations. However, Reinforcement Learning (RL) agents are far from human-level performance in any of these abilities. We hypothesize one reason for such cognitive deficiencies is that they lack the benefits of thinking in language and that we… ▽ More

    Submitted 17 January, 2024; v1 submitted 31 May, 2023; originally announced June 2023.

    Comments: Accepted to NeurIPS 2023 as a spotlight

  12. arXiv:2206.11795  [pdf, other

    cs.LG cs.AI

    Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos

    Authors: Bowen Baker, Ilge Akkaya, Peter Zhokhov, Joost Huizinga, Jie Tang, Adrien Ecoffet, Brandon Houghton, Raul Sampedro, Jeff Clune

    Abstract: Pretraining on noisy, internet-scale datasets has been heavily studied as a technique for training models with broad, general capabilities for text, images, and other modalities. However, for many sequential decision domains such as robotics, video games, and computer use, publicly available data does not contain the labels required to train behavioral priors in the same way. We extend the interne… ▽ More

    Submitted 23 June, 2022; originally announced June 2022.

  13. arXiv:2108.12056  [pdf, other

    cs.LG cs.AI cs.CV

    Continual learning under domain transfer with sparse synaptic bursting

    Authors: Shawn L. Beaulieu, Jeff Clune, Nick Cheney

    Abstract: Existing machines are functionally specific tools that were made for easy prediction and control. Tomorrow's machines may be closer to biological systems in their mutability, resilience, and autonomy. But first they must be capable of learning and retaining new information without being exposed to it arbitrarily often. Past efforts to engineer such systems have sought to build or regulate artifici… ▽ More

    Submitted 16 January, 2024; v1 submitted 26 August, 2021; originally announced August 2021.

  14. arXiv:2106.14876  [pdf, other

    cs.LG stat.ML

    Multi-task curriculum learning in a complex, visual, hard-exploration domain: Minecraft

    Authors: Ingmar Kanitscheider, Joost Huizinga, David Farhi, William Hebgen Guss, Brandon Houghton, Raul Sampedro, Peter Zhokhov, Bowen Baker, Adrien Ecoffet, Jie Tang, Oleg Klimov, Jeff Clune

    Abstract: An important challenge in reinforcement learning is training agents that can solve a wide variety of tasks. If tasks depend on each other (e.g. needing to learn to walk before learning to run), curriculum learning can speed up learning by focusing on the next best task to learn. We explore curriculum learning in a complex, visual domain with many hard exploration challenges: Minecraft. We find tha… ▽ More

    Submitted 28 June, 2021; originally announced June 2021.

    Comments: first submission

  15. Program Equivalence for Assisted Grading of Functional Programs (Extended Version)

    Authors: Joshua Clune, Vijay Ramamurthy, Ruben Martins, Umut A. Acar

    Abstract: In courses that involve programming assignments, giving meaningful feedback to students is an important challenge. Human beings can give useful feedback by manually grading the programs but this is a time-consuming, labor intensive, and usually boring process. Automatic graders can be fast and scale well but they usually provide poor feedback. Although there has been research on improving automati… ▽ More

    Submitted 15 October, 2020; originally announced October 2020.

  16. arXiv:2006.07495  [pdf, other

    cs.NE

    Open Questions in Creating Safe Open-ended AI: Tensions Between Control and Creativity

    Authors: Adrien Ecoffet, Jeff Clune, Joel Lehman

    Abstract: Artificial life originated and has long studied the topic of open-ended evolution, which seeks the principles underlying artificial systems that innovate continually, inspired by biological evolution. Recently, interest has grown within the broader field of AI in a generalization of open-ended evolution, here called open-ended search, wherein such questions of open-endedness are explored for advan… ▽ More

    Submitted 12 June, 2020; originally announced June 2020.

  17. arXiv:2005.13092  [pdf, other

    cs.LG stat.ML

    Synthetic Petri Dish: A Novel Surrogate Model for Rapid Architecture Search

    Authors: Aditya Rawal, Joel Lehman, Felipe Petroski Such, Jeff Clune, Kenneth O. Stanley

    Abstract: Neural Architecture Search (NAS) explores a large space of architectural motifs -- a compute-intensive process that often involves ground-truth evaluation of each motif by instantiating it within a large network, and training and evaluating the network with thousands of domain-specific data samples. Inspired by how biological motifs such as cells are sometimes extracted from their natural environm… ▽ More

    Submitted 26 May, 2020; originally announced May 2020.

  18. First return, then explore

    Authors: Adrien Ecoffet, Joost Huizinga, Joel Lehman, Kenneth O. Stanley, Jeff Clune

    Abstract: The promise of reinforcement learning is to solve complex sequential decision problems autonomously by specifying a high-level reward function only. However, reinforcement learning algorithms struggle when, as is often the case, simple and intuitive rewards provide sparse and deceptive feedback. Avoiding these pitfalls requires thoroughly exploring the environment, but creating algorithms that can… ▽ More

    Submitted 16 September, 2021; v1 submitted 27 April, 2020; originally announced April 2020.

    Comments: 47 pages, 14 figures, 4 tables; reorganized sections and modified SI text extensively; added reference to the published version, changed title to published title; added reference to published unformatted pdf

    Journal ref: Nature 590, 580-586 (2021)

  19. arXiv:2003.11164  [pdf, other

    cs.LG stat.ML

    Fiber: A Platform for Efficient Development and Distributed Training for Reinforcement Learning and Population-Based Methods

    Authors: Jiale Zhi, Rui Wang, Jeff Clune, Kenneth O. Stanley

    Abstract: Recent advances in machine learning are consistently enabled by increasing amounts of computation. Reinforcement learning (RL) and population-based methods in particular pose unique challenges for efficiency and flexibility to the underlying distributed computing frameworks. These challenges include frequent interaction with simulations, the need for dynamic scaling, and the need for a user interf… ▽ More

    Submitted 24 March, 2020; originally announced March 2020.

  20. arXiv:2003.08536  [pdf, other

    cs.NE

    Enhanced POET: Open-Ended Reinforcement Learning through Unbounded Invention of Learning Challenges and their Solutions

    Authors: Rui Wang, Joel Lehman, Aditya Rawal, Jiale Zhi, Yulun Li, Jeff Clune, Kenneth O. Stanley

    Abstract: Creating open-ended algorithms, which generate their own never-ending stream of novel and appropriately challenging learning opportunities, could help to automate and accelerate progress in machine learning. A recent step in this direction is the Paired Open-Ended Trailblazer (POET), an algorithm that generates and solves its own challenges, and allows solutions to goal-switch between challenges t… ▽ More

    Submitted 13 April, 2020; v1 submitted 18 March, 2020; originally announced March 2020.

    Comments: 23 pages, 14 figures

  21. arXiv:2003.01825  [pdf, other

    cs.NE cs.AI cs.LG

    Scaling MAP-Elites to Deep Neuroevolution

    Authors: Cédric Colas, Joost Huizinga, Vashisht Madhavan, Jeff Clune

    Abstract: Quality-Diversity (QD) algorithms, and MAP-Elites (ME) in particular, have proven very useful for a broad range of applications including enabling real robots to recover quickly from joint damage, solving strongly deceptive maze tasks or evolving robot morphologies to discover new gaits. However, present implementations of MAP-Elites and other QD algorithms seem to be limited to low-dimensional co… ▽ More

    Submitted 5 June, 2020; v1 submitted 3 March, 2020; originally announced March 2020.

    Comments: Accepted to GECCO 2020

  22. arXiv:2002.10585  [pdf, other

    cs.NE

    Backpropamine: training self-modifying neural networks with differentiable neuromodulated plasticity

    Authors: Thomas Miconi, Aditya Rawal, Jeff Clune, Kenneth O. Stanley

    Abstract: The impressive lifelong learning in animal brains is primarily enabled by plastic changes in synaptic connectivity. Importantly, these changes are not passive, but are actively controlled by neuromodulation, which is itself under the control of the brain. The resulting self-modifying abilities of the brain play an important role in learning and adaptation, and are a major basis for biological rein… ▽ More

    Submitted 24 February, 2020; originally announced February 2020.

    Comments: Presented at the 7th International Conference on Learning Representations (ICLR 2019)

    Journal ref: 7th International Conference on Learning Representations, ICLR 2019, New Orleans, LA, USA, May 6-9, 2019

  23. arXiv:2002.09571  [pdf, other

    cs.LG cs.CV cs.NE stat.ML

    Learning to Continually Learn

    Authors: Shawn Beaulieu, Lapo Frati, Thomas Miconi, Joel Lehman, Kenneth O. Stanley, Jeff Clune, Nick Cheney

    Abstract: Continual lifelong learning requires an agent or model to learn many sequentially ordered tasks, building on previous knowledge without catastrophically forgetting it. Much work has gone towards preventing the default tendency of machine learning models to catastrophically forget, yet virtually all such work involves manually-designed solutions to the problem. We instead advocate meta-learning a s… ▽ More

    Submitted 3 March, 2020; v1 submitted 21 February, 2020; originally announced February 2020.

  24. arXiv:1912.07768  [pdf, other

    cs.LG stat.ML

    Generative Teaching Networks: Accelerating Neural Architecture Search by Learning to Generate Synthetic Training Data

    Authors: Felipe Petroski Such, Aditya Rawal, Joel Lehman, Kenneth O. Stanley, Jeff Clune

    Abstract: This paper investigates the intriguing question of whether we can create learning algorithms that automatically generate training data, learning environments, and curricula in order to help AI agents rapidly learn. We show that such algorithms are possible via Generative Teaching Networks (GTNs), a general approach that is, in theory, applicable to supervised, unsupervised, and reinforcement learn… ▽ More

    Submitted 16 December, 2019; originally announced December 2019.

  25. arXiv:1910.09716  [pdf, other

    cs.LG cs.CV eess.IV stat.ML

    A deep active learning system for species identification and counting in camera trap images

    Authors: Mohammad Sadegh Norouzzadeh, Dan Morris, Sara Beery, Neel Joshi, Nebojsa Jojic, Jeff Clune

    Abstract: Biodiversity conservation depends on accurate, up-to-date information about wildlife population distributions. Motion-activated cameras, also known as camera traps, are a critical tool for population surveys, as they are cheap and non-intrusive. However, extracting useful information from camera trap images is a cumbersome process: a typical camera trap survey may produce millions of images that r… ▽ More

    Submitted 21 October, 2019; originally announced October 2019.

    Comments: 15 pages, 5 figures

  26. arXiv:1907.06077  [pdf, other

    cs.NE

    Evolvability ES: Scalable and Direct Optimization of Evolvability

    Authors: Alexander Gajewski, Jeff Clune, Kenneth O. Stanley, Joel Lehman

    Abstract: Designing evolutionary algorithms capable of uncovering highly evolvable representations is an open challenge; such evolvability is important because it accelerates evolution and enables fast adaptation to changing circumstances. This paper introduces evolvability ES, an evolutionary algorithm designed to explicitly and efficiently optimize for evolvability, i.e. the ability to further adapt. The… ▽ More

    Submitted 13 July, 2019; originally announced July 2019.

    Comments: Published in GECCO 2019

  27. arXiv:1905.10985  [pdf, ps, other

    cs.AI

    AI-GAs: AI-generating algorithms, an alternate paradigm for producing general artificial intelligence

    Authors: Jeff Clune

    Abstract: Perhaps the most ambitious scientific quest in human history is the creation of general artificial intelligence, which roughly means AI that is as smart or smarter than humans. The dominant approach in the machine learning community is to attempt to discover each of the pieces required for intelligence, with the implicit assumption that some future group will complete the Herculean task of figurin… ▽ More

    Submitted 31 January, 2020; v1 submitted 27 May, 2019; originally announced May 2019.

  28. arXiv:1904.08939  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Understanding Neural Networks via Feature Visualization: A survey

    Authors: Anh Nguyen, Jason Yosinski, Jeff Clune

    Abstract: A neuroscience method to understanding the brain is to find and study the preferred stimuli that highly activate an individual cell or groups of cells. Recent advances in machine learning enable a family of methods to synthesize preferred stimuli that cause a neuron in an artificial or biological brain to fire strongly. Those methods are known as Activation Maximization (AM) or Feature Visualizati… ▽ More

    Submitted 18 April, 2019; originally announced April 2019.

    Comments: A book chapter in an Interpretable ML book (http://www.interpretable-ml.org/book/)

  29. arXiv:1901.10995  [pdf, other

    cs.LG cs.AI stat.ML

    Go-Explore: a New Approach for Hard-Exploration Problems

    Authors: Adrien Ecoffet, Joost Huizinga, Joel Lehman, Kenneth O. Stanley, Jeff Clune

    Abstract: A grand challenge in reinforcement learning is intelligent exploration, especially when rewards are sparse or deceptive. Two Atari games serve as benchmarks for such hard-exploration domains: Montezuma's Revenge and Pitfall. On both games, current RL algorithms perform poorly, even those with intrinsic motivation, which is the dominant method to improve performance on hard-exploration domains. To… ▽ More

    Submitted 26 February, 2021; v1 submitted 30 January, 2019; originally announced January 2019.

    Comments: 37 pages, 14 figures; added references to Goyal et al. and Oh et al., updated reference to Colas et al; updated author emails; point readers to updated paper

  30. arXiv:1901.01753  [pdf, other

    cs.NE

    Paired Open-Ended Trailblazer (POET): Endlessly Generating Increasingly Complex and Diverse Learning Environments and Their Solutions

    Authors: Rui Wang, Joel Lehman, Jeff Clune, Kenneth O. Stanley

    Abstract: While the history of machine learning so far largely encompasses a series of problems posed by researchers and algorithms that learn their solutions, an important question is whether the problems themselves can be generated by the algorithm at the same time as they are being solved. Such a process would in effect build its own diverse and expanding curricula, and the solutions to problems at vario… ▽ More

    Submitted 20 February, 2019; v1 submitted 7 January, 2019; originally announced January 2019.

    Comments: 28 pages, 9 figures

  31. arXiv:1812.10687  [pdf, other

    cs.LG stat.ML

    Robustness to Out-of-Distribution Inputs via Task-Aware Generative Uncertainty

    Authors: Rowan McAllister, Gregory Kahn, Jeff Clune, Sergey Levine

    Abstract: Deep learning provides a powerful tool for machine perception when the observations resemble the training data. However, real-world robotic systems must react intelligently to their observations even in unexpected circumstances. This requires a system to reason about its own uncertainty given unfamiliar, out-of-distribution observations. Approximate Bayesian approaches are commonly used to estimat… ▽ More

    Submitted 27 December, 2018; originally announced December 2018.

  32. arXiv:1812.07069  [pdf, other

    cs.NE

    An Atari Model Zoo for Analyzing, Visualizing, and Comparing Deep Reinforcement Learning Agents

    Authors: Felipe Petroski Such, Vashisht Madhavan, Rosanne Liu, Rui Wang, Pablo Samuel Castro, Yulun Li, Jiale Zhi, Ludwig Schubert, Marc G. Bellemare, Jeff Clune, Joel Lehman

    Abstract: Much human and computational effort has aimed to improve how deep reinforcement learning algorithms perform on benchmarks such as the Atari Learning Environment. Comparatively less effort has focused on understanding what has been learned by such methods, and investigating and comparing the representations learned by different families of reinforcement learning (RL) algorithms. Sources of friction… ▽ More

    Submitted 29 May, 2019; v1 submitted 17 December, 2018; originally announced December 2018.

  33. arXiv:1807.03392  [pdf, other

    cs.NE

    Evolving Multimodal Robot Behavior via Many Step** Stones with the Combinatorial Multi-Objective Evolutionary Algorithm

    Authors: Joost Huizinga, Jeff Clune

    Abstract: An important challenge in reinforcement learning, including evolutionary robotics, is to solve multimodal problems, where agents have to act in qualitatively different ways depending on the circumstances. Because multimodal problems are often too difficult to solve directly, it is helpful to take advantage of staging, where a difficult task is divided into simpler subtasks that can serve as steppi… ▽ More

    Submitted 10 December, 2019; v1 submitted 9 July, 2018; originally announced July 2018.

  34. arXiv:1806.00553  [pdf, other

    cs.AI

    Deep Curiosity Search: Intra-Life Exploration Can Improve Performance on Challenging Deep Reinforcement Learning Problems

    Authors: Christopher Stanton, Jeff Clune

    Abstract: Traditional exploration methods in RL require agents to perform random actions to find rewards. But these approaches struggle on sparse-reward domains like Montezuma's Revenge where the probability that any random action sequence leads to reward is extremely low. Recent algorithms have performed well on such tasks by encouraging agents to visit new states or perform new actions in relation to all… ▽ More

    Submitted 23 November, 2018; v1 submitted 1 June, 2018; originally announced June 2018.

  35. arXiv:1805.01141  [pdf, other

    cs.NE cs.AI

    VINE: An Open Source Interactive Data Visualization Tool for Neuroevolution

    Authors: Rui Wang, Jeff Clune, Kenneth O. Stanley

    Abstract: Recent advances in deep neuroevolution have demonstrated that evolutionary algorithms, such as evolution strategies (ES) and genetic algorithms (GA), can scale to train deep neural networks to solve difficult reinforcement learning (RL) problems. However, it remains a challenge to analyze and interpret the underlying process of neuroevolution in such high dimensions. To begin to address this chall… ▽ More

    Submitted 3 May, 2018; originally announced May 2018.

  36. arXiv:1804.02464  [pdf, other

    cs.NE cs.LG stat.ML

    Differentiable plasticity: training plastic neural networks with backpropagation

    Authors: Thomas Miconi, Jeff Clune, Kenneth O. Stanley

    Abstract: How can we build agents that keep learning from experience, quickly and efficiently, after their initial training? Here we take inspiration from the main mechanism of learning in biological brains: synaptic plasticity, carefully tuned by evolution to produce efficient lifelong learning. We show that plasticity, just like connection weights, can be optimized by gradient descent in large (millions o… ▽ More

    Submitted 31 July, 2018; v1 submitted 6 April, 2018; originally announced April 2018.

    Comments: Presented at ICML 2018

    Journal ref: Proceedings of the 35th International Conference on Machine Learning (ICML2018), Stockholm, Sweden, PMLR 80, 2018

  37. arXiv:1803.03453  [pdf, other

    cs.NE

    The Surprising Creativity of Digital Evolution: A Collection of Anecdotes from the Evolutionary Computation and Artificial Life Research Communities

    Authors: Joel Lehman, Jeff Clune, Dusan Misevic, Christoph Adami, Lee Altenberg, Julie Beaulieu, Peter J. Bentley, Samuel Bernard, Guillaume Beslon, David M. Bryson, Patryk Chrabaszcz, Nick Cheney, Antoine Cully, Stephane Doncieux, Fred C. Dyer, Kai Olav Ellefsen, Robert Feldt, Stephan Fischer, Stephanie Forrest, Antoine Frénoy, Christian Gagné, Leni Le Goff, Laura M. Grabowski, Babak Hodjat, Frank Hutter , et al. (28 additional authors not shown)

    Abstract: Biological evolution provides a creative fount of complex and subtle adaptations, often surprising the scientists who discover them. However, because evolution is an algorithmic process that transcends the substrate in which it occurs, evolution's creativity is not limited to nature. Indeed, many researchers in the field of digital evolution have observed their evolving algorithms and organisms su… ▽ More

    Submitted 21 November, 2019; v1 submitted 9 March, 2018; originally announced March 2018.

  38. arXiv:1712.06568  [pdf, other

    cs.NE cs.AI

    ES Is More Than Just a Traditional Finite-Difference Approximator

    Authors: Joel Lehman, Jay Chen, Jeff Clune, Kenneth O. Stanley

    Abstract: An evolution strategy (ES) variant based on a simplification of a natural evolution strategy recently attracted attention because it performs surprisingly well in challenging deep reinforcement learning domains. It searches for neural network parameters by generating perturbations to the current set of parameters, checking their performance, and moving in the aggregate direction of higher reward.… ▽ More

    Submitted 1 May, 2018; v1 submitted 18 December, 2017; originally announced December 2017.

  39. arXiv:1712.06567  [pdf, other

    cs.NE cs.LG

    Deep Neuroevolution: Genetic Algorithms Are a Competitive Alternative for Training Deep Neural Networks for Reinforcement Learning

    Authors: Felipe Petroski Such, Vashisht Madhavan, Edoardo Conti, Joel Lehman, Kenneth O. Stanley, Jeff Clune

    Abstract: Deep artificial neural networks (DNNs) are typically trained via gradient-based learning algorithms, namely backpropagation. Evolution strategies (ES) can rival backprop-based algorithms such as Q-learning and policy gradients on challenging deep reinforcement learning (RL) problems. However, ES can be considered a gradient-based algorithm because it performs stochastic gradient descent via an ope… ▽ More

    Submitted 20 April, 2018; v1 submitted 18 December, 2017; originally announced December 2017.

  40. arXiv:1712.06564  [pdf, other

    cs.NE

    On the Relationship Between the OpenAI Evolution Strategy and Stochastic Gradient Descent

    Authors: Xingwen Zhang, Jeff Clune, Kenneth O. Stanley

    Abstract: Because stochastic gradient descent (SGD) has shown promise optimizing neural networks with millions of parameters and few if any alternatives are known to exist, it has moved to the heart of leading approaches to reinforcement learning (RL). For that reason, the recent result from OpenAI showing that a particular kind of evolution strategy (ES) can rival the performance of SGD-based deep RL metho… ▽ More

    Submitted 18 December, 2017; originally announced December 2017.

  41. arXiv:1712.06563  [pdf, other

    cs.NE cs.AI

    Safe Mutations for Deep and Recurrent Neural Networks through Output Gradients

    Authors: Joel Lehman, Jay Chen, Jeff Clune, Kenneth O. Stanley

    Abstract: While neuroevolution (evolving neural networks) has a successful track record across a variety of domains from reinforcement learning to artificial life, it is rarely applied to large, deep neural networks. A central reason is that while random mutation generally works in low dimensions, a random perturbation of thousands or millions of weights is likely to break existing functionality, providing… ▽ More

    Submitted 1 May, 2018; v1 submitted 18 December, 2017; originally announced December 2017.

  42. arXiv:1712.06560  [pdf, other

    cs.AI

    Improving Exploration in Evolution Strategies for Deep Reinforcement Learning via a Population of Novelty-Seeking Agents

    Authors: Edoardo Conti, Vashisht Madhavan, Felipe Petroski Such, Joel Lehman, Kenneth O. Stanley, Jeff Clune

    Abstract: Evolution strategies (ES) are a family of black-box optimization algorithms able to train deep neural networks roughly as well as Q-learning and policy gradient methods on challenging deep reinforcement learning (RL) problems, but are much faster (e.g. hours vs. days) because they parallelize better. However, many RL problems require directed exploration because they have reward functions that are… ▽ More

    Submitted 29 October, 2018; v1 submitted 18 December, 2017; originally announced December 2017.

  43. Diffusion-based neuromodulation can eliminate catastrophic forgetting in simple neural networks

    Authors: Roby Velez, Jeff Clune

    Abstract: A long-term goal of AI is to produce agents that can learn a diversity of skills throughout their lifetimes and continuously improve those skills via experience. A longstanding obstacle towards that goal is catastrophic forgetting, which is when learning new information erases previously learned information. Catastrophic forgetting occurs in artificial neural networks (ANNs), which have fueled mos… ▽ More

    Submitted 18 November, 2017; v1 submitted 19 May, 2017; originally announced May 2017.

  44. The Emergence of Canalization and Evolvability in an Open-Ended, Interactive Evolutionary System

    Authors: Joost Huizinga, Kenneth O. Stanley, Jeff Clune

    Abstract: Natural evolution has produced a tremendous diversity of functional organisms. Many believe an essential component of this process was the evolution of evolvability, whereby evolution speeds up its ability to innovate by generating a more adaptive pool of offspring. One hypothesized mechanism for evolvability is developmental canalization, wherein certain dimensions of variation become more likely… ▽ More

    Submitted 14 February, 2019; v1 submitted 17 April, 2017; originally announced April 2017.

    Comments: SI can be found at: http://www.evolvingai.org/files/SI_0.zip

    Journal ref: Artificial life, 24(3), pp.157-181 (2018)

  45. arXiv:1703.05830  [pdf, other

    cs.CV cs.LG

    Automatically identifying, counting, and describing wild animals in camera-trap images with deep learning

    Authors: Mohammed Sadegh Norouzzadeh, Anh Nguyen, Margaret Kosmala, Ali Swanson, Meredith Palmer, Craig Packer, Jeff Clune

    Abstract: Having accurate, detailed, and up-to-date information about the location and behavior of animals in the wild would revolutionize our ability to study and conserve ecosystems. We investigate the ability to automatically, accurately, and inexpensively collect such data, which could transform many fields of biology, ecology, and zoology into "big data" sciences. Motion sensor "camera traps" enable co… ▽ More

    Submitted 15 November, 2017; v1 submitted 16 March, 2017; originally announced March 2017.

  46. arXiv:1612.00005  [pdf, other

    cs.CV

    Plug & Play Generative Networks: Conditional Iterative Generation of Images in Latent Space

    Authors: Anh Nguyen, Jeff Clune, Yoshua Bengio, Alexey Dosovitskiy, Jason Yosinski

    Abstract: Generating high-resolution, photo-realistic images has been a long-standing goal in machine learning. Recently, Nguyen et al. (2016) showed one interesting way to synthesize novel images by performing gradient ascent in the latent space of a generator network to maximize the activations of one or multiple neurons in a separate classifier network. In this paper we extend this method by introducing… ▽ More

    Submitted 12 April, 2017; v1 submitted 30 November, 2016; originally announced December 2016.

    Comments: CVPR camera-ready

  47. arXiv:1605.09304  [pdf, other

    cs.NE cs.AI cs.CV cs.LG

    Synthesizing the preferred inputs for neurons in neural networks via deep generator networks

    Authors: Anh Nguyen, Alexey Dosovitskiy, Jason Yosinski, Thomas Brox, Jeff Clune

    Abstract: Deep neural networks (DNNs) have demonstrated state-of-the-art results on many pattern recognition tasks, especially vision classification problems. Understanding the inner workings of such computational brains is both fascinating basic science that is interesting in its own right - similar to why we study the human brain - and will enable researchers to further improve DNNs. One path to understan… ▽ More

    Submitted 23 November, 2016; v1 submitted 30 May, 2016; originally announced May 2016.

    Comments: 29 pages, 35 figures, NIPS camera-ready

  48. arXiv:1602.03616  [pdf, other

    cs.NE cs.CV

    Multifaceted Feature Visualization: Uncovering the Different Types of Features Learned By Each Neuron in Deep Neural Networks

    Authors: Anh Nguyen, Jason Yosinski, Jeff Clune

    Abstract: We can better understand deep neural networks by identifying which features each of their neurons have learned to detect. To do so, researchers have created Deep Visualization techniques including activation maximization, which synthetically generates inputs (e.g. images) that maximally activate each neuron. A limitation of current techniques is that they assume each neuron detects only one type o… ▽ More

    Submitted 7 May, 2016; v1 submitted 11 February, 2016; originally announced February 2016.

    Comments: 23 pages (including SI), 24 figures

  49. arXiv:1511.07543  [pdf, other

    cs.LG cs.NE

    Convergent Learning: Do different neural networks learn the same representations?

    Authors: Yixuan Li, Jason Yosinski, Jeff Clune, Hod Lipson, John Hopcroft

    Abstract: Recent success in training deep neural networks have prompted active investigation into the features learned on their intermediate layers. Such research is difficult because it requires making sense of non-linear computations performed by millions of parameters, but valuable because it increases our ability to understand current models and create improved versions of them. In this paper we investi… ▽ More

    Submitted 28 February, 2016; v1 submitted 23 November, 2015; originally announced November 2015.

    Comments: Published as a conference paper at ICLR 2016

  50. arXiv:1506.06579  [pdf, other

    cs.CV cs.LG cs.NE

    Understanding Neural Networks Through Deep Visualization

    Authors: Jason Yosinski, Jeff Clune, Anh Nguyen, Thomas Fuchs, Hod Lipson

    Abstract: Recent years have produced great advances in training large, deep neural networks (DNNs), including notable successes in training convolutional neural networks (convnets) to recognize natural images. However, our understanding of how these models work, especially what computations they perform at intermediate layers, has lagged behind. Progress in the field will be further accelerated by the devel… ▽ More

    Submitted 22 June, 2015; originally announced June 2015.

    Comments: 12 pages. To appear at ICML Deep Learning Workshop 2015