Skip to main content

Showing 1–31 of 31 results for author: Stanley, K O

.
  1. arXiv:2206.08896  [pdf, other

    cs.NE

    Evolution through Large Models

    Authors: Joel Lehman, Jonathan Gordon, Shawn Jain, Kamal Ndousse, Cathy Yeh, Kenneth O. Stanley

    Abstract: This paper pursues the insight that large language models (LLMs) trained to generate code can vastly improve the effectiveness of mutation operators applied to programs in genetic programming (GP). Because such LLMs benefit from training data that includes sequential changes and modifications, they can approximate likely changes that humans would make. To highlight the breadth of implications of s… ▽ More

    Submitted 17 June, 2022; originally announced June 2022.

  2. arXiv:2106.12070  [pdf, other

    cs.LG cs.AI cs.CV

    Towards Consistent Predictive Confidence through Fitted Ensembles

    Authors: Navid Kardan, Ankit Sharma, Kenneth O. Stanley

    Abstract: Deep neural networks are behind many of the recent successes in machine learning applications. However, these models can produce overconfident decisions while encountering out-of-distribution (OOD) examples or making a wrong prediction. This inconsistent predictive confidence limits the integration of independently-trained learning models into a larger system. This paper introduces separable conce… ▽ More

    Submitted 22 June, 2021; originally announced June 2021.

    Comments: IJCNN 2021

  3. arXiv:2005.13092  [pdf, other

    cs.LG stat.ML

    Synthetic Petri Dish: A Novel Surrogate Model for Rapid Architecture Search

    Authors: Aditya Rawal, Joel Lehman, Felipe Petroski Such, Jeff Clune, Kenneth O. Stanley

    Abstract: Neural Architecture Search (NAS) explores a large space of architectural motifs -- a compute-intensive process that often involves ground-truth evaluation of each motif by instantiating it within a large network, and training and evaluating the network with thousands of domain-specific data samples. Inspired by how biological motifs such as cells are sometimes extracted from their natural environm… ▽ More

    Submitted 26 May, 2020; originally announced May 2020.

  4. First return, then explore

    Authors: Adrien Ecoffet, Joost Huizinga, Joel Lehman, Kenneth O. Stanley, Jeff Clune

    Abstract: The promise of reinforcement learning is to solve complex sequential decision problems autonomously by specifying a high-level reward function only. However, reinforcement learning algorithms struggle when, as is often the case, simple and intuitive rewards provide sparse and deceptive feedback. Avoiding these pitfalls requires thoroughly exploring the environment, but creating algorithms that can… ▽ More

    Submitted 16 September, 2021; v1 submitted 27 April, 2020; originally announced April 2020.

    Comments: 47 pages, 14 figures, 4 tables; reorganized sections and modified SI text extensively; added reference to the published version, changed title to published title; added reference to published unformatted pdf

    Journal ref: Nature 590, 580-586 (2021)

  5. arXiv:2003.11164  [pdf, other

    cs.LG stat.ML

    Fiber: A Platform for Efficient Development and Distributed Training for Reinforcement Learning and Population-Based Methods

    Authors: Jiale Zhi, Rui Wang, Jeff Clune, Kenneth O. Stanley

    Abstract: Recent advances in machine learning are consistently enabled by increasing amounts of computation. Reinforcement learning (RL) and population-based methods in particular pose unique challenges for efficiency and flexibility to the underlying distributed computing frameworks. These challenges include frequent interaction with simulations, the need for dynamic scaling, and the need for a user interf… ▽ More

    Submitted 24 March, 2020; originally announced March 2020.

  6. arXiv:2003.08536  [pdf, other

    cs.NE

    Enhanced POET: Open-Ended Reinforcement Learning through Unbounded Invention of Learning Challenges and their Solutions

    Authors: Rui Wang, Joel Lehman, Aditya Rawal, Jiale Zhi, Yulun Li, Jeff Clune, Kenneth O. Stanley

    Abstract: Creating open-ended algorithms, which generate their own never-ending stream of novel and appropriately challenging learning opportunities, could help to automate and accelerate progress in machine learning. A recent step in this direction is the Paired Open-Ended Trailblazer (POET), an algorithm that generates and solves its own challenges, and allows solutions to goal-switch between challenges t… ▽ More

    Submitted 13 April, 2020; v1 submitted 18 March, 2020; originally announced March 2020.

    Comments: 23 pages, 14 figures

  7. arXiv:2002.10585  [pdf, other

    cs.NE

    Backpropamine: training self-modifying neural networks with differentiable neuromodulated plasticity

    Authors: Thomas Miconi, Aditya Rawal, Jeff Clune, Kenneth O. Stanley

    Abstract: The impressive lifelong learning in animal brains is primarily enabled by plastic changes in synaptic connectivity. Importantly, these changes are not passive, but are actively controlled by neuromodulation, which is itself under the control of the brain. The resulting self-modifying abilities of the brain play an important role in learning and adaptation, and are a major basis for biological rein… ▽ More

    Submitted 24 February, 2020; originally announced February 2020.

    Comments: Presented at the 7th International Conference on Learning Representations (ICLR 2019)

    Journal ref: 7th International Conference on Learning Representations, ICLR 2019, New Orleans, LA, USA, May 6-9, 2019

  8. arXiv:2002.09571  [pdf, other

    cs.LG cs.CV cs.NE stat.ML

    Learning to Continually Learn

    Authors: Shawn Beaulieu, Lapo Frati, Thomas Miconi, Joel Lehman, Kenneth O. Stanley, Jeff Clune, Nick Cheney

    Abstract: Continual lifelong learning requires an agent or model to learn many sequentially ordered tasks, building on previous knowledge without catastrophically forgetting it. Much work has gone towards preventing the default tendency of machine learning models to catastrophically forget, yet virtually all such work involves manually-designed solutions to the problem. We instead advocate meta-learning a s… ▽ More

    Submitted 3 March, 2020; v1 submitted 21 February, 2020; originally announced February 2020.

  9. arXiv:2001.01683  [pdf, other

    cs.NE cs.LG stat.ML

    Deep Innovation Protection: Confronting the Credit Assignment Problem in Training Heterogeneous Neural Architectures

    Authors: Sebastian Risi, Kenneth O. Stanley

    Abstract: Deep reinforcement learning approaches have shown impressive results in a variety of different domains, however, more complex heterogeneous architectures such as world models require the different neural components to be trained separately instead of end-to-end. While a simple genetic algorithm recently showed end-to-end training is possible, it failed to solve a more complex 3D task. This paper p… ▽ More

    Submitted 23 February, 2021; v1 submitted 29 December, 2019; originally announced January 2020.

  10. arXiv:1912.07768  [pdf, other

    cs.LG stat.ML

    Generative Teaching Networks: Accelerating Neural Architecture Search by Learning to Generate Synthetic Training Data

    Authors: Felipe Petroski Such, Aditya Rawal, Joel Lehman, Kenneth O. Stanley, Jeff Clune

    Abstract: This paper investigates the intriguing question of whether we can create learning algorithms that automatically generate training data, learning environments, and curricula in order to help AI agents rapidly learn. We show that such algorithms are possible via Generative Teaching Networks (GTNs), a general approach that is, in theory, applicable to supervised, unsupervised, and reinforcement learn… ▽ More

    Submitted 16 December, 2019; originally announced December 2019.

  11. An Overview of Open-Ended Evolution: Editorial Introduction to the Open-Ended Evolution II Special Issue

    Authors: Norman Packard, Mark A. Bedau, Alastair Channon, Takashi Ikegami, Steen Rasmussen, Kenneth O. Stanley, Tim Taylor

    Abstract: Nature's spectacular inventiveness, reflected in the enormous diversity of form and function displayed by the biosphere, is a feature of life that distinguishes living most strongly from nonliving. It is, therefore, not surprising that this aspect of life should become a central focus of artificial life. We have known since Darwin that the diversity is produced dynamically, through the process of… ▽ More

    Submitted 10 September, 2019; originally announced September 2019.

    Comments: This article is published in the Artificial Life journal (https://www.mitpressjournals.org/loi/artl) and is copyright (c) 2019 Massachusetts Institute of Technology. It it posted on arXiv.org after the publication embargo period in accordance with MIT Press Journals' author posting guidelines (https://www.mitpressjournals.org/for_authors#authorposting)

    Journal ref: Artificial Life, 25(2), pp. 93-103, 2019

  12. arXiv:1907.06077  [pdf, other

    cs.NE

    Evolvability ES: Scalable and Direct Optimization of Evolvability

    Authors: Alexander Gajewski, Jeff Clune, Kenneth O. Stanley, Joel Lehman

    Abstract: Designing evolutionary algorithms capable of uncovering highly evolvable representations is an open challenge; such evolvability is important because it accelerates evolution and enables fast adaptation to changing circumstances. This paper introduces evolvability ES, an evolutionary algorithm designed to explicitly and efficiently optimize for evolvability, i.e. the ability to further adapt. The… ▽ More

    Submitted 13 July, 2019; originally announced July 2019.

    Comments: Published in GECCO 2019

  13. arXiv:1906.08857  [pdf, other

    cs.NE cs.AI

    Deep Neuroevolution of Recurrent and Discrete World Models

    Authors: Sebastian Risi, Kenneth O. Stanley

    Abstract: Neural architectures inspired by our own human cognitive system, such as the recently introduced world models, have been shown to outperform traditional deep reinforcement learning (RL) methods in a variety of different domains. Instead of the relatively simple architectures employed in most RL experiments, world models rely on multiple different neural components that are responsible for visual i… ▽ More

    Submitted 28 April, 2019; originally announced June 2019.

  14. arXiv:1901.10995  [pdf, other

    cs.LG cs.AI stat.ML

    Go-Explore: a New Approach for Hard-Exploration Problems

    Authors: Adrien Ecoffet, Joost Huizinga, Joel Lehman, Kenneth O. Stanley, Jeff Clune

    Abstract: A grand challenge in reinforcement learning is intelligent exploration, especially when rewards are sparse or deceptive. Two Atari games serve as benchmarks for such hard-exploration domains: Montezuma's Revenge and Pitfall. On both games, current RL algorithms perform poorly, even those with intrinsic motivation, which is the dominant method to improve performance on hard-exploration domains. To… ▽ More

    Submitted 26 February, 2021; v1 submitted 30 January, 2019; originally announced January 2019.

    Comments: 37 pages, 14 figures; added references to Goyal et al. and Oh et al., updated reference to Colas et al; updated author emails; point readers to updated paper

  15. arXiv:1901.01753  [pdf, other

    cs.NE

    Paired Open-Ended Trailblazer (POET): Endlessly Generating Increasingly Complex and Diverse Learning Environments and Their Solutions

    Authors: Rui Wang, Joel Lehman, Jeff Clune, Kenneth O. Stanley

    Abstract: While the history of machine learning so far largely encompasses a series of problems posed by researchers and algorithms that learn their solutions, an important question is whether the problems themselves can be generated by the algorithm at the same time as they are being solved. Such a process would in effect build its own diverse and expanding curricula, and the solutions to problems at vario… ▽ More

    Submitted 20 February, 2019; v1 submitted 7 January, 2019; originally announced January 2019.

    Comments: 28 pages, 9 figures

  16. arXiv:1805.01141  [pdf, other

    cs.NE cs.AI

    VINE: An Open Source Interactive Data Visualization Tool for Neuroevolution

    Authors: Rui Wang, Jeff Clune, Kenneth O. Stanley

    Abstract: Recent advances in deep neuroevolution have demonstrated that evolutionary algorithms, such as evolution strategies (ES) and genetic algorithms (GA), can scale to train deep neural networks to solve difficult reinforcement learning (RL) problems. However, it remains a challenge to analyze and interpret the underlying process of neuroevolution in such high dimensions. To begin to address this chall… ▽ More

    Submitted 3 May, 2018; originally announced May 2018.

  17. arXiv:1804.02464  [pdf, other

    cs.NE cs.LG stat.ML

    Differentiable plasticity: training plastic neural networks with backpropagation

    Authors: Thomas Miconi, Jeff Clune, Kenneth O. Stanley

    Abstract: How can we build agents that keep learning from experience, quickly and efficiently, after their initial training? Here we take inspiration from the main mechanism of learning in biological brains: synaptic plasticity, carefully tuned by evolution to produce efficient lifelong learning. We show that plasticity, just like connection weights, can be optimized by gradient descent in large (millions o… ▽ More

    Submitted 31 July, 2018; v1 submitted 6 April, 2018; originally announced April 2018.

    Comments: Presented at ICML 2018

    Journal ref: Proceedings of the 35th International Conference on Machine Learning (ICML2018), Stockholm, Sweden, PMLR 80, 2018

  18. arXiv:1803.03453  [pdf, other

    cs.NE

    The Surprising Creativity of Digital Evolution: A Collection of Anecdotes from the Evolutionary Computation and Artificial Life Research Communities

    Authors: Joel Lehman, Jeff Clune, Dusan Misevic, Christoph Adami, Lee Altenberg, Julie Beaulieu, Peter J. Bentley, Samuel Bernard, Guillaume Beslon, David M. Bryson, Patryk Chrabaszcz, Nick Cheney, Antoine Cully, Stephane Doncieux, Fred C. Dyer, Kai Olav Ellefsen, Robert Feldt, Stephan Fischer, Stephanie Forrest, Antoine Frénoy, Christian Gagné, Leni Le Goff, Laura M. Grabowski, Babak Hodjat, Frank Hutter , et al. (28 additional authors not shown)

    Abstract: Biological evolution provides a creative fount of complex and subtle adaptations, often surprising the scientists who discover them. However, because evolution is an algorithmic process that transcends the substrate in which it occurs, evolution's creativity is not limited to nature. Indeed, many researchers in the field of digital evolution have observed their evolving algorithms and organisms su… ▽ More

    Submitted 21 November, 2019; v1 submitted 9 March, 2018; originally announced March 2018.

  19. arXiv:1712.06568  [pdf, other

    cs.NE cs.AI

    ES Is More Than Just a Traditional Finite-Difference Approximator

    Authors: Joel Lehman, Jay Chen, Jeff Clune, Kenneth O. Stanley

    Abstract: An evolution strategy (ES) variant based on a simplification of a natural evolution strategy recently attracted attention because it performs surprisingly well in challenging deep reinforcement learning domains. It searches for neural network parameters by generating perturbations to the current set of parameters, checking their performance, and moving in the aggregate direction of higher reward.… ▽ More

    Submitted 1 May, 2018; v1 submitted 18 December, 2017; originally announced December 2017.

  20. arXiv:1712.06567  [pdf, other

    cs.NE cs.LG

    Deep Neuroevolution: Genetic Algorithms Are a Competitive Alternative for Training Deep Neural Networks for Reinforcement Learning

    Authors: Felipe Petroski Such, Vashisht Madhavan, Edoardo Conti, Joel Lehman, Kenneth O. Stanley, Jeff Clune

    Abstract: Deep artificial neural networks (DNNs) are typically trained via gradient-based learning algorithms, namely backpropagation. Evolution strategies (ES) can rival backprop-based algorithms such as Q-learning and policy gradients on challenging deep reinforcement learning (RL) problems. However, ES can be considered a gradient-based algorithm because it performs stochastic gradient descent via an ope… ▽ More

    Submitted 20 April, 2018; v1 submitted 18 December, 2017; originally announced December 2017.

  21. arXiv:1712.06564  [pdf, other

    cs.NE

    On the Relationship Between the OpenAI Evolution Strategy and Stochastic Gradient Descent

    Authors: Xingwen Zhang, Jeff Clune, Kenneth O. Stanley

    Abstract: Because stochastic gradient descent (SGD) has shown promise optimizing neural networks with millions of parameters and few if any alternatives are known to exist, it has moved to the heart of leading approaches to reinforcement learning (RL). For that reason, the recent result from OpenAI showing that a particular kind of evolution strategy (ES) can rival the performance of SGD-based deep RL metho… ▽ More

    Submitted 18 December, 2017; originally announced December 2017.

  22. arXiv:1712.06563  [pdf, other

    cs.NE cs.AI

    Safe Mutations for Deep and Recurrent Neural Networks through Output Gradients

    Authors: Joel Lehman, Jay Chen, Jeff Clune, Kenneth O. Stanley

    Abstract: While neuroevolution (evolving neural networks) has a successful track record across a variety of domains from reinforcement learning to artificial life, it is rarely applied to large, deep neural networks. A central reason is that while random mutation generally works in low dimensions, a random perturbation of thousands or millions of weights is likely to break existing functionality, providing… ▽ More

    Submitted 1 May, 2018; v1 submitted 18 December, 2017; originally announced December 2017.

  23. arXiv:1712.06560  [pdf, other

    cs.AI

    Improving Exploration in Evolution Strategies for Deep Reinforcement Learning via a Population of Novelty-Seeking Agents

    Authors: Edoardo Conti, Vashisht Madhavan, Felipe Petroski Such, Joel Lehman, Kenneth O. Stanley, Jeff Clune

    Abstract: Evolution strategies (ES) are a family of black-box optimization algorithms able to train deep neural networks roughly as well as Q-learning and policy gradient methods on challenging deep reinforcement learning (RL) problems, but are much faster (e.g. hours vs. days) because they parallelize better. However, many RL problems require directed exploration because they have reward functions that are… ▽ More

    Submitted 29 October, 2018; v1 submitted 18 December, 2017; originally announced December 2017.

  24. The Emergence of Canalization and Evolvability in an Open-Ended, Interactive Evolutionary System

    Authors: Joost Huizinga, Kenneth O. Stanley, Jeff Clune

    Abstract: Natural evolution has produced a tremendous diversity of functional organisms. Many believe an essential component of this process was the evolution of evolvability, whereby evolution speeds up its ability to innovate by generating a more adaptive pool of offspring. One hypothesized mechanism for evolvability is developmental canalization, wherein certain dimensions of variation become more likely… ▽ More

    Submitted 14 February, 2019; v1 submitted 17 April, 2017; originally announced April 2017.

    Comments: SI can be found at: http://www.evolvingai.org/files/SI_0.zip

    Journal ref: Artificial life, 24(3), pp.157-181 (2018)

  25. Born to Learn: the Inspiration, Progress, and Future of Evolved Plastic Artificial Neural Networks

    Authors: Andrea Soltoggio, Kenneth O. Stanley, Sebastian Risi

    Abstract: Biological plastic neural networks are systems of extraordinary computational capabilities shaped by evolution, development, and lifetime learning. The interplay of these elements leads to the emergence of adaptive behavior and intelligence. Inspired by such intricate natural phenomena, Evolved Plastic Artificial Neural Networks (EPANNs) use simulated evolution in-silico to breed plastic neural ne… ▽ More

    Submitted 8 August, 2018; v1 submitted 30 March, 2017; originally announced March 2017.

    Journal ref: Neural Networks, 2018

  26. arXiv:1609.02226  [pdf, other

    cs.AI cs.LG cs.NE

    Fitted Learning: Models with Awareness of their Limits

    Authors: Navid Kardan, Kenneth O. Stanley

    Abstract: Though deep learning has pushed the boundaries of classification forward, in recent years hints of the limits of standard classification have begun to emerge. Problems such as fooling, adding new classes over time, and the need to retrain learning models only for small changes to the original problem all point to a potential shortcoming in the classic classification regime, where a comprehensive a… ▽ More

    Submitted 9 July, 2018; v1 submitted 7 September, 2016; originally announced September 2016.

  27. arXiv:1407.3000  [pdf, ps, other

    cs.NE

    A Proposed Infrastructure for Adding Online Interaction to Any Evolutionary Domain

    Authors: Paul Szerlip, Kenneth O. Stanley

    Abstract: To address the difficulty of creating online collaborative evolutionary systems, this paper presents a new prototype library called Worldwide Infrastructure for Neuroevolution (WIN) and its accompanying site WIN Online (http://winark.org/). The WIN library is a collection of software packages built on top of Node.js that reduce the complexity of creating fully persistent, online, and interactive (… ▽ More

    Submitted 10 July, 2014; originally announced July 2014.

    Comments: Presented at WebAL-1: Workshop on Artificial Life and the Web 2014 (arXiv:1406.2507)

    Report number: WebAL1/2014/05

  28. arXiv:1406.1833  [pdf, other

    cs.NE cs.LG

    Unsupervised Feature Learning through Divergent Discriminative Feature Accumulation

    Authors: Paul A. Szerlip, Gregory Morse, Justin K. Pugh, Kenneth O. Stanley

    Abstract: Unlike unsupervised approaches such as autoencoders that learn to reconstruct their inputs, this paper introduces an alternative approach to unsupervised feature learning called divergent discriminative feature accumulation (DDFA) that instead continually accumulates features that make novel discriminations among the training set. Thus DDFA features are inherently discriminative from the start eve… ▽ More

    Submitted 9 June, 2014; v1 submitted 6 June, 2014; originally announced June 2014.

    Comments: Corrected citation formatting

  29. Evolvability Is Inevitable: Increasing Evolvability Without the Pressure to Adapt

    Authors: Joel Lehman, Kenneth O. Stanley

    Abstract: Why evolvability appears to have increased over evolutionary time is an important unresolved biological question. Unlike most candidate explanations, this paper proposes that increasing evolvability can result without any pressure to adapt. The insight is that if evolvability is heritable, then an unbiased drifting process across genotypes can still create a distribution of phenotypes biased towar… ▽ More

    Submitted 5 February, 2013; originally announced February 2013.

  30. arXiv:1207.6682  [pdf, other

    cs.NE

    Exploring Promising Step** Stones by Combining Novelty Search with Interactive Evolution

    Authors: Brian G. Woolley, Kenneth O. Stanley

    Abstract: The field of evolutionary computation is inspired by the achievements of natural evolution, in which there is no final objective. Yet the pursuit of objectives is ubiquitous in simulated evolution. A significant problem is that objective approaches assume that intermediate step** stones will increasingly resemble the final objective when in fact they often do not. The consequence is that while s… ▽ More

    Submitted 27 July, 2012; originally announced July 2012.

    Comments: 15 pages, 7 figures

    ACM Class: I.2.6

  31. Competitive Coevolution through Evolutionary Complexification

    Authors: R. Miikkulainen, K. O. Stanley

    Abstract: Two major goals in machine learning are the discovery and improvement of solutions to complex problems. In this paper, we argue that complexification, i.e. the incremental elaboration of solutions through adding new structure, achieves both these goals. We demonstrate the power of complexification through the NeuroEvolution of Augmenting Topologies (NEAT) method, which evolves increasingly compl… ▽ More

    Submitted 30 June, 2011; originally announced July 2011.

    Journal ref: Journal Of Artificial Intelligence Research, Volume 21, pages 63-100, 2004