Skip to main content

Showing 1–13 of 13 results for author: Guss, W

Searching in archive cs. Search in all archives.
.
  1. arXiv:2107.03374  [pdf, other

    cs.LG

    Evaluating Large Language Models Trained on Code

    Authors: Mark Chen, Jerry Tworek, Heewoo Jun, Qiming Yuan, Henrique Ponde de Oliveira Pinto, Jared Kaplan, Harri Edwards, Yuri Burda, Nicholas Joseph, Greg Brockman, Alex Ray, Raul Puri, Gretchen Krueger, Michael Petrov, Heidy Khlaaf, Girish Sastry, Pamela Mishkin, Brooke Chan, Scott Gray, Nick Ryder, Mikhail Pavlov, Alethea Power, Lukasz Kaiser, Mohammad Bavarian, Clemens Winter , et al. (33 additional authors not shown)

    Abstract: We introduce Codex, a GPT language model fine-tuned on publicly available code from GitHub, and study its Python code-writing capabilities. A distinct production version of Codex powers GitHub Copilot. On HumanEval, a new evaluation set we release to measure functional correctness for synthesizing programs from docstrings, our model solves 28.8% of the problems, while GPT-3 solves 0% and GPT-J sol… ▽ More

    Submitted 14 July, 2021; v1 submitted 7 July, 2021; originally announced July 2021.

    Comments: corrected typos, added references, added authors, added acknowledgements

  2. arXiv:2107.01969  [pdf, other

    cs.LG cs.AI

    The MineRL BASALT Competition on Learning from Human Feedback

    Authors: Rohin Shah, Cody Wild, Steven H. Wang, Neel Alex, Brandon Houghton, William Guss, Sharada Mohanty, Anssi Kanervisto, Stephanie Milani, Nicholay Topin, Pieter Abbeel, Stuart Russell, Anca Dragan

    Abstract: The last decade has seen a significant increase of interest in deep learning research, with many public successes that have demonstrated its potential. As such, these systems are now being incorporated into commercial products. With this comes an additional challenge: how can we build AI systems that solve tasks where there is not a crisp, well-defined specification? While multiple solutions have… ▽ More

    Submitted 5 July, 2021; originally announced July 2021.

    Comments: NeurIPS 2021 Competition Track

  3. arXiv:2106.14876  [pdf, other

    cs.LG stat.ML

    Multi-task curriculum learning in a complex, visual, hard-exploration domain: Minecraft

    Authors: Ingmar Kanitscheider, Joost Huizinga, David Farhi, William Hebgen Guss, Brandon Houghton, Raul Sampedro, Peter Zhokhov, Bowen Baker, Adrien Ecoffet, Jie Tang, Oleg Klimov, Jeff Clune

    Abstract: An important challenge in reinforcement learning is training agents that can solve a wide variety of tasks. If tasks depend on each other (e.g. needing to learn to walk before learning to run), curriculum learning can speed up learning by focusing on the next best task to learn. We explore curriculum learning in a complex, visual domain with many hard exploration challenges: Minecraft. We find tha… ▽ More

    Submitted 28 June, 2021; originally announced June 2021.

    Comments: first submission

  4. arXiv:2106.03748  [pdf, other

    cs.LG cs.AI cs.NE cs.RO stat.ML

    Towards robust and domain agnostic reinforcement learning competitions

    Authors: William Hebgen Guss, Stephanie Milani, Nicholay Topin, Brandon Houghton, Sharada Mohanty, Andrew Melnik, Augustin Harter, Benoit Buschmaas, Bjarne Jaster, Christoph Berganski, Dennis Heitkamp, Marko Henning, Helge Ritter, Chengjie Wu, Xiaotian Hao, Yiming Lu, Hangyu Mao, Yihuan Mao, Chao Wang, Michal Opanowicz, Anssi Kanervisto, Yanick Schraner, Christian Scheller, Xiren Zhou, Lu Liu , et al. (4 additional authors not shown)

    Abstract: Reinforcement learning competitions have formed the basis for standard research benchmarks, galvanized advances in the state-of-the-art, and shaped the direction of the field. Despite this, a majority of challenges suffer from the same fundamental problems: participant solutions to the posed challenge are usually domain-specific, biased to maximally exploit compute resources, and not guaranteed to… ▽ More

    Submitted 7 June, 2021; originally announced June 2021.

    Comments: 20 pages, several figures, published PMLR

  5. arXiv:2103.15332  [pdf, other

    cs.LG cs.AI

    Measuring Sample Efficiency and Generalization in Reinforcement Learning Benchmarks: NeurIPS 2020 Procgen Benchmark

    Authors: Sharada Mohanty, Jyotish Poonganam, Adrien Gaidon, Andrey Kolobov, Blake Wulfe, Dipam Chakraborty, Gražvydas Šemetulskis, João Schapke, Jonas Kubilius, Jurgis Pašukonis, Linas Klimas, Matthew Hausknecht, Patrick MacAlpine, Quang Nhat Tran, Thomas Tumiel, Xiaocheng Tang, Xinwei Chen, Christopher Hesse, Jacob Hilton, William Hebgen Guss, Sahika Genc, John Schulman, Karl Cobbe

    Abstract: The NeurIPS 2020 Procgen Competition was designed as a centralized benchmark with clearly defined tasks for measuring Sample Efficiency and Generalization in Reinforcement Learning. Generalization remains one of the most fundamental challenges in deep reinforcement learning, and yet we do not have enough benchmarks to measure the progress of the community on Generalization in Reinforcement Learnin… ▽ More

    Submitted 29 March, 2021; originally announced March 2021.

  6. arXiv:2101.11071  [pdf, other

    cs.LG cs.AI stat.ML

    The MineRL 2020 Competition on Sample Efficient Reinforcement Learning using Human Priors

    Authors: William H. Guss, Mario Ynocente Castro, Sam Devlin, Brandon Houghton, Noboru Sean Kuno, Crissman Loomis, Stephanie Milani, Sharada Mohanty, Keisuke Nakata, Ruslan Salakhutdinov, John Schulman, Shinya Shiroshita, Nicholay Topin, Avinash Ummadisingu, Oriol Vinyals

    Abstract: Although deep reinforcement learning has led to breakthroughs in many difficult domains, these successes have required an ever-increasing number of samples, affording only a shrinking segment of the AI community access to their development. Resolution of these limitations requires new, sample-efficient methods. To facilitate research in this direction, we propose this second iteration of the MineR… ▽ More

    Submitted 26 January, 2021; originally announced January 2021.

    Comments: 37 pages, initial submission, accepted at NeurIPS. arXiv admin note: substantial text overlap with arXiv:1904.10079

  7. arXiv:2005.06041  [pdf, ps, other

    cs.LG stat.ML

    Guaranteeing Reproducibility in Deep Learning Competitions

    Authors: Brandon Houghton, Stephanie Milani, Nicholay Topin, William Guss, Katja Hofmann, Diego Perez-Liebana, Manuela Veloso, Ruslan Salakhutdinov

    Abstract: To encourage the development of methods with reproducible and robust training behavior, we propose a challenge paradigm where competitors are evaluated directly on the performance of their learning procedures rather than pre-trained agents. Since competition organizers re-train proposed methods in a controlled setting they can guarantee reproducibility, and -- by retraining submissions using a hel… ▽ More

    Submitted 12 May, 2020; originally announced May 2020.

    Comments: Accepted as a poster presentation to the 2019 NeruIPS Challenges in Machine Learning workshop (CiML)

  8. arXiv:2003.05012  [pdf, other

    cs.LG cs.AI stat.ML

    Retrospective Analysis of the 2019 MineRL Competition on Sample Efficient Reinforcement Learning

    Authors: Stephanie Milani, Nicholay Topin, Brandon Houghton, William H. Guss, Sharada P. Mohanty, Keisuke Nakata, Oriol Vinyals, Noboru Sean Kuno

    Abstract: To facilitate research in the direction of sample efficient reinforcement learning, we held the MineRL Competition on Sample Efficient Reinforcement Learning Using Human Priors at the Thirty-third Conference on Neural Information Processing Systems (NeurIPS 2019). The primary goal of this competition was to promote the development of algorithms that use human demonstrations alongside reinforcement… ▽ More

    Submitted 18 June, 2020; v1 submitted 10 March, 2020; originally announced March 2020.

    Comments: To appear in Proceedings of Machine Learning Research: NeurIPS 2019 Competition & Demonstration Track Postproceedings. 12 pages, 2 figures

  9. arXiv:1910.01545  [pdf, ps, other

    cs.LG math.FA stat.ML

    On Universal Approximation by Neural Networks with Uniform Guarantees on Approximation of Infinite Dimensional Maps

    Authors: William H. Guss, Ruslan Salakhutdinov

    Abstract: The study of universal approximation of arbitrary functions $f: \mathcal{X} \to \mathcal{Y}$ by neural networks has a rich and thorough history dating back to Kolmogorov (1957). In the case of learning finite dimensional maps, many authors have shown various forms of the universality of both fixed depth and fixed width neural networks. However, in many cases, these classical results fail to extend… ▽ More

    Submitted 3 October, 2019; originally announced October 2019.

    Comments: 12 pages

  10. arXiv:1907.13440  [pdf, other

    cs.LG cs.AI cs.NE stat.ML

    MineRL: A Large-Scale Dataset of Minecraft Demonstrations

    Authors: William H. Guss, Brandon Houghton, Nicholay Topin, Phillip Wang, Cayden Codel, Manuela Veloso, Ruslan Salakhutdinov

    Abstract: The sample inefficiency of standard deep reinforcement learning methods precludes their application to many real-world problems. Methods which leverage human demonstrations require fewer samples but have been researched less. As demonstrated in the computer vision and natural language processing communities, large-scale datasets have the capacity to facilitate research by serving as an experimenta… ▽ More

    Submitted 29 July, 2019; originally announced July 2019.

    Comments: Accepted at IJCAI 2019, 7 pages, 6 figures. arXiv admin note: text overlap with arXiv:1904.10079

  11. arXiv:1904.10079  [pdf, other

    cs.LG cs.AI stat.ML

    The MineRL 2019 Competition on Sample Efficient Reinforcement Learning using Human Priors

    Authors: William H. Guss, Cayden Codel, Katja Hofmann, Brandon Houghton, Noboru Kuno, Stephanie Milani, Sharada Mohanty, Diego Perez Liebana, Ruslan Salakhutdinov, Nicholay Topin, Manuela Veloso, Phillip Wang

    Abstract: Though deep reinforcement learning has led to breakthroughs in many difficult domains, these successes have required an ever-increasing number of samples. As state-of-the-art reinforcement learning (RL) systems require an exponentially increasing number of samples, their development is restricted to a continually shrinking segment of the AI community. Likewise, many of these systems cannot be appl… ▽ More

    Submitted 19 January, 2021; v1 submitted 22 April, 2019; originally announced April 2019.

    Comments: accepted at NeurIPS 2019, 28 pages

  12. arXiv:1802.04443  [pdf, other

    cs.LG cs.CG cs.NE math.AT stat.ML

    On Characterizing the Capacity of Neural Networks using Algebraic Topology

    Authors: William H. Guss, Ruslan Salakhutdinov

    Abstract: The learnability of different neural architectures can be characterized directly by computable measures of data complexity. In this paper, we reframe the problem of architecture selection as understanding how data determines the most expressive and generalizable architectures suited to that data, beyond inductive bias. After suggesting algebraic topology as a measure for data complexity, we show t… ▽ More

    Submitted 12 February, 2018; originally announced February 2018.

    Comments: 13 pages, 11 figures

  13. arXiv:1612.04799  [pdf, other

    stat.ML cs.CV cs.LG

    Deep Function Machines: Generalized Neural Networks for Topological Layer Expression

    Authors: William H. Guss

    Abstract: In this paper we propose a generalization of deep neural networks called deep function machines (DFMs). DFMs act on vector spaces of arbitrary (possibly infinite) dimension and we show that a family of DFMs are invariant to the dimension of input data; that is, the parameterization of the model does not directly hinge on the quality of the input (eg. high resolution images). Using this generalizat… ▽ More

    Submitted 6 November, 2017; v1 submitted 14 December, 2016; originally announced December 2016.

    Comments: 23 pages, 9 figures, with experiments