Skip to main content

Showing 1–10 of 10 results for author: Brown, T B

.
  1. arXiv:2010.14701  [pdf, other

    cs.LG cs.CL cs.CV

    Scaling Laws for Autoregressive Generative Modeling

    Authors: Tom Henighan, Jared Kaplan, Mor Katz, Mark Chen, Christopher Hesse, Jacob Jackson, Heewoo Jun, Tom B. Brown, Prafulla Dhariwal, Scott Gray, Chris Hallacy, Benjamin Mann, Alec Radford, Aditya Ramesh, Nick Ryder, Daniel M. Ziegler, John Schulman, Dario Amodei, Sam McCandlish

    Abstract: We identify empirical scaling laws for the cross-entropy loss in four domains: generative image modeling, video modeling, multimodal image$\leftrightarrow$text models, and mathematical problem solving. In all cases autoregressive Transformers smoothly improve in performance as model size and compute budgets increase, following a power-law plus constant scaling law. The optimal model size also depe… ▽ More

    Submitted 5 November, 2020; v1 submitted 27 October, 2020; originally announced October 2020.

    Comments: 20+17 pages, 33 figures; added appendix with additional language results

  2. arXiv:2005.14165  [pdf, other

    cs.CL

    Language Models are Few-Shot Learners

    Authors: Tom B. Brown, Benjamin Mann, Nick Ryder, Melanie Subbiah, Jared Kaplan, Prafulla Dhariwal, Arvind Neelakantan, Pranav Shyam, Girish Sastry, Amanda Askell, Sandhini Agarwal, Ariel Herbert-Voss, Gretchen Krueger, Tom Henighan, Rewon Child, Aditya Ramesh, Daniel M. Ziegler, Jeffrey Wu, Clemens Winter, Christopher Hesse, Mark Chen, Eric Sigler, Mateusz Litwin, Scott Gray, Benjamin Chess , et al. (6 additional authors not shown)

    Abstract: Recent work has demonstrated substantial gains on many NLP tasks and benchmarks by pre-training on a large corpus of text followed by fine-tuning on a specific task. While typically task-agnostic in architecture, this method still requires task-specific fine-tuning datasets of thousands or tens of thousands of examples. By contrast, humans can generally perform a new language task from only a few… ▽ More

    Submitted 22 July, 2020; v1 submitted 28 May, 2020; originally announced May 2020.

    Comments: 40+32 pages

  3. arXiv:2005.04305  [pdf

    cs.LG cs.CV stat.ML

    Measuring the Algorithmic Efficiency of Neural Networks

    Authors: Danny Hernandez, Tom B. Brown

    Abstract: Three factors drive the advance of AI: algorithmic innovation, data, and the amount of compute available for training. Algorithmic progress has traditionally been more difficult to quantify than compute and data. In this work, we argue that algorithmic progress has an aspect that is both straightforward to measure and interesting: reductions over time in the compute needed to reach past capabiliti… ▽ More

    Submitted 8 May, 2020; originally announced May 2020.

    Comments: 20 pages, 5 figures

  4. arXiv:2001.08361  [pdf, other

    cs.LG stat.ML

    Scaling Laws for Neural Language Models

    Authors: Jared Kaplan, Sam McCandlish, Tom Henighan, Tom B. Brown, Benjamin Chess, Rewon Child, Scott Gray, Alec Radford, Jeffrey Wu, Dario Amodei

    Abstract: We study empirical scaling laws for language model performance on the cross-entropy loss. The loss scales as a power-law with model size, dataset size, and the amount of compute used for training, with some trends spanning more than seven orders of magnitude. Other architectural details such as network width or depth have minimal effects within a wide range. Simple equations govern the dependence… ▽ More

    Submitted 22 January, 2020; originally announced January 2020.

    Comments: 19 pages, 15 figures

  5. arXiv:1909.08593  [pdf, other

    cs.CL cs.LG stat.ML

    Fine-Tuning Language Models from Human Preferences

    Authors: Daniel M. Ziegler, Nisan Stiennon, Jeffrey Wu, Tom B. Brown, Alec Radford, Dario Amodei, Paul Christiano, Geoffrey Irving

    Abstract: Reward learning enables the application of reinforcement learning (RL) to tasks where reward is defined by human judgment, building a model of reward by asking humans questions. Most work on reward learning has used simulated environments, but complex information about values is often expressed in natural language, and we believe reward learning for language is a key to making RL practical and saf… ▽ More

    Submitted 8 January, 2020; v1 submitted 18 September, 2019; originally announced September 2019.

  6. arXiv:1809.08352  [pdf, other

    stat.ML cs.CV cs.LG

    Unrestricted Adversarial Examples

    Authors: Tom B. Brown, Nicholas Carlini, Chiyuan Zhang, Catherine Olsson, Paul Christiano, Ian Goodfellow

    Abstract: We introduce a two-player contest for evaluating the safety and robustness of machine learning systems, with a large prize pool. Unlike most prior work in ML robustness, which studies norm-constrained adversaries, we shift our focus to unconstrained adversaries. Defenders submit machine learning models, and try to achieve high accuracy and coverage on non-adversarial data while making no confident… ▽ More

    Submitted 21 September, 2018; originally announced September 2018.

  7. arXiv:1802.08768  [pdf, other

    stat.ML cs.LG

    Is Generator Conditioning Causally Related to GAN Performance?

    Authors: Augustus Odena, Jacob Buckman, Catherine Olsson, Tom B. Brown, Christopher Olah, Colin Raffel, Ian Goodfellow

    Abstract: Recent work (Pennington et al, 2017) suggests that controlling the entire distribution of Jacobian singular values is an important design consideration in deep learning. Motivated by this, we study the distribution of singular values of the Jacobian of the generator in Generative Adversarial Networks (GANs). We find that this Jacobian generally becomes ill-conditioned at the beginning of training.… ▽ More

    Submitted 18 June, 2018; v1 submitted 23 February, 2018; originally announced February 2018.

  8. arXiv:1712.09665  [pdf, other

    cs.CV

    Adversarial Patch

    Authors: Tom B. Brown, Dandelion Mané, Aurko Roy, Martín Abadi, Justin Gilmer

    Abstract: We present a method to create universal, robust, targeted adversarial image patches in the real world. The patches are universal because they can be used to attack any scene, robust because they work under a wide variety of transformations, and targeted because they can cause a classifier to output any target class. These adversarial patches can be printed, added to any scene, photographed, and pr… ▽ More

    Submitted 16 May, 2018; v1 submitted 27 December, 2017; originally announced December 2017.

  9. arXiv:1706.03741  [pdf, other

    stat.ML cs.AI cs.HC cs.LG

    Deep reinforcement learning from human preferences

    Authors: Paul Christiano, Jan Leike, Tom B. Brown, Miljan Martic, Shane Legg, Dario Amodei

    Abstract: For sophisticated reinforcement learning (RL) systems to interact usefully with real-world environments, we need to communicate complex goals to these systems. In this work, we explore goals defined in terms of (non-expert) human preferences between pairs of trajectory segments. We show that this approach can effectively solve complex RL tasks without access to the reward function, including Atari… ▽ More

    Submitted 17 February, 2023; v1 submitted 12 June, 2017; originally announced June 2017.

  10. arXiv:nucl-ex/0105007  [pdf, ps, other

    nucl-ex

    Global Lifetime Measurements of Highly-Deformed and Other Rotational Structures in the A~135 Light Rare-Earth Region: Probing the Single-Particle Motion in a Rotating Potential

    Authors: M. A. Riley, R. W. Laird, F. G. Kondev, D. J. Hartley, D. E. Archer, T. B. Brown, R. M. Clark, M. D evlin, P. Fallon, I. M. Hibbert, D. T. Joss, D. R. LaFosse, P. J. Nolan, N. J. O'Brien, E. S. Paul, J. Pfohl, D. G. Sarantites, R. K. Sheline, S. L. Shepherd, J. Simpson, R. Wadsworth, M. T. Matev, A. V. Afanasjev, J. Dobaczewski, G. A. Lalazissis , et al. (2 additional authors not shown)

    Abstract: It has been possible, using GAMMASPHERE plus Microball,to extract differential lifetime measurements free from common systematic errors for over 15 different nuclei (various isotopes of Ce, Pr, Nd, Pm, and Sm) at high spin within a single experiment. This comprehensive study establishes the effective single-particle quadrupole moments in the A~135 light rare-earth region. Detailed comparisons ar… ▽ More

    Submitted 15 May, 2001; originally announced May 2001.

    Comments: 14 pages, 3 postscript figures. "High Spin Physics 2001" Warsaw, Poland, February 6--10, 2001, Acta Physica Polonica (in press)

    Journal ref: Acta Phys.Polon. B32 (2001) 2683-2696