Skip to main content

Showing 1–4 of 4 results for author: Fetterman, A J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2306.08055  [pdf, other

    cs.LG cs.AI

    Tune As You Scale: Hyperparameter Optimization For Compute Efficient Training

    Authors: Abraham J. Fetterman, Ellie Kitanidis, Joshua Albrecht, Zachary Polizzi, Bryden Fogelman, Maksis Knutins, Bartosz Wróblewski, James B. Simon, Kanjun Qiu

    Abstract: Hyperparameter tuning of deep learning models can lead to order-of-magnitude performance gains for the same amount of compute. Despite this, systematic tuning is uncommon, particularly for large models, which are expensive to evaluate and tend to have many hyperparameters, necessitating difficult judgment calls about tradeoffs, budgets, and search bounds. To address these issues and propose a prac… ▽ More

    Submitted 13 June, 2023; originally announced June 2023.

  2. arXiv:2303.15438  [pdf, other

    cs.LG

    On the Stepwise Nature of Self-Supervised Learning

    Authors: James B. Simon, Maksis Knutins, Liu Ziyin, Daniel Geisz, Abraham J. Fetterman, Joshua Albrecht

    Abstract: We present a simple picture of the training process of joint embedding self-supervised learning methods. We find that these methods learn their high-dimensional embeddings one dimension at a time in a sequence of discrete, well-separated steps. We arrive at this conclusion via the study of a linearized model of Barlow Twins applicable to the case in which the trained network is infinitely wide. We… ▽ More

    Submitted 30 May, 2023; v1 submitted 27 March, 2023; originally announced March 2023.

    Comments: 9 pages (main text) + 14 pages (refs + appendices). ICML '23

  3. arXiv:2212.06295  [pdf, other

    cs.CL cs.AI

    Despite "super-human" performance, current LLMs are unsuited for decisions about ethics and safety

    Authors: Joshua Albrecht, Ellie Kitanidis, Abraham J. Fetterman

    Abstract: Large language models (LLMs) have exploded in popularity in the past few years and have achieved undeniably impressive results on benchmarks as varied as question answering and text summarization. We provide a simple new prompting strategy that leads to yet another supposedly "super-human" result, this time outperforming humans at common sense ethical reasoning (as measured by accuracy on a subset… ▽ More

    Submitted 12 December, 2022; originally announced December 2022.

    Comments: ML Safety Workshop, NeurIPS 2022

  4. arXiv:2210.13417  [pdf, other

    cs.AI cs.LG

    Avalon: A Benchmark for RL Generalization Using Procedurally Generated Worlds

    Authors: Joshua Albrecht, Abraham J. Fetterman, Bryden Fogelman, Ellie Kitanidis, Bartosz Wróblewski, Nicole Seo, Michael Rosenthal, Maksis Knutins, Zachary Polizzi, James B. Simon, Kanjun Qiu

    Abstract: Despite impressive successes, deep reinforcement learning (RL) systems still fall short of human performance on generalization to new tasks and environments that differ from their training. As a benchmark tailored for studying RL generalization, we introduce Avalon, a set of tasks in which embodied agents in highly diverse procedural 3D worlds must survive by navigating terrain, hunting or gatheri… ▽ More

    Submitted 24 October, 2022; originally announced October 2022.

    Comments: Accepted to NeurIPS Datasets and Benchmarks 2022. Video and links to all code, data, etc can be found at https://generallyintelligent.com/avalon/