Skip to main content

Showing 1–6 of 6 results for author: Snell, J C

.
  1. arXiv:2311.14601  [pdf, other

    cs.LG cs.NE stat.ML

    A Metalearned Neural Circuit for Nonparametric Bayesian Inference

    Authors: Jake C. Snell, Gianluca Bencomo, Thomas L. Griffiths

    Abstract: Most applications of machine learning to classification assume a closed set of balanced classes. This is at odds with the real world, where class occurrence statistics often follow a long-tailed power-law distribution and it is unlikely that all classes are seen in a single sample. Nonparametric Bayesian models naturally capture this phenomenon, but have significant practical barriers to widesprea… ▽ More

    Submitted 24 November, 2023; originally announced November 2023.

    Comments: 13 pages, 3 figures. Code available at https://github.com/jakesnell/neural-circuits

  2. arXiv:2311.13628  [pdf, other

    cs.LG cs.AI cs.CL

    Prompt Risk Control: A Rigorous Framework for Responsible Deployment of Large Language Models

    Authors: Thomas P. Zollo, Todd Morrill, Zhun Deng, Jake C. Snell, Toniann Pitassi, Richard Zemel

    Abstract: The recent explosion in the capabilities of large language models has led to a wave of interest in how best to prompt a model to perform a given task. While it may be tempting to simply choose a prompt based on average performance on a validation set, this can lead to a deployment where unexpectedly poor responses are generated, especially for the worst-off users. To mitigate this prospect, we pro… ▽ More

    Submitted 27 March, 2024; v1 submitted 22 November, 2023; originally announced November 2023.

    Comments: 34 pages, 10 figures, published as conference paper at ICLR 2024, and accepted to the Socially Responsible Language Modelling Research (SoLaR) workshop at NeurIPS 2023

  3. arXiv:2311.10580  [pdf, other

    cs.LG eess.SY stat.ML

    Implicit Maximum a Posteriori Filtering via Adaptive Optimization

    Authors: Gianluca M. Bencomo, Jake C. Snell, Thomas L. Griffiths

    Abstract: Bayesian filtering approximates the true underlying behavior of a time-varying system by inverting an explicit generative model to convert noisy measurements into state estimates. This process typically requires either storage, inversion, and multiplication of large matrices or Monte Carlo estimation, neither of which are practical in high-dimensional state spaces such as the weight spaces of arti… ▽ More

    Submitted 17 November, 2023; originally announced November 2023.

    Comments: Under review at ICLR 2024

  4. arXiv:2309.13786  [pdf, other

    cs.LG stat.ML

    Distribution-Free Statistical Dispersion Control for Societal Applications

    Authors: Zhun Deng, Thomas P. Zollo, Jake C. Snell, Toniann Pitassi, Richard Zemel

    Abstract: Explicit finite-sample statistical guarantees on model performance are an important ingredient in responsible machine learning. Previous work has focused mainly on bounding either the expected loss of a predictor or the probability that an individual prediction will incur a loss value in a specified range. However, for many high-stakes applications, it is crucial to understand and control the disp… ▽ More

    Submitted 6 March, 2024; v1 submitted 24 September, 2023; originally announced September 2023.

    Comments: Accepted by NeurIPS as spotlight (top 3% among submissions)

  5. arXiv:2305.17262  [pdf, other

    cs.CV cs.AI

    Im-Promptu: In-Context Composition from Image Prompts

    Authors: Bhishma Dedhia, Michael Chang, Jake C. Snell, Thomas L. Griffiths, Niraj K. Jha

    Abstract: Large language models are few-shot learners that can solve diverse tasks from a handful of demonstrations. This implicit understanding of tasks suggests that the attention mechanisms over word tokens may play a role in analogical reasoning. In this work, we investigate whether analogical reasoning can enable in-context composition over composable elements of visual stimuli. First, we introduce a s… ▽ More

    Submitted 22 October, 2023; v1 submitted 26 May, 2023; originally announced May 2023.

  6. arXiv:2212.13629  [pdf, other

    cs.LG stat.ML

    Quantile Risk Control: A Flexible Framework for Bounding the Probability of High-Loss Predictions

    Authors: Jake C. Snell, Thomas P. Zollo, Zhun Deng, Toniann Pitassi, Richard Zemel

    Abstract: Rigorous guarantees about the performance of predictive algorithms are necessary in order to ensure their responsible use. Previous work has largely focused on bounding the expected loss of a predictor, but this is not sufficient in many risk-sensitive applications where the distribution of errors is important. In this work, we propose a flexible framework to produce a family of bounds on quantile… ▽ More

    Submitted 27 December, 2022; originally announced December 2022.

    Comments: 24 pages, 4 figures. Code is available at https://github.com/jakesnell/quantile-risk-control