Skip to main content

Showing 1–4 of 4 results for author: Cherian, J J

.
  1. arXiv:2406.09714  [pdf, other

    stat.ML cs.LG stat.ME

    Large language model validity via enhanced conformal prediction methods

    Authors: John J. Cherian, Isaac Gibbs, Emmanuel J. Candès

    Abstract: We develop new conformal inference methods for obtaining validity guarantees on the output of large language models (LLMs). Prior work in conformal language modeling identifies a subset of the text that satisfies a high-probability guarantee of correctness. These methods work by filtering claims from the LLM's original response if a scoring function evaluated on the claim fails to exceed a thresho… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: 20 pages, 8 figures

  2. arXiv:2305.12616  [pdf, other

    stat.ME

    Conformal Prediction With Conditional Guarantees

    Authors: Isaac Gibbs, John J. Cherian, Emmanuel J. Candès

    Abstract: We consider the problem of constructing distribution-free prediction sets with finite-sample conditional guarantees. Prior work has shown that it is impossible to provide exact conditional coverage universally in finite samples. Thus, most popular methods only provide marginal coverage over the covariates. This paper bridges this gap by defining a spectrum of problems that interpolate between marg… ▽ More

    Submitted 20 December, 2023; v1 submitted 21 May, 2023; originally announced May 2023.

    Comments: 46 pages, 11 figures

  3. arXiv:2305.03712  [pdf, other

    stat.ME cs.CY cs.LG

    Statistical Inference for Fairness Auditing

    Authors: John J. Cherian, Emmanuel J. Candès

    Abstract: Before deploying a black-box model in high-stakes problems, it is important to evaluate the model's performance on sensitive subpopulations. For example, in a recidivism prediction task, we may wish to identify demographic groups for which our prediction model has unacceptably high false positive rates or certify that no such groups exist. In this paper, we frame this task, often referred to as "f… ▽ More

    Submitted 8 June, 2023; v1 submitted 5 May, 2023; originally announced May 2023.

    Comments: 44 pages, 8 figures

  4. arXiv:2008.06431  [pdf, other

    stat.ML cs.LG

    Efficient hyperparameter optimization by way of PAC-Bayes bound minimization

    Authors: John J. Cherian, Andrew G. Taube, Robert T. McGibbon, Panagiotis Angelikopoulos, Guy Blanc, Michael Snarski, Daniel D. Richman, John L. Klepeis, David E. Shaw

    Abstract: Identifying optimal values for a high-dimensional set of hyperparameters is a problem that has received growing attention given its importance to large-scale machine learning applications such as neural architecture search. Recently developed optimization methods can be used to select thousands or even millions of hyperparameters. Such methods often yield overfit models, however, leading to poor p… ▽ More

    Submitted 14 August, 2020; originally announced August 2020.