Skip to main content

Showing 1–5 of 5 results for author: Rambachan, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.03689  [pdf, other

    cs.CL cs.AI

    Evaluating the World Model Implicit in a Generative Model

    Authors: Keyon Vafa, Justin Y. Chen, Jon Kleinberg, Sendhil Mullainathan, Ashesh Rambachan

    Abstract: Recent work suggests that large language models may implicitly learn world models. How should we assess this possibility? We formalize this question for the case where the underlying reality is governed by a deterministic finite automaton. This includes problems as diverse as simple logical reasoning, geographic navigation, game-playing, and chemistry. We propose new evaluation metrics for world m… ▽ More

    Submitted 22 June, 2024; v1 submitted 5 June, 2024; originally announced June 2024.

  2. arXiv:2406.01382  [pdf, other

    cs.CL cs.AI

    Do Large Language Models Perform the Way People Expect? Measuring the Human Generalization Function

    Authors: Keyon Vafa, Ashesh Rambachan, Sendhil Mullainathan

    Abstract: What makes large language models (LLMs) impressive is also what makes them hard to evaluate: their diversity of uses. To evaluate these models, we must understand the purposes they will be used for. We consider a setting where these deployment decisions are made by people, and in particular, people's beliefs about where an LLM will perform well. We model such beliefs as the consequence of a human… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: To appear in ICML 2024

  3. arXiv:2212.09844  [pdf, other

    econ.EM cs.CY cs.LG stat.ME

    Robust Design and Evaluation of Predictive Algorithms under Unobserved Confounding

    Authors: Ashesh Rambachan, Amanda Coston, Edward Kennedy

    Abstract: Predictive algorithms inform consequential decisions in settings where the outcome is selectively observed given choices made by human decision makers. We propose a unified framework for the robust design and evaluation of predictive algorithms in selectively observed data. We impose general assumptions on how much the outcome may vary on average between unselected and selected units conditional o… ▽ More

    Submitted 19 May, 2024; v1 submitted 19 December, 2022; originally announced December 2022.

  4. arXiv:2101.00352  [pdf, other

    cs.LG stat.ML

    Characterizing Fairness Over the Set of Good Models Under Selective Labels

    Authors: Amanda Coston, Ashesh Rambachan, Alexandra Chouldechova

    Abstract: Algorithmic risk assessments are used to inform decisions in a wide variety of high-stakes settings. Often multiple predictive models deliver similar overall performance but differ markedly in their predictions for individual cases, an empirical phenomenon known as the "Rashomon Effect." These models may have different properties over various groups, and therefore have different predictive fairnes… ▽ More

    Submitted 30 April, 2021; v1 submitted 1 January, 2021; originally announced January 2021.

    Comments: Added comparison methods to the empirical lending analysis

  5. Bias In, Bias Out? Evaluating the Folk Wisdom

    Authors: Ashesh Rambachan, Jonathan Roth

    Abstract: We evaluate the folk wisdom that algorithmic decision rules trained on data produced by biased human decision-makers necessarily reflect this bias. We consider a setting where training labels are only generated if a biased decision-maker takes a particular action, and so "biased" training data arise due to discriminatory selection into the training data. In our baseline model, the more biased the… ▽ More

    Submitted 19 December, 2020; v1 submitted 18 September, 2019; originally announced September 2019.

    Journal ref: 1st Symposium on Foundations of Responsible Computing (FORC 2020)