Skip to main content

Showing 1–15 of 15 results for author: Kalai, A T

Searching in archive stat. Search in all archives.
.
  1. arXiv:2310.02304  [pdf, other

    cs.CL cs.AI cs.LG stat.ML

    Self-Taught Optimizer (STOP): Recursively Self-Improving Code Generation

    Authors: Eric Zelikman, Eliana Lorch, Lester Mackey, Adam Tauman Kalai

    Abstract: Several recent advances in AI systems (e.g., Tree-of-Thoughts and Program-Aided Language Models) solve problems by providing a "scaffolding" program that structures multiple calls to language models to generate better outputs. A scaffolding program is written in a programming language such as Python. In this work, we use a language-model-infused scaffolding program to improve itself. We start with… ▽ More

    Submitted 1 March, 2024; v1 submitted 3 October, 2023; originally announced October 2023.

  2. arXiv:2304.09424  [pdf, other

    cs.LG cs.AI stat.ML

    Loss Minimization Yields Multicalibration for Large Neural Networks

    Authors: Jarosław Błasiok, Parikshit Gopalan, Lunjia Hu, Adam Tauman Kalai, Preetum Nakkiran

    Abstract: Multicalibration is a notion of fairness for predictors that requires them to provide calibrated predictions across a large set of protected groups. Multicalibration is known to be a distinct goal than loss minimization, even for simple predictors such as linear functions. In this work, we consider the setting where the protected groups can be represented by neural networks of size $k$, and the… ▽ More

    Submitted 7 December, 2023; v1 submitted 19 April, 2023; originally announced April 2023.

    Comments: In ITCS 2024

  3. arXiv:2209.00735  [pdf, other

    cs.LG stat.ML

    Recurrent Convolutional Neural Networks Learn Succinct Learning Algorithms

    Authors: Surbhi Goel, Sham Kakade, Adam Tauman Kalai, Cyril Zhang

    Abstract: Neural networks (NNs) struggle to efficiently solve certain problems, such as learning parities, even when there are simple learning algorithms for those problems. Can NNs discover learning algorithms on their own? We exhibit a NN architecture that, in polynomial time, learns as well as any efficient learning algorithm describable by a constant-sized program. For example, on parity problems, the N… ▽ More

    Submitted 15 January, 2023; v1 submitted 1 September, 2022; originally announced September 2022.

    Comments: v2: final camera-ready revisions for NeurIPS 2022

  4. arXiv:2205.09838  [pdf, ps, other

    cs.LG stat.ML

    Why GANs are overkill for NLP

    Authors: David Alvarez-Melis, Vikas Garg, Adam Tauman Kalai

    Abstract: This work offers a novel theoretical perspective on why, despite numerous attempts, adversarial approaches to generative modeling (e.g., GANs) have not been as popular for certain generation tasks, particularly sequential tasks such as Natural Language Generation, as they have in others, such as Computer Vision. In particular, on sequential data such as text, maximum-likelihood approaches are sign… ▽ More

    Submitted 19 May, 2022; originally announced May 2022.

  5. arXiv:2109.05389  [pdf, other

    cs.LG stat.ML

    Omnipredictors

    Authors: Parikshit Gopalan, Adam Tauman Kalai, Omer Reingold, Vatsal Sharan, Udi Wieder

    Abstract: Loss minimization is a dominant paradigm in machine learning, where a predictor is trained to minimize some loss function that depends on an uncertain event (e.g., "will it rain tomorrow?''). Different loss functions imply different learning algorithms and, at times, very different predictors. While widespread and appealing, a clear drawback of this approach is that the loss function may not be kn… ▽ More

    Submitted 11 September, 2021; originally announced September 2021.

    Comments: 35 pages, 1 figure

  6. arXiv:2105.14119  [pdf, other

    cs.LG cs.AI cs.DS stat.ML

    Towards optimally abstaining from prediction with OOD test examples

    Authors: Adam Tauman Kalai, Varun Kanade

    Abstract: A common challenge across all areas of machine learning is that training data is not distributed like test data, due to natural shifts, "blind spots," or adversarial examples; such test examples are referred to as out-of-distribution (OOD) test examples. We consider a model where one may abstain from predicting, at a fixed cost. In particular, our transductive abstention algorithm takes labeled tr… ▽ More

    Submitted 27 October, 2021; v1 submitted 28 May, 2021; originally announced May 2021.

    Comments: In NeurIPS 2021 (+spotlight), 24 pages

  7. arXiv:2007.05145  [pdf, other

    cs.LG stat.ML

    Beyond Perturbations: Learning Guarantees with Arbitrary Adversarial Test Examples

    Authors: Shafi Goldwasser, Adam Tauman Kalai, Yael Tauman Kalai, Omar Montasser

    Abstract: We present a transductive learning algorithm that takes as input training examples from a distribution $P$ and arbitrary (unlabeled) test examples, possibly chosen by an adversary. This is unlike prior work that assumes that test examples are small perturbations of $P$. Our algorithm outputs a selective classifier, which abstains from predicting on some examples. By considering selective transduct… ▽ More

    Submitted 30 September, 2020; v1 submitted 9 July, 2020; originally announced July 2020.

    Comments: To appear in NeurIPS 2020

  8. arXiv:1904.11875  [pdf, other

    cs.LG stat.ML

    Learning to Prune: Speeding up Repeated Computations

    Authors: Daniel Alabi, Adam Tauman Kalai, Katrina Ligett, Cameron Musco, Christos Tzamos, Ellen Vitercik

    Abstract: It is common to encounter situations where one must solve a sequence of similar computational problems. Running a standard algorithm with worst-case runtime guarantees on each instance will fail to take advantage of valuable structure shared across the problem instances. For example, when a commuter drives from work to home, there are typically only a handful of routes that will ever be the shorte… ▽ More

    Submitted 26 April, 2019; originally announced April 2019.

  9. arXiv:1904.05233  [pdf, other

    cs.LG cs.CL stat.ML

    What's in a Name? Reducing Bias in Bios without Access to Protected Attributes

    Authors: Alexey Romanov, Maria De-Arteaga, Hanna Wallach, Jennifer Chayes, Christian Borgs, Alexandra Chouldechova, Sahin Geyik, Krishnaram Kenthapadi, Anna Rumshisky, Adam Tauman Kalai

    Abstract: There is a growing body of work that proposes methods for mitigating bias in machine learning systems. These methods typically rely on access to protected attributes such as race, gender, or age. However, this raises two significant challenges: (1) protected attributes may not be available or it may not be legal to use them, and (2) it is often desirable to simultaneously consider multiple protect… ▽ More

    Submitted 10 April, 2019; originally announced April 2019.

    Comments: Accepted at NAACL 2019; Best Thematic Paper

  10. arXiv:1902.02783  [pdf, other

    cs.CL cs.LG stat.ML

    Humor in Word Embeddings: Cockamamie Gobbledegook for Nincompoops

    Authors: Limor Gultchin, Genevieve Patterson, Nancy Baym, Nathaniel Swinger, Adam Tauman Kalai

    Abstract: While humor is often thought to be beyond the reach of Natural Language Processing, we show that several aspects of single-word humor correlate with simple linear directions in Word Embeddings. In particular: (a) the word vectors capture multiple aspects discussed in humor theories from various disciplines; (b) each individual's sense of humor can be represented by a vector, which can predict diff… ▽ More

    Submitted 24 May, 2019; v1 submitted 8 February, 2019; originally announced February 2019.

  11. arXiv:1901.09451  [pdf, other

    cs.IR cs.LG stat.ML

    Bias in Bios: A Case Study of Semantic Representation Bias in a High-Stakes Setting

    Authors: Maria De-Arteaga, Alexey Romanov, Hanna Wallach, Jennifer Chayes, Christian Borgs, Alexandra Chouldechova, Sahin Geyik, Krishnaram Kenthapadi, Adam Tauman Kalai

    Abstract: We present a large-scale study of gender bias in occupation classification, a task where the use of machine learning may lead to negative outcomes on peoples' lives. We analyze the potential allocation harms that can result from semantic representation bias. To do so, we study the impact on occupation classification of including explicit gender indicators---such as first names and pronouns---in di… ▽ More

    Submitted 27 January, 2019; originally announced January 2019.

    Comments: Accepted at ACM Conference on Fairness, Accountability, and Transparency (ACM FAT*), 2019

  12. arXiv:1804.04503  [pdf, other

    cs.LG cs.DS stat.ML

    Unleashing Linear Optimizers for Group-Fair Learning and Optimization

    Authors: Daniel Alabi, Nicole Immorlica, Adam Tauman Kalai

    Abstract: Most systems and learning algorithms optimize average performance or average loss -- one reason being computational complexity. However, many objectives of practical interest are more complex than simply average loss. This arises, for example, when balancing performance or loss with fairness across people. We prove that, from a computational perspective, optimizing arbitrary objectives that take i… ▽ More

    Submitted 4 June, 2018; v1 submitted 10 April, 2018; originally announced April 2018.

    Comments: Accepted for presentation at the Conference on Learning Theory (COLT) 2018

  13. arXiv:1709.08669  [pdf, other

    cs.LG cs.AI stat.ML

    Glass-Box Program Synthesis: A Machine Learning Approach

    Authors: Konstantina Christakopoulou, Adam Tauman Kalai

    Abstract: Recently proposed models which learn to write computer programs from data use either input/output examples or rich execution traces. Instead, we argue that a novel alternative is to use a glass-box loss function, given as a program itself that can be directly inspected. Glass-box optimization covers a wide range of problems, from computing the greatest common divisor of two integers, to learning-t… ▽ More

    Submitted 25 September, 2017; originally announced September 2017.

  14. arXiv:1504.00064  [pdf, other

    stat.ML cs.LG

    Crowdsourcing Feature Discovery via Adaptively Chosen Comparisons

    Authors: James Y. Zou, Kamalika Chaudhuri, Adam Tauman Kalai

    Abstract: We introduce an unsupervised approach to efficiently discover the underlying features in a data set via crowdsourcing. Our queries ask crowd members to articulate a feature common to two out of three displayed examples. In addition we also ask the crowd to provide binary labels to the remaining examples based on the discovered features. The triples are chosen adaptively based on the labels of the… ▽ More

    Submitted 31 March, 2015; originally announced April 2015.

  15. arXiv:1104.2018  [pdf, other

    cs.AI cs.LG stat.ML

    Efficient Learning of Generalized Linear and Single Index Models with Isotonic Regression

    Authors: Sham Kakade, Adam Tauman Kalai, Varun Kanade, Ohad Shamir

    Abstract: Generalized Linear Models (GLMs) and Single Index Models (SIMs) provide powerful generalizations of linear regression, where the target variable is assumed to be a (possibly unknown) 1-dimensional function of a linear predictor. In general, these problems entail non-convex estimation procedures, and, in practice, iterative local search heuristics are often used. Kalai and Sastry (2009) recently pr… ▽ More

    Submitted 11 April, 2011; originally announced April 2011.