Skip to main content

Showing 1–5 of 5 results for author: Dennis, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.02348  [pdf, other

    cs.LG

    Revisiting Cascaded Ensembles for Efficient Inference

    Authors: Steven Kolawole, Don Dennis, Ameet Talwalkar, Virginia Smith

    Abstract: A common approach to make machine learning inference more efficient is to use example-specific adaptive schemes, which route or select models for each example at inference time. In this work we study a simple scheme for adaptive inference. We build a cascade of ensembles (CoE), beginning with resource-efficient models and growing to larger, more expressive models, where ensemble agreement serves a… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: ES-FOMO, ICML 2024

  2. arXiv:2302.10093  [pdf, other

    cs.LG

    Progressive Ensemble Distillation: Building Ensembles for Efficient Inference

    Authors: Don Kurian Dennis, Abhishek Shetty, Anish Sevekari, Kazuhito Koishida, Virginia Smith

    Abstract: We study the problem of progressive ensemble distillation: Given a large, pretrained teacher model $g$, we seek to decompose the model into smaller, low-inference cost student models $f_i$, such that progressively evaluating additional models in this ensemble leads to improved predictions. The resulting ensemble allows for flexibly tuning accuracy vs. inference cost at runtime, which is useful for… ▽ More

    Submitted 9 November, 2023; v1 submitted 20 February, 2023; originally announced February 2023.

  3. arXiv:2302.02931  [pdf, other

    cs.LG

    Bitrate-Constrained DRO: Beyond Worst Case Robustness To Unknown Group Shifts

    Authors: Amrith Setlur, Don Dennis, Benjamin Eysenbach, Aditi Raghunathan, Chelsea Finn, Virginia Smith, Sergey Levine

    Abstract: Training machine learning models robust to distribution shifts is critical for real-world applications. Some robust training algorithms (e.g., Group DRO) specialize to group shifts and require group information on all training points. Other methods (e.g., CVaR DRO) that do not need group annotations can be overly conservative, since they naively upweight high loss points which may form a contrived… ▽ More

    Submitted 11 October, 2023; v1 submitted 6 February, 2023; originally announced February 2023.

    Journal ref: ICLR 2023

  4. arXiv:2211.00241  [pdf, other

    cs.LG cs.AI cs.CR stat.ML

    Adversarial Policies Beat Superhuman Go AIs

    Authors: Tony T. Wang, Adam Gleave, Tom Tseng, Kellin Pelrine, Nora Belrose, Joseph Miller, Michael D. Dennis, Yawen Duan, Viktor Pogrebniak, Sergey Levine, Stuart Russell

    Abstract: We attack the state-of-the-art Go-playing AI system KataGo by training adversarial policies against it, achieving a >97% win rate against KataGo running at superhuman settings. Our adversaries do not win by playing Go well. Instead, they trick KataGo into making serious blunders. Our attack transfers zero-shot to other superhuman Go-playing AIs, and is comprehensible to the extent that human exper… ▽ More

    Submitted 13 July, 2023; v1 submitted 31 October, 2022; originally announced November 2022.

    Comments: Accepted to ICML 2023, see paper for changelog

    ACM Class: I.2.6

  5. arXiv:2103.00697  [pdf, other

    cs.LG

    Heterogeneity for the Win: One-Shot Federated Clustering

    Authors: Don Kurian Dennis, Tian Li, Virginia Smith

    Abstract: In this work, we explore the unique challenges -- and opportunities -- of unsupervised federated learning (FL). We develop and analyze a one-shot federated clustering scheme, $k$-FED, based on the widely-used Lloyd's method for $k$-means clustering. In contrast to many supervised problems, we show that the issue of statistical heterogeneity in federated networks can in fact benefit our analysis. W… ▽ More

    Submitted 5 October, 2021; v1 submitted 28 February, 2021; originally announced March 2021.