Skip to main content

Showing 1–50 of 71 results for author: Talwalkar, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.02348  [pdf, other

    cs.LG

    Revisiting Cascaded Ensembles for Efficient Inference

    Authors: Steven Kolawole, Don Dennis, Ameet Talwalkar, Virginia Smith

    Abstract: A common approach to make machine learning inference more efficient is to use example-specific adaptive schemes, which route or select models for each example at inference time. In this work we study a simple scheme for adaptive inference. We build a cascade of ensembles (CoE), beginning with resource-efficient models and growing to larger, more expressive models, where ensemble agreement serves a… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: ES-FOMO, ICML 2024

  2. arXiv:2404.02806  [pdf, other

    cs.SE cs.AI cs.HC

    The RealHumanEval: Evaluating Large Language Models' Abilities to Support Programmers

    Authors: Hussein Mozannar, Valerie Chen, Mohammed Alsobay, Subhro Das, Sebastian Zhao, Dennis Wei, Manish Nagireddy, Prasanna Sattigeri, Ameet Talwalkar, David Sontag

    Abstract: Evaluation of large language models (LLMs) for code has primarily relied on static benchmarks, including HumanEval (Chen et al., 2021), which measure the ability of LLMs to generate complete code that passes unit tests. As LLMs are increasingly used as programmer assistants, we study whether gains on existing benchmarks translate to gains in programmer productivity when coding with LLMs, including… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

  3. arXiv:2403.07187  [pdf, other

    cs.LG

    UPS: Efficiently Building Foundation Models for PDE Solving via Cross-Modal Adaptation

    Authors: Junhong Shen, Tanya Marwah, Ameet Talwalkar

    Abstract: We present Unified PDE Solvers (UPS), a data- and compute-efficient approach to develo** unified neural operators for diverse families of spatiotemporal PDEs from various domains, dimensions, and resolutions. UPS embeds different PDEs into a shared representation space and processes them using a FNO-transformer architecture. Rather than training the network from scratch, which is data-demanding… ▽ More

    Submitted 23 May, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

  4. arXiv:2402.05406  [pdf, other

    cs.LG cs.CL

    Everybody Prune Now: Structured Pruning of LLMs with only Forward Passes

    Authors: Lucio Dery, Steven Kolawole, Jean-François Kagy, Virginia Smith, Graham Neubig, Ameet Talwalkar

    Abstract: Given the generational gap in available hardware between lay practitioners and the most endowed institutions, LLMs are becoming increasingly inaccessible as they grow in size. Whilst many approaches have been proposed to compress LLMs to make their resource consumption manageable, these methods themselves tend to be resource intensive, putting them out of the reach of the very user groups they tar… ▽ More

    Submitted 9 February, 2024; v1 submitted 7 February, 2024; originally announced February 2024.

    Comments: 15 pages, 4 fiigures, 15 tables

  5. arXiv:2312.03151  [pdf, other

    cs.LG

    Multitask Learning Can Improve Worst-Group Outcomes

    Authors: Atharva Kulkarni, Lucio Dery, Amrith Setlur, Aditi Raghunathan, Ameet Talwalkar, Graham Neubig

    Abstract: In order to create machine learning systems that serve a variety of users well, it is vital to not only achieve high average performance but also ensure equitable outcomes across diverse groups. However, most machine learning methods are designed to improve a model's average performance on a chosen end task without consideration for their impact on worst group error. Multitask learning (MTL) is on… ▽ More

    Submitted 28 February, 2024; v1 submitted 5 December, 2023; originally announced December 2023.

    Comments: 20 pages, 7 tables, 6 Figures

  6. arXiv:2311.04076  [pdf, other

    cs.CL

    Do LLMs exhibit human-like response biases? A case study in survey design

    Authors: Lindia Tjuatja, Valerie Chen, Sherry Tongshuang Wu, Ameet Talwalkar, Graham Neubig

    Abstract: As large language models (LLMs) become more capable, there is growing excitement about the possibility of using LLMs as proxies for humans in real-world tasks where subjective labels are desired, such as in surveys and opinion polling. One widely-cited barrier to the adoption of LLMs as proxies for humans in subjective tasks is their sensitivity to prompt wording - but interestingly, humans also d… ▽ More

    Submitted 5 February, 2024; v1 submitted 7 November, 2023; originally announced November 2023.

  7. arXiv:2310.02246  [pdf, other

    cs.LG cs.AI math.NA stat.ML

    Learning to Relax: Setting Solver Parameters Across a Sequence of Linear System Instances

    Authors: Mikhail Khodak, Edmond Chow, Maria-Florina Balcan, Ameet Talwalkar

    Abstract: Solving a linear system $Ax=b$ is a fundamental scientific computing primitive for which numerous solvers and preconditioners have been developed. These come with parameters whose optimal values depend on the system being solved and are often impossible or too expensive to identify; thus in practice sub-optimal heuristics are used. We consider the common setting in which many related linear system… ▽ More

    Submitted 2 May, 2024; v1 submitted 3 October, 2023; originally announced October 2023.

    Comments: ICLR 2024 Spotlight

  8. arXiv:2307.15475  [pdf, other

    cs.HC cs.AI cs.LG

    FeedbackLogs: Recording and Incorporating Stakeholder Feedback into Machine Learning Pipelines

    Authors: Matthew Barker, Emma Kallina, Dhananjay Ashok, Katherine M. Collins, Ashley Casovan, Adrian Weller, Ameet Talwalkar, Valerie Chen, Umang Bhatt

    Abstract: Even though machine learning (ML) pipelines affect an increasing array of stakeholders, there is little work on how input from stakeholders is recorded and incorporated. We propose FeedbackLogs, addenda to existing documentation of ML pipelines, to track the input of multiple stakeholders. Each log records important details about the feedback collection process, the feedback itself, and how the fe… ▽ More

    Submitted 28 July, 2023; originally announced July 2023.

  9. Where Does My Model Underperform? A Human Evaluation of Slice Discovery Algorithms

    Authors: Nari Johnson, Ángel Alexander Cabrera, Gregory Plumb, Ameet Talwalkar

    Abstract: Machine learning (ML) models that achieve high average accuracy can still underperform on semantically coherent subsets ("slices") of data. This behavior can have significant societal consequences for the safety or bias of the model in deployment, but identifying these underperforming slices can be difficult in practice, especially in domains where practitioners lack access to group annotations to… ▽ More

    Submitted 9 February, 2024; v1 submitted 13 June, 2023; originally announced June 2023.

    Comments: Proceedings of the AAAI Conference on Human Computation and Crowdsourcing, 11(1), 65-76. Best Paper Award

  10. arXiv:2304.06701  [pdf, other

    cs.LG cs.AI cs.CY cs.HC

    Learning Personalized Decision Support Policies

    Authors: Umang Bhatt, Valerie Chen, Katherine M. Collins, Parameswaran Kamalaruban, Emma Kallina, Adrian Weller, Ameet Talwalkar

    Abstract: Individual human decision-makers may benefit from different forms of support to improve decision outcomes, but when each form of support will yield better outcomes? In this work, we posit that personalizing access to decision support tools can be an effective mechanism for instantiating the appropriate use of AI assistance. Specifically, we propose the general problem of learning a decision suppor… ▽ More

    Submitted 27 May, 2024; v1 submitted 13 April, 2023; originally announced April 2023.

    Comments: 29 pages, 12 figures

  11. arXiv:2302.08450  [pdf, other

    cs.LG cs.HC

    Assisting Human Decisions in Document Matching

    Authors: Joon Sik Kim, Valerie Chen, Danish Pruthi, Nihar B. Shah, Ameet Talwalkar

    Abstract: Many practical applications, ranging from paper-reviewer assignment in peer review to job-applicant matching for hiring, require human decision makers to identify relevant matches by combining their expertise with predictions from machine learning models. In many such model-assisted document matching tasks, the decision makers have stressed the need for assistive information about the model output… ▽ More

    Submitted 16 February, 2023; originally announced February 2023.

  12. arXiv:2302.05738  [pdf, other

    cs.LG

    Cross-Modal Fine-Tuning: Align then Refine

    Authors: Junhong Shen, Liam Li, Lucio M. Dery, Corey Staten, Mikhail Khodak, Graham Neubig, Ameet Talwalkar

    Abstract: Fine-tuning large-scale pretrained models has led to tremendous progress in well-studied modalities such as vision and NLP. However, similar gains have not been observed in many other modalities due to a lack of relevant pretrained models. In this work, we propose ORCA, a general cross-modal fine-tuning framework that extends the applicability of a single large-scale pretrained model to diverse mo… ▽ More

    Submitted 18 March, 2023; v1 submitted 11 February, 2023; originally announced February 2023.

  13. Zeno: An Interactive Framework for Behavioral Evaluation of Machine Learning

    Authors: Ángel Alexander Cabrera, Erica Fu, Donald Bertucci, Kenneth Holstein, Ameet Talwalkar, Jason I. Hong, Adam Perer

    Abstract: Machine learning models with high accuracy on test data can still produce systematic failures, such as harmful biases and safety issues, when deployed in the real world. To detect and mitigate such failures, practitioners run behavioral evaluation of their models, checking model outputs for specific types of inputs. Behavioral evaluation is important but challenging, requiring that practitioners d… ▽ More

    Submitted 9 February, 2023; originally announced February 2023.

  14. arXiv:2212.08930  [pdf, other

    cs.LG

    On Noisy Evaluation in Federated Hyperparameter Tuning

    Authors: Kevin Kuo, Pratiksha Thaker, Mikhail Khodak, John Nguyen, Daniel Jiang, Ameet Talwalkar, Virginia Smith

    Abstract: Hyperparameter tuning is critical to the success of federated learning applications. Unfortunately, appropriately selecting hyperparameters is challenging in federated networks. Issues of scale, privacy, and heterogeneity introduce noise in the tuning process and make it difficult to evaluate the performance of various hyperparameters. In this work, we perform the first systematic study on the eff… ▽ More

    Submitted 15 May, 2023; v1 submitted 17 December, 2022; originally announced December 2022.

    Comments: v1: 19 pages, 15 figures, submitted to MLSys2023; v2: Fixed citation formatting; v3: Fixed typo, update acks v4: MLSys2023 camera-ready

  15. arXiv:2210.03324  [pdf, other

    cs.LG cs.AI stat.ML

    AutoML for Climate Change: A Call to Action

    Authors: Renbo Tu, Nicholas Roberts, Vishak Prasad, Sibasis Nayak, Paarth Jain, Frederic Sala, Ganesh Ramakrishnan, Ameet Talwalkar, Willie Neiswanger, Colin White

    Abstract: The challenge that climate change poses to humanity has spurred a rapidly develo** field of artificial intelligence research focused on climate change applications. The climate change AI (CCAI) community works on a diverse, challenging set of problems which often involve physics-constrained ML or heterogeneous spatiotemporal data. It would be desirable to use automated machine learning (AutoML)… ▽ More

    Submitted 7 October, 2022; originally announced October 2022.

  16. arXiv:2208.12218  [pdf, other

    cs.LG

    SONAR: Joint Architecture and System Optimization Search

    Authors: Elias Jääsaari, Michelle Ma, Ameet Talwalkar, Tianqi Chen

    Abstract: There is a growing need to deploy machine learning for different tasks on a wide array of new hardware platforms. Such deployment scenarios require tackling multiple challenges, including identifying a model architecture that can achieve a suitable predictive accuracy (architecture search), and finding an efficient implementation of the model to satisfy underlying hardware-specific systems constra… ▽ More

    Submitted 25 August, 2022; originally announced August 2022.

  17. arXiv:2207.10199  [pdf, other

    cs.LG stat.ML

    Provably tuning the ElasticNet across instances

    Authors: Maria-Florina Balcan, Mikhail Khodak, Dravyansh Sharma, Ameet Talwalkar

    Abstract: An important unresolved challenge in the theory of regularization is to set the regularization coefficients of popular techniques like the ElasticNet with general provable guarantees. We consider the problem of tuning the regularization parameters of Ridge regression, LASSO, and the ElasticNet across multiple problem instances, a setting that encompasses both cross-validation and multi-task hyperp… ▽ More

    Submitted 15 January, 2024; v1 submitted 20 July, 2022; originally announced July 2022.

  18. arXiv:2207.04104  [pdf, other

    cs.LG cs.CV

    Towards a More Rigorous Science of Blindspot Discovery in Image Classification Models

    Authors: Gregory Plumb, Nari Johnson, Ángel Alexander Cabrera, Ameet Talwalkar

    Abstract: A growing body of work studies Blindspot Discovery Methods ("BDM"s): methods that use an image embedding to find semantically meaningful (i.e., united by a human-understandable concept) subsets of the data where an image classifier performs significantly worse. Motivated by observed gaps in prior work, we introduce a new framework for evaluating BDMs, SpotCheck, that uses synthetic image datasets… ▽ More

    Submitted 11 July, 2023; v1 submitted 8 July, 2022; originally announced July 2022.

    Comments: reviewed on OpenReview: https://openreview.net/forum?id=MaDvbLaBiF

    Journal ref: TMLR 2023

  19. arXiv:2206.13503  [pdf, other

    cs.LG cs.HC

    On the Importance of Application-Grounded Experimental Design for Evaluating Explainable ML Methods

    Authors: Kasun Amarasinghe, Kit T. Rodolfa, Sérgio Jesus, Valerie Chen, Vladimir Balayan, Pedro Saleiro, Pedro Bizarro, Ameet Talwalkar, Rayid Ghani

    Abstract: Most existing evaluations of explainable machine learning (ML) methods rely on simplifying assumptions or proxies that do not reflect real-world use cases; the handful of more robust evaluations on real-world settings have shortcomings in their design, resulting in limited conclusions of methods' real-world utility. In this work, we seek to bridge this gap by conducting a study that evaluates thre… ▽ More

    Submitted 21 February, 2023; v1 submitted 24 June, 2022; originally announced June 2022.

  20. arXiv:2206.02256  [pdf, other

    cs.HC cs.AI cs.LG

    Use-Case-Grounded Simulations for Explanation Evaluation

    Authors: Valerie Chen, Nari Johnson, Nicholay Topin, Gregory Plumb, Ameet Talwalkar

    Abstract: A growing body of research runs human subject evaluations to study whether providing users with explanations of machine learning models can help them with practical real-world use cases. However, running user studies is challenging and costly, and consequently each study typically only evaluates a limited number of different settings, e.g., studies often only evaluate a few arbitrarily selected ex… ▽ More

    Submitted 20 August, 2022; v1 submitted 5 June, 2022; originally announced June 2022.

  21. arXiv:2205.14082  [pdf, other

    cs.LG cs.AI

    AANG: Automating Auxiliary Learning

    Authors: Lucio M. Dery, Paul Michel, Mikhail Khodak, Graham Neubig, Ameet Talwalkar

    Abstract: Auxiliary objectives, supplementary learning signals that are introduced to help aid learning on data-starved or highly complex end-tasks, are commonplace in machine learning. Whilst much work has been done to formulate useful auxiliary objectives, their construction is still an art which proceeds by slow and tedious hand-design. Intuition for how and when these objectives improve end-task perform… ▽ More

    Submitted 27 February, 2023; v1 submitted 27 May, 2022; originally announced May 2022.

    Comments: Accepted to ICLR 2023 22 pages, 7 tables and 5 figures

  22. arXiv:2205.06905  [pdf, other

    cs.LG

    Perspectives on Incorporating Expert Feedback into Model Updates

    Authors: Valerie Chen, Umang Bhatt, Hoda Heidari, Adrian Weller, Ameet Talwalkar

    Abstract: Machine learning (ML) practitioners are increasingly tasked with develo** models that are aligned with non-technical experts' values and goals. However, there has been insufficient consideration on how practitioners should translate domain expertise into ML updates. In this paper, we consider how to capture interactions between practitioners and experts systematically. We devise a taxonomy to ma… ▽ More

    Submitted 16 July, 2022; v1 submitted 13 May, 2022; originally announced May 2022.

  23. arXiv:2204.07554  [pdf, other

    cs.LG cs.AI

    Efficient Architecture Search for Diverse Tasks

    Authors: Junhong Shen, Mikhail Khodak, Ameet Talwalkar

    Abstract: While neural architecture search (NAS) has enabled automated machine learning (AutoML) for well-researched areas, its application to tasks beyond computer vision is still under-explored. As less-studied domains are precisely those where we expect AutoML to have the greatest impact, in this work we study NAS for efficiently solving diverse problems. Seeking an approach that is fast, simple, and bro… ▽ More

    Submitted 9 October, 2022; v1 submitted 15 April, 2022; originally announced April 2022.

    Comments: NeurIPS 2022 Camera-Ready; code available at https://github.com/sjunhongshen/DASH

  24. arXiv:2202.09312  [pdf, other

    cs.LG cs.AI cs.DS stat.ML

    Learning Predictions for Algorithms with Predictions

    Authors: Mikhail Khodak, Maria-Florina Balcan, Ameet Talwalkar, Sergei Vassilvitskii

    Abstract: A burgeoning paradigm in algorithm design is the field of algorithms with predictions, in which algorithms can take advantage of a possibly-imperfect prediction of some aspect of the problem. While much work has focused on using predictions to improve competitive ratios, running times, or other performance measures, less effort has been devoted to the question of how to obtain the predictions them… ▽ More

    Submitted 17 October, 2022; v1 submitted 18 February, 2022; originally announced February 2022.

    Comments: NeurIPS 2022 camera-ready

  25. arXiv:2112.06283  [pdf, other

    cs.GT cs.LG

    Bayesian Persuasion for Algorithmic Recourse

    Authors: Keegan Harris, Valerie Chen, Joon Sik Kim, Ameet Talwalkar, Hoda Heidari, Zhiwei Steven Wu

    Abstract: When subjected to automated decision-making, decision subjects may strategically modify their observable features in ways they believe will maximize their chances of receiving a favorable decision. In many practical situations, the underlying assessment rule is deliberately kept secret to avoid gaming and maintain competitive advantage. The resulting opacity forces the decision subjects to rely on… ▽ More

    Submitted 7 October, 2022; v1 submitted 12 December, 2021; originally announced December 2021.

    Comments: In the thirty-sixth Conference on Neural Information Processing Systems (NeurIPS 2022)

  26. arXiv:2110.05668  [pdf, other

    cs.CV cs.LG

    NAS-Bench-360: Benchmarking Neural Architecture Search on Diverse Tasks

    Authors: Renbo Tu, Nicholas Roberts, Mikhail Khodak, Junhong Shen, Frederic Sala, Ameet Talwalkar

    Abstract: Most existing neural architecture search (NAS) benchmarks and algorithms prioritize well-studied tasks, e.g. image classification on CIFAR or ImageNet. This makes the performance of NAS approaches in more diverse areas poorly understood. In this paper, we present NAS-Bench-360, a benchmark suite to evaluate methods on domains beyond those traditionally studied in architecture search, and use it to… ▽ More

    Submitted 19 January, 2023; v1 submitted 11 October, 2021; originally announced October 2021.

    Comments: NeurIPS 2022 Datasets and Benchmarks Track

  27. arXiv:2109.07437  [pdf, other

    cs.LG cs.CL

    Should We Be Pre-training? An Argument for End-task Aware Training as an Alternative

    Authors: Lucio M. Dery, Paul Michel, Ameet Talwalkar, Graham Neubig

    Abstract: In most settings of practical concern, machine learning practitioners know in advance what end-task they wish to boost with auxiliary tasks. However, widely used methods for leveraging auxiliary data like pre-training and its continued-pretraining variant are end-task agnostic: they rarely, if ever, exploit knowledge of the target task. We study replacing end-task agnostic continued training of pr… ▽ More

    Submitted 6 February, 2022; v1 submitted 15 September, 2021; originally announced September 2021.

    Comments: 18 pages, 4 figures

  28. arXiv:2108.08770  [pdf, other

    cs.LG

    Learning-to-learn non-convex piecewise-Lipschitz functions

    Authors: Maria-Florina Balcan, Mikhail Khodak, Dravyansh Sharma, Ameet Talwalkar

    Abstract: We analyze the meta-learning of the initialization and step-size of learning algorithms for piecewise-Lipschitz functions, a non-convex setting with applications to both machine learning and algorithms. Starting from recent regret bounds for the exponential forecaster on losses with dispersed discontinuities, we generalize them to be initialization-dependent and then use this result to propose a p… ▽ More

    Submitted 19 August, 2021; originally announced August 2021.

  29. arXiv:2107.06917  [pdf, other

    cs.LG

    A Field Guide to Federated Optimization

    Authors: Jianyu Wang, Zachary Charles, Zheng Xu, Gauri Joshi, H. Brendan McMahan, Blaise Aguera y Arcas, Maruan Al-Shedivat, Galen Andrew, Salman Avestimehr, Katharine Daly, Deepesh Data, Suhas Diggavi, Hubert Eichner, Advait Gadhikar, Zachary Garrett, Antonious M. Girgis, Filip Hanzely, Andrew Hard, Chaoyang He, Samuel Horvath, Zhouyuan Huo, Alex Ingerman, Martin Jaggi, Tara Javidi, Peter Kairouz , et al. (28 additional authors not shown)

    Abstract: Federated learning and analytics are a distributed approach for collaboratively learning models (or statistics) from decentralized data, motivated by and designed for privacy protection. The distributed learning process can be formulated as solving federated optimization problems, which emphasize communication efficiency, data heterogeneity, compatibility with privacy and system requirements, and… ▽ More

    Submitted 14 July, 2021; originally announced July 2021.

  30. arXiv:2106.04502  [pdf, other

    cs.LG cs.AI cs.DC stat.ML

    Federated Hyperparameter Tuning: Challenges, Baselines, and Connections to Weight-Sharing

    Authors: Mikhail Khodak, Renbo Tu, Tian Li, Liam Li, Maria-Florina Balcan, Virginia Smith, Ameet Talwalkar

    Abstract: Tuning hyperparameters is a crucial but arduous part of the machine learning pipeline. Hyperparameter optimization is even more challenging in federated learning, where models are learned over a distributed network of heterogeneous devices; here, the need to keep data on device and perform local training makes it difficult to efficiently train and evaluate configurations. In this work, we investig… ▽ More

    Submitted 4 November, 2021; v1 submitted 8 June, 2021; originally announced June 2021.

    Comments: NeurIPS 2021

  31. arXiv:2106.02112  [pdf, other

    cs.LG

    Finding and Fixing Spurious Patterns with Explanations

    Authors: Gregory Plumb, Marco Tulio Ribeiro, Ameet Talwalkar

    Abstract: Image classifiers often use spurious patterns, such as "relying on the presence of a person to detect a tennis racket, which do not generalize. In this work, we present an end-to-end pipeline for identifying and mitigating spurious patterns for such models, under the assumption that we have access to pixel-wise object-annotations. We start by identifying patterns such as "the model's prediction fo… ▽ More

    Submitted 17 August, 2022; v1 submitted 3 June, 2021; originally announced June 2021.

  32. arXiv:2105.06506  [pdf, other

    cs.LG

    Sanity Simulations for Saliency Methods

    Authors: Joon Sik Kim, Gregory Plumb, Ameet Talwalkar

    Abstract: Saliency methods are a popular class of feature attribution explanation methods that aim to capture a model's predictive reasoning by identifying "important" pixels in an input image. However, the development and adoption of these methods are hindered by the lack of access to ground-truth model reasoning, which prevents accurate evaluation. In this work, we design a synthetic benchmarking framewor… ▽ More

    Submitted 16 June, 2022; v1 submitted 13 May, 2021; originally announced May 2021.

    Comments: Accepted to International Conference on Machine Learning (ICML 2022)

  33. arXiv:2103.15798  [pdf, other

    cs.LG cs.AI cs.CV math.NA stat.ML

    Rethinking Neural Operations for Diverse Tasks

    Authors: Nicholas Roberts, Mikhail Khodak, Tri Dao, Liam Li, Christopher Ré, Ameet Talwalkar

    Abstract: An important goal of AutoML is to automate-away the design of neural networks on new tasks in under-explored domains. Motivated by this goal, we study the problem of enabling users to discover the right neural operations given data from their specific domain. We introduce a search space of operations called XD-Operations that mimic the inductive bias of standard multi-channel convolutions while be… ▽ More

    Submitted 4 November, 2021; v1 submitted 29 March, 2021; originally announced March 2021.

    Comments: NeurIPS 2021

  34. arXiv:2103.06254  [pdf, other

    cs.LG

    Interpretable Machine Learning: Moving From Mythos to Diagnostics

    Authors: Valerie Chen, Jeffrey Li, Joon Sik Kim, Gregory Plumb, Ameet Talwalkar

    Abstract: Despite increasing interest in the field of Interpretable Machine Learning (IML), a significant gap persists between the technical objectives targeted by researchers' methods and the high-level goals of consumers' use cases. In this work, we synthesize foundational work on IML methods and evaluation into an actionable taxonomy. This taxonomy serves as a tool to conceptualize the gap between resear… ▽ More

    Submitted 28 July, 2021; v1 submitted 10 March, 2021; originally announced March 2021.

    Comments: Presented at ICML HILL Workshop 2021

  35. arXiv:2103.00065  [pdf, other

    cs.LG stat.ML

    Gradient Descent on Neural Networks Typically Occurs at the Edge of Stability

    Authors: Jeremy M. Cohen, Simran Kaur, Yuanzhi Li, J. Zico Kolter, Ameet Talwalkar

    Abstract: We empirically demonstrate that full-batch gradient descent on neural network training objectives typically operates in a regime we call the Edge of Stability. In this regime, the maximum eigenvalue of the training loss Hessian hovers just above the numerical value $2 / \text{(step size)}$, and the training loss behaves non-monotonically over short timescales, yet consistently decreases over long… ▽ More

    Submitted 23 November, 2022; v1 submitted 26 February, 2021; originally announced March 2021.

    Comments: ICLR 2021. v3 moves several figures from the appendix into the main text, and adds more discussion regarding Jastrzębski et al (2020): https://doi.org/10.48550/arXiv.2002.09572

  36. arXiv:2102.00127  [pdf, other

    cs.LG cs.AI stat.ML

    On Data Efficiency of Meta-learning

    Authors: Maruan Al-Shedivat, Liam Li, Eric Xing, Ameet Talwalkar

    Abstract: Meta-learning has enabled learning statistical models that can be quickly adapted to new prediction tasks. Motivated by use-cases in personalized federated learning, we study the often overlooked aspect of the modern meta-learning algorithms -- their data efficiency. To shed more light on which methods are more efficient, we use techniques from algorithmic stability to derive bounds on the transfe… ▽ More

    Submitted 29 January, 2021; originally announced February 2021.

    Comments: Preliminary version. An updated version is to appear in AISTATS 2021

  37. arXiv:2011.01205  [pdf, other

    cs.LG stat.ML

    A Learning Theoretic Perspective on Local Explainability

    Authors: Jeffrey Li, Vaishnavh Nagarajan, Gregory Plumb, Ameet Talwalkar

    Abstract: In this paper, we explore connections between interpretable machine learning and learning theory through the lens of local approximation explanations. First, we tackle the traditional problem of performance generalization and bound the test-time accuracy of a model using a notion of how locally explainable it is. Second, we explore the novel problem of explanation generalization which is an import… ▽ More

    Submitted 2 November, 2020; originally announced November 2020.

  38. arXiv:2004.07802  [pdf, other

    cs.LG cs.CV cs.NE math.OC stat.ML

    Geometry-Aware Gradient Algorithms for Neural Architecture Search

    Authors: Liam Li, Mikhail Khodak, Maria-Florina Balcan, Ameet Talwalkar

    Abstract: Recent state-of-the-art methods for neural architecture search (NAS) exploit gradient-based optimization by relaxing the problem into continuous optimization over architectures and shared-weights, a noisy process that remains poorly understood. We argue for the study of single-level empirical risk minimization to understand NAS with weight-sharing, reducing the design of NAS methods to devising op… ▽ More

    Submitted 18 March, 2021; v1 submitted 16 April, 2020; originally announced April 2020.

    Comments: ICLR 2021 Camera-Ready

  39. arXiv:2004.03424  [pdf, other

    cs.LG cs.CY stat.ML

    FACT: A Diagnostic for Group Fairness Trade-offs

    Authors: Joon Sik Kim, Jiahao Chen, Ameet Talwalkar

    Abstract: Group fairness, a class of fairness notions that measure how different groups of individuals are treated differently according to their protected attributes, has been shown to conflict with one another, often with a necessary cost in loss of model's predictive performance. We propose a general diagnostic that enables systematic characterization of these trade-offs in group fairness. We observe tha… ▽ More

    Submitted 7 July, 2020; v1 submitted 7 April, 2020; originally announced April 2020.

    Comments: Accepted to International Conference on Machine Learning (ICML 2020)

  40. arXiv:2003.01640  [pdf, other

    cs.LG stat.ML

    Explaining Groups of Points in Low-Dimensional Representations

    Authors: Gregory Plumb, Jonathan Terhorst, Sriram Sankararaman, Ameet Talwalkar

    Abstract: A common workflow in data exploration is to learn a low-dimensional representation of the data, identify groups of points in that representation, and examine the differences between the groups to determine what they represent. We treat this workflow as an interpretable machine learning problem by leveraging the model that learned the low-dimensional representation to help identify the key differen… ▽ More

    Submitted 14 August, 2020; v1 submitted 3 March, 2020; originally announced March 2020.

  41. arXiv:2001.01920  [pdf, other

    cs.LG stat.ML

    FedDANE: A Federated Newton-Type Method

    Authors: Tian Li, Anit Kumar Sahu, Manzil Zaheer, Maziar Sanjabi, Ameet Talwalkar, Virginia Smith

    Abstract: Federated learning aims to jointly learn statistical models over massively distributed remote devices. In this work, we propose FedDANE, an optimization method that we adapt from DANE, a method for classical distributed optimization, to handle the practical constraints of federated learning. We provide convergence guarantees for this method when learning over both convex and non-convex functions.… ▽ More

    Submitted 7 January, 2020; originally announced January 2020.

    Comments: Asilomar Conference on Signals, Systems, and Computers 2019

  42. arXiv:1909.05830  [pdf, other

    cs.LG cs.AI cs.CR stat.ML

    Differentially Private Meta-Learning

    Authors: Jeffrey Li, Mikhail Khodak, Sebastian Caldas, Ameet Talwalkar

    Abstract: Parameter-transfer is a well-known and versatile approach for meta-learning, with applications including few-shot learning, federated learning, and reinforcement learning. However, parameter-transfer algorithms often require sharing models that have been trained on the samples from specific tasks, thus leaving the task-owners susceptible to breaches of privacy. We conduct the first formal study of… ▽ More

    Submitted 21 February, 2020; v1 submitted 12 September, 2019; originally announced September 2019.

  43. arXiv:1908.07873  [pdf, other

    cs.LG cs.DC stat.ML

    Federated Learning: Challenges, Methods, and Future Directions

    Authors: Tian Li, Anit Kumar Sahu, Ameet Talwalkar, Virginia Smith

    Abstract: Federated learning involves training statistical models over remote devices or siloed data centers, such as mobile phones or hospitals, while kee** data localized. Training in heterogeneous and potentially massive networks introduces novel challenges that require a fundamental departure from standard approaches for large-scale machine learning, distributed optimization, and privacy-preserving da… ▽ More

    Submitted 21 August, 2019; originally announced August 2019.

  44. arXiv:1906.11813  [pdf, ps, other

    cs.LG stat.ML

    Learning Fair Representations for Kernel Models

    Authors: Zilong Tan, Samuel Yeom, Matt Fredrikson, Ameet Talwalkar

    Abstract: Fair representations are a powerful tool for establishing criteria like statistical parity, proxy non-discrimination, and equality of opportunity in learned models. Existing techniques for learning these representations are typically model-agnostic, as they preprocess the original data such that the output satisfies some fairness criterion, and can be used with arbitrary learning methods. In contr… ▽ More

    Submitted 20 January, 2020; v1 submitted 27 June, 2019; originally announced June 2019.

    Comments: The 23rd International Conference on Artificial Intelligence and Statistics (AISTATS 2020)

  45. arXiv:1906.02717  [pdf, other

    cs.LG cs.AI stat.ML

    Adaptive Gradient-Based Meta-Learning Methods

    Authors: Mikhail Khodak, Maria-Florina Balcan, Ameet Talwalkar

    Abstract: We build a theoretical framework for designing and understanding practical meta-learning methods that integrates sophisticated formalizations of task-similarity with the extensive literature on online convex optimization and sequential prediction algorithms. Our approach enables the task-similarity to be learned adaptively, provides sharper transfer-risk bounds in the setting of statistical learni… ▽ More

    Submitted 6 December, 2019; v1 submitted 6 June, 2019; originally announced June 2019.

    Comments: NeurIPS 2019

  46. arXiv:1906.01431  [pdf, other

    cs.LG stat.ML

    Regularizing Black-box Models for Improved Interpretability (HILL 2019 Version)

    Authors: Gregory Plumb, Maruan Al-Shedivat, Eric Xing, Ameet Talwalkar

    Abstract: Most of the work on interpretable machine learning has focused on designing either inherently interpretable models, which typically trade-off accuracy for interpretability, or post-hoc explanation systems, which lack guarantees about their explanation quality. We propose an alternative to these approaches by directly regularizing a black-box model for interpretability at training time. Our approac… ▽ More

    Submitted 31 May, 2019; originally announced June 2019.

    Comments: presented at 2019 ICML Workshop on Human in the Loop Learning (HILL 2019), Long Beach, USA. arXiv admin note: substantial text overlap with arXiv:1902.06787

  47. arXiv:1904.03257  [pdf, ps, other

    cs.LG cs.DB cs.DC cs.SE stat.ML

    MLSys: The New Frontier of Machine Learning Systems

    Authors: Alexander Ratner, Dan Alistarh, Gustavo Alonso, David G. Andersen, Peter Bailis, Sarah Bird, Nicholas Carlini, Bryan Catanzaro, Jennifer Chayes, Eric Chung, Bill Dally, Jeff Dean, Inderjit S. Dhillon, Alexandros Dimakis, Pradeep Dubey, Charles Elkan, Grigori Fursin, Gregory R. Ganger, Lise Getoor, Phillip B. Gibbons, Garth A. Gibson, Joseph E. Gonzalez, Justin Gottschlich, Song Han, Kim Hazelwood , et al. (44 additional authors not shown)

    Abstract: Machine learning (ML) techniques are enjoying rapidly increasing adoption. However, designing and implementing the systems that support ML models in real-world deployments remains a significant obstacle, in large part due to the radically different development and deployment profile of modern ML methods, and the range of practical concerns that come with broader adoption. We propose to foster a ne… ▽ More

    Submitted 1 December, 2019; v1 submitted 29 March, 2019; originally announced April 2019.

  48. arXiv:1903.05176  [pdf, other

    cs.LG stat.ML

    Exploiting Reuse in Pipeline-Aware Hyperparameter Tuning

    Authors: Liam Li, Evan Sparks, Kevin Jamieson, Ameet Talwalkar

    Abstract: Hyperparameter tuning of multi-stage pipelines introduces a significant computational burden. Motivated by the observation that work can be reused across pipelines if the intermediate computations are the same, we propose a pipeline-aware approach to hyperparameter tuning. Our approach optimizes both the design and execution of pipelines to maximize reuse. We design pipelines amenable for reuse by… ▽ More

    Submitted 12 March, 2019; originally announced March 2019.

  49. arXiv:1902.11175  [pdf, other

    cs.LG stat.ML

    One-Shot Federated Learning

    Authors: Neel Guha, Ameet Talwalkar, Virginia Smith

    Abstract: We present one-shot federated learning, where a central server learns a global model over a network of federated devices in a single round of communication. Our approach - drawing on ensemble learning and knowledge aggregation - achieves an average relative gain of 51.5% in AUC over local baselines and comes within 90.1% of the (unattainable) global ideal. We discuss these methods and identify sev… ▽ More

    Submitted 5 March, 2019; v1 submitted 28 February, 2019; originally announced February 2019.

    Comments: 5 pages, 3 figures, 1 table. 2nd Workshop on Machine Learning on the Phone and other Consumer Devices, NeurIPs 2018

  50. arXiv:1902.10644  [pdf, other

    cs.LG cs.AI stat.ML

    Provable Guarantees for Gradient-Based Meta-Learning

    Authors: Mikhail Khodak, Maria-Florina Balcan, Ameet Talwalkar

    Abstract: We study the problem of meta-learning through the lens of online convex optimization, develo** a meta-algorithm bridging the gap between popular gradient-based meta-learning and classical regularization-based multi-task transfer methods. Our method is the first to simultaneously satisfy good sample efficiency guarantees in the convex setting, with generalization bounds that improve with task-sim… ▽ More

    Submitted 16 May, 2019; v1 submitted 27 February, 2019; originally announced February 2019.

    Comments: ICML 2019