Skip to main content

Showing 1–17 of 17 results for author: Slack, D

.
  1. arXiv:2405.17450  [pdf, other

    cs.CV cs.LG

    The Power of Next-Frame Prediction for Learning Physical Laws

    Authors: Thomas Winterbottom, G. Thomas Hudson, Daniel Kluvanec, Dean Slack, Jamie Sterling, Junjie Shentu, Chenghao Xiao, Zheming Zhou, Noura Al Moubayed

    Abstract: Next-frame prediction is a useful and powerful method for modelling and understanding the dynamics of video data. Inspired by the empirical success of causal language modelling and next-token prediction in language modelling, we explore the extent to which next-frame prediction serves as a strong foundational learning strategy (analogous to language modelling) for inducing an understanding of the… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

    Comments: 7 Figures, 12 Pages, 1 Table

    MSC Class: 68T45 ACM Class: I.2.6; I.2.10

  2. arXiv:2405.00332  [pdf, other

    cs.CL cs.AI cs.LG

    A Careful Examination of Large Language Model Performance on Grade School Arithmetic

    Authors: Hugh Zhang, Jeff Da, Dean Lee, Vaughn Robinson, Catherine Wu, Will Song, Tiffany Zhao, Pranav Raja, Dylan Slack, Qin Lyu, Sean Hendryx, Russell Kaplan, Michele Lunati, Summer Yue

    Abstract: Large language models (LLMs) have achieved impressive success on many benchmarks for mathematical reasoning. However, there is growing concern that some of this performance actually reflects dataset contamination, where data closely resembling benchmark questions leaks into the training data, instead of true reasoning ability. To investigate this claim rigorously, we commission Grade School Math 1… ▽ More

    Submitted 3 May, 2024; v1 submitted 1 May, 2024; originally announced May 2024.

  3. arXiv:2305.11426  [pdf, other

    cs.CL cs.AI

    Post Hoc Explanations of Language Models Can Improve Language Models

    Authors: Satyapriya Krishna, Jiaqi Ma, Dylan Slack, Asma Ghandeharioun, Sameer Singh, Himabindu Lakkaraju

    Abstract: Large Language Models (LLMs) have demonstrated remarkable capabilities in performing complex tasks. Moreover, recent research has shown that incorporating human-annotated rationales (e.g., Chain-of-Thought prompting) during in-context learning can significantly enhance the performance of these models, particularly on tasks that require reasoning capabilities. However, incorporating such rationales… ▽ More

    Submitted 7 December, 2023; v1 submitted 19 May, 2023; originally announced May 2023.

  4. arXiv:2304.13188  [pdf, other

    cs.LG cs.CL

    TABLET: Learning From Instructions For Tabular Data

    Authors: Dylan Slack, Sameer Singh

    Abstract: Acquiring high-quality data is often a significant challenge in training machine learning (ML) models for tabular prediction, particularly in privacy-sensitive and costly domains like medicine and finance. Providing natural language instructions to large language models (LLMs) offers an alternative solution. However, it is unclear how effectively instructions leverage the knowledge in LLMs for sol… ▽ More

    Submitted 25 April, 2023; originally announced April 2023.

    Comments: Please find the TABLET demo and code at https://dylanslacks.website/Tablet

  5. arXiv:2207.04154  [pdf, other

    cs.LG cs.AI cs.CL

    TalkToModel: Explaining Machine Learning Models with Interactive Natural Language Conversations

    Authors: Dylan Slack, Satyapriya Krishna, Himabindu Lakkaraju, Sameer Singh

    Abstract: Machine Learning (ML) models are increasingly used to make critical decisions in real-world applications, yet they have become more complex, making them harder to understand. To this end, researchers have proposed several techniques to explain model predictions. However, practitioners struggle to use these explainability techniques because they often do not know which one to choose and how to inte… ▽ More

    Submitted 6 March, 2023; v1 submitted 8 July, 2022; originally announced July 2022.

    Comments: Pre-print; comments welcome! Reach out to [email protected] v3 update title and abstract

  6. arXiv:2202.04849  [pdf, other

    cs.LG

    SAFER: Data-Efficient and Safe Reinforcement Learning via Skill Acquisition

    Authors: Dylan Slack, Yinlam Chow, Bo Dai, Nevan Wichers

    Abstract: Methods that extract policy primitives from offline demonstrations using deep generative models have shown promise at accelerating reinforcement learning(RL) for new tasks. Intuitively, these methods should also help to trainsafeRLagents because they enforce useful skills. However, we identify these techniques are not well equipped for safe policy learning because they ignore negative experiences(… ▽ More

    Submitted 30 June, 2022; v1 submitted 10 February, 2022; originally announced February 2022.

  7. arXiv:2202.01875  [pdf, other

    cs.LG

    Rethinking Explainability as a Dialogue: A Practitioner's Perspective

    Authors: Himabindu Lakkaraju, Dylan Slack, Yuxin Chen, Chenhao Tan, Sameer Singh

    Abstract: As practitioners increasingly deploy machine learning models in critical domains such as health care, finance, and policy, it becomes vital to ensure that domain experts function effectively alongside these models. Explainability is one way to bridge the gap between human decision-makers and machine learning models. However, most of the existing work on explainability focuses on one-off, static ex… ▽ More

    Submitted 3 February, 2022; originally announced February 2022.

  8. arXiv:2106.12563  [pdf, other

    cs.LG cs.CR

    Feature Attributions and Counterfactual Explanations Can Be Manipulated

    Authors: Dylan Slack, Sophie Hilgard, Sameer Singh, Himabindu Lakkaraju

    Abstract: As machine learning models are increasingly used in critical decision-making settings (e.g., healthcare, finance), there has been a growing emphasis on develo** methods to explain model predictions. Such \textit{explanations} are used to understand and establish trust in models and are vital components in machine learning pipelines. Though explanations are a critical piece in these systems, ther… ▽ More

    Submitted 25 June, 2021; v1 submitted 23 June, 2021; originally announced June 2021.

    Comments: arXiv admin note: text overlap with arXiv:2106.02666

  9. arXiv:2106.04631  [pdf, other

    cs.CL cs.LG

    On the Lack of Robust Interpretability of Neural Text Classifiers

    Authors: Muhammad Bilal Zafar, Michele Donini, Dylan Slack, Cédric Archambeau, Sanjiv Das, Krishnaram Kenthapadi

    Abstract: With the ever-increasing complexity of neural language models, practitioners have turned to methods for understanding the predictions of these models. One of the most well-adopted approaches for model interpretability is feature-based interpretability, i.e., ranking the features in terms of their impact on model predictions. Several prior studies have focused on assessing the fidelity of feature-b… ▽ More

    Submitted 8 June, 2021; originally announced June 2021.

    Comments: Appearing at ACL Findings 2021

  10. arXiv:2106.02666  [pdf, other

    cs.LG

    Counterfactual Explanations Can Be Manipulated

    Authors: Dylan Slack, Sophie Hilgard, Himabindu Lakkaraju, Sameer Singh

    Abstract: Counterfactual explanations are emerging as an attractive option for providing recourse to individuals adversely impacted by algorithmic decisions. As they are deployed in critical applications (e.g. law enforcement, financial lending), it becomes important to ensure that we clearly understand the vulnerabilities of these methods and find ways to address them. However, there is little understandin… ▽ More

    Submitted 3 November, 2021; v1 submitted 4 June, 2021; originally announced June 2021.

  11. arXiv:2102.06162  [pdf, other

    cs.LG

    Defuse: Harnessing Unrestricted Adversarial Examples for Debugging Models Beyond Test Accuracy

    Authors: Dylan Slack, Nathalie Rauschmayr, Krishnaram Kenthapadi

    Abstract: We typically compute aggregate statistics on held-out test data to assess the generalization of machine learning models. However, statistics on test data often overstate model generalization, and thus, the performance of deployed machine learning models can be variable and untrustworthy. Motivated by these concerns, we develop methods to automatically discover and correct model errors beyond those… ▽ More

    Submitted 11 February, 2021; originally announced February 2021.

  12. arXiv:2009.05886  [pdf, other

    cs.LG cs.CL cs.CR

    Differentially Private Language Models Benefit from Public Pre-training

    Authors: Gavin Kerrigan, Dylan Slack, Jens Tuyls

    Abstract: Language modeling is a keystone task in natural language processing. When training a language model on sensitive information, differential privacy (DP) allows us to quantify the degree to which our private data is protected. However, training algorithms which enforce differential privacy often lead to degradation in model quality. We study the feasibility of learning a language model which is simu… ▽ More

    Submitted 26 October, 2020; v1 submitted 12 September, 2020; originally announced September 2020.

  13. arXiv:2008.05030  [pdf, other

    cs.LG stat.ML

    Reliable Post hoc Explanations: Modeling Uncertainty in Explainability

    Authors: Dylan Slack, Sophie Hilgard, Sameer Singh, Himabindu Lakkaraju

    Abstract: As black box explanations are increasingly being employed to establish model credibility in high-stakes settings, it is important to ensure that these explanations are accurate and reliable. However, prior work demonstrates that explanations generated by state-of-the-art techniques are inconsistent, unstable, and provide very little insight into their correctness and reliability. In addition, thes… ▽ More

    Submitted 6 November, 2021; v1 submitted 11 August, 2020; originally announced August 2020.

  14. arXiv:1911.04336  [pdf, other

    cs.LG stat.ML

    Fair Meta-Learning: Learning How to Learn Fairly

    Authors: Dylan Slack, Sorelle Friedler, Emile Givental

    Abstract: Data sets for fairness relevant tasks can lack examples or be biased according to a specific label in a sensitive attribute. We demonstrate the usefulness of weight based meta-learning approaches in such situations. For models that can be trained through gradient descent, we demonstrate that there are some parameter configurations that allow models to be optimized from a few number of gradient ste… ▽ More

    Submitted 6 November, 2019; originally announced November 2019.

    Comments: arXiv admin note: substantial text overlap with arXiv:1908.09092

  15. arXiv:1911.02508  [pdf, other

    cs.LG cs.AI stat.ML

    Fooling LIME and SHAP: Adversarial Attacks on Post hoc Explanation Methods

    Authors: Dylan Slack, Sophie Hilgard, Emily Jia, Sameer Singh, Himabindu Lakkaraju

    Abstract: As machine learning black boxes are increasingly being deployed in domains such as healthcare and criminal justice, there is growing emphasis on building tools and techniques for explaining these black boxes in an interpretable manner. Such explanations are being leveraged by domain experts to diagnose systematic errors and underlying biases of black boxes. In this paper, we demonstrate that post… ▽ More

    Submitted 3 February, 2020; v1 submitted 6 November, 2019; originally announced November 2019.

  16. arXiv:1908.09092  [pdf, other

    cs.LG stat.ML

    Fairness Warnings and Fair-MAML: Learning Fairly with Minimal Data

    Authors: Dylan Slack, Sorelle Friedler, Emile Givental

    Abstract: Motivated by concerns surrounding the fairness effects of sharing and transferring fair machine learning tools, we propose two algorithms: Fairness Warnings and Fair-MAML. The first is a model-agnostic algorithm that provides interpretable boundary conditions for when a fairly trained model may not behave fairly on similar but slightly different tasks within a given domain. The second is a fair me… ▽ More

    Submitted 5 December, 2019; v1 submitted 24 August, 2019; originally announced August 2019.

  17. arXiv:1902.03501  [pdf, other

    cs.LG cs.HC stat.ML

    Assessing the Local Interpretability of Machine Learning Models

    Authors: Dylan Slack, Sorelle A. Friedler, Carlos Scheidegger, Chitradeep Dutta Roy

    Abstract: The increasing adoption of machine learning tools has led to calls for accountability via model interpretability. But what does it mean for a machine learning model to be interpretable by humans, and how can this be assessed? We focus on two definitions of interpretability that have been introduced in the machine learning literature: simulatability (a user's ability to run a model on a given input… ▽ More

    Submitted 2 August, 2019; v1 submitted 9 February, 2019; originally announced February 2019.