Skip to main content

Showing 1–19 of 19 results for author: Wong, E

Searching in archive stat. Search in all archives.
.
  1. arXiv:2406.00611  [pdf, other

    cs.LG stat.ME

    DISCRET: Synthesizing Faithful Explanations For Treatment Effect Estimation

    Authors: Yinjun Wu, Mayank Keoliya, Kan Chen, Neelay Velingker, Ziyang Li, Emily J Getzen, Qi Long, Mayur Naik, Ravi B Parikh, Eric Wong

    Abstract: Designing faithful yet accurate AI models is challenging, particularly in the field of individual treatment effect estimation (ITE). ITE prediction models deployed in critical settings such as healthcare should ideally be (i) accurate, and (ii) provide faithful explanations. However, current solutions are inadequate: state-of-the-art black-box models do not supply explanations, post-hoc explainers… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

    Comments: Accepted at ICML 2024. 22 pages, 5 figures

  2. arXiv:2310.03684  [pdf, other

    cs.LG cs.AI stat.ML

    SmoothLLM: Defending Large Language Models Against Jailbreaking Attacks

    Authors: Alexander Robey, Eric Wong, Hamed Hassani, George J. Pappas

    Abstract: Despite efforts to align large language models (LLMs) with human intentions, widely-used LLMs such as GPT, Llama, and Claude are susceptible to jailbreaking attacks, wherein an adversary fools a targeted LLM into generating objectionable content. To address this vulnerability, we propose SmoothLLM, the first algorithm designed to mitigate jailbreaking attacks. Based on our finding that adversarial… ▽ More

    Submitted 11 June, 2024; v1 submitted 5 October, 2023; originally announced October 2023.

  3. arXiv:2302.00464  [pdf

    stat.AP physics.ed-ph

    Improving Models for Student Retention and Graduation using Markov Chains

    Authors: Mason N Tedeschi, Tiana M Hose, Emily K Mehlman, Scott Franklin, Tony E Wong

    Abstract: Graduation rates are a key measure of the long-term efficacy of academic interventions. However, challenges to using traditional estimates of graduation rates for underrepresented students include inherently small sample sizes and high data requirements. Here, we show that a Markov model increases confidence and reduces biases in estimated graduation rates for underrepresented minority and first-g… ▽ More

    Submitted 1 February, 2023; originally announced February 2023.

  4. arXiv:2211.16460  [pdf

    physics.ao-ph stat.AP

    Sea Level and Socioeconomic Uncertainty Drives High-End Coastal Adaptation Costs

    Authors: Tony E. Wong, Catherine Ledna, Lisa Rennels, Hannah Sheets, Frank C. Errickson, Delavane Diaz, David Anthoff

    Abstract: Sea-level rise and associated flood hazards pose severe risks to the millions of people globally living in coastal zones. Models representing coastal adaptation and impacts are important tools to inform the design of strategies to manage these risks. Representing the often deep uncertainties influencing these risks poses nontrivial challenges. A common uncertainty characterization approach is to u… ▽ More

    Submitted 29 November, 2022; originally announced November 2022.

  5. arXiv:2209.08422  [pdf

    cs.LG stat.ML

    Computed Decision Weights and a New Learning Algorithm for Neural Classifiers

    Authors: Eugene Wong

    Abstract: In this paper we consider the possibility of computing rather than training the decision layer weights of a neural classifier. Such a possibility arises in two way, from making an appropriate choice of loss function and by solving a problem of constrained optimization. The latter formulation leads to a promising new learning process for pre-decision weights with both simplicity and efficacy.

    Submitted 17 September, 2022; originally announced September 2022.

  6. arXiv:2105.04857  [pdf, other

    cs.LG stat.ML

    Leveraging Sparse Linear Layers for Debuggable Deep Networks

    Authors: Eric Wong, Shibani Santurkar, Aleksander Mądry

    Abstract: We show how fitting sparse linear models over learned deep feature representations can lead to more debuggable neural networks. These networks remain highly accurate while also being more amenable to human interpretation, as we demonstrate quantiatively via numerical and human experiments. We further illustrate how the resulting sparse explanations can help to identify spurious correlations, expla… ▽ More

    Submitted 11 May, 2021; originally announced May 2021.

  7. arXiv:2007.08450  [pdf, other

    cs.LG stat.ML

    Learning perturbation sets for robust machine learning

    Authors: Eric Wong, J. Zico Kolter

    Abstract: Although much progress has been made towards robust deep learning, a significant gap in robustness remains between real-world perturbations and more narrowly defined sets typically studied in adversarial defenses. In this paper, we aim to bridge this gap by learning perturbation sets from data, in order to characterize real-world effects for robust training and evaluation. Specifically, we use a c… ▽ More

    Submitted 8 October, 2020; v1 submitted 16 July, 2020; originally announced July 2020.

  8. arXiv:2007.00147  [pdf, other

    cs.LG stat.ML

    Neural Network Virtual Sensors for Fuel Injection Quantities with Provable Performance Specifications

    Authors: Eric Wong, Tim Schneider, Joerg Schmitt, Frank R. Schmidt, J. Zico Kolter

    Abstract: Recent work has shown that it is possible to learn neural networks with provable guarantees on the output of the model when subject to input perturbations, however these works have focused primarily on defending against adversarial examples for image classifiers. In this paper, we study how these provable guarantees can be naturally applied to other real world settings, namely getting performance… ▽ More

    Submitted 30 June, 2020; originally announced July 2020.

  9. arXiv:2002.11569  [pdf, other

    cs.LG stat.ML

    Overfitting in adversarially robust deep learning

    Authors: Leslie Rice, Eric Wong, J. Zico Kolter

    Abstract: It is common practice in deep learning to use overparameterized networks and train for as long as possible; there are numerous studies that show, both theoretically and empirically, that such practices surprisingly do not unduly harm the generalization performance of the classifier. In this paper, we empirically study this phenomenon in the setting of adversarially trained deep networks, which are… ▽ More

    Submitted 4 March, 2020; v1 submitted 26 February, 2020; originally announced February 2020.

  10. arXiv:2001.03994  [pdf, other

    cs.LG stat.ML

    Fast is better than free: Revisiting adversarial training

    Authors: Eric Wong, Leslie Rice, J. Zico Kolter

    Abstract: Adversarial training, a method for learning robust deep networks, is typically assumed to be more expensive than traditional training due to the necessity of constructing adversarial examples via a first-order method like projected gradient decent (PGD). In this paper, we make the surprising discovery that it is possible to train empirically robust models using a much weaker and cheaper adversary,… ▽ More

    Submitted 12 January, 2020; originally announced January 2020.

  11. arXiv:1910.11987  [pdf

    physics.geo-ph stat.AP

    A tighter constraint on Earth-system sensitivity from long-term temperature and carbon-cycle observations

    Authors: Tony E. Wong, Ying Cui, Dana L. Royer, Klaus Keller

    Abstract: The long-term temperature response to a given change in CO2 forcing, or Earth-system sensitivity (ESS), is a key parameter quantifying our understanding about the relationship between changes in Earth's radiative forcing and the resulting long-term Earth-system response. Current ESS estimates are subject to sizable uncertainties. Long-term carbon cycle models can provide a useful avenue to constra… ▽ More

    Submitted 1 March, 2021; v1 submitted 25 October, 2019; originally announced October 2019.

  12. arXiv:1910.10122  [pdf

    cs.LG stat.ML

    Class Mean Vectors, Self Monitoring and Self Learning for Neural Classifiers

    Authors: Eugene Wong

    Abstract: In this paper we explore the role of sample mean in building a neural network for classification. This role is surprisingly extensive and includes: direct computation of weights without training, performance monitoring for samples without known classification, and self-training for unlabeled data. Experimental computation on a CIFAR-10 data set provides promising empirical evidence on the efficacy… ▽ More

    Submitted 22 October, 2019; originally announced October 2019.

  13. arXiv:1909.04068  [pdf, other

    cs.LG cs.AI stat.ML

    Adversarial Robustness Against the Union of Multiple Perturbation Models

    Authors: Pratyush Maini, Eric Wong, J. Zico Kolter

    Abstract: Owing to the susceptibility of deep learning systems to adversarial attacks, there has been a great deal of work in develo** (both empirically and certifiably) robust classifiers. While most work has defended against a single type of attack, recent work has looked at defending against multiple perturbation models using simple aggregations of multiple attacks. However, these methods can be diffic… ▽ More

    Submitted 28 July, 2020; v1 submitted 9 September, 2019; originally announced September 2019.

    Comments: ICML 2020 Final Version

  14. arXiv:1909.00306  [pdf, other

    stat.AP stat.ML

    Categorical Co-Frequency Analysis: Clustering Diagnosis Codes to Predict Hospital Readmissions

    Authors: Hallee E. Wong, Brianna C. Heggeseth, Steven J. Miller

    Abstract: Accurately predicting patients' risk of 30-day hospital readmission would enable hospitals to efficiently allocate resource-intensive interventions. We develop a new method, Categorical Co-Frequency Analysis (CoFA), for clustering diagnosis codes from the International Classification of Diseases (ICD) according to the similarity in relationships between covariates and readmission risk. CoFA measur… ▽ More

    Submitted 31 August, 2019; originally announced September 2019.

    Comments: 14 Pages

  15. arXiv:1902.07906  [pdf, other

    cs.LG stat.ML

    Wasserstein Adversarial Examples via Projected Sinkhorn Iterations

    Authors: Eric Wong, Frank R. Schmidt, J. Zico Kolter

    Abstract: A rapidly growing area of work has studied the existence of adversarial examples, datapoints which have been perturbed to fool a classifier, but the vast majority of these works have focused primarily on threat models defined by $\ell_p$ norm-bounded perturbations. In this paper, we propose a new threat model for adversarial attacks based on the Wasserstein distance. In the image classification se… ▽ More

    Submitted 18 January, 2020; v1 submitted 21 February, 2019; originally announced February 2019.

  16. arXiv:1809.06463  [pdf

    cs.LG cs.NE stat.ML

    Self Configuration in Machine Learning

    Authors: Eugene Wong

    Abstract: In this paper we first present a class of algorithms for training multi-level neural networks with a quadratic cost function one layer at a time starting from the input layer. The algorithm is based on the fact that for any layer to be trained, the effect of a direct connection to an optimized linear output layer can be computed without the connection being made. Thus, starting from the input laye… ▽ More

    Submitted 17 September, 2018; originally announced September 2018.

  17. arXiv:1808.06440  [pdf

    stat.AP

    An Integration and Assessment of Covariates of Nonstationary Storm Surge Statistical Behavior by Bayesian Model Averaging

    Authors: Tony E. Wong

    Abstract: Projections of storm surge return levels are a basic requirement for effective management of coastal risks. A common approach to estimate hazards posed by extreme sea levels is to use a statistical model, which may use a time series of a climate variable as a covariate to modulate the statistical model and account for potentially nonstationary storm surge behavior. Previous work using nonstationar… ▽ More

    Submitted 25 August, 2018; v1 submitted 20 August, 2018; originally announced August 2018.

  18. arXiv:1805.12514  [pdf, other

    cs.LG cs.AI math.OC stat.ML

    Scaling provable adversarial defenses

    Authors: Eric Wong, Frank R. Schmidt, Jan Hendrik Metzen, J. Zico Kolter

    Abstract: Recent work has developed methods for learning deep network classifiers that are provably robust to norm-bounded adversarial perturbation; however, these methods are currently only possible for relatively small feedforward networks. In this paper, in an effort to scale these approaches to substantially larger models, we extend previous work in three main directions. First, we present a technique f… ▽ More

    Submitted 21 November, 2018; v1 submitted 31 May, 2018; originally announced May 2018.

  19. Neglecting Model Structural Uncertainty Underestimates Upper Tails of Flood Hazard

    Authors: Tony E. Wong, Alexandra Klufas, Vivek Srikrishnan, Klaus Keller

    Abstract: Coastal flooding drives considerable risks to many communities, but projections of future flood risks are deeply uncertain. The paucity of observations of extreme events often motivates the use of statistical approaches to model the distribution of extreme storm surge events. A key deep uncertainty that is often overlooked is model structural uncertainty. There is currently no strong consensus amo… ▽ More

    Submitted 3 June, 2018; v1 submitted 25 September, 2017; originally announced September 2017.