Skip to main content

Showing 1–4 of 4 results for author: Li, J D

.
  1. arXiv:2403.03218  [pdf, other

    cs.LG cs.AI cs.CL cs.CY

    The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning

    Authors: Nathaniel Li, Alexander Pan, Anjali Gopal, Summer Yue, Daniel Berrios, Alice Gatti, Justin D. Li, Ann-Kathrin Dombrowski, Shashwat Goel, Long Phan, Gabriel Mukobi, Nathan Helm-Burger, Rassin Lababidi, Lennart Justen, Andrew B. Liu, Michael Chen, Isabelle Barrass, Oliver Zhang, Xiaoyuan Zhu, Rishub Tamirisa, Bhrugu Bharathi, Adam Khoja, Zhenqi Zhao, Ariel Herbert-Voss, Cort B. Breuer , et al. (32 additional authors not shown)

    Abstract: The White House Executive Order on Artificial Intelligence highlights the risks of large language models (LLMs) empowering malicious actors in develo** biological, cyber, and chemical weapons. To measure these risks of malicious use, government institutions and major AI labs are develo** evaluations for hazardous capabilities in LLMs. However, current evaluations are private, preventing furthe… ▽ More

    Submitted 15 May, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

    Comments: See the project page at https://wmdp.ai

  2. arXiv:2306.07544  [pdf, other

    cs.LG stat.ML

    On Achieving Optimal Adversarial Test Error

    Authors: Justin D. Li, Matus Telgarsky

    Abstract: We first elucidate various fundamental properties of optimal adversarial predictors: the structure of optimal adversarial convex predictors in terms of optimal adversarial zero-one predictors, bounds relating the adversarial convex loss to the adversarial zero-one loss, and the fact that continuous predictors can get arbitrarily close to the optimal adversarial error for both convex and zero-one l… ▽ More

    Submitted 28 April, 2024; v1 submitted 13 June, 2023; originally announced June 2023.

    Comments: ICLR 2023; bugs fixed

  3. arXiv:2106.05932  [pdf, other

    cs.LG stat.ML

    Early-stopped neural networks are consistent

    Authors: Ziwei Ji, Justin D. Li, Matus Telgarsky

    Abstract: This work studies the behavior of shallow ReLU networks trained with the logistic loss via gradient descent on binary classification data where the underlying data distribution is general, and the (optimal) Bayes risk is not necessarily zero. In this setting, it is shown that gradient descent with early stop** achieves population risk arbitrarily close to optimal in terms of not just logistic an… ▽ More

    Submitted 4 November, 2021; v1 submitted 10 June, 2021; originally announced June 2021.

  4. Computer Aided Detection of Transient Inflation Events at Alaskan Volcanoes using GPS Measurements from 2005-2015

    Authors: Justin D Li, Cody M Rude, David M Blair, Michael G Gowanlock, Thomas A Herring, Victor Pankratius

    Abstract: Analysis of transient deformation events in time series data observed via networks of continuous Global Positioning System (GPS) ground stations provide insight into the magmatic and tectonic processes that drive volcanic activity. Typical analyses of spatial positions originating from each station require careful tuning of algorithmic parameters and selection of time and spatial regions of intere… ▽ More

    Submitted 14 February, 2017; originally announced February 2017.

    Comments: Published in the Journal of Volcanology and Geothermal Research. 9 pages, 7 figures

    Journal ref: Journal of Volcanology and Geothermal Research, 327, 634-642