Skip to main content

Showing 1–10 of 10 results for author: Izzo, Z

.
  1. arXiv:2403.07183  [pdf, other

    cs.CL cs.AI cs.LG cs.SI

    Monitoring AI-Modified Content at Scale: A Case Study on the Impact of ChatGPT on AI Conference Peer Reviews

    Authors: Weixin Liang, Zachary Izzo, Yaohui Zhang, Haley Lepp, Hancheng Cao, Xuandong Zhao, Lingjiao Chen, Haotian Ye, Sheng Liu, Zhi Huang, Daniel A. McFarland, James Y. Zou

    Abstract: We present an approach for estimating the fraction of text in a large corpus which is likely to be substantially modified or produced by a large language model (LLM). Our maximum likelihood model leverages expert-written and AI-generated reference texts to accurately and efficiently examine real-world LLM-use at the corpus level. We apply this approach to a case study of scientific peer review in… ▽ More

    Submitted 15 June, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

    Comments: 46 pages, 31 figures, ICML '24

    ACM Class: I.2.7

  2. arXiv:2305.00195  [pdf, other

    cs.LG stat.ML

    Data-Driven Subgroup Identification for Linear Regression

    Authors: Zachary Izzo, Ruishan Liu, James Zou

    Abstract: Medical studies frequently require to extract the relationship between each covariate and the outcome with statistical confidence measures. To do this, simple parametric models are frequently used (e.g. coefficients of linear regression) but usually fitted on the whole dataset. However, it is common that the covariates may not have a uniform effect over the whole population and thus a unified simp… ▽ More

    Submitted 29 April, 2023; originally announced May 2023.

    Comments: Accepted at ICML 2023

  3. arXiv:2211.06582  [pdf, other

    cs.LG cs.CR stat.ML

    Provable Membership Inference Privacy

    Authors: Zachary Izzo, **sung Yoon, Sercan O. Arik, James Zou

    Abstract: In applications involving sensitive data, such as finance and healthcare, the necessity for preserving data privacy can be a significant barrier to machine learning model development. Differential privacy (DP) has emerged as one canonical standard for provable privacy. However, DP's strong theoretical guarantees often come at the cost of a large drop in its utility for machine learning, and DP gua… ▽ More

    Submitted 12 November, 2022; originally announced November 2022.

    Comments: 19 pages, 2 figures

  4. arXiv:2210.07513  [pdf, ps, other

    math.OC cs.LG math.NA stat.ML

    Continuous-in-time Limit for Bayesian Bandits

    Authors: Yuhua Zhu, Zachary Izzo, Lexing Ying

    Abstract: This paper revisits the bandit problem in the Bayesian setting. The Bayesian approach formulates the bandit problem as an optimization problem, and the goal is to find the optimal policy which minimizes the Bayesian regret. One of the main challenges facing the Bayesian approach is that computation of the optimal policy is often intractable, especially when the length of the problem horizon or the… ▽ More

    Submitted 29 September, 2023; v1 submitted 14 October, 2022; originally announced October 2022.

  5. arXiv:2209.08745  [pdf, other

    cs.LG cs.AI math.ST stat.ML

    Importance Tempering: Group Robustness for Overparameterized Models

    Authors: Yi** Lu, Wenlong Ji, Zachary Izzo, Lexing Ying

    Abstract: Although overparameterized models have shown their success on many machine learning tasks, the accuracy could drop on the testing distribution that is different from the training one. This accuracy drop still limits applying machine learning in the wild. At the same time, importance weighting, a traditional technique to handle distribution shifts, has been demonstrated to have less or even no effe… ▽ More

    Submitted 27 September, 2022; v1 submitted 18 September, 2022; originally announced September 2022.

  6. arXiv:2112.07042  [pdf, other

    cs.LG stat.ML

    How to Learn when Data Gradually Reacts to Your Model

    Authors: Zachary Izzo, James Zou, Lexing Ying

    Abstract: A recent line of work has focused on training machine learning (ML) models in the performative setting, i.e. when the data distribution reacts to the deployed model. The goal in this setting is to learn a model which both induces a favorable data distribution and performs well on the induced distribution, thereby minimizing the test loss. Previous work on finding an optimal model assumes that the… ▽ More

    Submitted 13 December, 2021; originally announced December 2021.

    Comments: 40 pages, 8 figures

  7. arXiv:2110.08991  [pdf, other

    cs.DS cs.LG math.PR

    Dimensionality Reduction for Wasserstein Barycenter

    Authors: Zachary Izzo, Sandeep Silwal, Samson Zhou

    Abstract: The Wasserstein barycenter is a geometric construct which captures the notion of centrality among probability distributions, and which has found many applications in machine learning. However, most algorithms for finding even an approximate barycenter suffer an exponential dependence on the dimension $d$ of the underlying space of the distributions. In order to cope with this "curse of dimensional… ▽ More

    Submitted 18 October, 2021; v1 submitted 17 October, 2021; originally announced October 2021.

    Comments: Published as a conference paper in NeurIPS 2021

  8. arXiv:2102.07698  [pdf, other

    cs.LG stat.ML

    How to Learn when Data Reacts to Your Model: Performative Gradient Descent

    Authors: Zachary Izzo, Lexing Ying, James Zou

    Abstract: Performative distribution shift captures the setting where the choice of which ML model is deployed changes the data distribution. For example, a bank which uses the number of open credit lines to determine a customer's risk of default on a loan may induce customers to open more credit lines in order to improve their chances of being approved. Because of the interactions between the model and data… ▽ More

    Submitted 16 February, 2021; v1 submitted 15 February, 2021; originally announced February 2021.

    Comments: 21 pages, 5 figures

  9. arXiv:2006.06173  [pdf, other

    math.OC cs.LG stat.ML

    Borrowing From the Future: Addressing Double Sampling in Model-free Control

    Authors: Yuhua Zhu, Zach Izzo, Lexing Ying

    Abstract: In model-free reinforcement learning, the temporal difference method and its variants become unstable when combined with nonlinear function approximations. Bellman residual minimization with stochastic gradient descent (SGD) is more stable, but it suffers from the double sampling problem: given the current state, two independent samples for the next state are required, but often only one sample is… ▽ More

    Submitted 10 June, 2020; originally announced June 2020.

  10. arXiv:2002.10077  [pdf, other

    cs.LG stat.ML

    Approximate Data Deletion from Machine Learning Models

    Authors: Zachary Izzo, Mary Anne Smart, Kamalika Chaudhuri, James Zou

    Abstract: Deleting data from a trained machine learning (ML) model is a critical task in many applications. For example, we may want to remove the influence of training points that might be out of date or outliers. Regulations such as EU's General Data Protection Regulation also stipulate that individuals can request to have their data deleted. The naive approach to data deletion is to retrain the ML model… ▽ More

    Submitted 23 February, 2021; v1 submitted 24 February, 2020; originally announced February 2020.

    Comments: 20 pages, 1 figure, accepted for publication at AISTATS 2021