Skip to main content

Showing 1–24 of 24 results for author: Cooper, A F

.
  1. arXiv:2406.09548  [pdf, other

    cs.LG cs.AI cs.CY stat.ML

    Between Randomness and Arbitrariness: Some Lessons for Reliable Machine Learning at Scale

    Authors: A. Feder Cooper

    Abstract: To develop rigorous knowledge about ML models -- and the systems in which they are embedded -- we need reliable measurements. But reliable measurement is fundamentally challenging, and touches on issues of reproducibility, scalability, uncertainty quantification, epistemology, and more. This dissertation addresses criteria needed to take reliability seriously: both criteria for designing meaningfu… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: Ph.D. Dissertation

  2. arXiv:2404.12590  [pdf

    cs.CY

    The Files are in the Computer: Copyright, Memorization, and Generative-AI Systems

    Authors: A. Feder Cooper, James Grimmelmann

    Abstract: A central issue in copyright lawsuits against companies that produce generative-AI systems is the degree to which a generative-AI model does or does not "memorize" the data it was trained on. Unfortunately, the debate has been clouded by ambiguity over what "memorization" is, leading to legal debates in which participants often talk past one another. In this Essay, we attempt to bring clarity to t… ▽ More

    Submitted 2 July, 2024; v1 submitted 18 April, 2024; originally announced April 2024.

    Comments: Forthcoming, Chicago-Kent Law Review

  3. arXiv:2403.06634  [pdf, other

    cs.CR

    Stealing Part of a Production Language Model

    Authors: Nicholas Carlini, Daniel Paleka, Krishnamurthy Dj Dvijotham, Thomas Steinke, Jonathan Hayase, A. Feder Cooper, Katherine Lee, Matthew Jagielski, Milad Nasr, Arthur Conmy, Eric Wallace, David Rolnick, Florian Tramèr

    Abstract: We introduce the first model-stealing attack that extracts precise, nontrivial information from black-box production language models like OpenAI's ChatGPT or Google's PaLM-2. Specifically, our attack recovers the embedding projection layer (up to symmetries) of a transformer model, given typical API access. For under \… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

  4. arXiv:2402.05979  [pdf, other

    cs.SE cs.AI

    On the Standardization of Behavioral Use Clauses and Their Adoption for Responsible Licensing of AI

    Authors: Daniel McDuff, Tim Korjakow, Scott Cambo, Jesse Josua Benjamin, Jenny Lee, Yacine Jernite, Carlos Muñoz Ferrandis, Aaron Gokaslan, Alek Tarkowski, Joseph Lindley, A. Feder Cooper, Danish Contractor

    Abstract: Growing concerns over negligent or malicious uses of AI have increased the appetite for tools that help manage the risks of the technology. In 2018, licenses with behaviorial-use clauses (commonly referred to as Responsible AI Licenses) were proposed to give developers a framework for releasing AI assets while specifying their users to mitigate negative applications. As of the end of 2023, on the… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

  5. arXiv:2311.17035  [pdf, other

    cs.LG cs.CL cs.CR

    Scalable Extraction of Training Data from (Production) Language Models

    Authors: Milad Nasr, Nicholas Carlini, Jonathan Hayase, Matthew Jagielski, A. Feder Cooper, Daphne Ippolito, Christopher A. Choquette-Choo, Eric Wallace, Florian Tramèr, Katherine Lee

    Abstract: This paper studies extractable memorization: training data that an adversary can efficiently extract by querying a machine learning model without prior knowledge of the training dataset. We show an adversary can extract gigabytes of training data from open-source language models like Pythia or GPT-Neo, semi-open models like LLaMA or Falcon, and closed models like ChatGPT. Existing techniques from… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

  6. arXiv:2311.06477  [pdf, other

    cs.CY

    Report of the 1st Workshop on Generative AI and Law

    Authors: A. Feder Cooper, Katherine Lee, James Grimmelmann, Daphne Ippolito, Christopher Callison-Burch, Christopher A. Choquette-Choo, Niloofar Mireshghallah, Miles Brundage, David Mimno, Madiha Zahrah Choksi, Jack M. Balkin, Nicholas Carlini, Christopher De Sa, Jonathan Frankle, Deep Ganguli, Bryant Gipson, Andres Guadamuz, Swee Leng Harris, Abigail Z. Jacobs, Elizabeth Joh, Gautam Kamath, Mark Lemley, Cass Matthews, Christine McLeavey, Corynne McSherry , et al. (10 additional authors not shown)

    Abstract: This report presents the takeaways of the inaugural Workshop on Generative AI and Law (GenLaw), held in July 2023. A cross-disciplinary group of practitioners and scholars from computer science and law convened to discuss the technical, doctrinal, and policy challenges presented by law for Generative AI, and by Generative AI for law, with an emphasis on U.S. law in particular. We begin the report… ▽ More

    Submitted 2 December, 2023; v1 submitted 10 November, 2023; originally announced November 2023.

  7. arXiv:2310.16825  [pdf, other

    cs.CV cs.CY

    CommonCanvas: An Open Diffusion Model Trained with Creative-Commons Images

    Authors: Aaron Gokaslan, A. Feder Cooper, Jasmine Collins, Landan Seguin, Austin Jacobson, Mihir Patel, Jonathan Frankle, Cory Stephenson, Volodymyr Kuleshov

    Abstract: We assemble a dataset of Creative-Commons-licensed (CC) images, which we use to train a set of open diffusion models that are qualitatively competitive with Stable Diffusion 2 (SD2). This task presents two challenges: (1) high-resolution CC images lack the captions necessary to train text-to-image generative models; (2) CC images are relatively scarce. In turn, to address these challenges, we use… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

  8. arXiv:2309.08133  [pdf

    cs.CY

    Talkin' 'Bout AI Generation: Copyright and the Generative-AI Supply Chain

    Authors: Katherine Lee, A. Feder Cooper, James Grimmelmann

    Abstract: "Does generative AI infringe copyright?" is an urgent question. It is also a difficult question, for two reasons. First, "generative AI" is not just one product from one company. It is a catch-all name for a massive ecosystem of loosely related technologies, including conversational text chatbots like ChatGPT, image generators like Midjourney and DALL-E, coding assistants like GitHub Copilot, and… ▽ More

    Submitted 1 March, 2024; v1 submitted 15 September, 2023; originally announced September 2023.

    Comments: Forthcoming, Journal of the Copyright Society of the USA '24

  9. arXiv:2302.00845  [pdf, other

    cs.LG cs.DC math.OC

    Coordinating Distributed Example Orders for Provably Accelerated Training

    Authors: A. Feder Cooper, Wentao Guo, Khiem Pham, Tiancheng Yuan, Charlie F. Ruan, Yucheng Lu, Christopher De Sa

    Abstract: Recent research on online Gradient Balancing (GraB) has revealed that there exist permutation-based example orderings for SGD that are guaranteed to outperform random reshuffling (RR). Whereas RR arbitrarily permutes training examples, GraB leverages stale gradients from prior epochs to order examples -- achieving a provably faster convergence rate than RR. However, GraB is limited by design: whil… ▽ More

    Submitted 21 December, 2023; v1 submitted 1 February, 2023; originally announced February 2023.

    Comments: NeurIPS 2023

  10. arXiv:2301.11562  [pdf, other

    cs.LG cs.AI cs.CY stat.ML

    Arbitrariness and Social Prediction: The Confounding Role of Variance in Fair Classification

    Authors: A. Feder Cooper, Katherine Lee, Madiha Zahrah Choksi, Solon Barocas, Christopher De Sa, James Grimmelmann, Jon Kleinberg, Siddhartha Sen, Baobao Zhang

    Abstract: Variance in predictions across different trained models is a significant, under-explored source of error in fair binary classification. In practice, the variance on some data examples is so large that decisions can be effectively arbitrary. To investigate this problem, we take an experimental approach and make four overarching contributions: We: 1) Define a metric called self-consistency, derived… ▽ More

    Submitted 6 March, 2024; v1 submitted 27 January, 2023; originally announced January 2023.

    Comments: AAAI '24 (received a Best Paper Honorable Mention designation)

  11. arXiv:2208.02056  [pdf

    cs.CY

    Fast or Accurate? Governing Conflicting Goals in Highly Autonomous Vehicles

    Authors: A. Feder Cooper, Karen Levy

    Abstract: The tremendous excitement around the deployment of autonomous vehicles (AVs) comes from their purported promise. In addition to decreasing accidents, AVs are projected to usher in a new era of equity in human autonomy by providing affordable, accessible, and widespread mobility for disabled, elderly, and low-income populations. However, to realize this promise, it is necessary to ensure that AVs a… ▽ More

    Submitted 3 August, 2022; originally announced August 2022.

    Comments: Vol. 20, pp. 249-277

    Journal ref: Colorado Technology Law Journal 2022

  12. Non-Determinism and the Lawlessness of Machine Learning Code

    Authors: A. Feder Cooper, Jonathan Frankle, Christopher De Sa

    Abstract: Legal literature on machine learning (ML) tends to focus on harms, and thus tends to reason about individual model outcomes and summary error rates. This focus has masked important aspects of ML that are rooted in its reliance on randomness -- namely, stochasticity and non-determinism. While some recent work has begun to reason about the relationship between stochasticity and arbitrariness in lega… ▽ More

    Submitted 5 October, 2022; v1 submitted 23 June, 2022; originally announced June 2022.

    Comments: Proceedings of the 2022 Symposium on Computer Science and Law (CSLAW '22)

  13. arXiv:2206.06738  [pdf, other

    cs.CY cs.DL cs.SI

    Four Years of FAccT: A Reflexive, Mixed-Methods Analysis of Research Contributions, Shortcomings, and Future Prospects

    Authors: Benjamin Laufer, Sameer Jain, A. Feder Cooper, Jon Kleinberg, Hoda Heidari

    Abstract: Fairness, Accountability, and Transparency (FAccT) for socio-technical systems has been a thriving area of research in recent years. An ACM conference bearing the same name has been the central venue for scholars in this area to come together, provide peer feedback to one another, and publish their work. This reflexive study aims to shed light on FAccT's activities to date and identify major gaps… ▽ More

    Submitted 14 June, 2022; originally announced June 2022.

    Comments: 26 pages, 5 figures, to be published in 2022 ACM Conference on Fairness, Accountability, and Transparency

  14. arXiv:2203.07490  [pdf, other

    cs.LG cs.CY

    Repairing Regressors for Fair Binary Classification at Any Decision Threshold

    Authors: Kweku Kwegyir-Aggrey, A. Feder Cooper, Jessica Dai, John Dickerson, Keegan Hines, Suresh Venkatasubramanian

    Abstract: We study the problem of post-processing a supervised machine-learned regressor to maximize fair binary classification at all decision thresholds. By decreasing the statistical distance between each group's score distributions, we show that we can increase fair performance across all thresholds at once, and that we can do so without a large decrease in accuracy. To this end, we introduce a formal m… ▽ More

    Submitted 10 December, 2023; v1 submitted 14 March, 2022; originally announced March 2022.

  15. arXiv:2202.05338  [pdf, ps, other

    cs.CY cs.AI cs.LG

    Accountability in an Algorithmic Society: Relationality, Responsibility, and Robustness in Machine Learning

    Authors: A. Feder Cooper, Emanuel Moss, Benjamin Laufer, Helen Nissenbaum

    Abstract: In 1996, Accountability in a Computerized Society [95] issued a clarion call concerning the erosion of accountability in society due to the ubiquitous delegation of consequential functions to computerized systems. Nissenbaum [95] described four barriers to accountability that computerization presented, which we revisit in relation to the ascendance of data-driven algorithmic systems--i.e., machine… ▽ More

    Submitted 13 May, 2022; v1 submitted 10 February, 2022; originally announced February 2022.

    Journal ref: 2022 ACM Conference on Fairness, Accountability, and Transparency (FAccT '22)

  16. Making the Unaccountable Internet: The Changing Meaning of Accounting in the Early ARPANET

    Authors: A. Feder Cooper, Gili Vidan

    Abstract: Contemporary concerns over the governance of technological systems often run up against narratives about the technical infeasibility of designing mechanisms for accountability. While in recent AI ethics literature these concerns have been deliberated predominantly in relation to ML, other instances in computing history also presented circumstances in which computer scientists needed to un-muddle w… ▽ More

    Submitted 11 May, 2022; v1 submitted 27 January, 2022; originally announced January 2022.

    Journal ref: 2022 ACM Conference on Fairness, Accountability, and Transparency (FAccT '22)

  17. Tecnologica cosa: Modeling Storyteller Personalities in Boccaccio's Decameron

    Authors: A. Feder Cooper, Maria Antoniak, Christopher De Sa, Marilyn Migiel, David Mimno

    Abstract: We explore Boccaccio's Decameron to see how digital humanities tools can be used for tasks that have limited data in a language no longer in contemporary use: medieval Italian. We focus our analysis on the question: Do the different storytellers in the text exhibit distinct personalities? To answer this question, we curate and release a dataset based on the authoritative edition of the text. We us… ▽ More

    Submitted 21 September, 2021; originally announced September 2021.

    Comments: The 5th Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature (co-located with EMNLP 2021)

  18. arXiv:2104.00606  [pdf, other

    cs.LG cs.AI cs.CY

    Model Selection's Disparate Impact in Real-World Deep Learning Applications

    Authors: Jessica Zosa Forde, A. Feder Cooper, Kweku Kwegyir-Aggrey, Chris De Sa, Michael Littman

    Abstract: Algorithmic fairness has emphasized the role of biased data in automated decision outcomes. Recently, there has been a shift in attention to sources of bias that implicate fairness in other stages in the ML pipeline. We contend that one source of such bias, human preferences in model selection, remains under-explored in terms of its role in disparate impact across demographic groups. Using a deep… ▽ More

    Submitted 7 September, 2021; v1 submitted 1 April, 2021; originally announced April 2021.

    Comments: Science and Engineering of Deep Learning Workshop, ICLR 2021

  19. arXiv:2102.03034  [pdf, other

    cs.LG cs.LO

    Hyperparameter Optimization Is Deceiving Us, and How to Stop It

    Authors: A. Feder Cooper, Yucheng Lu, Jessica Zosa Forde, Christopher De Sa

    Abstract: Recent empirical work shows that inconsistent results based on choice of hyperparameter optimization (HPO) configuration are a widespread problem in ML research. When comparing two algorithms J and K searching one subspace can yield the conclusion that J outperforms K, whereas searching another can entail the opposite. In short, the way we choose hyperparameters can deceive us. We provide a theore… ▽ More

    Submitted 25 October, 2021; v1 submitted 5 February, 2021; originally announced February 2021.

    Comments: To appear, NeurIPS 2021

    Journal ref: Advances in Neural Information Processing Systems 34 pre-proceedings (NeurIPS 2021)

  20. arXiv:2102.01203  [pdf, other

    cs.CY cs.AI cs.LG

    Emergent Unfairness in Algorithmic Fairness-Accuracy Trade-Off Research

    Authors: A. Feder Cooper, Ellen Abrams

    Abstract: Across machine learning (ML) sub-disciplines, researchers make explicit mathematical assumptions in order to facilitate proof-writing. We note that, specifically in the area of fairness-accuracy trade-off optimization scholarship, similar attention is not paid to the normative assumptions that ground this approach. Such assumptions presume that 1) accuracy and fairness are in inherent opposition t… ▽ More

    Submitted 7 September, 2021; v1 submitted 1 February, 2021; originally announced February 2021.

    Comments: AIES 2021 (Oral)

  21. arXiv:2010.10407  [pdf, ps, other

    cs.CY cs.AI cs.LG

    Where Is the Normative Proof? Assumptions and Contradictions in ML Fairness Research

    Authors: A. Feder Cooper

    Abstract: Across machine learning (ML) sub-disciplines researchers make mathematical assumptions to facilitate proof-writing. While such assumptions are necessary for providing mathematical guarantees for how algorithms behave, they also necessarily limit the applicability of these algorithms to different problem settings. This practice is known--in fact, obvious--and accepted in ML research. However, simil… ▽ More

    Submitted 3 November, 2020; v1 submitted 20 October, 2020; originally announced October 2020.

  22. Accuracy-Efficiency Trade-Offs and Accountability in Distributed ML Systems

    Authors: A. Feder Cooper, Karen Levy, Christopher De Sa

    Abstract: Trade-offs between accuracy and efficiency pervade law, public health, and other non-computing domains, which have developed policies to guide how to balance the two in conditions of uncertainty. While computer science also commonly studies accuracy-efficiency trade-offs, their policy implications remain poorly examined. Drawing on risk assessment practices in the US, we argue that, since examinin… ▽ More

    Submitted 2 October, 2021; v1 submitted 4 July, 2020; originally announced July 2020.

    Journal ref: Equity and Access in Algorithms, Mechanisms, and Optimization (EAAMO 2021)

  23. arXiv:2006.11677  [pdf, other

    cs.LG stat.ML

    Asymptotically Optimal Exact Minibatch Metropolis-Hastings

    Authors: Ruqi Zhang, A. Feder Cooper, Christopher De Sa

    Abstract: Metropolis-Hastings (MH) is a commonly-used MCMC algorithm, but it can be intractable on large datasets due to requiring computations over the whole dataset. In this paper, we study minibatch MH methods, which instead use subsamples to enable scaling. We observe that most existing minibatch MH methods are inexact (i.e. they may change the target distribution), and show that this inexactness can ca… ▽ More

    Submitted 22 October, 2020; v1 submitted 20 June, 2020; originally announced June 2020.

    Journal ref: Advances in Neural Information Processing Systems 33 (NeurIPS 2020)

  24. arXiv:2003.00193  [pdf, other

    cs.LG stat.ML

    AMAGOLD: Amortized Metropolis Adjustment for Efficient Stochastic Gradient MCMC

    Authors: Ruqi Zhang, A. Feder Cooper, Christopher De Sa

    Abstract: Stochastic gradient Hamiltonian Monte Carlo (SGHMC) is an efficient method for sampling from continuous distributions. It is a faster alternative to HMC: instead of using the whole dataset at each iteration, SGHMC uses only a subsample. This improves performance, but introduces bias that can cause SGHMC to converge to the wrong distribution. One can prevent this using a step size that decays to ze… ▽ More

    Submitted 29 February, 2020; originally announced March 2020.

    Comments: Published at AISTATS 2020

    Journal ref: Proceedings of the Twenty Third International Conference on Artificial Intelligence and Statistics (AISTATS 2020)