Skip to main content

Showing 1–50 of 198 results for author: Roth, A

.
  1. arXiv:2405.20272  [pdf, other

    cs.LG cs.CR

    Reconstruction Attacks on Machine Unlearning: Simple Models are Vulnerable

    Authors: Martin Bertran, Shuai Tang, Michael Kearns, Jamie Morgenstern, Aaron Roth, Zhiwei Steven Wu

    Abstract: Machine unlearning is motivated by desire for data autonomy: a person can request to have their data's influence removed from deployed models, and those models should be updated as if they were retrained without the person's data. We show that, counter-intuitively, these updates expose individuals to high-accuracy reconstruction attacks which allow the attacker to recover their data in its entiret… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  2. arXiv:2405.16752  [pdf, other

    cs.LG cs.AI

    Model Ensembling for Constrained Optimization

    Authors: Ira Globus-Harris, Varun Gupta, Michael Kearns, Aaron Roth

    Abstract: There is a long history in machine learning of model ensembling, beginning with boosting and bagging and continuing to the present day. Much of this history has focused on combining models for classification and regression, but recently there is interest in more complex settings such as ensembling policies in reinforcement learning. Strong connections have also emerged between ensembling and multi… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

  3. arXiv:2405.16739  [pdf, other

    cs.LG cs.AI eess.SY

    Oracle-Efficient Reinforcement Learning for Max Value Ensembles

    Authors: Marcel Hussing, Michael Kearns, Aaron Roth, Sikata Bela Sengupta, Jessica Sorrell

    Abstract: Reinforcement learning (RL) in large or infinite state spaces is notoriously challenging, both theoretically (where worst-case sample and computational complexities must scale with state space cardinality) and experimentally (where function approximation and policy gradient techniques often scale poorly and suffer from instability and high variance). One line of research attempting to address thes… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

  4. arXiv:2405.02225  [pdf, other

    stat.ML cs.AI cs.CY cs.LG stat.ME

    Fair Risk Control: A Generalized Framework for Calibrating Multi-group Fairness Risks

    Authors: Lu**g Zhang, Aaron Roth, Linjun Zhang

    Abstract: This paper introduces a framework for post-processing machine learning models so that their predictions satisfy multi-group fairness guarantees. Based on the celebrated notion of multicalibration, we introduce $(\mathbf{s},\mathcal{G}, α)-$GMC (Generalized Multi-Dimensional Multicalibration) for multi-dimensional map**s $\mathbf{s}$, constraint set $\mathcal{G}$, and a pre-specified threshold le… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

    Comments: 28 pages, 8 figures, accepted by ICML2024

  5. Hot Jupiter Diversity and the Onset of TiO/VO Revealed by a Large Grid of Non-Grey Global Circulation Models

    Authors: Alexander Roth, Vivien Parmentier, Mark Hammond

    Abstract: The population of hot Jupiters is extremely diverse, with large variations in their irradiation, period, gravity and chemical composition. To understand the intrinsic planet diversity through the observed population level trends, we explore the a-priori scatter in the population created by the different responses of atmospheric circulation to planetary parameters. We use the SPARC/MITgcm 3D global… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: 28 pages, 25 figures, accepted in MNRAS

  6. arXiv:2404.04689  [pdf, other

    stat.ML cs.CL cs.LG

    Multicalibration for Confidence Scoring in LLMs

    Authors: Gianluca Detommaso, Martin Bertran, Riccardo Fogliato, Aaron Roth

    Abstract: This paper proposes the use of "multicalibration" to yield interpretable and reliable confidence scores for outputs generated by large language models (LLMs). Multicalibration asks for calibration not just marginally, but simultaneously across various intersecting grou**s of the data. We show how to form grou**s for prompt/completion pairs that are correlated with the probability of correctnes… ▽ More

    Submitted 6 April, 2024; originally announced April 2024.

  7. arXiv:2402.17108  [pdf, ps, other

    cs.GT cs.DS cs.LG

    Repeated Contracting with Multiple Non-Myopic Agents: Policy Regret and Limited Liability

    Authors: Natalie Collina, Varun Gupta, Aaron Roth

    Abstract: We study a repeated contracting setting in which a Principal adaptively chooses amongst $k$ Agents at each of $T$ rounds. The Agents are non-myopic, and so a mechanism for the Principal induces a $T$-round extensive form game amongst the Agents. We give several results aimed at understanding an under-explored aspect of contract theory -- the game induced when choosing an Agent to contract with. Fi… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

  8. arXiv:2402.11410  [pdf, ps, other

    cs.LG cs.DS stat.ML

    An Elementary Predictor Obtaining $2\sqrt{T}$ Distance to Calibration

    Authors: Eshwar Ram Arunachaleswaran, Natalie Collina, Aaron Roth, Mirah Shi

    Abstract: Blasiok et al. [2023] proposed distance to calibration as a natural measure of calibration error that unlike expected calibration error (ECE) is continuous. Recently, Qiao and Zheng [2024] gave a non-constructive argument establishing the existence of an online predictor that can obtain $O(\sqrt{T})$ distance to calibration in the adversarial setting, which is known to be impossible for ECE. They… ▽ More

    Submitted 17 February, 2024; originally announced February 2024.

  9. arXiv:2402.10795  [pdf, other

    cs.LG cs.CY cs.HC

    Diversified Ensembling: An Experiment in Crowdsourced Machine Learning

    Authors: Ira Globus-Harris, Declan Harrison, Michael Kearns, Pietro Perona, Aaron Roth

    Abstract: Crowdsourced machine learning on competition platforms such as Kaggle is a popular and often effective method for generating accurate models. Typically, teams vie for the most accurate model, as measured by overall error on a holdout set, and it is common towards the end of such competitions for teams at the top of the leaderboard to ensemble or average their models outside the platform mechanism… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

  10. arXiv:2402.08753  [pdf, ps, other

    cs.GT cs.LG

    Forecasting for Swap Regret for All Downstream Agents

    Authors: Aaron Roth, Mirah Shi

    Abstract: We study the problem of making predictions so that downstream agents who best respond to them will be guaranteed diminishing swap regret, no matter what their utility functions are. It has been known since Foster and Vohra (1997) that agents who best-respond to calibrated forecasts have no swap regret. Unfortunately, the best known algorithms for guaranteeing calibrated forecasts in sequential adv… ▽ More

    Submitted 15 June, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

  11. arXiv:2312.06589  [pdf, other

    econ.GN

    Power sector impacts of a simultaneous European heat pump rollout

    Authors: Alexander Roth

    Abstract: The decarbonization of buildings requires the phase-out of fossil fuel heating systems. Heat pumps are considered a crucial technology to supply a substantial part of heating energy for buildings. Yet, their introduction is not without challenges, as heat pumps generate additional electricity demand as well as peak loads. To better understand these challenges, an ambitious simultaneous heat pump r… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

  12. arXiv:2312.05140  [pdf, other

    cs.LG cs.CR

    Membership Inference Attacks on Diffusion Models via Quantile Regression

    Authors: Shuai Tang, Zhiwei Steven Wu, Sergul Aydore, Michael Kearns, Aaron Roth

    Abstract: Recently, diffusion models have become popular tools for image synthesis because of their high-quality outputs. However, like other large-scale models, they may leak private information about their training data. Here, we demonstrate a privacy vulnerability of diffusion models through a \emph{membership inference (MI) attack}, which aims to identify whether a target example belongs to the training… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

  13. arXiv:2311.07754  [pdf, other

    cs.GT cs.DS econ.TH

    Efficient Prior-Free Mechanisms for No-Regret Agents

    Authors: Natalie Collina, Aaron Roth, Han Shao

    Abstract: We study a repeated Principal Agent problem between a long lived Principal and Agent pair in a prior free setting. In our setting, the sequence of realized states of nature may be adversarially chosen, the Agent is non-myopic, and the Principal aims for a strong form of policy regret. Following Camara, Hartline, and Johnson, we model the Agent's long-run behavior with behavioral assumptions that r… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

  14. arXiv:2310.17651  [pdf, other

    cs.LG

    High-Dimensional Prediction for Sequential Decision Making

    Authors: Georgy Noarov, Ramya Ramalingam, Aaron Roth, Stephan Xie

    Abstract: We study the problem of making predictions of an adversarially chosen high-dimensional state that are unbiased subject to an arbitrary collection of conditioning events, with the goal of tailoring these events to downstream decision makers. We give efficient algorithms for solving this problem, as well as a number of applications that stem from choosing an appropriate set of conditioning events.… ▽ More

    Submitted 27 October, 2023; v1 submitted 26 October, 2023; originally announced October 2023.

    Comments: Added references, Arxiv abstract edited

  15. arXiv:2310.05693  [pdf, other

    astro-ph.HE astro-ph.GA

    CONGRuENTS (COsmic-ray, Neutrino, Gamma-ray and Radio Non-Thermal Spectra). II. Population-level correlations between galactic infrared, radio, and γ-ray emission

    Authors: Matt A. Roth, Mark R. Krumholz, Roland M. Crocker, Todd A. Thompson

    Abstract: Galaxies obey a number of empirical correlations between their radio, γ-ray, and infrared emission, but the physical origins of these correlations remain uncertain. Here we use the CONGRuENTS model for broadband non-thermal emission from star-forming galaxies, which self-consistently calculates energy-dependent transport and non-thermal emission from cosmic ray hadrons and leptons, to predict radi… ▽ More

    Submitted 15 June, 2024; v1 submitted 9 October, 2023; originally announced October 2023.

    Comments: 17 pages, 14 figures

    Journal ref: MNRAS, Volume 530, Issue 2, May 2024, Pages 1849-1865

  16. arXiv:2310.04652  [pdf, other

    cs.LG

    Oracle Efficient Algorithms for Groupwise Regret

    Authors: Krishna Acharya, Eshwar Ram Arunachaleswaran, Sampath Kannan, Aaron Roth, Juba Ziani

    Abstract: We study the problem of online prediction, in which at each time step $t$, an individual $x_t$ arrives, whose label we must predict. Each individual is associated with various groups, defined based on their features such as age, sex, race etc., which may intersect. Our goal is to make predictions that have regret guarantees not just overall but also simultaneously on each sub-sequence comprised of… ▽ More

    Submitted 6 October, 2023; originally announced October 2023.

  17. arXiv:2310.00946  [pdf, other

    cs.LG cs.AI

    Distilling Influences to Mitigate Prediction Churn in Graph Neural Networks

    Authors: Andreas Roth, Thomas Liebig

    Abstract: Models with similar performances exhibit significant disagreement in the predictions of individual samples, referred to as prediction churn. Our work explores this phenomenon in graph neural networks by investigating differences between models differing only in their initializations in their utilized features for predictions. We propose a novel metric called Influence Difference (ID) to quantify t… ▽ More

    Submitted 2 October, 2023; originally announced October 2023.

    Comments: Accepted at ACML 2023

  18. arXiv:2309.06000  [pdf, other

    cs.RO

    Gait Design of a Novel Arboreal Concertina Locomotion for Snake-like Robots

    Authors: Shuoqi Chen, Aaron Roth

    Abstract: In this paper, we propose a novel strategy for a snake robot to move straight up a cylindrical surface. Prior works on pole-climbing for a snake robot mainly utilized a rolling helix gait, and although proven to be efficient, it does not reassemble movements made by a natural snake. We take inspiration from nature and seek to imitate the Arboreal Concertina Locomotion (ACL) from real-life serpents… ▽ More

    Submitted 12 September, 2023; originally announced September 2023.

    Comments: 4 pages, 3 figures

  19. arXiv:2308.16800  [pdf, other

    cs.LG cs.AI

    Rank Collapse Causes Over-Smoothing and Over-Correlation in Graph Neural Networks

    Authors: Andreas Roth, Thomas Liebig

    Abstract: Our study reveals new theoretical insights into over-smoothing and feature over-correlation in deep graph neural networks. We show the prevalence of invariant subspaces, demonstrating a fixed relative behavior that is unaffected by feature transformations. Our work clarifies recent observations related to convergence to a constant state and a potential over-separation of node states, as the amplif… ▽ More

    Submitted 21 February, 2024; v1 submitted 31 August, 2023; originally announced August 2023.

    Comments: Published at LoG 2023

  20. arXiv:2308.16516  [pdf, other

    cs.LG cs.AI

    Curvature-based Pooling within Graph Neural Networks

    Authors: Cedric Sanders, Andreas Roth, Thomas Liebig

    Abstract: Over-squashing and over-smoothing are two critical issues, that limit the capabilities of graph neural networks (GNNs). While over-smoothing eliminates the differences between nodes making them indistinguishable, over-squashing refers to the inability of GNNs to propagate information over long distances, as exponentially many node states are squashed into fixed-size representations. Both phenomena… ▽ More

    Submitted 31 August, 2023; originally announced August 2023.

    Comments: ECMLPKDD 2023 - Workshop on Mining and Learning with Graphs

  21. arXiv:2307.12918  [pdf, other

    econ.GN

    Flexible heat pumps: must-have or nice to have in a power sector with renewables?

    Authors: Alexander Roth, Dana Kirchem, Carlos Gaete-Morales, Wolf-Peter Schill

    Abstract: Heat pumps are a key technology for reducing fossil fuel use in the heating sector. However, the transition to heat pumps implies an increase in electricity demand, especially in the cold winter months. Therefore, the flexible operation of heat pumps will be of high importance to the power sector. Using an open-source power sector model, we examine the power sector impacts of three different expan… ▽ More

    Submitted 25 June, 2024; v1 submitted 24 July, 2023; originally announced July 2023.

  22. arXiv:2307.08999  [pdf, ps, other

    cs.LG stat.ML

    Oracle Efficient Online Multicalibration and Omniprediction

    Authors: Sumegha Garg, Christopher Jung, Omer Reingold, Aaron Roth

    Abstract: A recent line of work has shown a surprising connection between multicalibration, a multi-group fairness notion, and omniprediction, a learning paradigm that provides simultaneous loss minimization guarantees for a large family of loss functions. Prior work studies omniprediction in the batch setting. We initiate the study of omniprediction in the online adversarial setting. Although there exist a… ▽ More

    Submitted 18 July, 2023; originally announced July 2023.

  23. arXiv:2307.03694  [pdf, other

    cs.LG cs.AI cs.CR

    Scalable Membership Inference Attacks via Quantile Regression

    Authors: Martin Bertran, Shuai Tang, Michael Kearns, Jamie Morgenstern, Aaron Roth, Zhiwei Steven Wu

    Abstract: Membership inference attacks are designed to determine, using black box access to trained models, whether a particular example was used in training or not. Membership inference can be formalized as a hypothesis testing problem. The most effective existing attacks estimate the distribution of some test statistic (usually the model's confidence on the true label) on points that were (and were not) u… ▽ More

    Submitted 7 July, 2023; originally announced July 2023.

  24. Balanced Filtering via Disclosure-Controlled Proxies

    Authors: Siqi Deng, Emily Diana, Michael Kearns, Aaron Roth

    Abstract: We study the problem of collecting a cohort or set that is balanced with respect to sensitive groups when group membership is unavailable or prohibited from use at deployment time. Specifically, our deployment-time collection mechanism does not reveal significantly more about the group membership of any individual sample than can be ascertained from base rates alone. To do this, we study a learner… ▽ More

    Submitted 17 June, 2024; v1 submitted 26 June, 2023; originally announced June 2023.

    Journal ref: 5th Symposium on Foundations of Responsible Computing (FORC 2024)

  25. Awesome SOSS: Atmospheric Characterisation of WASP-96 b using the JWST Early Release Observations

    Authors: Jake Taylor, Michael Radica, Luis Welbanks, Ryan J. MacDonald, Jasmina Blecic, Maria Zamyatina, Alexander Roth, Jacob L. Bean, Vivien Parmentier, Louis-Philippe Coulombe, Adina D. Feinstein, Néstor Espinoza, Björn Benneke, David Lafrenière, René Doyon, Eva-Maria Ahrer

    Abstract: The newly operational JWST offers the potential to study the atmospheres of distant worlds with precision that has not been achieved before. One of the first exoplanets observed by JWST in the summer of 2022 was WASP-96 b, a hot-Saturn orbiting a G8 star. As part of the Early Release Observations program, one transit of WASP-96 b was observed with NIRISS/SOSS to capture its transmission spectrum f… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

    Comments: 12 pages, 5 Figures. Accepted for publication in MNRAS. Companion paper to Radica et al., 2023

  26. arXiv:2303.03451  [pdf, other

    cs.LG cs.CR

    Improved Differentially Private Regression via Gradient Boosting

    Authors: Shuai Tang, Sergul Aydore, Michael Kearns, Saeyoung Rho, Aaron Roth, Yichen Wang, Yu-Xiang Wang, Zhiwei Steven Wu

    Abstract: We revisit the problem of differentially private squared error linear regression. We observe that existing state-of-the-art methods are sensitive to the choice of hyperparameters -- including the ``clip** threshold'' that cannot be set optimally in a data-independent way. We give a new algorithm for private linear regression based on gradient boosting. We show that our method consistently improv… ▽ More

    Submitted 20 May, 2023; v1 submitted 6 March, 2023; originally announced March 2023.

  27. arXiv:2302.08507  [pdf, ps, other

    cs.LG cs.DS math.ST

    The Scope of Multicalibration: Characterizing Multicalibration via Property Elicitation

    Authors: Georgy Noarov, Aaron Roth

    Abstract: We make a connection between multicalibration and property elicitation and show that (under mild technical conditions) it is possible to produce a multicalibrated predictor for a continuous scalar distributional property $Γ$ if and only if $Γ$ is elicitable. On the negative side, we show that for non-elicitable continuous properties there exist simple data distributions on which even the true di… ▽ More

    Submitted 16 February, 2023; originally announced February 2023.

  28. arXiv:2301.13767  [pdf, other

    cs.LG cs.DS

    Multicalibration as Boosting for Regression

    Authors: Ira Globus-Harris, Declan Harrison, Michael Kearns, Aaron Roth, Jessica Sorrell

    Abstract: We study the connection between multicalibration and boosting for squared error regression. First we prove a useful characterization of multicalibration in terms of a ``swap regret'' like condition on squared error. Using this characterization, we give an exceedingly simple algorithm that can be analyzed both as a boosting algorithm for regression and as a multicalibration algorithm for a class H… ▽ More

    Submitted 31 January, 2023; originally announced January 2023.

    Comments: Code available here: https://github.com/Declancharrison/Level-Set-Boosting

  29. arXiv:2212.09428  [pdf, other

    astro-ph.HE astro-ph.GA

    CONGRuENTS (COsmic-ray, Neutrino, Gamma-ray and Radio Non-Thermal Spectra). I. A predictive model for galactic non-thermal emission

    Authors: Matt A. Roth, Mark R. Krumholz, Roland M. Crocker, Todd A. Thompson

    Abstract: The total luminosity and spectral shape of the non-thermal emission produced by cosmic rays depends on their interstellar environment, a dependence that gives rise to correlations between galaxies' bulk properties -- star formation rate, stellar mass, and others -- and their non-thermal spectra. Understanding the physical mechanisms of cosmic ray transport, loss, and emission is key to understandi… ▽ More

    Submitted 16 May, 2023; v1 submitted 19 December, 2022; originally announced December 2022.

    Comments: 23 pages, 14 figures, 1 table, accepted for publication in MNRAS

  30. Geographical balancing of wind power decreases storage needs in a 100% renewable European power sector

    Authors: Alexander Roth, Wolf-Peter Schill

    Abstract: To reduce greenhouse gas emissions, many countries plan to massively expand wind power and solar photovoltaic capacities. These variable renewable energy sources require additional flexibility in the power sector. Both geographical balancing enabled by interconnection and electricity storage can provide such flexibility. In a 100% renewable energy scenario of twelve central European countries, we… ▽ More

    Submitted 21 June, 2023; v1 submitted 29 November, 2022; originally announced November 2022.

  31. arXiv:2211.11596  [pdf, other

    cs.LG

    Forecasting Unobserved Node States with spatio-temporal Graph Neural Networks

    Authors: Andreas Roth, Thomas Liebig

    Abstract: Forecasting future states of sensors is key to solving tasks like weather prediction, route planning, and many others when dealing with networks of sensors. But complete spatial coverage of sensors is generally unavailable and would practically be infeasible due to limitations in budget and other resources during deployment and maintenance. Currently existing approaches using machine learning are… ▽ More

    Submitted 21 November, 2022; originally announced November 2022.

  32. arXiv:2211.03128  [pdf, other

    cs.CY cs.CR cs.LG

    Confidence-Ranked Reconstruction of Census Microdata from Published Statistics

    Authors: Travis Dick, Cynthia Dwork, Michael Kearns, Terrance Liu, Aaron Roth, Giuseppe Vietri, Zhiwei Steven Wu

    Abstract: A reconstruction attack on a private dataset $D$ takes as input some publicly accessible information about the dataset and produces a list of candidate elements of $D$. We introduce a new class of data reconstruction attacks based on randomized methods for non-convex optimization. We empirically demonstrate that our attacks can not only reconstruct full rows of $D$ from aggregate query statistics… ▽ More

    Submitted 6 February, 2023; v1 submitted 6 November, 2022; originally announced November 2022.

  33. arXiv:2209.15145  [pdf, other

    cs.LG math.ST

    Batch Multivalid Conformal Prediction

    Authors: Christopher Jung, Georgy Noarov, Ramya Ramalingam, Aaron Roth

    Abstract: We develop fast distribution-free conformal prediction algorithms for obtaining multivalid coverage on exchangeable data in the batch setting. Multivalid coverage guarantees are stronger than marginal coverage guarantees in two ways: (1) They hold even conditional on group membership -- that is, the target coverage level $1-α$ holds conditionally on membership in each of an arbitrary (potentially… ▽ More

    Submitted 29 September, 2022; originally announced September 2022.

    Comments: Code to replicate all of our experiments can be found at https://github.com/ProgBelarus/BatchMultivalidConformal

  34. arXiv:2209.09079  [pdf, other

    cs.RO cs.AI cs.HC cs.LG

    MSVIPER: Improved Policy Distillation for Reinforcement-Learning-Based Robot Navigation

    Authors: Aaron M. Roth, **g Liang, Ram Sriram, Elham Tabassi, Dinesh Manocha

    Abstract: We present Multiple Scenario Verifiable Reinforcement Learning via Policy Extraction (MSVIPER), a new method for policy distillation to decision trees for improved robot navigation. MSVIPER learns an "expert" policy using any Reinforcement Learning (RL) technique involving learning a state-action map** and then uses imitation learning to learn a decision-tree policy from it. We demonstrate that… ▽ More

    Submitted 19 September, 2022; originally announced September 2022.

    Comments: 6 pages main paper, 2 pages of references, 5 page appendix (13 pages total) 5 tables, 9 algorithms, 4 figures

  35. arXiv:2209.07400  [pdf, other

    cs.LG

    Private Synthetic Data for Multitask Learning and Marginal Queries

    Authors: Giuseppe Vietri, Cedric Archambeau, Sergul Aydore, William Brown, Michael Kearns, Aaron Roth, Ankit Siva, Shuai Tang, Zhiwei Steven Wu

    Abstract: We provide a differentially private algorithm for producing synthetic data simultaneously useful for multiple tasks: marginal queries and multitask machine learning (ML). A key innovation in our algorithm is the ability to directly handle numerical features, in contrast to a number of related prior approaches which require numerical features to be first converted into {high cardinality} categorica… ▽ More

    Submitted 15 September, 2022; originally announced September 2022.

    Comments: The short version of this paper appears in the proceedings of NeurIPS-22

  36. arXiv:2209.07375  [pdf, other

    cs.GT

    Wealth Dynamics Over Generations: Analysis and Interventions

    Authors: Krishna Acharya, Eshwar Ram Arunachaleswaran, Sampath Kannan, Aaron Roth, Juba Ziani

    Abstract: We present a stylized model with feedback loops for the evolution of a population's wealth over generations. Individuals have both talent and wealth: talent is a random variable distributed identically for everyone, but wealth is a random variable that is dependent on the population one is born into. Individuals then apply to a downstream agent, which we treat as a university throughout the paper… ▽ More

    Submitted 15 September, 2022; originally announced September 2022.

  37. arXiv:2209.07312  [pdf, other

    cs.LG cs.DS

    Multicalibrated Regression for Downstream Fairness

    Authors: Ira Globus-Harris, Varun Gupta, Christopher Jung, Michael Kearns, Jamie Morgenstern, Aaron Roth

    Abstract: We show how to take a regression function $\hat{f}$ that is appropriately ``multicalibrated'' and efficiently post-process it into an approximately error minimizing classifier satisfying a large variety of fairness constraints. The post-processing requires no labeled data, and only a modest amount of unlabeled data and computation. The computational and sample complexity requirements of computing… ▽ More

    Submitted 15 September, 2022; originally announced September 2022.

  38. arXiv:2209.01687  [pdf, ps, other

    cs.LG cs.DS math.ST

    Reconciling Individual Probability Forecasts

    Authors: Aaron Roth, Alexander Tolbert, Scott Weinstein

    Abstract: Individual probabilities refer to the probabilities of outcomes that are realized only once: the probability that it will rain tomorrow, the probability that Alice will die within the next 12 months, the probability that Bob will be arrested for a violent crime in the next 18 months, etc. Individual probabilities are fundamentally unknowable. Nevertheless, we show that two parties who agree on the… ▽ More

    Submitted 6 May, 2023; v1 submitted 4 September, 2022; originally announced September 2022.

    Comments: This is the full version of a paper that appears in the proceedings of FAccT 2023: The Sixth Annual ACM Conference on Fairness, Accountability, and Transparency, 2023

  39. arXiv:2208.05916  [pdf, other

    quant-ph

    Multi-disk clutch optimization using quantum annealing

    Authors: John D. Malcolm, Alexander Roth, Mladjan Radic, Pablo Martin-Ramiro, Jon Oillarburu, Borja Aizpurua, Roman Orus, Samuel Mugel

    Abstract: In this work, we develop a new quantum algorithm to solve a combinatorial problem with significant practical relevance occurring in clutch manufacturing. It is demonstrated how quantum optimization can play a role in real industrial applications in the manufacturing sector. Using the quantum annealer provided by D-Wave Systems, we analyze the performance of the quantum and quantum-classical hybrid… ▽ More

    Submitted 5 April, 2024; v1 submitted 11 August, 2022; originally announced August 2022.

    Comments: 11 pages, 4 figures

  40. arXiv:2207.00684  [pdf, other

    cs.LG

    Transforming PageRank into an Infinite-Depth Graph Neural Network

    Authors: Andreas Roth, Thomas Liebig

    Abstract: Popular graph neural networks are shallow models, despite the success of very deep architectures in other application domains of deep learning. This reduces the modeling capacity and leaves models unable to capture long-range relationships. The primary reason for the shallow design results from over-smoothing, which leads node states to become more similar with increased depth. We build on the clo… ▽ More

    Submitted 1 July, 2022; originally announced July 2022.

    Comments: Accepted at ECML-PKDD 2022

    ACM Class: I.2.6; I.0

  41. arXiv:2206.04475  [pdf, ps, other

    cs.LG stat.ML

    Individually Fair Learning with One-Sided Feedback

    Authors: Yahav Bechavod, Aaron Roth

    Abstract: We consider an online learning problem with one-sided feedback, in which the learner is able to observe the true label only for positively predicted instances. On each round, $k$ instances arrive and receive classification outcomes according to a randomized policy deployed by the learner, whose goal is to maximize accuracy while deploying individually fair policies. We first extend the framework o… ▽ More

    Submitted 9 June, 2022; originally announced June 2022.

  42. arXiv:2206.01067  [pdf, other

    cs.LG

    Practical Adversarial Multivalid Conformal Prediction

    Authors: Osbert Bastani, Varun Gupta, Christopher Jung, Georgy Noarov, Ramya Ramalingam, Aaron Roth

    Abstract: We give a simple, generic conformal prediction method for sequential prediction that achieves target empirical coverage guarantees against adversarially chosen data. It is computationally lightweight -- comparable to split conformal prediction -- but does not require having a held-out validation set, and so all data can be used for training models from which to derive a conformal score. It gives s… ▽ More

    Submitted 2 June, 2022; originally announced June 2022.

    Comments: Code for our experiments can be found at: https://github.com/ProgBelarus/MultiValidPrediction

  43. arXiv:2205.04698  [pdf, other

    quant-ph physics.atom-ph

    Correlated steady states and Raman lasing in continuously pumped and probed atomic ensembles

    Authors: Alexander Roth, Klemens Hammerer, Kirill S. Tikhonov

    Abstract: Spin-polarised atomic ensembles probed by light based on the Faraday interaction are a versatile platform for numerous applications in quantum metrology and quantum information processing. Here we consider an ensemble of Alkali atoms that are continuously optically pumped and probed. Due to the collective scattering of photons at large optical depth, the steady state of atoms does not correspond t… ▽ More

    Submitted 10 May, 2022; originally announced May 2022.

    Comments: 13 pages

  44. arXiv:2203.11481  [pdf, other

    cs.CV cs.CR

    Mixed Differential Privacy in Computer Vision

    Authors: Aditya Golatkar, Alessandro Achille, Yu-Xiang Wang, Aaron Roth, Michael Kearns, Stefano Soatto

    Abstract: We introduce AdaMix, an adaptive differentially private algorithm for training deep neural network classifiers using both private and public image data. While pre-training language models on large public datasets has enabled strong differential privacy (DP) guarantees with minor loss of accuracy, a similar practice yields punishing trade-offs in vision tasks. A few-shot or even zero-shot learning… ▽ More

    Submitted 28 March, 2022; v1 submitted 22 March, 2022; originally announced March 2022.

    Comments: Accepted at CVPR 2022

  45. arXiv:2201.10408  [pdf, other

    cs.LG cs.CY cs.DS

    An Algorithmic Framework for Bias Bounties

    Authors: Ira Globus-Harris, Michael Kearns, Aaron Roth

    Abstract: We propose and analyze an algorithmic framework for "bias bounties": events in which external participants are invited to propose improvements to a trained model, akin to bug bounty events in software and security. Our framework allows participants to submit arbitrary subgroup improvements, which are then algorithmically incorporated into an updated model. Our algorithm has the property that there… ▽ More

    Submitted 9 May, 2022; v1 submitted 25 January, 2022; originally announced January 2022.

  46. The diffuse $γ$-ray background is dominated by star-forming galaxies

    Authors: Matt A. Roth, Mark R. Krumholz, Roland M. Crocker, Silvia Celli

    Abstract: The Fermi Gamma-ray Space Telescope has revealed a diffuse $γ$-ray background at energies from 0.1 GeV to 1 TeV, which can be separated into Galactic emission and an isotropic, extragalactic component. Previous efforts to understand the latter have been hampered by the lack of physical models capable of predicting the $γ$-ray emission produced by the many candidate sources, primarily active galact… ▽ More

    Submitted 15 September, 2021; originally announced September 2021.

    Comments: 18 pages, 10 figures. This work has been published in Nature. The version deposited here is the author's pre-print and may not reflect post-acceptance corrections or formatting related changes. The published version (Version of Record) of this manuscript is available at https://www.nature.com/articles/s41586-021-03802-x

    Journal ref: Nature 597, 341-344 (2021)

  47. arXiv:2108.03837  [pdf, ps, other

    cs.LG cs.DS cs.GT

    Online Minimax Multiobjective Optimization: Multicalibeating and Other Applications

    Authors: Daniel Lee, Georgy Noarov, Mallesh Pai, Aaron Roth

    Abstract: We introduce a simple but general online learning framework in which a learner plays against an adversary in a vector-valued game that changes every round. Even though the learner's objective is not convex-concave (and so the minimax theorem does not apply), we give a simple algorithm that can compete with the setting in which the adversary must announce their action first, with optimally diminish… ▽ More

    Submitted 13 October, 2022; v1 submitted 9 August, 2021; originally announced August 2021.

    Comments: Appears in NeurIPS 2022

  48. arXiv:2107.04423  [pdf, other

    cs.LG cs.DS

    Multiaccurate Proxies for Downstream Fairness

    Authors: Emily Diana, Wesley Gill, Michael Kearns, Krishnaram Kenthapadi, Aaron Roth, Saeed Sharifi-Malvajerdi

    Abstract: We study the problem of training a model that must obey demographic fairness conditions when the sensitive features are not available at training time -- in other words, how can we train a model to be fair by race when we don't have data about race? We adopt a fairness pipeline perspective, in which an "upstream" learner that does have access to the sensitive features will learn a proxy model for… ▽ More

    Submitted 25 January, 2022; v1 submitted 9 July, 2021; originally announced July 2021.

  49. arXiv:2106.16207  [pdf, other

    cs.SI cs.CY

    When the Echo Chamber Shatters: Examining the Use of Community-Specific Language Post-Subreddit Ban

    Authors: Milo Z. Trujillo, Samuel F. Rosenblatt, Guillermo de Anda Jáuregui, Emily Moog, Briane Paul V. Samson, Laurent Hébert-Dufresne, Allison M. Roth

    Abstract: Community-level bans are a common tool against groups that enable online harassment and harmful speech. Unfortunately, the efficacy of community bans has only been partially studied and with mixed results. Here, we provide a flexible unsupervised methodology to identify in-group language and track user activity on Reddit both before and after the ban of a community (subreddit). We use a simple wor… ▽ More

    Submitted 30 June, 2021; originally announced June 2021.

    Comments: 15 pages (including references and appendix), 5 figures

  50. arXiv:2106.04378  [pdf, other

    cs.LG stat.ML

    Adaptive Machine Unlearning

    Authors: Varun Gupta, Christopher Jung, Seth Neel, Aaron Roth, Saeed Sharifi-Malvajerdi, Chris Waites

    Abstract: Data deletion algorithms aim to remove the influence of deleted data points from trained models at a cheaper computational cost than fully retraining those models. However, for sequences of deletions, most prior work in the non-convex setting gives valid guarantees only for sequences that are chosen independently of the models that are published. If people choose to delete their data as a function… ▽ More

    Submitted 8 June, 2021; originally announced June 2021.