Skip to main content

Showing 1–19 of 19 results for author: McMillan, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.19566  [pdf, other

    cs.LG cs.CR cs.DS math.ST stat.ML

    Instance-Optimal Private Density Estimation in the Wasserstein Distance

    Authors: Vitaly Feldman, Audra McMillan, Satchit Sivakumar, Kunal Talwar

    Abstract: Estimating the density of a distribution from samples is a fundamental problem in statistics. In many practical settings, the Wasserstein distance is an appropriate error metric for density estimation. For example, when estimating population densities in a geographic region, a small Wasserstein distance means that the estimate is able to capture roughly where the population mass is. In this work w… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  2. Anatomy and Physiology of Artificial Intelligence in PET Imaging

    Authors: Tyler J. Bradshaw, Alan B. McMillan

    Abstract: The influence of artificial intelligence (AI) within the field of nuclear medicine has been rapidly growing. Many researchers and clinicians are seeking to apply AI within PET, and clinicians will soon find themselves engaging with AI-based applications all along the chain of molecular imaging, from image reconstruction to enhanced reporting. This expanding presence of AI in PET imaging will resul… ▽ More

    Submitted 30 November, 2023; originally announced November 2023.

    Journal ref: PET Clin; 16(4):471-482 (2021)

  3. arXiv:2307.15835  [pdf, ps, other

    cs.CR cs.DS cs.LG stat.ML

    Mean Estimation with User-level Privacy under Data Heterogeneity

    Authors: Rachel Cummings, Vitaly Feldman, Audra McMillan, Kunal Talwar

    Abstract: A key challenge in many modern data analysis tasks is that user data are heterogeneous. Different users may possess vastly different numbers of data points. More importantly, it cannot be assumed that all users sample from the same underlying distribution. This is true, for example in language data, where different speech styles result in data heterogeneity. In this work we propose a simple model… ▽ More

    Submitted 28 July, 2023; originally announced July 2023.

    Comments: Conference version published at NeurIPS 2022

  4. arXiv:2307.15017  [pdf, other

    cs.CR cs.LG

    Samplable Anonymous Aggregation for Private Federated Data Analysis

    Authors: Kunal Talwar, Shan Wang, Audra McMillan, Vojta **a, Vitaly Feldman, Bailey Basile, Aine Cahill, Yi Sheng Chan, Mike Chatzidakis, Junye Chen, Oliver Chick, Mona Chitnis, Suman Ganta, Yusuf Goren, Filip Granqvist, Kristine Guo, Frederic Jacobs, Omid Javidbakht, Albert Liu, Richard Low, Dan Mascenik, Steve Myers, David Park, Wonhee Park, Gianni Parsa , et al. (11 additional authors not shown)

    Abstract: We revisit the problem of designing scalable protocols for private statistics and private federated learning when each device holds its private data. Our first contribution is to propose a simple primitive that allows for efficient implementation of several commonly used algorithms, and allows for privacy accounting that is close to that in the central setting without requiring the strong trust as… ▽ More

    Submitted 27 July, 2023; originally announced July 2023.

    Comments: 24 pages

  5. arXiv:2307.11749  [pdf, other

    cs.LG cs.CR

    Differentially Private Heavy Hitter Detection using Federated Analytics

    Authors: Karan Chadha, Junye Chen, John Duchi, Vitaly Feldman, Hanieh Hashemi, Omid Javidbakht, Audra McMillan, Kunal Talwar

    Abstract: In this work, we study practical heuristics to improve the performance of prefix-tree based algorithms for differentially private heavy hitter detection. Our model assumes each user has multiple data points and the goal is to learn as many of the most frequent data points as possible across all users' data with aggregate and local differential privacy. We propose an adaptive hyperparameter tuning… ▽ More

    Submitted 21 July, 2023; originally announced July 2023.

  6. arXiv:2211.10082  [pdf, other

    cs.CR

    Private Federated Statistics in an Interactive Setting

    Authors: Audra McMillan, Omid Javidbakht, Kunal Talwar, Elliot Briggs, Mike Chatzidakis, Junye Chen, John Duchi, Vitaly Feldman, Yusuf Goren, Michael Hesse, Vojta **a, Anil Katti, Albert Liu, Cheney Lyford, Joey Meyer, Alex Palmer, David Park, Wonhee Park, Gianni Parsa, Paul Pelzl, Rehan Rishi, Congzheng Song, Shan Wang, Shundong Zhou

    Abstract: Privately learning statistics of events on devices can enable improved user experience. Differentially private algorithms for such problems can benefit significantly from interactivity. We argue that an aggregation protocol can enable an interactive private federated statistics system where user's devices maintain control of the privacy assurance. We describe the architecture of such a system, and… ▽ More

    Submitted 18 November, 2022; originally announced November 2022.

  7. arXiv:2210.15819  [pdf, other

    math.ST cs.CR cs.LG

    Instance-Optimal Differentially Private Estimation

    Authors: Audra McMillan, Adam Smith, Jon Ullman

    Abstract: In this work, we study local minimax convergence estimation rates subject to $ε$-differential privacy. Unlike worst-case rates, which may be conservative, algorithms that are locally minimax optimal must adapt to easy instances of the problem. We construct locally minimax differentially private estimators for one-parameter exponential families and estimating the tail rate of a distribution. In the… ▽ More

    Submitted 27 October, 2022; originally announced October 2022.

  8. arXiv:2208.04591  [pdf, other

    cs.CR cs.DS cs.LG stat.ML

    Stronger Privacy Amplification by Shuffling for Rényi and Approximate Differential Privacy

    Authors: Vitaly Feldman, Audra McMillan, Kunal Talwar

    Abstract: The shuffle model of differential privacy has gained significant interest as an intermediate trust model between the standard local and central models [EFMRTT19; CSUZZ19]. A key result in this model is that randomly shuffling locally randomized data amplifies differential privacy guarantees. Such amplification implies substantially stronger privacy guarantees for systems in which data is contribut… ▽ More

    Submitted 30 October, 2023; v1 submitted 9 August, 2022; originally announced August 2022.

    Comments: Errata added. 14 pages, 4 figures

  9. arXiv:2106.10333  [pdf, other

    cs.CR cs.LG stat.ME stat.ML

    Non-parametric Differentially Private Confidence Intervals for the Median

    Authors: Joerg Drechsler, Ira Globus-Harris, Audra McMillan, Jayshree Sarathy, Adam Smith

    Abstract: Differential privacy is a restriction on data processing algorithms that provides strong confidentiality guarantees for individual records in the data. However, research on proper statistical inference, that is, research on properly quantifying the uncertainty of the (noisy) sample estimate regarding the true value in the population, is currently still limited. This paper proposes and evaluates se… ▽ More

    Submitted 3 July, 2021; v1 submitted 18 June, 2021; originally announced June 2021.

    Comments: 44 pages, 15 figures

  10. arXiv:2012.12803  [pdf, other

    cs.LG cs.CR cs.DS stat.ML

    Hiding Among the Clones: A Simple and Nearly Optimal Analysis of Privacy Amplification by Shuffling

    Authors: Vitaly Feldman, Audra McMillan, Kunal Talwar

    Abstract: Recent work of Erlingsson, Feldman, Mironov, Raghunathan, Talwar, and Thakurta [EFMRTT19] demonstrates that random shuffling amplifies differential privacy guarantees of locally randomized data. Such amplification implies substantially stronger privacy guarantees for systems in which data is contributed anonymously [BEMMRLRKTS17] and has lead to significant interest in the shuffle model of privacy… ▽ More

    Submitted 7 September, 2021; v1 submitted 23 December, 2020; originally announced December 2020.

    Comments: Updated to include numerical experiments for Renyi differential privacy

  11. arXiv:2007.12674  [pdf, other

    stat.ME cs.CR cs.LG

    Controlling Privacy Loss in Sampling Schemes: an Analysis of Stratified and Cluster Sampling

    Authors: Mark Bun, Jörg Drechsler, Marco Gaboardi, Audra McMillan, Jayshree Sarathy

    Abstract: Sampling schemes are fundamental tools in statistics, survey design, and algorithm design. A fundamental result in differential privacy is that a differentially private mechanism run on a simple random sample of a population provides stronger privacy guarantees than the same algorithm run on the entire population. However, in practice, sampling designs are often more complex than the simple, data-… ▽ More

    Submitted 21 June, 2023; v1 submitted 24 July, 2020; originally announced July 2020.

    Comments: Appeared at FORC 2022

  12. arXiv:2007.05157  [pdf, other

    cs.LG cs.CR stat.ME stat.ML

    Differentially Private Simple Linear Regression

    Authors: Daniel Alabi, Audra McMillan, Jayshree Sarathy, Adam Smith, Salil Vadhan

    Abstract: Economics and social science research often require analyzing datasets of sensitive personal information at fine granularity, with models fit to small subsets of the data. Unfortunately, such fine-grained analysis can easily reveal sensitive individual information. We study algorithms for simple linear regression that satisfy differential privacy, a constraint which guarantees that an algorithm's… ▽ More

    Submitted 10 July, 2020; originally announced July 2020.

    Comments: 20 pages, 18 figures

  13. arXiv:1908.00656  [pdf

    eess.IV cs.CV

    Robustifying deep networks for image segmentation

    Authors: Zheng Liu, **nian Zhang, Varun Jog, Po-Ling Loh, Alan B McMillan

    Abstract: Purpose: The purpose of this study is to investigate the robustness of a commonly-used convolutional neural network for image segmentation with respect to visually-subtle adversarial perturbations, and suggest new methods to make these networks more robust to such perturbations. Materials and Methods: In this retrospective study, the accuracy of brain tumor segmentation was studied in subjects wit… ▽ More

    Submitted 1 August, 2019; originally announced August 2019.

  14. arXiv:1905.11947  [pdf, ps, other

    cs.DS cs.CR cs.IT cs.LG stat.ML

    Private Identity Testing for High-Dimensional Distributions

    Authors: Clément L. Canonne, Gautam Kamath, Audra McMillan, Jonathan Ullman, Lydia Zakynthinou

    Abstract: In this work we present novel differentially private identity (goodness-of-fit) testers for natural and widely studied classes of multivariate product distributions: Gaussians in $\mathbb{R}^d$ with known covariance and product distributions over $\{\pm 1\}^{d}$. Our testers have improved sample complexity compared to those derived from previous techniques, and are the first testers whose sample c… ▽ More

    Submitted 3 March, 2022; v1 submitted 28 May, 2019; originally announced May 2019.

    Comments: Discussing a mistake in the proof of one of the algorithms (Theorem 1.2, computationally inefficient tester), and pointing to follow-up work by Narayanan (2022) who improves upon our results and fixes this mistake

  15. arXiv:1811.11148  [pdf, ps, other

    cs.DS cs.CR cs.IT cs.LG stat.ML

    The Structure of Optimal Private Tests for Simple Hypotheses

    Authors: Clément L. Canonne, Gautam Kamath, Audra McMillan, Adam Smith, Jonathan Ullman

    Abstract: Hypothesis testing plays a central role in statistical inference, and is used in many settings where privacy concerns are paramount. This work answers a basic question about privately testing simple hypotheses: given two distributions $P$ and $Q$, and a privacy level $\varepsilon$, how many i.i.d. samples are needed to distinguish $P$ from $Q$ subject to $\varepsilon$-differential privacy, and wha… ▽ More

    Submitted 2 April, 2019; v1 submitted 27 November, 2018; originally announced November 2018.

    Comments: To appear in STOC 2019

  16. arXiv:1806.06427  [pdf, ps, other

    cs.CR cs.DS

    Property Testing for Differential Privacy

    Authors: Anna Gilbert, Audra McMillan

    Abstract: We consider the problem of property testing for differential privacy: with black-box access to a purportedly private algorithm, can we verify its privacy guarantees? In particular, we show that any privacy guarantee that can be efficiently verified is also efficiently breakable in the sense that there exist two databases between which we can efficiently distinguish. We give lower bounds on the que… ▽ More

    Submitted 13 February, 2019; v1 submitted 17 June, 2018; originally announced June 2018.

    Comments: Allerton, 2018

  17. arXiv:1711.10019  [pdf, ps, other

    cs.LG

    Online Learning via the Differential Privacy Lens

    Authors: Jacob Abernethy, Young Hun Jung, Chansoo Lee, Audra McMillan, Ambuj Tewari

    Abstract: In this paper, we use differential privacy as a lens to examine online learning in both full and partial information settings. The differential privacy framework is, at heart, less about privacy and more about algorithmic stability, and thus has found application in domains well beyond those where information security is central. Here we develop an algorithmic property called one-step differential… ▽ More

    Submitted 28 October, 2019; v1 submitted 27 November, 2017; originally announced November 2017.

  18. arXiv:1706.05916  [pdf, other

    cs.CR cs.DB

    Local Differential Privacy for Physical Sensor Data and Sparse Recovery

    Authors: Anna C. Gilbert, Audra McMillan

    Abstract: In this work we explore the utility of locally differentially private thermal sensor data. We design a locally differentially private recovery algorithm for the 1-dimensional, discrete heat source location problem and analyse its performance in terms of the Earth Mover Distance error. Our work indicates that it is possible to produce locally private sensor measurements that both keep the exact loc… ▽ More

    Submitted 23 March, 2018; v1 submitted 30 May, 2017; originally announced June 2017.

    Comments: appeared at CISS 2018

  19. arXiv:1604.01871  [pdf, ps, other

    math.ST cs.LG

    When is Nontrivial Estimation Possible for Graphons and Stochastic Block Models?

    Authors: Audra McMillan, Adam Smith

    Abstract: Block graphons (also called stochastic block models) are an important and widely-studied class of models for random networks. We provide a lower bound on the accuracy of estimators for block graphons with a large number of blocks. We show that, given only the number $k$ of blocks and an upper bound $ρ$ on the values (connection probabilities) of the graphon, every estimator incurs error at least o… ▽ More

    Submitted 7 April, 2016; originally announced April 2016.

    Comments: 11 pages