Skip to main content

Showing 1–4 of 4 results for author: Siva, A

.
  1. arXiv:2209.07400  [pdf, other

    cs.LG

    Private Synthetic Data for Multitask Learning and Marginal Queries

    Authors: Giuseppe Vietri, Cedric Archambeau, Sergul Aydore, William Brown, Michael Kearns, Aaron Roth, Ankit Siva, Shuai Tang, Zhiwei Steven Wu

    Abstract: We provide a differentially private algorithm for producing synthetic data simultaneously useful for multiple tasks: marginal queries and multitask machine learning (ML). A key innovation in our algorithm is the ability to directly handle numerical features, in contrast to a number of related prior approaches which require numerical features to be first converted into {high cardinality} categorica… ▽ More

    Submitted 15 September, 2022; originally announced September 2022.

    Comments: The short version of this paper appears in the proceedings of NeurIPS-22

  2. Amazon SageMaker Clarify: Machine Learning Bias Detection and Explainability in the Cloud

    Authors: Michaela Hardt, Xiaoguang Chen, Xiaoyi Cheng, Michele Donini, Jason Gelman, Satish Gollaprolu, John He, Pedro Larroy, Xinyu Liu, Nick McCarthy, Ashish Rathi, Scott Rees, Ankit Siva, ErhYuan Tsai, Keerthan Vasist, Pinar Yilmaz, Muhammad Bilal Zafar, Sanjiv Das, Kevin Haas, Tyler Hill, Krishnaram Kenthapadi

    Abstract: Understanding the predictions made by machine learning (ML) models and their potential biases remains a challenging and labor-intensive task that depends on the application, the dataset, and the specific model. We present Amazon SageMaker Clarify, an explainability feature for Amazon SageMaker that launched in December 2020, providing insights into data and ML models by identifying biases and expl… ▽ More

    Submitted 7 September, 2021; originally announced September 2021.

    Journal ref: In Proc. ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD), 2974-2983 (2021)

  3. arXiv:2103.06641  [pdf, other

    cs.LG cs.CR

    Differentially Private Query Release Through Adaptive Projection

    Authors: Sergul Aydore, William Brown, Michael Kearns, Krishnaram Kenthapadi, Luca Melis, Aaron Roth, Ankit Siva

    Abstract: We propose, implement, and evaluate a new algorithm for releasing answers to very large numbers of statistical queries like $k$-way marginals, subject to differential privacy. Our algorithm makes adaptive use of a continuous relaxation of the Projection Mechanism, which answers queries on the private dataset using simple perturbation, and then attempts to find the synthetic dataset that most close… ▽ More

    Submitted 23 June, 2021; v1 submitted 11 March, 2021; originally announced March 2021.

  4. arXiv:1912.03250  [pdf, other

    cs.LG cs.CR stat.ML

    Differentially Private Synthetic Mixed-Type Data Generation For Unsupervised Learning

    Authors: Uthaipon Tantipongpipat, Chris Waites, Digvijay Boob, Amaresh Ankit Siva, Rachel Cummings

    Abstract: We introduce the DP-auto-GAN framework for synthetic data generation, which combines the low dimensional representation of autoencoders with the flexibility of Generative Adversarial Networks (GANs). This framework can be used to take in raw sensitive data and privately train a model for generating synthetic data that will satisfy similar statistical properties as the original data. This learned m… ▽ More

    Submitted 9 December, 2020; v1 submitted 6 December, 2019; originally announced December 2019.