Skip to main content

Showing 1–9 of 9 results for author: Sivakumar, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.19566  [pdf, other

    cs.LG cs.CR cs.DS math.ST stat.ML

    Instance-Optimal Private Density Estimation in the Wasserstein Distance

    Authors: Vitaly Feldman, Audra McMillan, Satchit Sivakumar, Kunal Talwar

    Abstract: Estimating the density of a distribution from samples is a fundamental problem in statistics. In many practical settings, the Wasserstein distance is an appropriate error metric for density estimation. For example, when estimating population densities in a geographic region, a small Wasserstein distance means that the estimate is able to capture roughly where the population mass is. In this work w… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  2. arXiv:2401.06336  [pdf

    cs.IR cs.DB

    TRACE: A Time-Relational Approximate Cubing Engine for Fast Data Insights

    Authors: Suharsh Sivakumar, Jonathan Shen, Rajat Monga

    Abstract: A large class of data questions can be modeled as identifying important slices of data driven by user defined metrics. This paper presents TRACE, a Time-Relational Approximate Cubing Engine that enables interactive analysis on such slices with a low upfront cost - both in space and computation. It does this by materializing the most important parts of the cube over time enabling interactive queryi… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.

    ACM Class: H.3.3

  3. arXiv:2306.06723  [pdf, ps, other

    cs.DS cs.CR

    Counting Distinct Elements in the Turnstile Model with Differential Privacy under Continual Observation

    Authors: Palak Jain, Iden Kalemaj, Sofya Raskhodnikova, Satchit Sivakumar, Adam Smith

    Abstract: Privacy is a central challenge for systems that learn from sensitive data sets, especially when a system's outputs must be continuously updated to reflect changing data. We consider the achievable error for differentially private continual release of a basic statistic -- the number of distinct items -- in a stream where items may be both inserted and deleted (the turnstile model). With only insert… ▽ More

    Submitted 30 October, 2023; v1 submitted 11 June, 2023; originally announced June 2023.

  4. arXiv:2304.09133  [pdf

    eess.IV cs.CV cs.LG

    Detection and Classification of Glioblastoma Brain Tumor

    Authors: Utkarsh Maurya, Appisetty Krishna Kalyan, Swapnil Bohidar, Dr. S. Sivakumar

    Abstract: Glioblastoma brain tumors are highly malignant and often require early detection and accurate segmentation for effective treatment. We are proposing two deep learning models in this paper, namely UNet and Deeplabv3, for the detection and segmentation of glioblastoma brain tumors using preprocessed brain MRI images. The performance evaluation is done for these models in terms of accuracy and comput… ▽ More

    Submitted 18 April, 2023; originally announced April 2023.

    Comments: 12 pages, 8 figures

  5. arXiv:2303.12921  [pdf, ps, other

    cs.LG cs.CR cs.DS

    Stability is Stable: Connections between Replicability, Privacy, and Adaptive Generalization

    Authors: Mark Bun, Marco Gaboardi, Max Hopkins, Russell Impagliazzo, Rex Lei, Toniann Pitassi, Satchit Sivakumar, Jessica Sorrell

    Abstract: The notion of replicable algorithms was introduced in Impagliazzo et al. [STOC '22] to describe randomized algorithms that are stable under the resampling of their inputs. More precisely, a replicable algorithm gives the same output with high probability when its randomness is fixed and it is run on a new i.i.d. sample drawn from the same distribution. Using replicable algorithms for data analysis… ▽ More

    Submitted 24 March, 2023; v1 submitted 22 March, 2023; originally announced March 2023.

    Comments: STOC 2023, minor typos fixed

  6. arXiv:2211.08193  [pdf, ps, other

    cs.LG cs.CR

    Differentially Private Sampling from Distributions

    Authors: Sofya Raskhodnikova, Satchit Sivakumar, Adam Smith, Marika Swanberg

    Abstract: We initiate an investigation of private sampling from distributions. Given a dataset with $n$ independent observations from an unknown distribution $P$, a sampling algorithm must output a single observation from a distribution that is close in total variation distance to $P$ while satisfying differential privacy. Sampling abstracts the goal of generating small amounts of realistic-looking data. We… ▽ More

    Submitted 15 November, 2022; originally announced November 2022.

    Comments: 44 pages, preliminary version in NeurIPS 2021

  7. arXiv:2112.00828  [pdf, other

    cs.DS cs.CR

    The Price of Differential Privacy under Continual Observation

    Authors: Palak Jain, Sofya Raskhodnikova, Satchit Sivakumar, Adam Smith

    Abstract: We study the accuracy of differentially private mechanisms in the continual release model. A continual release mechanism receives a sensitive dataset as a stream of $T$ inputs and produces, after receiving each input, an accurate output on the obtained inputs. In contrast, a batch algorithm receives the data as one batch and produces a single output. We provide the first strong lower bounds on t… ▽ More

    Submitted 11 January, 2022; v1 submitted 1 December, 2021; originally announced December 2021.

    Comments: 28 pages

  8. arXiv:2109.03200  [pdf, other

    cs.AI cs.CL cs.LG

    ExCode-Mixed: Explainable Approaches towards Sentiment Analysis on Code-Mixed Data using BERT models

    Authors: Aman Priyanshu, Aleti Vardhan, Sudarshan Sivakumar, Supriti Vijay, Nipuna Chhabra

    Abstract: The increasing use of social media sites in countries like India has given rise to large volumes of code-mixed data. Sentiment analysis of this data can provide integral insights into people's perspectives and opinions. Develo** robust explainability techniques which explain why models make their predictions becomes essential. In this paper, we propose an adequate methodology to integrate explai… ▽ More

    Submitted 25 September, 2021; v1 submitted 7 September, 2021; originally announced September 2021.

    Comments: 3 pages, 1 figure

  9. arXiv:2107.10870  [pdf, other

    cs.LG cs.DS

    Multiclass versus Binary Differentially Private PAC Learning

    Authors: Mark Bun, Marco Gaboardi, Satchit Sivakumar

    Abstract: We show a generic reduction from multiclass differentially private PAC learning to binary private PAC learning. We apply this transformation to a recently proposed binary private PAC learner to obtain a private multiclass learner with sample complexity that has a polynomial dependence on the multiclass Littlestone dimension and a poly-logarithmic dependence on the number of classes. This yields an… ▽ More

    Submitted 22 July, 2021; originally announced July 2021.