Skip to main content

Showing 1–4 of 4 results for author: Silwal, S

Searching in archive stat. Search in all archives.
.
  1. arXiv:2306.11312  [pdf, other

    cs.DS cs.LG stat.ML

    Data Structures for Density Estimation

    Authors: Anders Aamand, Alexandr Andoni, Justin Y. Chen, Piotr Indyk, Shyam Narayanan, Sandeep Silwal

    Abstract: We study statistical/computational tradeoffs for the following density estimation problem: given $k$ distributions $v_1, \ldots, v_k$ over a discrete domain of size $n$, and sampling access to a distribution $p$, identify $v_i$ that is "close" to $p$. Our main result is the first data structure that, given a sublinear (in $n$) number of samples from $p$, identifies $v_i$ in time sublinear in $k$.… ▽ More

    Submitted 20 June, 2023; originally announced June 2023.

    Comments: To appear at ICML'23

  2. arXiv:2211.03232  [pdf, other

    cs.LG stat.ML

    Exponentially Improving the Complexity of Simulating the Weisfeiler-Lehman Test with Graph Neural Networks

    Authors: Anders Aamand, Justin Y. Chen, Piotr Indyk, Shyam Narayanan, Ronitt Rubinfeld, Nicholas Schiefer, Sandeep Silwal, Tal Wagner

    Abstract: Recent work shows that the expressive power of Graph Neural Networks (GNNs) in distinguishing non-isomorphic graphs is exactly the same as that of the Weisfeiler-Lehman (WL) graph test. In particular, they show that the WL test can be simulated by GNNs. However, those simulations involve neural networks for the 'combine' function of size polynomial or even exponential in the number of graph nodes… ▽ More

    Submitted 21 December, 2022; v1 submitted 6 November, 2022; originally announced November 2022.

    Comments: 22 pages,5 figures, published at NeurIPS 2022. Updated funding statements

  3. arXiv:1912.01098  [pdf, ps, other

    cs.LG stat.ML

    Using Dimensionality Reduction to Optimize t-SNE

    Authors: Rikhav Shah, Sandeep Silwal

    Abstract: t-SNE is a popular tool for embedding multi-dimensional datasets into two or three dimensions. However, it has a large computational cost, especially when the input data has many dimensions. Many use t-SNE to embed the output of a neural network, which is generally of much lower dimension than the original data. This limits the use of t-SNE in unsupervised scenarios. We propose using \textit{rando… ▽ More

    Submitted 2 December, 2019; originally announced December 2019.

    Comments: 11th Annual Workshop on Optimization for Machine Learning (OPT2019 )

  4. arXiv:1911.07324  [pdf, ps, other

    cs.DS cs.DM cs.LG stat.ML

    Testing Properties of Multiple Distributions with Few Samples

    Authors: Maryam Aliakbarpour, Sandeep Silwal

    Abstract: We propose a new setting for testing properties of distributions while receiving samples from several distributions, but few samples per distribution. Given samples from $s$ distributions, $p_1, p_2, \ldots, p_s$, we design testers for the following problems: (1) Uniformity Testing: Testing whether all the $p_i$'s are uniform or $ε$-far from being uniform in $\ell_1$-distance (2) Identity Testing:… ▽ More

    Submitted 17 November, 2019; originally announced November 2019.

    Comments: ITCS 2020