Skip to main content

Showing 1–11 of 11 results for author: Somani, R

.
  1. arXiv:2308.09214  [pdf, other

    math.PR stat.ML

    Path convergence of Markov chains on large graphs

    Authors: Siva Athreya, Soumik Pal, Raghav Somani, Raghavendra Tripathi

    Abstract: We consider two classes of natural stochastic processes on finite unlabeled graphs. These are Euclidean stochastic optimization algorithms on the adjacency matrix of weighted graphs and a modified version of the Metropolis MCMC algorithm on stochastic block models over unweighted graphs. In both cases we show that, as the size of the graph goes to infinity, the random trajectories of the stochasti… ▽ More

    Submitted 15 October, 2023; v1 submitted 17 August, 2023; originally announced August 2023.

    Comments: Improved presentation, added Non-asymptotic rate of convergence in main results. 45 pages+references, 1 figure, 1 table

    MSC Class: 05C80; 60K35; 65C05

  2. arXiv:2210.00422  [pdf, ps, other

    math.PR cs.LG stat.ML

    Stochastic optimization on matrices and a graphon McKean-Vlasov limit

    Authors: Zaid Harchaoui, Sewoong Oh, Soumik Pal, Raghav Somani, Raghavendra Tripathi

    Abstract: We consider stochastic gradient descents on the space of large symmetric matrices of suitable functions that are invariant under permuting the rows and columns using the same permutation. We establish deterministic limits of these random curves as the dimensions of the matrices go to infinity while the entries remain bounded. Under a ``small noise'' assumption the limit is shown to be the gradient… ▽ More

    Submitted 27 May, 2024; v1 submitted 2 October, 2022; originally announced October 2022.

    Comments: 37 pages+ references, introduction modified and new examples added. Improved presentation

    MSC Class: 05C60; 05C63; 05C80; 68R10; 60K35; 60G09

  3. arXiv:2111.09459  [pdf, other

    math.PR cs.LG stat.ML

    Gradient flows on graphons: existence, convergence, continuity equations

    Authors: Sewoong Oh, Soumik Pal, Raghav Somani, Raghavendra Tripathi

    Abstract: Wasserstein gradient flows on probability measures have found a host of applications in various optimization problems. They typically arise as the continuum limit of exchangeable particle systems evolving by some mean-field interaction involving a gradient-type potential. However, in many problems, such as in multi-layer neural networks, the so-called particles are edge weights on large graphs who… ▽ More

    Submitted 29 June, 2023; v1 submitted 17 November, 2021; originally announced November 2021.

    Comments: 43+3 pages, 2 figures (Accepted version for publication in Journal of Theoretical Probability)

    MSC Class: 05C60; 05C80; 68R10; 60K35

  4. arXiv:2106.01487  [pdf, other

    cs.LG cs.CV

    LLC: Accurate, Multi-purpose Learnt Low-dimensional Binary Codes

    Authors: Aditya Kusupati, Matthew Wallingford, Vivek Ramanujan, Raghav Somani, Jae Sung Park, Krishna Pillutla, Prateek Jain, Sham Kakade, Ali Farhadi

    Abstract: Learning binary representations of instances and classes is a classical problem with several high potential applications. In modern settings, the compression of high-dimensional neural representations to low-dimensional binary codes is a challenging task and often require large bit-codes to be accurate. In this work, we propose a novel method for Learning Low-dimensional binary Codes (LLC) for ins… ▽ More

    Submitted 6 October, 2021; v1 submitted 2 June, 2021; originally announced June 2021.

    Comments: NeurIPS 2021 Camera Ready. 19 pages, 6 figures

  5. arXiv:2104.11315  [pdf, other

    cs.LG cs.AI stat.ML

    SPECTRE: Defending Against Backdoor Attacks Using Robust Statistics

    Authors: Jonathan Hayase, Weihao Kong, Raghav Somani, Sewoong Oh

    Abstract: Modern machine learning increasingly requires training on a large collection of data from multiple sources, not all of which can be trusted. A particularly concerning scenario is when a small fraction of poisoned data changes the behavior of the trained model when triggered by an attacker-specified watermark. Such a compromised model will be deployed unnoticed as the model is accurate otherwise. T… ▽ More

    Submitted 22 April, 2021; originally announced April 2021.

    Comments: 29 pages 19 figures

  6. arXiv:2006.09702  [pdf, other

    cs.LG stat.ML

    Robust Meta-learning for Mixed Linear Regression with Small Batches

    Authors: Weihao Kong, Raghav Somani, Sham Kakade, Sewoong Oh

    Abstract: A common challenge faced in practical supervised learning, such as medical image processing and robotic interactions, is that there are plenty of tasks but each task cannot afford to collect enough labeled examples to be learned in isolation. However, by exploiting the similarities across those tasks, one can hope to overcome such data scarcity. Under a canonical scenario where each task is drawn… ▽ More

    Submitted 18 June, 2020; v1 submitted 17 June, 2020; originally announced June 2020.

    Comments: 52 pages, 2 figures

  7. arXiv:2002.08936  [pdf, other

    cs.LG stat.ML

    Meta-learning for mixed linear regression

    Authors: Weihao Kong, Raghav Somani, Zhao Song, Sham Kakade, Sewoong Oh

    Abstract: In modern supervised learning, there are a large number of tasks, but many of them are associated with only a small amount of labeled data. These include data from medical image processing and robotic interaction. Even though each individual task cannot be meaningfully trained in isolation, one seeks to meta-learn across the tasks from past experiences by exploiting some similarities. We study a f… ▽ More

    Submitted 20 February, 2020; originally announced February 2020.

  8. arXiv:2002.03231  [pdf, other

    cs.LG cs.CV stat.ML

    Soft Threshold Weight Reparameterization for Learnable Sparsity

    Authors: Aditya Kusupati, Vivek Ramanujan, Raghav Somani, Mitchell Wortsman, Prateek Jain, Sham Kakade, Ali Farhadi

    Abstract: Sparsity in Deep Neural Networks (DNNs) is studied extensively with the focus of maximizing prediction accuracy given an overall parameter budget. Existing methods rely on uniform or heuristic non-uniform sparsity budgets which have sub-optimal layer-wise parameter allocation resulting in a) lower prediction accuracy or b) higher inference cost (FLOPs). This work proposes Soft Threshold Reparamete… ▽ More

    Submitted 22 June, 2020; v1 submitted 8 February, 2020; originally announced February 2020.

    Comments: 19 pages, 10 figures, Published at International Conference on Machine Learning (ICML) 2020

  9. arXiv:1910.09626  [pdf, other

    cs.LG stat.ML

    Non-Gaussianity of Stochastic Gradient Noise

    Authors: Abhishek Panigrahi, Raghav Somani, Navin Goyal, Praneeth Netrapalli

    Abstract: What enables Stochastic Gradient Descent (SGD) to achieve better generalization than Gradient Descent (GD) in Neural Network training? This question has attracted much attention. In this paper, we study the distribution of the Stochastic Gradient Noise (SGN) vectors during the training. We observe that for batch sizes 256 and above, the distribution is best described as Gaussian at-least in the ea… ▽ More

    Submitted 25 October, 2019; v1 submitted 21 October, 2019; originally announced October 2019.

  10. arXiv:1811.00159  [pdf, other

    cs.IR cs.LG stat.ML

    Clustered Monotone Transforms for Rating Factorization

    Authors: Gaurush Hiranandani, Raghav Somani, Oluwasanmi Koyejo, Sreangsu Acharyya

    Abstract: Exploiting low-rank structure of the user-item rating matrix has been the crux of many recommendation engines. However, existing recommendation engines force raters with heterogeneous behavior profiles to map their intrinsic rating scales to a common rating scale (e.g. 1-5). This non-linear transformation of the rating scale shatters the low-rank structure of the rating matrix, therefore resulting… ▽ More

    Submitted 31 October, 2018; originally announced November 2018.

    Comments: The first two authors contributed equally to the paper. The paper to appear in WSDM 2019

  11. arXiv:1707.02294  [pdf, ps, other

    stat.ML cs.LG stat.CO

    A case study of Empirical Bayes in User-Movie Recommendation system

    Authors: Arabin Kumar Dey, Raghav Somani, Sreangsu Acharyya

    Abstract: In this article we provide a formulation of empirical bayes described by Atchade (2011) to tune the hyperparameters of priors used in bayesian set up of collaborative filter. We implement the same in MovieLens small dataset. We see that it can be used to get a good initial choice for the parameters. It can also be used to guess an initial choice for hyper-parameters in grid search procedure even f… ▽ More

    Submitted 7 July, 2017; originally announced July 2017.

    Comments: 14 pages, 3 figures, 4 subfigures