Skip to main content

Showing 1–31 of 31 results for author: Dubey, A

Searching in archive stat. Search in all archives.
.
  1. arXiv:2302.00787  [pdf, other

    cs.LG stat.ML

    FAVOR#: Sharp Attention Kernel Approximations via New Classes of Positive Random Features

    Authors: Valerii Likhosherstov, Krzysztof Choromanski, Avinava Dubey, Frederick Liu, Tamas Sarlos, Adrian Weller

    Abstract: The problem of efficient approximation of a linear operator induced by the Gaussian or softmax kernel is often addressed using random features (RFs) which yield an unbiased approximation of the operator's result. Such operators emerge in important applications ranging from kernel methods to efficient Transformers. We propose parameterized, positive, non-trigonometric RFs which approximate Gaussian… ▽ More

    Submitted 1 February, 2023; originally announced February 2023.

  2. arXiv:2210.02415  [pdf, other

    cs.LG cs.DS stat.ML

    A Fourier Approach to Mixture Learning

    Authors: Mingda Qiao, Guru Guruganesh, Ankit Singh Rawat, Avinava Dubey, Manzil Zaheer

    Abstract: We revisit the problem of learning mixtures of spherical Gaussians. Given samples from mixture $\frac{1}{k}\sum_{j=1}^{k}\mathcal{N}(μ_j, I_d)$, the goal is to estimate the means $μ_1, μ_2, \ldots, μ_k \in \mathbb{R}^d$ up to a small error. The hardness of this learning problem can be measured by the separation $Δ$ defined as the minimum distance between all pairs of means. Regev and Vijayaraghava… ▽ More

    Submitted 5 October, 2022; v1 submitted 5 October, 2022; originally announced October 2022.

    Comments: To appear at NeurIPS 2022; v2 corrected author information

  3. arXiv:2205.14174  [pdf, other

    stat.ML cs.CR cs.LG cs.MA

    Private and Byzantine-Proof Cooperative Decision-Making

    Authors: Abhimanyu Dubey, Alex Pentland

    Abstract: The cooperative bandit problem is a multi-agent decision problem involving a group of agents that interact simultaneously with a multi-armed bandit, while communicating over a network with delays. The central idea in this problem is to design algorithms that can efficiently leverage communication to obtain improvements over acting in isolation. In this paper, we investigate the stochastic bandit p… ▽ More

    Submitted 27 May, 2022; originally announced May 2022.

    Comments: Full version of AAMAS 2020 paper uploaded to arXiv

  4. arXiv:2112.02012  [pdf, other

    cs.LG cs.AI stat.ML

    Practitioner-Centric Approach for Early Incident Detection Using Crowdsourced Data for Emergency Services

    Authors: Yasas Senarath, Ayan Mukhopadhyay, Sayyed Mohsen Vazirizade, Hemant Purohit, Saideep Nannapaneni, Abhishek Dubey

    Abstract: Emergency response is highly dependent on the time of incident reporting. Unfortunately, the traditional approach to receiving incident reports (e.g., calling 911 in the USA) has time delays. Crowdsourcing platforms such as Waze provide an opportunity for early identification of incidents. However, detecting incidents from crowdsourced data streams is difficult due to the challenges of noise and u… ▽ More

    Submitted 3 December, 2021; originally announced December 2021.

    Comments: Accepted at IEEE International Conference on Data Mining (ICDM) 2021

  5. arXiv:2111.12482  [pdf, other

    stat.ML cs.LG

    One More Step Towards Reality: Cooperative Bandits with Imperfect Communication

    Authors: Udari Madhushani, Abhimanyu Dubey, Naomi Ehrich Leonard, Alex Pentland

    Abstract: The cooperative bandit problem is increasingly becoming relevant due to its applications in large-scale decision-making. However, most research for this problem focuses exclusively on the setting with perfect communication, whereas in most real-world distributed settings, communication is often over stochastic networks, with arbitrary corruptions and delays. In this paper, we study cooperative ban… ▽ More

    Submitted 24 November, 2021; originally announced November 2021.

    Journal ref: Conference on Neural Information Processing Systems, 2021

  6. arXiv:2104.07061  [pdf, other

    cs.LG cs.DS physics.data-an stat.ML

    Exact and Approximate Hierarchical Clustering Using A*

    Authors: Craig S. Greenberg, Sebastian Macaluso, Nicholas Monath, Avinava Dubey, Patrick Flaherty, Manzil Zaheer, Amr Ahmed, Kyle Cranmer, Andrew McCallum

    Abstract: Hierarchical clustering is a critical task in numerous domains. Many approaches are based on heuristics and the properties of the resulting clusterings are studied post hoc. However, in several applications, there is a natural cost function that can be used to characterize the quality of the clustering. In those cases, hierarchical clustering can be seen as a combinatorial optimization problem. To… ▽ More

    Submitted 14 April, 2021; originally announced April 2021.

    Comments: 30 pages, 9 figures

  7. arXiv:2103.04972  [pdf, ps, other

    cs.LG cs.MA stat.ML

    Provably Efficient Cooperative Multi-Agent Reinforcement Learning with Function Approximation

    Authors: Abhimanyu Dubey, Alex Pentland

    Abstract: Reinforcement learning in cooperative multi-agent settings has recently advanced significantly in its scope, with applications in cooperative estimation for advertising, dynamic treatment regimes, distributed control, and federated learning. In this paper, we discuss the problem of cooperative multi-agent RL with function approximation, where a group of agents communicates with each other to joint… ▽ More

    Submitted 8 March, 2021; originally announced March 2021.

    Comments: 53 pages including Appendix

  8. arXiv:2102.12467  [pdf, other

    stat.ML cs.CR cs.LG

    No-Regret Algorithms for Private Gaussian Process Bandit Optimization

    Authors: Abhimanyu Dubey

    Abstract: The widespread proliferation of data-driven decision-making has ushered in a recent interest in the design of privacy-preserving algorithms. In this paper, we consider the ubiquitous problem of gaussian process (GP) bandit optimization from the lens of privacy-preserving statistics. We propose a solution for differentially private GP bandit optimization that combines a uniform kernel approximator… ▽ More

    Submitted 24 February, 2021; originally announced February 2021.

    Comments: AISTATS21 Camera Ready v1

  9. arXiv:2010.11425  [pdf, other

    cs.LG cs.CR cs.MA stat.ML

    Differentially-Private Federated Linear Bandits

    Authors: Abhimanyu Dubey, Alex Pentland

    Abstract: The rapid proliferation of decentralized learning systems mandates the need for differentially-private cooperative learning. In this paper, we study this in context of the contextual linear bandit: we consider a collection of agents cooperating to solve a common contextual bandit, while ensuring that their communication remains private. For this problem, we devise \textsc{FedUCB}, a multiagent pri… ▽ More

    Submitted 21 October, 2020; originally announced October 2020.

    Comments: 22 pages. Camera-ready for NeurIPS 2020

  10. arXiv:2008.06244  [pdf, other

    cs.LG cs.MA stat.ML

    Cooperative Multi-Agent Bandits with Heavy Tails

    Authors: Abhimanyu Dubey, Alex Pentland

    Abstract: We study the heavy-tailed stochastic bandit problem in the cooperative multi-agent setting, where a group of agents interact with a common bandit problem, while communicating on a network with delays. Existing algorithms for the stochastic bandit in this setting utilize confidence intervals arising from an averaging-based communication protocol known as~\textit{running consensus}, that does not le… ▽ More

    Submitted 14 August, 2020; originally announced August 2020.

    Comments: 26 pages including appendix, camera-ready for ICML 2020

  11. arXiv:2008.06220  [pdf, other

    cs.LG cs.MA stat.ML

    Kernel Methods for Cooperative Multi-Agent Contextual Bandits

    Authors: Abhimanyu Dubey, Alex Pentland

    Abstract: Cooperative multi-agent decision making involves a group of agents cooperatively solving learning problems while communicating over a network with delays. In this paper, we consider the kernelised contextual bandit problem, where the reward obtained by an agent is an arbitrary linear function of the contexts' images in the related reproducing kernel Hilbert space (RKHS), and a group of agents must… ▽ More

    Submitted 14 August, 2020; originally announced August 2020.

    Comments: 19 pages including supplement, camera-ready at ICML 2020

  12. arXiv:2007.14062  [pdf, other

    cs.LG cs.CL stat.ML

    Big Bird: Transformers for Longer Sequences

    Authors: Manzil Zaheer, Guru Guruganesh, Avinava Dubey, Joshua Ainslie, Chris Alberti, Santiago Ontanon, Philip Pham, Anirudh Ravula, Qifan Wang, Li Yang, Amr Ahmed

    Abstract: Transformers-based models, such as BERT, have been one of the most successful deep learning models for NLP. Unfortunately, one of their core limitations is the quadratic dependency (mainly in terms of memory) on the sequence length due to their full attention mechanism. To remedy this, we propose, BigBird, a sparse attention mechanism that reduces this quadratic dependency to linear. We show that… ▽ More

    Submitted 8 January, 2021; v1 submitted 28 July, 2020; originally announced July 2020.

    Journal ref: Neural Information Processing Systems (NeurIPS) 2020

  13. arXiv:2001.05591  [pdf, other

    stat.ML cs.LG

    Distributed, partially collapsed MCMC for Bayesian Nonparametrics

    Authors: Avinava Dubey, Michael Minyi Zhang, Eric P. Xing, Sinead A. Williamson

    Abstract: Bayesian nonparametric (BNP) models provide elegant methods for discovering underlying latent features within a data set, but inference in such models can be slow. We exploit the fact that completely random measures, which commonly used models like the Dirichlet process and the beta-Bernoulli process can be expressed as, are decomposable into independent sub-measures. We use this decomposition to… ▽ More

    Submitted 4 March, 2020; v1 submitted 15 January, 2020; originally announced January 2020.

    Comments: To appear in the 23rd International Conference on Artificial Intelligence and Statistics

    Journal ref: Artificial Intelligence and Statistics, 108:3685-3695, 2020

  14. arXiv:1912.03718  [pdf, other

    stat.ME cs.LG eess.SP

    Improved Covariance Matrix Estimator using Shrinkage Transformation and Random Matrix Theory

    Authors: Samruddhi Deshmukh, Amartansh Dubey

    Abstract: One of the major challenges in multivariate analysis is the estimation of population covariance matrix from sample covariance matrix (SCM). Most recent covariance matrix estimators use either shrinkage transformations or asymptotic results from Random Matrix Theory (RMT). Shrinkage techniques help in pulling extreme correlation values towards certain target values whereas tools from RMT help in re… ▽ More

    Submitted 8 December, 2019; originally announced December 2019.

  15. arXiv:1912.02574  [pdf, other

    cs.NE cs.LG stat.ML

    Data-Driven Optimization of Public Transit Schedule

    Authors: Sanchita Basak, Fangzhou Sun, Saptarshi Sengupta, Abhishek Dubey

    Abstract: Bus transit systems are the backbone of public transportation in the United States. An important indicator of the quality of service in such infrastructures is on-time performance at stops, with published transit schedules playing an integral role governing the level of success of the service. However there are relatively few optimization architectures leveraging stochastic search that focus on op… ▽ More

    Submitted 29 November, 2019; originally announced December 2019.

    Comments: 20 pages, 6 figures, 2 tables

  16. arXiv:1908.11250  [pdf, other

    cs.LG stat.ML

    Smaller Models, Better Generalization

    Authors: Mayank Sharma, Suraj Tripathi, Abhimanyu Dubey, Jayadeva, Sai Guruju, Nihal Goalla

    Abstract: Reducing network complexity has been a major research focus in recent years with the advent of mobile technology. Convolutional Neural Networks that perform various vision tasks without memory overhaul is the need of the hour. This paper focuses on qualitative and quantitative analysis of reducing the network complexity using an upper bound on the Vapnik-Chervonenkis dimension, pruning, and quanti… ▽ More

    Submitted 29 August, 2019; originally announced August 2019.

    Comments: 10 pages, 3 figures, In Review

  17. arXiv:1907.03821  [pdf, other

    cs.LG stat.ML

    Thompson Sampling on Symmetric $α$-Stable Bandits

    Authors: Abhimanyu Dubey, Alex Pentland

    Abstract: Thompson Sampling provides an efficient technique to introduce prior knowledge in the multi-armed bandit problem, along with providing remarkable empirical performance. In this paper, we revisit the Thompson Sampling algorithm under rewards drawn from symmetric $α$-stable distributions, which are a class of heavy-tailed probability distributions utilized in finance and economics, in problems such… ▽ More

    Submitted 5 December, 2019; v1 submitted 8 July, 2019; originally announced July 2019.

    Comments: IJCAI 2019 Camera Ready with appendix, updated Theorem 1

  18. arXiv:1902.06740  [pdf, other

    cs.LG stat.ML

    Leveraging Communication Topologies Between Learning Agents in Deep Reinforcement Learning

    Authors: Dhaval Adjodah, Dan Calacci, Abhimanyu Dubey, Anirudh Goyal, Peter Krafft, Esteban Moro, Alex Pentland

    Abstract: A common technique to improve learning performance in deep reinforcement learning (DRL) and many other machine learning algorithms is to run multiple learning agents in parallel. A neglected component in the development of these algorithms has been how best to arrange the learning agents involved to improve distributed search. Here we draw upon results from the networked optimization literatures s… ▽ More

    Submitted 11 March, 2020; v1 submitted 16 February, 2019; originally announced February 2019.

    Comments: arXiv admin note: substantial text overlap with arXiv:1811.12556

    Journal ref: AAMAS 2020

  19. arXiv:1812.10782  [pdf, other

    cs.LG stat.ML

    Evaluating Generative Adversarial Networks on Explicitly Parameterized Distributions

    Authors: Shayne O'Brien, Matt Groh, Abhimanyu Dubey

    Abstract: The true distribution parameterizations of commonly used image datasets are inaccessible. Rather than designing metrics for feature spaces with unknown characteristics, we propose to measure GAN performance by evaluating on explicitly parameterized, synthetic data distributions. As a case study, we examine the performance of 16 GAN variants on six multivariate distributions of varying dimensionali… ▽ More

    Submitted 27 December, 2018; originally announced December 2018.

    Comments: Presented at the NeurIPS 2018 Workshop on Critiquing and Correcting Trends in Machine Learning

  20. arXiv:1812.03288  [pdf, other

    cs.LG cs.DC stat.ML

    No Peek: A Survey of private distributed deep learning

    Authors: Praneeth Vepakomma, Tristan Swedish, Ramesh Raskar, Otkrist Gupta, Abhimanyu Dubey

    Abstract: We survey distributed deep learning models for training or inference without accessing raw data from clients. These methods aim to protect confidential patterns in data while still allowing servers to train models. The distributed deep learning methods of federated learning, split learning and large batch stochastic gradient descent are compared in addition to private and secure approaches of diff… ▽ More

    Submitted 8 December, 2018; originally announced December 2018.

    Comments: 21 pages

  21. arXiv:1811.12556   

    cs.LG cs.AI stat.ML

    How to Organize your Deep Reinforcement Learning Agents: The Importance of Communication Topology

    Authors: Dhaval Adjodah, Dan Calacci, Abhimanyu Dubey, Peter Krafft, Esteban Moro, Alex `Sandy' Pentland

    Abstract: In this empirical paper, we investigate how learning agents can be arranged in more efficient communication topologies for improved learning. This is an important problem because a common technique to improve speed and robustness of learning in deep reinforcement learning and many other machine learning algorithms is to run multiple learning agents in parallel. The standard communication architect… ▽ More

    Submitted 2 March, 2019; v1 submitted 29 November, 2018; originally announced November 2018.

    Comments: please refer to arXiv:1902.06740 for updated paper

  22. arXiv:1810.08985  [pdf, other

    cs.LG stat.ML

    Mechanisms for Integrated Feature Normalization and Remaining Useful Life Estimation Using LSTMs Applied to Hard-Disks

    Authors: Sanchita Basak, Saptarshi Sengupta, Abhishek Dubey

    Abstract: With emerging smart communities, improving overall system availability is becoming a major concern. In order to improve the reliability of the components in a system we propose an inference model to predict Remaining Useful Life (RUL) of those components. In this paper we work with components of backend data servers such as hard disks, that are subject to degradation. A Deep Long-Short Term Memory… ▽ More

    Submitted 16 June, 2019; v1 submitted 21 October, 2018; originally announced October 2018.

    Comments: 9 pages, 13 figures, 2 tables

    Journal ref: Proceedings of IEEE Smartcomp 2019

  23. arXiv:1802.00002  [pdf, other

    cs.LG stat.ML

    DxNAT - Deep Neural Networks for Explaining Non-Recurring Traffic Congestion

    Authors: Fangzhou sun, Abhishek Dubey, Jules White

    Abstract: Non-recurring traffic congestion is caused by temporary disruptions, such as accidents, sports games, adverse weather, etc. We use data related to real-time traffic speed, jam factors (a traffic congestion indicator), and events collected over a year from Nashville, TN to train a multi-layered deep neural network. The traffic dataset contains over 900 million data records. The network is thereafte… ▽ More

    Submitted 30 January, 2018; originally announced February 2018.

  24. arXiv:1801.09819  [pdf, other

    stat.ML

    Transformation Autoregressive Networks

    Authors: Junier B. Oliva, Avinava Dubey, Manzil Zaheer, Barnabás Póczos, Ruslan Salakhutdinov, Eric P. Xing, Jeff Schneider

    Abstract: The fundamental task of general density estimation $p(x)$ has been of keen interest to machine learning. In this work, we attempt to systematically characterize methods for density estimation. Broadly speaking, most of the existing methods can be categorized into either using: \textit{a}) autoregressive models to estimate the conditional factors of the chain rule, $p(x_{i}\, |\, x_{i-1}, \ldots)$;… ▽ More

    Submitted 23 October, 2018; v1 submitted 29 January, 2018; originally announced January 2018.

    Journal ref: ICML 2018

  25. arXiv:1705.10750  [pdf, other

    cs.LG stat.ML

    Recurrent Estimation of Distributions

    Authors: Junier B. Oliva, Kumar Avinava Dubey, Barnabas Poczos, Eric Xing, Jeff Schneider

    Abstract: This paper presents the recurrent estimation of distributions (RED) for modeling real-valued data in a semiparametric fashion. RED models make two novel uses of recurrent neural networks (RNNs) for density estimation of general real-valued data. First, RNNs are used to transform input covariates into a latent space to better capture conditional dependencies in inputs. After, an RNN is used to comp… ▽ More

    Submitted 30 May, 2017; originally announced May 2017.

  26. arXiv:1705.10301  [pdf, other

    cs.LG cs.AI stat.ML

    Contextual Explanation Networks

    Authors: Maruan Al-Shedivat, Avinava Dubey, Eric P. Xing

    Abstract: Modern learning algorithms excel at producing accurate but complex models of the data. However, deploying such models in the real-world requires extra care: we must ensure their reliability, robustness, and absence of undesired biases. This motivates the development of models that are equally accurate but can be also easily inspected and assessed beyond their predictive performance. To this end, w… ▽ More

    Submitted 9 September, 2020; v1 submitted 29 May, 2017; originally announced May 2017.

    Comments: 48 pages, 18 figures, to appear in JMLR

  27. arXiv:1703.03457  [pdf, other

    stat.ML

    Parallel Markov Chain Monte Carlo for the Indian Buffet Process

    Authors: Michael M. Zhang, Avinava Dubey, Sinead A. Williamson

    Abstract: Indian Buffet Process based models are an elegant way for discovering underlying features within a data set, but inference in such models can be slow. Inferring underlying features using Markov chain Monte Carlo either relies on an uncollapsed representation, which leads to poor mixing, or on a collapsed representation, which leads to a quadratic increase in computational complexity. Existing atte… ▽ More

    Submitted 9 March, 2017; originally announced March 2017.

    Comments: Workshop paper in Bayesian Nonparametrics: The Next Generation, NIPS 2015

  28. arXiv:1506.08776  [pdf, ps, other

    stat.ML

    Bayesian Nonparametric Kernel-Learning

    Authors: Junier Oliva, Avinava Dubey, Andrew G. Wilson, Barnabas Poczos, Jeff Schneider, Eric P. Xing

    Abstract: Kernel methods are ubiquitous tools in machine learning. However, there is often little reason for the common practice of selecting a kernel a priori. Even if a universal approximating kernel is selected, the quality of the finite sample estimator may be greatly affected by the choice of kernel. Furthermore, when directly applying kernel methods, one typically needs to compute a $N \times N$ Gram… ▽ More

    Submitted 29 January, 2018; v1 submitted 29 June, 2015; originally announced June 2015.

  29. arXiv:1409.2617  [pdf, other

    math.OC stat.ML

    Large-scale randomized-coordinate descent methods with non-separable linear constraints

    Authors: Sashank Reddi, Ahmed Hefny, Carlton Downey, Avinava Dubey, Suvrit Sra

    Abstract: We develop randomized (block) coordinate descent (CD) methods for linearly constrained convex optimization. Unlike most CD methods, we do not assume the constraints to be separable, but let them be coupled linearly. To our knowledge, ours is the first CD method that allows linear coupling constraints, without making the global iteration complexity have an exponential dependence on the number of co… ▽ More

    Submitted 10 June, 2015; v1 submitted 9 September, 2014; originally announced September 2014.

  30. arXiv:1211.7120  [pdf, other

    stat.ML

    Exact and Efficient Parallel Inference for Nonparametric Mixture Models

    Authors: Sinead A. Williamson, Avinava Dubey, Eric P. Xing

    Abstract: Nonparametric mixture models based on the Dirichlet process are an elegant alternative to finite models when the number of underlying components is unknown, but inference in such models can be slow. Existing attempts to parallelize inference in such models have relied on introducing approximations, which can lead to inaccuracies in the posterior estimate. In this paper, we describe auxiliary varia… ▽ More

    Submitted 29 November, 2012; originally announced November 2012.

  31. arXiv:1208.4411  [pdf, other

    stat.ML

    A non-parametric mixture model for topic modeling over time

    Authors: Avinava Dubey, Ahmed Hefny, Sinead Williamson, Eric P. Xing

    Abstract: A single, stationary topic model such as latent Dirichlet allocation is inappropriate for modeling corpora that span long time periods, as the popularity of topics is likely to change over time. A number of models that incorporate time have been proposed, but in general they either exhibit limited forms of temporal variation, or require computationally expensive inference methods. In this paper we… ▽ More

    Submitted 21 August, 2012; originally announced August 2012.

    Comments: 9 pages