Skip to main content

Showing 1–17 of 17 results for author: Sankararaman, A

Searching in archive stat. Search in all archives.
.
  1. arXiv:2306.09548  [pdf, other

    stat.ML cs.LG

    Online Heavy-tailed Change-point detection

    Authors: Abishek Sankararaman, Balakrishnan, Narayanaswamy

    Abstract: We study algorithms for online change-point detection (OCPD), where samples that are potentially heavy-tailed, are presented one at a time and a change in the underlying mean must be detected as early as possible. We present an algorithm based on clipped Stochastic Gradient Descent (SGD), that works even if we only assume that the second moment of the data generating process is bounded. We derive… ▽ More

    Submitted 3 July, 2023; v1 submitted 15 June, 2023; originally announced June 2023.

    Comments: UAI 2023

  2. arXiv:2211.07484  [pdf, ps, other

    cs.LG stat.ML

    Contextual Bandits with Packing and Covering Constraints: A Modular Lagrangian Approach via Regression

    Authors: Aleksandrs Slivkins, Xingyu Zhou, Karthik Abinav Sankararaman, Dylan J. Foster

    Abstract: We consider contextual bandits with linear constraints (CBwLC), a variant of contextual bandits in which the algorithm consumes multiple resources subject to linear constraints on total consumption. This problem generalizes contextual bandits with knapsacks (CBwK), allowing for packing and covering constraints, as well as positive and negative resource consumption. We provide the first algorithm f… ▽ More

    Submitted 29 June, 2024; v1 submitted 14 November, 2022; originally announced November 2022.

    Comments: A preliminary version of this paper, authored by A. Slivkins, K.A. Sankararaman and D.J. Foster, has been published at COLT 2023. The present version features an important improvement, due to Xingyu Zhou. Specifically, the $\sqrt{T}$-regret result in Theorem 3.6(a) holds under a much weaker assumption, and is now positioned as the main guarantee

  3. arXiv:2206.00120  [pdf, other

    stat.ML cs.IT cs.LG

    Decentralized Competing Bandits in Non-Stationary Matching Markets

    Authors: Avishek Ghosh, Abishek Sankararaman, Kannan Ramchandran, Tara Javidi, Arya Mazumdar

    Abstract: Understanding complex dynamics of two-sided online matching markets, where the demand-side agents compete to match with the supply-side (arms), has recently received substantial interest. To that end, in this paper, we introduce the framework of decentralized two-sided matching market under non stationary (dynamic) environments. We adhere to the serial dictatorship setting, where the demand-side a… ▽ More

    Submitted 31 May, 2022; originally announced June 2022.

  4. arXiv:2205.09899  [pdf, other

    stat.ML cs.AI cs.IT cs.LG

    Breaking the $\sqrt{T}$ Barrier: Instance-Independent Logarithmic Regret in Stochastic Contextual Linear Bandits

    Authors: Avishek Ghosh, Abishek Sankararaman

    Abstract: We prove an instance independent (poly) logarithmic regret for stochastic contextual bandits with linear payoff. Previously, in \cite{chu2011contextual}, a lower bound of $\mathcal{O}(\sqrt{T})$ is shown for the contextual linear bandit problem with arbitrary (adversarily chosen) contexts. In this paper, we show that stochastic contexts indeed help to reduce the regret from $\sqrt{T}$ to… ▽ More

    Submitted 19 May, 2022; originally announced May 2022.

    Comments: To appear in ICML 2022

  5. arXiv:2107.03455  [pdf, ps, other

    stat.ML cs.IT cs.LG

    Model Selection for Generic Contextual Bandits

    Authors: Avishek Ghosh, Abishek Sankararaman, Kannan Ramchandran

    Abstract: We consider the problem of model selection for the general stochastic contextual bandits under the realizability assumption. We propose a successive refinement based algorithm called Adaptive Contextual Bandit ({\ttfamily ACB}), that works in phases and successively eliminates model classes that are too simple to fit the given instance. We prove that this algorithm is adaptive, i.e., the regret ra… ▽ More

    Submitted 20 July, 2023; v1 submitted 7 July, 2021; originally announced July 2021.

    Comments: Accepted at IEEE Transactions on Information Theory. arXiv admin note: text overlap with arXiv:2006.02612

  6. arXiv:2106.08902  [pdf, other

    stat.ML cs.LG

    Adaptive Clustering and Personalization in Multi-Agent Stochastic Linear Bandits

    Authors: Avishek Ghosh, Abishek Sankararaman, Kannan Ramchandran

    Abstract: We consider the problem of minimizing regret in an $N$ agent heterogeneous stochastic linear bandits framework, where the agents (users) are similar but not all identical. We model user heterogeneity using two popularly used ideas in practice; (i) A clustering framework where users are partitioned into groups with users in the same group being identical to each other, but different across groups,… ▽ More

    Submitted 2 February, 2022; v1 submitted 14 June, 2021; originally announced June 2021.

    Comments: 25 pages, 8 figures

  7. arXiv:2103.07501  [pdf, other

    cs.LG stat.ML

    Beyond $\log^2(T)$ Regret for Decentralized Bandits in Matching Markets

    Authors: Soumya Basu, Karthik Abinav Sankararaman, Abishek Sankararaman

    Abstract: We design decentralized algorithms for regret minimization in the two-sided matching market with one-sided bandit feedback that significantly improves upon the prior works (Liu et al. 2020a, 2020b, Sankararaman et al. 2020). First, for general markets, for any $\varepsilon > 0$, we design an algorithm that achieves a $O(\log^{1+\varepsilon}(T))$ regret to the agent-optimal stable matching, with un… ▽ More

    Submitted 12 March, 2021; originally announced March 2021.

  8. arXiv:2007.06869  [pdf, other

    cs.LG cs.AI cs.DS stat.ML

    Robust Identifiability in Linear Structural Equation Models of Causal Inference

    Authors: Karthik Abinav Sankararaman, Anand Louis, Navin Goyal

    Abstract: In this work, we consider the problem of robust parameter estimation from observational data in the context of linear structural equation models (LSEMs). LSEMs are a popular and well-studied class of models for inferring causality in the natural and social sciences. One of the main problems related to LSEMs is to recover the model parameters from the observational data. Under various conditions on… ▽ More

    Submitted 14 July, 2020; originally announced July 2020.

  9. arXiv:2007.01442  [pdf, other

    cs.LG cs.DC cs.SI stat.ML

    Multi-Agent Low-Dimensional Linear Bandits

    Authors: Ronshee Chawla, Abishek Sankararaman, Sanjay Shakkottai

    Abstract: We study a multi-agent stochastic linear bandit with side information, parameterized by an unknown vector $θ^* \in \mathbb{R}^d$. The side information consists of a finite collection of low-dimensional subspaces, one of which contains $θ^*$. In our setting, agents can collaborate to reduce regret by sending recommendations across a communication graph connecting them. We present a novel decentrali… ▽ More

    Submitted 25 May, 2022; v1 submitted 2 July, 2020; originally announced July 2020.

    Comments: To appear in IEEE Transactions on Automatic Control

  10. arXiv:2006.15166  [pdf, other

    cs.LG cs.DS cs.GT stat.ML

    Dominate or Delete: Decentralized Competing Bandits in Serial Dictatorship

    Authors: Abishek Sankararaman, Soumya Basu, Karthik Abinav Sankararaman

    Abstract: Online learning in a two-sided matching market, with demand side agents continuously competing to be matched with supply side (arms), abstracts the complex interactions under partial information on matching platforms (e.g. UpWork, TaskRabbit). We study the decentralized serial dictatorship setting, a two-sided matching market where the demand side agents have unknown and heterogeneous valuation ov… ▽ More

    Submitted 12 March, 2021; v1 submitted 26 June, 2020; originally announced June 2020.

    Comments: AISTATS, 2021

  11. arXiv:2006.02612  [pdf, ps, other

    stat.ML cs.LG

    Problem-Complexity Adaptive Model Selection for Stochastic Linear Bandits

    Authors: Avishek Ghosh, Abishek Sankararaman, Kannan Ramchandran

    Abstract: We consider the problem of model selection for two popular stochastic linear bandit settings, and propose algorithms that adapts to the unknown problem complexity. In the first setting, we consider the $K$ armed mixture bandits, where the mean reward of arm $i \in [K]$, is $μ_i+ \langle α_{i,t},θ^* \rangle $, with $α_{i,t} \in \mathbb{R}^d$ being the known context vector and $μ_i \in [-1,1]$ and… ▽ More

    Submitted 15 June, 2020; v1 submitted 3 June, 2020; originally announced June 2020.

    Comments: 24 pages, 8 figures

  12. arXiv:2002.00253  [pdf, other

    cs.LG cs.DS stat.ML

    Bandits with Knapsacks beyond the Worst-Case

    Authors: Karthik Abinav Sankararaman, Aleksandrs Slivkins

    Abstract: Bandits with Knapsacks (BwK) is a general model for multi-armed bandits under supply/budget constraints. While worst-case regret bounds for BwK are well-understood, we present three results that go beyond the worst-case perspective. First, we provide upper and lower bounds which amount to a full characterization for logarithmic, instance-dependent regret rates. Second, we consider "simple regret"… ▽ More

    Submitted 28 December, 2021; v1 submitted 1 February, 2020; originally announced February 2020.

    Comments: The initial version, titled "Advances in Bandits with Knapsacks", was published on arxiv.longhoe.net in Jan'20. The present version improves both upper and lower bounds, deriving Theorem 3.2(ii) and Theorem 4.2. Moreover, it simplifies the algorithm and analysis in the main result, and fixes several issues in the lower bounds

  13. arXiv:2001.05452  [pdf, other

    cs.LG cs.DC cs.NI cs.SI stat.ML

    The Gossi** Insert-Eliminate Algorithm for Multi-Agent Bandits

    Authors: Ronshee Chawla, Abishek Sankararaman, Ayalvadi Ganesh, Sanjay Shakkottai

    Abstract: We consider a decentralized multi-agent Multi Armed Bandit (MAB) setup consisting of $N$ agents, solving the same MAB instance to minimize individual cumulative regret. In our model, agents collaborate by exchanging messages through pairwise gossip style communications on an arbitrary connected graph. We develop two novel algorithms, where each agent only plays from a subset of all the arms. Agent… ▽ More

    Submitted 2 July, 2024; v1 submitted 15 January, 2020; originally announced January 2020.

    Comments: To Appear in AISTATS 2020. The first two authors contributed equally

  14. arXiv:1910.02100  [pdf, other

    cs.LG cs.DC cs.NI cs.SI math.PR stat.ML

    Social Learning in Multi Agent Multi Armed Bandits

    Authors: Abishek Sankararaman, Ayalvadi Ganesh, Sanjay Shakkottai

    Abstract: In this paper, we introduce a distributed version of the classical stochastic Multi-Arm Bandit (MAB) problem. Our setting consists of a large number of agents $n$ that collaboratively and simultaneously solve the same instance of $K$ armed MAB to minimize the average cumulative regret over all agents. The agents can communicate and collaborate among each other \emph{only} through a pairwise asynch… ▽ More

    Submitted 4 November, 2019; v1 submitted 4 October, 2019; originally announced October 2019.

    Comments: Minor Corrections from before

  15. arXiv:1905.06836  [pdf, other

    cs.LG stat.ML

    Stability of Linear Structural Equation Models of Causal Inference

    Authors: Karthik Abinav Sankararaman, Anand Louis, Navin Goyal

    Abstract: We consider the numerical stability of the parameter recovery problem in Linear Structural Equation Model ($\LSEM$) of causal inference. A long line of work starting from Wright (1920) has focused on understanding which sub-classes of $\LSEM$ allow for efficient parameter recovery. Despite decades of study, this question is not yet fully resolved. The goal of this paper is complementary to this li… ▽ More

    Submitted 17 August, 2020; v1 submitted 16 May, 2019; originally announced May 2019.

    Comments: To appear in UAI 2019

  16. arXiv:1904.06963  [pdf, other

    cs.LG stat.ML

    The Impact of Neural Network Overparameterization on Gradient Confusion and Stochastic Gradient Descent

    Authors: Karthik A. Sankararaman, Soham De, Zheng Xu, W. Ronny Huang, Tom Goldstein

    Abstract: This paper studies how neural network architecture affects the speed of training. We introduce a simple concept called gradient confusion to help formally analyze this. When gradient confusion is high, stochastic gradients produced by different data samples may be negatively correlated, slowing down convergence. But when gradient confusion is low, data samples interact harmoniously, and training p… ▽ More

    Submitted 6 July, 2020; v1 submitted 15 April, 2019; originally announced April 2019.

    Comments: ICML 2020 camera-ready version

  17. arXiv:1811.11881  [pdf, other

    cs.DS cs.LG stat.ML

    Adversarial Bandits with Knapsacks

    Authors: Nicole Immorlica, Karthik Abinav Sankararaman, Robert Schapire, Aleksandrs Slivkins

    Abstract: We consider Bandits with Knapsacks (henceforth, BwK), a general model for multi-armed bandits under supply/budget constraints. In particular, a bandit algorithm needs to solve a well-known knapsack problem: find an optimal packing of items into a limited-size knapsack. The BwK problem is a common generalization of numerous motivating examples, which range from dynamic pricing to repeated auctions… ▽ More

    Submitted 6 March, 2023; v1 submitted 28 November, 2018; originally announced November 2018.

    Comments: The extended abstract appeared in FOCS 2019. The definitive version was published in JACM '22. V8 is the latest version with all technical changes. Subsequent versions fixes minor LATEX presentation issues