Skip to main content

Showing 1–8 of 8 results for author: Salgia, S

Searching in archive stat. Search in all archives.
.
  1. arXiv:2402.13182  [pdf, other

    cs.LG cs.DC stat.ML

    Order-Optimal Regret in Distributed Kernel Bandits using Uniform Sampling with Shared Randomness

    Authors: Nikola Pavlovic, Sudeep Salgia, Qing Zhao

    Abstract: We consider distributed kernel bandits where $N$ agents aim to collaboratively maximize an unknown reward function that lies in a reproducing kernel Hilbert space. Each agent sequentially queries the function to obtain noisy observations at the query points. Agents can share information through a central server, with the objective of minimizing regret that is accumulating over time $T$ and aggrega… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

  2. arXiv:2310.15351  [pdf, other

    cs.LG stat.ML

    Random Exploration in Bayesian Optimization: Order-Optimal Regret and Computational Efficiency

    Authors: Sudeep Salgia, Sattar Vakili, Qing Zhao

    Abstract: We consider Bayesian optimization using Gaussian Process models, also referred to as kernel-based bandit optimization. We study the methodology of exploring the domain using random samples drawn from a distribution. We show that this random exploration approach achieves the optimal error rates. Our analysis is based on novel concentration bounds in an infinite dimensional Hilbert space established… ▽ More

    Submitted 2 February, 2024; v1 submitted 23 October, 2023; originally announced October 2023.

  3. arXiv:2207.07948  [pdf, other

    stat.ML cs.LG

    Collaborative Learning in Kernel-based Bandits for Distributed Users

    Authors: Sudeep Salgia, Sattar Vakili, Qing Zhao

    Abstract: We study collaborative learning among distributed clients facilitated by a central server. Each client is interested in maximizing a personalized objective function that is a weighted sum of its local objective and a global objective. Each client has direct access to random bandit feedback on its local objective, but only has a partial view of the global objective and relies on information exchang… ▽ More

    Submitted 17 April, 2023; v1 submitted 16 July, 2022; originally announced July 2022.

  4. arXiv:2206.00099  [pdf, other

    stat.ML cs.LG

    Provably and Practically Efficient Neural Contextual Bandits

    Authors: Sudeep Salgia, Sattar Vakili, Qing Zhao

    Abstract: We consider the neural contextual bandit problem. In contrast to the existing work which primarily focuses on ReLU neural nets, we consider a general set of smooth activation functions. Under this more general setting, (i) we derive non-asymptotic error bounds on the difference between an overparameterized neural net and its corresponding neural tangent kernel, (ii) we propose an algorithm with a… ▽ More

    Submitted 31 May, 2022; originally announced June 2022.

  5. arXiv:2010.13997  [pdf, other

    stat.ML cs.LG

    A Domain-Shrinking based Bayesian Optimization Algorithm with Order-Optimal Regret Performance

    Authors: Sudeep Salgia, Sattar Vakili, Qing Zhao

    Abstract: We consider sequential optimization of an unknown function in a reproducing kernel Hilbert space. We propose a Gaussian process-based algorithm and establish its order-optimal regret performance (up to a poly-logarithmic factor). This is the first GP-based algorithm with an order-optimal regret guarantee. The proposed algorithm is rooted in the methodology of domain shrinking realized through a se… ▽ More

    Submitted 29 October, 2021; v1 submitted 26 October, 2020; originally announced October 2020.

    Comments: Accepted to NeurIPS 2021

  6. arXiv:2003.05482  [pdf, other

    stat.ML cs.LG math.OC

    Stochastic Coordinate Minimization with Progressive Precision for Stochastic Convex Optimization

    Authors: Sudeep Salgia, Qing Zhao, Sattar Vakili

    Abstract: A framework based on iterative coordinate minimization (CM) is developed for stochastic convex optimization. Given that exact coordinate minimization is impossible due to the unknown stochastic nature of the objective function, the crux of the proposed optimization algorithm is an optimal control of the minimization precision in each iteration. We establish the optimal precision control and the re… ▽ More

    Submitted 11 March, 2020; originally announced March 2020.

  7. arXiv:1904.09056  [pdf, other

    cs.LG stat.ML

    Disagreement-based Active Learning in Online Settings

    Authors: Boshuang Huang, Sudeep Salgia, Qing Zhao

    Abstract: We study online active learning for classifying streaming instances within the framework of statistical learning theory. At each time, the learner either queries the label of the current instance or predicts the label based on past seen examples. The objective is to minimize the number of queries while constraining the number of prediction errors over a horizon of length $T$. We develop a disagree… ▽ More

    Submitted 16 November, 2020; v1 submitted 18 April, 2019; originally announced April 2019.

  8. arXiv:1901.05947  [pdf, other

    stat.ML cs.LG math.OC

    Stochastic Gradient Descent on a Tree: an Adaptive and Robust Approach to Stochastic Convex Optimization

    Authors: Sattar Vakili, Sudeep Salgia, Qing Zhao

    Abstract: Online minimization of an unknown convex function over the interval $[0,1]$ is considered under first-order stochastic bandit feedback, which returns a random realization of the gradient of the function at each query point. Without knowing the distribution of the random gradients, a learning algorithm sequentially chooses query points with the objective of minimizing regret defined as the expected… ▽ More

    Submitted 20 February, 2020; v1 submitted 17 January, 2019; originally announced January 2019.