Skip to main content

Showing 1–5 of 5 results for author: Bollapragada, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2206.07553  [pdf, other

    cs.LG cs.DS math.NA math.OC stat.ML

    On the fast convergence of minibatch heavy ball momentum

    Authors: Raghu Bollapragada, Tyler Chen, Rachel Ward

    Abstract: Simple stochastic momentum methods are widely used in machine learning optimization, but their good practical performance is at odds with an absence of theoretical guarantees of acceleration in the literature. In this work, we aim to close the gap between theory and practice by showing that stochastic heavy ball momentum retains the fast linear rate of (deterministic) heavy ball momentum on quadra… ▽ More

    Submitted 12 December, 2023; v1 submitted 15 June, 2022; originally announced June 2022.

    MSC Class: 65K05; 90C06; 90C30; 65F10; 68W20

  2. arXiv:2110.15442  [pdf, other

    cs.LG

    Scalable Unidirectional Pareto Optimality for Multi-Task Learning with Constraints

    Authors: Soumyajit Gupta, Gurpreet Singh, Raghu Bollapragada, Matthew Lease

    Abstract: Multi-objective optimization (MOO) problems require balancing competing objectives, often under constraints. The Pareto optimal solution set defines all possible optimal trade-offs over such objectives. In this work, we present a novel method for Pareto-front learning: inducing the full Pareto manifold at train-time so users can pick any desired optimal trade-off point at run-time. Our key insight… ▽ More

    Submitted 16 April, 2022; v1 submitted 28 October, 2021; originally announced October 2021.

  3. arXiv:2109.12213  [pdf, other

    math.OC cs.AI stat.ML

    Adaptive Sampling Quasi-Newton Methods for Zeroth-Order Stochastic Optimization

    Authors: Raghu Bollapragada, Stefan M. Wild

    Abstract: We consider unconstrained stochastic optimization problems with no available gradient information. Such problems arise in settings from derivative-free simulation optimization to reinforcement learning. We propose an adaptive sampling quasi-Newton method where we estimate the gradients of a stochastic function using finite differences within a common random number framework. We develop modified ve… ▽ More

    Submitted 24 September, 2021; originally announced September 2021.

  4. arXiv:1802.05374  [pdf, other

    math.OC cs.LG stat.ML

    A Progressive Batching L-BFGS Method for Machine Learning

    Authors: Raghu Bollapragada, Dheevatsa Mudigere, Jorge Nocedal, Hao-Jun Michael Shi, ** Tak Peter Tang

    Abstract: The standard L-BFGS method relies on gradient approximations that are not dominated by noise, so that search directions are descent directions, the line search is reliable, and quasi-Newton updating yields useful quadratic models of the objective function. All of this appears to call for a full batch approach, but since small batch sizes give rise to faster algorithms with better generalization pr… ▽ More

    Submitted 30 May, 2018; v1 submitted 14 February, 2018; originally announced February 2018.

    Comments: ICML 2018. 25 pages, 17 figures, 2 tables

  5. arXiv:1705.06211  [pdf, other

    math.OC cs.LG stat.ML

    An Investigation of Newton-Sketch and Subsampled Newton Methods

    Authors: Albert S. Berahas, Raghu Bollapragada, Jorge Nocedal

    Abstract: Sketching, a dimensionality reduction technique, has received much attention in the statistics community. In this paper, we study sketching in the context of Newton's method for solving finite-sum optimization problems in which the number of variables and data points are both large. We study two forms of sketching that perform dimensionality reduction in data space: Hessian subsampling and randomi… ▽ More

    Submitted 30 May, 2019; v1 submitted 17 May, 2017; originally announced May 2017.

    Comments: 36 pages, 22 figures