-
Adaptive First- and Second-Order Algorithms for Large-Scale Machine Learning
Abstract: In this paper, we consider both first- and second-order techniques to address continuous optimization problems arising in machine learning. In the first-order case, we propose a framework of transition from deterministic or semi-deterministic to stochastic quadratic regularization methods. We leverage the two-phase nature of stochastic optimization to propose a novel first-order algorithm with ada… ▽ More
Submitted 29 November, 2021; originally announced November 2021.
Comments: 29 pages, 8 figures. arXiv admin note: text overlap with arXiv:2012.05783
MSC Class: 68T07; 90C15; 90C30; 90C53 ACM Class: G.1.6; G.3; G.4; I.2.6
-
Stochastic Damped L-BFGS with Controlled Norm of the Hessian Approximation
Abstract: We propose a new stochastic variance-reduced damped L-BFGS algorithm, where we leverage estimates of bounds on the largest and smallest eigenvalues of the Hessian approximation to balance its quality and conditioning. Our algorithm, VARCHEN, draws from previous work that proposed a novel stochastic damped L-BFGS algorithm called SdLBFGS. We establish almost sure convergence to a stationary point a… ▽ More
Submitted 10 December, 2020; originally announced December 2020.
Comments: 14 pages, 4 figures
Report number: Cahier du GERAD G-2020-52 MSC Class: 68T07; 90C15; 90C30; 90C53 ACM Class: G.1.6; G.3; G.4; I.2.6