On the Convergence of AdaGrad(Norm) on $\R^{d}$: Beyond Convexity, Non-Asymptotic Rate and Acceleration

Liu, Zijian; Nguyen, Ta Duy; Ene, Alina; Nguyen, Huy L.

Computer Science > Machine Learning

arXiv:2209.14827 (cs)

[Submitted on 29 Sep 2022 (v1), last revised 4 Oct 2023 (this version, v4)]

Title:On the Convergence of AdaGrad(Norm) on $\R^{d}$: Beyond Convexity, Non-Asymptotic Rate and Acceleration

Authors:Zijian Liu, Ta Duy Nguyen, Alina Ene, Huy L. Nguyen

View PDF

Abstract:Existing analysis of AdaGrad and other adaptive methods for smooth convex optimization is typically for functions with bounded domain diameter. In unconstrained problems, previous works guarantee an asymptotic convergence rate without an explicit constant factor that holds true for the entire function class. Furthermore, in the stochastic setting, only a modified version of AdaGrad, different from the one commonly used in practice, in which the latest gradient is not used to update the stepsize, has been analyzed. Our paper aims at bridging these gaps and develo** a deeper understanding of AdaGrad and its variants in the standard setting of smooth convex functions as well as the more general setting of quasar convex functions. First, we demonstrate new techniques to explicitly bound the convergence rate of the vanilla AdaGrad for unconstrained problems in both deterministic and stochastic settings. Second, we propose a variant of AdaGrad for which we can show the convergence of the last iterate, instead of the average iterate. Finally, we give new accelerated adaptive algorithms and their convergence guarantee in the deterministic setting with explicit dependency on the problem parameters, improving upon the asymptotic rate shown in previous works.

Comments:	Updated manuscript from ICLR 2023 with fixed typos
Subjects:	Machine Learning (cs.LG); Data Structures and Algorithms (cs.DS)
Cite as:	arXiv:2209.14827 [cs.LG]
	(or arXiv:2209.14827v4 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2209.14827

Submission history

From: Ta Duy Nguyen [view email]
[v1] Thu, 29 Sep 2022 14:44:40 UTC (29 KB)
[v2] Fri, 24 Mar 2023 21:40:43 UTC (147 KB)
[v3] Wed, 19 Apr 2023 04:37:25 UTC (175 KB)
[v4] Wed, 4 Oct 2023 04:06:35 UTC (175 KB)

Computer Science > Machine Learning

Title:On the Convergence of AdaGrad(Norm) on $\R^{d}$: Beyond Convexity, Non-Asymptotic Rate and Acceleration

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:On the Convergence of AdaGrad(Norm) on $\R^{d}$: Beyond Convexity, Non-Asymptotic Rate and Acceleration

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators