Showing 1–1 of 1 results for author: Anita, S

Search v0.5.6 released 2020-02-24

arXiv:2402.06388 [pdf, ps, other]

stat.ML cs.AI cs.DS cs.LG math.NA

On the Convergence Rate of the Stochastic Gradient Descent (SGD) and application to a modified policy gradient for the Multi Armed Bandit

Authors: Stefana Anita, Gabriel Turinici

Abstract: We present a self-contained proof of the convergence rate of the Stochastic Gradient Descent (SGD) when the learning rate follows an inverse time decays schedule; we next apply the results to the convergence of a modified form of policy gradient Multi-Armed Bandit (MAB) with $L2$ regularization. We present a self-contained proof of the convergence rate of the Stochastic Gradient Descent (SGD) when the learning rate follows an inverse time decays schedule; we next apply the results to the convergence of a modified form of policy gradient Multi-Armed Bandit (MAB) with $L2$ regularization. △ Less

Submitted 9 February, 2024; originally announced February 2024.

Search v0.5.6 released 2020-02-24