Skip to main content

Showing 1–1 of 1 results for author: Anita, S

Searching in archive stat. Search in all archives.
.
  1. arXiv:2402.06388  [pdf, ps, other

    stat.ML cs.AI cs.DS cs.LG math.NA

    On the Convergence Rate of the Stochastic Gradient Descent (SGD) and application to a modified policy gradient for the Multi Armed Bandit

    Authors: Stefana Anita, Gabriel Turinici

    Abstract: We present a self-contained proof of the convergence rate of the Stochastic Gradient Descent (SGD) when the learning rate follows an inverse time decays schedule; we next apply the results to the convergence of a modified form of policy gradient Multi-Armed Bandit (MAB) with $L2$ regularization.

    Submitted 9 February, 2024; originally announced February 2024.