Skip to main content

Showing 1–1 of 1 results for author: Gomes, D M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.16397  [pdf, other

    cs.LG math.OC

    AdaFisher: Adaptive Second Order Optimization via Fisher Information

    Authors: Damien Martins Gomes, Yanlei Zhang, Eugene Belilovsky, Guy Wolf, Mahdi S. Hosseini

    Abstract: First-order optimization methods are currently the mainstream in training deep neural networks (DNNs). Optimizers like Adam incorporate limited curvature information by employing the diagonal matrix preconditioning of the stochastic gradient during the training. Despite their widespread, second-order optimization algorithms exhibit superior convergence properties compared to their first-order coun… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.