Skip to main content

Showing 1–1 of 1 results for author: Xiao, K L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.11733  [pdf, other

    stat.ML cs.LG

    A Clipped Trip: the Dynamics of SGD with Gradient Clip** in High-Dimensions

    Authors: Noah Marshall, Ke Liang Xiao, Atish Agarwala, Elliot Paquette

    Abstract: The success of modern machine learning is due in part to the adaptive optimization methods that have been developed to deal with the difficulties of training large models over complex datasets. One such method is gradient clip**: a practical procedure with limited theoretical underpinnings. In this work, we study clip** in a least squares problem under streaming SGD. We develop a theoretical a… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.