Skip to main content

Showing 1–3 of 3 results for author: Gomez-Uribe, C A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2206.13669  [pdf, other

    stat.ML cs.LG

    Studying Generalization Through Data Averaging

    Authors: Carlos A. Gomez-Uribe

    Abstract: The generalization of machine learning models has a complex dependence on the data, model and learning algorithm. We study train and test performance, as well as the generalization gap given by the mean of their difference over different data set samples to understand their ``typical" behavior. We derive an expression for the gap as a function of the covariance between the model parameter distribu… ▽ More

    Submitted 27 June, 2022; originally announced June 2022.

  2. arXiv:2108.09507  [pdf, other

    stat.ML cs.LG

    Shift-Curvature, SGD, and Generalization

    Authors: Arwen V. Bradley, Carlos Alberto Gomez-Uribe, Manish Reddy Vuyyuru

    Abstract: A longstanding debate surrounds the related hypotheses that low-curvature minima generalize better, and that SGD discourages curvature. We offer a more complete and nuanced view in support of both. First, we show that curvature harms test performance through two new mechanisms, the shift-curvature and bias-curvature, in addition to a known parameter-covariance mechanism. The three curvature-mediat… ▽ More

    Submitted 27 July, 2022; v1 submitted 21 August, 2021; originally announced August 2021.

  3. arXiv:1806.09976  [pdf, other

    stat.ML cs.LG

    The decoupled extended Kalman filter for dynamic exponential-family factorization models

    Authors: Carlos Alberto Gomez-Uribe, Brian Karrer

    Abstract: Motivated by the needs of online large-scale recommender systems, we specialize the decoupled extended Kalman filter (DEKF) to factorization models, including factorization machines, matrix and tensor factorization, and illustrate the effectiveness of the approach through numerical experiments on synthetic and on real-world data. Online learning of model parameters through the DEKF makes factoriza… ▽ More

    Submitted 24 February, 2021; v1 submitted 26 June, 2018; originally announced June 2018.

    Comments: 29 pages, 4 figures

    Journal ref: Journal of Machine Learning Research (JMLR), 22(5):1-25, 2021