Skip to main content

Showing 1–1 of 1 results for author: Shlapentokh-Rothman, M M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2010.11029  [pdf, other

    cs.LG cs.CV stat.ML

    Learning Curves for Analysis of Deep Networks

    Authors: Derek Hoiem, Tanmay Gupta, Zhizhong Li, Michal M. Shlapentokh-Rothman

    Abstract: Learning curves model a classifier's test error as a function of the number of training samples. Prior works show that learning curves can be used to select model parameters and extrapolate performance. We investigate how to use learning curves to evaluate design choices, such as pretraining, architecture, and data augmentation. We propose a method to robustly estimate learning curves, abstract th… ▽ More

    Submitted 5 April, 2021; v1 submitted 21 October, 2020; originally announced October 2020.

    Comments: Improved text and figure organization, additional experiments on optimization