Skip to main content

Showing 1–1 of 1 results for author: Bauer, J P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.17467  [pdf, other

    cs.LG

    Early learning of the optimal constant solution in neural networks and humans

    Authors: Jirko Rubruck, Jan P. Bauer, Andrew Saxe, Christopher Summerfield

    Abstract: Deep neural networks learn increasingly complex functions over the course of training. Here, we show both empirically and theoretically that learning of the target function is preceded by an early phase in which networks learn the optimal constant solution (OCS) - that is, initial model responses mirror the distribution of target labels, while entirely ignoring information provided in the input. U… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.