Skip to main content

Showing 1–3 of 3 results for author: Baldock, R J N

Searching in archive stat. Search in all archives.
.
  1. arXiv:2106.09647  [pdf, other

    cs.LG stat.ML

    Deep Learning Through the Lens of Example Difficulty

    Authors: Robert J. N. Baldock, Hartmut Maennel, Behnam Neyshabur

    Abstract: Existing work on understanding deep learning often employs measures that compress all data-dependent information into a few numbers. In this work, we adopt a perspective based on the role of individual examples. We introduce a measure of the computational difficulty of making a prediction for a given input: the (effective) prediction depth. Our extensive investigation reveals surprising yet simple… ▽ More

    Submitted 18 June, 2021; v1 submitted 17 June, 2021; originally announced June 2021.

    Comments: Main paper: 15 pages, 8 figures. Appendix: 31 pages, 40 figures

  2. arXiv:2006.10455  [pdf, other

    stat.ML cs.LG

    What Do Neural Networks Learn When Trained With Random Labels?

    Authors: Hartmut Maennel, Ibrahim Alabdulmohsin, Ilya Tolstikhin, Robert J. N. Baldock, Olivier Bousquet, Sylvain Gelly, Daniel Keysers

    Abstract: We study deep neural networks (DNNs) trained on natural image data with entirely random labels. Despite its popularity in the literature, where it is often used to study memorization, generalization, and other phenomena, little is known about what DNNs learn in this setting. In this paper, we show analytically for convolutional and fully connected networks that an alignment between the principal c… ▽ More

    Submitted 11 November, 2020; v1 submitted 18 June, 2020; originally announced June 2020.

    Comments: Accepted, NeurIPS2020

  3. arXiv:1904.04154  [pdf, other

    stat.ML cond-mat.dis-nn cs.LG

    Bayesian Neural Networks at Finite Temperature

    Authors: Robert J. N. Baldock, Nicola Marzari

    Abstract: We recapitulate the Bayesian formulation of neural network based classifiers and show that, while sampling from the posterior does indeed lead to better generalisation than is obtained by standard optimisation of the cost function, even better performance can in general be achieved by sampling finite temperature ($T$) distributions derived from the posterior. Taking the example of two different de… ▽ More

    Submitted 8 April, 2019; originally announced April 2019.

    Comments: 11 pages, 4 figures