Skip to main content

Showing 1–2 of 2 results for author: Kanwal, M S

Searching in archive cs. Search in all archives.
.
  1. arXiv:1706.09667  [pdf, other

    cs.IT cs.NE q-bio.NC

    Comparing Information-Theoretic Measures of Complexity in Boltzmann Machines

    Authors: Maxinder S. Kanwal, Joshua A. Grochow, Nihat Ay

    Abstract: In the past three decades, many theoretical measures of complexity have been proposed to help understand complex systems. In this work, for the first time, we place these measures on a level playing field, to explore the qualitative similarities and differences between them, and their shortcomings. Specifically, using the Boltzmann machine architecture (a fully connected recurrent neural network)… ▽ More

    Submitted 29 July, 2017; v1 submitted 29 June, 2017; originally announced June 2017.

    Comments: 16 pages, 7 figures; Appears in Entropy, Special Issue "Information Geometry II"

    Journal ref: Entropy (2017), 19(7), 310

  2. arXiv:1706.05394  [pdf, other

    stat.ML cs.LG

    A Closer Look at Memorization in Deep Networks

    Authors: Devansh Arpit, Stanisław Jastrzębski, Nicolas Ballas, David Krueger, Emmanuel Bengio, Maxinder S. Kanwal, Tegan Maharaj, Asja Fischer, Aaron Courville, Yoshua Bengio, Simon Lacoste-Julien

    Abstract: We examine the role of memorization in deep learning, drawing connections to capacity, generalization, and adversarial robustness. While deep networks are capable of memorizing noise data, our results suggest that they tend to prioritize learning simple patterns first. In our experiments, we expose qualitative differences in gradient-based optimization of deep neural networks (DNNs) on noise vs. r… ▽ More

    Submitted 1 July, 2017; v1 submitted 16 June, 2017; originally announced June 2017.

    Comments: Appears in Proceedings of the 34th International Conference on Machine Learning (ICML 2017), Devansh Arpit, Stanisław Jastrzębski, Nicolas Ballas, and David Krueger contributed equally to this work