Skip to main content

Showing 1–5 of 5 results for author: Zeiler, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:1905.11116  [pdf, other

    cs.CV cs.AI

    Finding Task-Relevant Features for Few-Shot Learning by Category Traversal

    Authors: Hongyang Li, David Eigen, Samuel Dodge, Matthew Zeiler, Xiaogang Wang

    Abstract: Few-shot learning is an important area of research. Conceptually, humans are readily able to understand new concepts given just a few examples, while in more pragmatic terms, limited-example training situations are common in practice. Recent effective approaches to few-shot learning employ a metric-learning framework to learn a feature similarity comparison between a query (test) example, and the… ▽ More

    Submitted 27 May, 2019; originally announced May 2019.

    Comments: CVPR 2019

  2. arXiv:1311.2901  [pdf, other

    cs.CV

    Visualizing and Understanding Convolutional Networks

    Authors: Matthew D Zeiler, Rob Fergus

    Abstract: Large Convolutional Network models have recently demonstrated impressive classification performance on the ImageNet benchmark. However there is no clear understanding of why they perform so well, or how they might be improved. In this paper we address both issues. We introduce a novel visualization technique that gives insight into the function of intermediate feature layers and the operation of t… ▽ More

    Submitted 28 November, 2013; v1 submitted 12 November, 2013; originally announced November 2013.

  3. arXiv:1301.3557  [pdf, other

    cs.LG cs.NE stat.ML

    Stochastic Pooling for Regularization of Deep Convolutional Neural Networks

    Authors: Matthew D. Zeiler, Rob Fergus

    Abstract: We introduce a simple and effective method for regularizing large convolutional neural networks. We replace the conventional deterministic pooling operations with a stochastic procedure, randomly picking the activation within each pooling region according to a multinomial distribution, given by the activities within the pooling region. The approach is hyper-parameter free and can be combined with… ▽ More

    Submitted 15 January, 2013; originally announced January 2013.

    Comments: 9 pages

  4. arXiv:1212.5701  [pdf, other

    cs.LG

    ADADELTA: An Adaptive Learning Rate Method

    Authors: Matthew D. Zeiler

    Abstract: We present a novel per-dimension learning rate method for gradient descent called ADADELTA. The method dynamically adapts over time using only first order information and has minimal computational overhead beyond vanilla stochastic gradient descent. The method requires no manual tuning of a learning rate and appears robust to noisy gradient information, different model architecture choices, variou… ▽ More

    Submitted 22 December, 2012; originally announced December 2012.

    Comments: 6 pages

  5. arXiv:1207.0151  [pdf, other

    cs.CV cs.LG

    Differentiable Pooling for Hierarchical Feature Learning

    Authors: Matthew D. Zeiler, Rob Fergus

    Abstract: We introduce a parametric form of pooling, based on a Gaussian, which can be optimized alongside the features in a single global objective function. By contrast, existing pooling schemes are based on heuristics (e.g. local maximum) and have no clear link to the cost function of the model. Furthermore, the variables of the Gaussian explicitly store location information, distinct from the appearance… ▽ More

    Submitted 30 June, 2012; originally announced July 2012.

    Comments: 12 pages