Skip to main content

Showing 1–1 of 1 results for author: Chassang, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:1412.6550  [pdf, ps, other

    cs.LG cs.NE

    FitNets: Hints for Thin Deep Nets

    Authors: Adriana Romero, Nicolas Ballas, Samira Ebrahimi Kahou, Antoine Chassang, Carlo Gatta, Yoshua Bengio

    Abstract: While depth tends to improve network performances, it also makes gradient-based training more difficult since deeper networks tend to be more non-linear. The recently proposed knowledge distillation approach is aimed at obtaining small and fast-to-execute models, and it has shown that a student network could imitate the soft output of a larger teacher network or ensemble of networks. In this paper… ▽ More

    Submitted 27 March, 2015; v1 submitted 19 December, 2014; originally announced December 2014.