Skip to main content

Showing 1–4 of 4 results for author: Matsukawa, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:1906.02994  [pdf, other

    stat.ML cs.LG

    Detecting Out-of-Distribution Inputs to Deep Generative Models Using Typicality

    Authors: Eric Nalisnick, Akihiro Matsukawa, Yee Whye Teh, Balaji Lakshminarayanan

    Abstract: Recent work has shown that deep generative models can assign higher likelihood to out-of-distribution data sets than to their training data (Nalisnick et al., 2019; Choi et al., 2019). We posit that this phenomenon is caused by a mismatch between the model's typical set and its areas of high probability density. In-distribution inputs should reside in the former but not necessarily in the latter,… ▽ More

    Submitted 16 October, 2019; v1 submitted 7 June, 2019; originally announced June 2019.

  2. arXiv:1902.03393  [pdf, other

    cs.LG cs.AI stat.ML

    Improved Knowledge Distillation via Teacher Assistant

    Authors: Seyed-Iman Mirzadeh, Mehrdad Farajtabar, Ang Li, Nir Levine, Akihiro Matsukawa, Hassan Ghasemzadeh

    Abstract: Despite the fact that deep neural networks are powerful models and achieve appealing results on many tasks, they are too large to be deployed on edge devices like smartphones or embedded sensor nodes. There have been efforts to compress these networks, and a popular method is knowledge distillation, where a large (teacher) pre-trained network is used to train a smaller (student) network. However,… ▽ More

    Submitted 16 December, 2019; v1 submitted 9 February, 2019; originally announced February 2019.

    Comments: AAAI 2020

  3. arXiv:1902.02767  [pdf, other

    cs.LG stat.ML

    Hybrid Models with Deep and Invertible Features

    Authors: Eric Nalisnick, Akihiro Matsukawa, Yee Whye Teh, Dilan Gorur, Balaji Lakshminarayanan

    Abstract: We propose a neural hybrid model consisting of a linear model defined on a set of features computed by a deep, invertible transformation (i.e. a normalizing flow). An attractive property of our model is that both p(features), the density of the features, and p(targets | features), the predictive distribution, can be computed exactly in a single feed-forward pass. We show that our hybrid model, des… ▽ More

    Submitted 29 May, 2019; v1 submitted 7 February, 2019; originally announced February 2019.

    Comments: ICML 2019

  4. arXiv:1810.09136  [pdf, other

    stat.ML cs.LG

    Do Deep Generative Models Know What They Don't Know?

    Authors: Eric Nalisnick, Akihiro Matsukawa, Yee Whye Teh, Dilan Gorur, Balaji Lakshminarayanan

    Abstract: A neural network deployed in the wild may be asked to make predictions for inputs that were drawn from a different distribution than that of the training data. A plethora of work has demonstrated that it is easy to find or synthesize inputs for which a neural network is highly confident yet wrong. Generative models are widely viewed to be robust to such mistaken confidence as modeling the density… ▽ More

    Submitted 24 February, 2019; v1 submitted 22 October, 2018; originally announced October 2018.

    Comments: ICLR 2019