Skip to main content

Showing 1–6 of 6 results for author: Frosst, N

Searching in archive stat. Search in all archives.
.
  1. arXiv:2004.13912  [pdf, other

    cs.LG cs.AI stat.ML

    Neural Additive Models: Interpretable Machine Learning with Neural Nets

    Authors: Rishabh Agarwal, Levi Melnick, Nicholas Frosst, Xuezhou Zhang, Ben Lengerich, Rich Caruana, Geoffrey Hinton

    Abstract: Deep neural networks (DNNs) are powerful black-box predictors that have achieved impressive performance on a wide variety of tasks. However, their accuracy comes at the cost of intelligibility: it is usually unclear how they make their decisions. This hinders their applicability to high stakes decision-making domains such as healthcare. We propose Neural Additive Models (NAMs) which combine some o… ▽ More

    Submitted 24 October, 2021; v1 submitted 28 April, 2020; originally announced April 2020.

    Comments: Spotlight (Top 3%) at NeurIPS 2021

  2. arXiv:2002.07405  [pdf, other

    cs.LG cs.CV stat.ML

    Deflecting Adversarial Attacks

    Authors: Yao Qin, Nicholas Frosst, Colin Raffel, Garrison Cottrell, Geoffrey Hinton

    Abstract: There has been an ongoing cycle where stronger defenses against adversarial attacks are subsequently broken by a more advanced defense-aware attack. We present a new approach towards ending this cycle where we "deflect'' adversarial attacks by causing the attacker to produce an input that semantically resembles the attack's target class. To this end, we first propose a stronger defense based on Ca… ▽ More

    Submitted 18 February, 2020; originally announced February 2020.

  3. arXiv:1907.02957  [pdf, other

    cs.LG cs.CR cs.CV stat.ML

    Detecting and Diagnosing Adversarial Images with Class-Conditional Capsule Reconstructions

    Authors: Yao Qin, Nicholas Frosst, Sara Sabour, Colin Raffel, Garrison Cottrell, Geoffrey Hinton

    Abstract: Adversarial examples raise questions about whether neural network models are sensitive to the same visual features as humans. In this paper, we first detect adversarial examples or otherwise corrupted images based on a class-conditional reconstruction of the input. To specifically attack our detection mechanism, we propose the Reconstructive Attack which seeks both to cause a misclassification and… ▽ More

    Submitted 18 February, 2020; v1 submitted 5 July, 2019; originally announced July 2019.

    Journal ref: ICLR 2020

  4. arXiv:1902.01889  [pdf, other

    stat.ML cs.LG

    Analyzing and Improving Representations with the Soft Nearest Neighbor Loss

    Authors: Nicholas Frosst, Nicolas Papernot, Geoffrey Hinton

    Abstract: We explore and expand the $\textit{Soft Nearest Neighbor Loss}$ to measure the $\textit{entanglement}$ of class manifolds in representation space: i.e., how close pairs of points from the same class are relative to pairs of points from different classes. We demonstrate several use cases of the loss. As an analytical tool, it provides insights into the evolution of class similarity structures durin… ▽ More

    Submitted 5 February, 2019; originally announced February 2019.

  5. arXiv:1811.06969  [pdf, other

    cs.LG cs.CR cs.CV stat.ML

    DARCCC: Detecting Adversaries by Reconstruction from Class Conditional Capsules

    Authors: Nicholas Frosst, Sara Sabour, Geoffrey Hinton

    Abstract: We present a simple technique that allows capsule models to detect adversarial images. In addition to being trained to classify images, the capsule model is trained to reconstruct the images from the pose parameters and identity of the correct top-level capsule. Adversarial images do not look like a typical member of the predicted class and they have much larger reconstruction errors when the reco… ▽ More

    Submitted 16 November, 2018; originally announced November 2018.

    Comments: To be presented at NIPS 2018 Workshop on Security in Machine Learning

  6. arXiv:1711.09784  [pdf, other

    cs.LG cs.AI stat.ML

    Distilling a Neural Network Into a Soft Decision Tree

    Authors: Nicholas Frosst, Geoffrey Hinton

    Abstract: Deep neural networks have proved to be a very effective way to perform classification tasks. They excel when the input data is high dimensional, the relationship between the input and the output is complicated, and the number of labeled training examples is large. But it is hard to explain why a learned network makes a particular classification decision on a particular test case. This is due to th… ▽ More

    Submitted 27 November, 2017; originally announced November 2017.

    Comments: presented at the CEX workshop at AI*IA 2017 conference