Skip to main content

Showing 1–8 of 8 results for author: Cottrell, G

Searching in archive stat. Search in all archives.
.
  1. arXiv:2003.04887  [pdf, other

    cs.LG cs.CL stat.ML

    ReZero is All You Need: Fast Convergence at Large Depth

    Authors: Thomas Bachlechner, Bodhisattwa Prasad Majumder, Huanru Henry Mao, Garrison W. Cottrell, Julian McAuley

    Abstract: Deep networks often suffer from vanishing or exploding gradients due to inefficient signal propagation, leading to long training times or convergence difficulties. Various architecture designs, sophisticated residual-style networks, and initialization schemes have been shown to improve deep signal propagation. Recently, Pennington et al. used free probability theory to show that dynamical isometry… ▽ More

    Submitted 24 June, 2020; v1 submitted 10 March, 2020; originally announced March 2020.

  2. arXiv:2002.07405  [pdf, other

    cs.LG cs.CV stat.ML

    Deflecting Adversarial Attacks

    Authors: Yao Qin, Nicholas Frosst, Colin Raffel, Garrison Cottrell, Geoffrey Hinton

    Abstract: There has been an ongoing cycle where stronger defenses against adversarial attacks are subsequently broken by a more advanced defense-aware attack. We present a new approach towards ending this cycle where we "deflect'' adversarial attacks by causing the attacker to produce an input that semantically resembles the attack's target class. To this end, we first propose a stronger defense based on Ca… ▽ More

    Submitted 18 February, 2020; originally announced February 2020.

  3. arXiv:1908.09451  [pdf, ps, other

    cs.LG cs.CL stat.ML

    Improving Neural Story Generation by Targeted Common Sense Grounding

    Authors: Huanru Henry Mao, Bodhisattwa Prasad Majumder, Julian McAuley, Garrison W. Cottrell

    Abstract: Stories generated with neural language models have shown promise in grammatical and stylistic consistency. However, the generated stories are still lacking in common sense reasoning, e.g., they often contain sentences deprived of world knowledge. We propose a simple multi-task learning scheme to achieve quantitatively better common sense reasoning in language models by leveraging auxiliary trainin… ▽ More

    Submitted 27 February, 2020; v1 submitted 25 August, 2019; originally announced August 2019.

  4. arXiv:1907.04868  [pdf, other

    cs.SD cs.LG cs.MM eess.AS stat.ML

    LakhNES: Improving multi-instrumental music generation with cross-domain pre-training

    Authors: Chris Donahue, Huanru Henry Mao, Yiting Ethan Li, Garrison W. Cottrell, Julian McAuley

    Abstract: We are interested in the task of generating multi-instrumental music scores. The Transformer architecture has recently shown great promise for the task of piano score generation; here we adapt it to the multi-instrumental setting. Transformers are complex, high-dimensional language models which are capable of capturing long-term structure in sequence data, but require large amounts of data to fit.… ▽ More

    Submitted 10 July, 2019; originally announced July 2019.

    Comments: Published as a conference paper at ISMIR 2019

  5. arXiv:1907.02957  [pdf, other

    cs.LG cs.CR cs.CV stat.ML

    Detecting and Diagnosing Adversarial Images with Class-Conditional Capsule Reconstructions

    Authors: Yao Qin, Nicholas Frosst, Sara Sabour, Colin Raffel, Garrison Cottrell, Geoffrey Hinton

    Abstract: Adversarial examples raise questions about whether neural network models are sensitive to the same visual features as humans. In this paper, we first detect adversarial examples or otherwise corrupted images based on a class-conditional reconstruction of the input. To specifically attack our detection mechanism, we propose the Reconstructive Attack which seeks both to cause a misclassification and… ▽ More

    Submitted 18 February, 2020; v1 submitted 5 July, 2019; originally announced July 2019.

    Journal ref: ICLR 2020

  6. arXiv:1903.10346  [pdf, other

    eess.AS cs.LG cs.SD stat.ML

    Imperceptible, Robust, and Targeted Adversarial Examples for Automatic Speech Recognition

    Authors: Yao Qin, Nicholas Carlini, Ian Goodfellow, Garrison Cottrell, Colin Raffel

    Abstract: Adversarial examples are inputs to machine learning models designed by an adversary to cause an incorrect output. So far, adversarial examples have been studied most extensively in the image domain. In this domain, adversarial examples can be constructed by imperceptibly modifying images to cause misclassification, and are practical in the physical world. In contrast, current targeted adversarial… ▽ More

    Submitted 7 June, 2019; v1 submitted 22 March, 2019; originally announced March 2019.

    Comments: International Conference on Machine Learning (ICML), 2019

  7. arXiv:1704.02971  [pdf, other

    cs.LG stat.ML

    A Dual-Stage Attention-Based Recurrent Neural Network for Time Series Prediction

    Authors: Yao Qin, Dong** Song, Haifeng Chen, Wei Cheng, Guofei Jiang, Garrison Cottrell

    Abstract: The Nonlinear autoregressive exogenous (NARX) model, which predicts the current value of a time series based upon its previous values as well as the current and past values of multiple driving (exogenous) series, has been studied for decades. Despite the fact that various NARX models have been developed, few of them can capture the long-term temporal dependencies appropriately and select the relev… ▽ More

    Submitted 14 August, 2017; v1 submitted 7 April, 2017; originally announced April 2017.

    Comments: International Joint Conference on Artificial Intelligence (IJCAI), 2017

  8. arXiv:1412.6177  [pdf, other

    cs.LG cs.AI stat.ML

    Example Selection For Dictionary Learning

    Authors: Tomoki Tsuchida, Garrison W. Cottrell

    Abstract: In unsupervised learning, an unbiased uniform sampling strategy is typically used, in order that the learned features faithfully encode the statistical structure of the training data. In this work, we explore whether active example selection strategies - algorithms that select which examples to use, based on the current estimate of the features - can accelerate learning. Specifically, we investiga… ▽ More

    Submitted 31 March, 2015; v1 submitted 18 December, 2014; originally announced December 2014.