Skip to main content

Showing 1–5 of 5 results for author: van der Westhuizen, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:1804.04849  [pdf, other

    cs.NE cs.LG stat.ML

    The unreasonable effectiveness of the forget gate

    Authors: Jos van der Westhuizen, Joan Lasenby

    Abstract: Given the success of the gated recurrent unit, a natural question is whether all the gates of the long short-term memory (LSTM) network are necessary. Previous research has shown that the forget gate is one of the most important gates in the LSTM. Here we show that a forget-gate-only version of the LSTM with chrono-initialized biases, not only provides computational savings but outperforms the sta… ▽ More

    Submitted 13 September, 2018; v1 submitted 13 April, 2018; originally announced April 2018.

    Comments: Corrected LSTM gradient derivations. Added link to code

  2. arXiv:1712.01664  [pdf, other

    stat.ML cs.LG

    Learning a Generative Model for Validity in Complex Discrete Structures

    Authors: David Janz, Jos van der Westhuizen, Brooks Paige, Matt J. Kusner, José Miguel Hernández-Lobato

    Abstract: Deep generative models have been successfully used to learn representations for high-dimensional discrete spaces by representing discrete objects as sequences and employing powerful sequence-based deep models. Unfortunately, these sequence-based models often produce invalid sequences: sequences which do not represent any underlying discrete structure; invalid sequences hinder the utility of such m… ▽ More

    Submitted 1 November, 2018; v1 submitted 5 December, 2017; originally announced December 2017.

    Comments: Conference paper at ICLR 2018. Code available online

  3. arXiv:1708.04465  [pdf, ps, other

    stat.ML cs.LG

    Actively Learning what makes a Discrete Sequence Valid

    Authors: David Janz, Jos van der Westhuizen, José Miguel Hernández-Lobato

    Abstract: Deep learning techniques have been hugely successful for traditional supervised and unsupervised machine learning problems. In large part, these techniques solve continuous optimization problems. Recently however, discrete generative deep learning models have been successfully used to efficiently search high-dimensional discrete spaces. These methods work by representing discrete objects as sequen… ▽ More

    Submitted 15 August, 2017; originally announced August 2017.

    Comments: 6 pages, 2 figures

  4. arXiv:1706.01242  [pdf, other

    stat.ML cs.LG stat.AP

    Bayesian LSTMs in medicine

    Authors: Jos van der Westhuizen, Joan Lasenby

    Abstract: The medical field stands to see significant benefits from the recent advances in deep learning. Knowing the uncertainty in the decision made by any machine learning algorithm is of utmost importance for medical practitioners. This study demonstrates the utility of using Bayesian LSTMs for classification of medical time series. Four medical time series datasets are used to show the accuracy improve… ▽ More

    Submitted 5 June, 2017; originally announced June 2017.

    Comments: 11 pages, 8 figures

  5. arXiv:1705.08153  [pdf, other

    stat.ML cs.LG

    Techniques for visualizing LSTMs applied to electrocardiograms

    Authors: Jos van der Westhuizen, Joan Lasenby

    Abstract: This paper explores four different visualization techniques for long short-term memory (LSTM) networks applied to continuous-valued time series. On the datasets analysed, we find that the best visualization technique is to learn an input deletion mask that optimally reduces the true class score. With a specific focus on single-lead electrocardiograms from the MIT-BIH arrhythmia dataset, we show th… ▽ More

    Submitted 15 June, 2018; v1 submitted 23 May, 2017; originally announced May 2017.

    Comments: presented at 2018 ICML Workshop on Human Interpretability in Machine Learning (WHI 2018), Stockholm, Sweden