Skip to main content

Showing 1–9 of 9 results for author: Munkhdalai, T

Searching in archive stat. Search in all archives.
.
  1. arXiv:2009.01803  [pdf, other

    cs.NE cs.AI cs.CL cs.LG stat.ML

    Sparse Meta Networks for Sequential Adaptation and its Application to Adaptive Language Modelling

    Authors: Tsendsuren Munkhdalai

    Abstract: Training a deep neural network requires a large amount of single-task data and involves a long time-consuming optimization phase. This is not scalable to complex, realistic environments with new unexpected changes. Humans can perform fast incremental learning on the fly and memory systems in the brain play a critical role. We introduce Sparse Meta Networks -- a meta-learning approach to learn onli… ▽ More

    Submitted 3 September, 2020; originally announced September 2020.

    Comments: 9 pages, 4 figures, 2 tables

  2. arXiv:2005.03350  [pdf

    stat.ML cs.AI cs.LG

    A Locally Adaptive Interpretable Regression

    Authors: Lkhagvadorj Munkhdalai, Tsendsuren Munkhdalai, Keun Ho Ryu

    Abstract: Machine learning models with both good predictability and high interpretability are crucial for decision support systems. Linear regression is one of the most interpretable prediction models. However, the linearity in a simple linear regression worsens its predictability. In this work, we introduce a locally adaptive interpretable regression (LoAIR). In LoAIR, a metamodel parameterized by neural n… ▽ More

    Submitted 28 April, 2022; v1 submitted 7 May, 2020; originally announced May 2020.

  3. arXiv:1907.09720  [pdf, other

    cs.NE cs.LG stat.ML

    Metalearned Neural Memory

    Authors: Tsendsuren Munkhdalai, Alessandro Sordoni, Tong Wang, Adam Trischler

    Abstract: We augment recurrent neural networks with an external memory mechanism that builds upon recent progress in metalearning. We conceptualize this memory as a rapidly adaptable function that we parameterize as a deep neural network. Reading from the neural memory function amounts to pushing an input (the key vector) through the function to produce an output (the value vector). Writing to memory means… ▽ More

    Submitted 3 December, 2019; v1 submitted 23 July, 2019; originally announced July 2019.

    Comments: NeurIPS 2019

  4. arXiv:1807.05076  [pdf, other

    cs.NE cs.AI cs.LG stat.ML

    Metalearning with Hebbian Fast Weights

    Authors: Tsendsuren Munkhdalai, Adam Trischler

    Abstract: We unify recent neural approaches to one-shot learning with older ideas of associative memory in a model for metalearning. Our model learns jointly to represent data and to bind class labels to representations in a single shot. It builds representations via slow weights, learned across tasks through SGD, while fast weights constructed by a Hebbian learning rule implement one-shot binding for each… ▽ More

    Submitted 12 July, 2018; originally announced July 2018.

    Comments: 8 pages, 3 figures, 4 tables. arXiv admin note: text overlap with arXiv:1712.09926

  5. arXiv:1712.09926  [pdf, other

    cs.LG cs.NE stat.ML

    Rapid Adaptation with Conditionally Shifted Neurons

    Authors: Tsendsuren Munkhdalai, Xingdi Yuan, Soroush Mehri, Adam Trischler

    Abstract: We describe a mechanism by which artificial neural networks can learn rapid adaptation - the ability to adapt on the fly, with little data, to new tasks - that we call conditionally shifted neurons. We apply this mechanism in the framework of metalearning, where the aim is to replicate some of the flexibility of human learning in machines. Conditionally shifted neurons modify their activation valu… ▽ More

    Submitted 3 July, 2018; v1 submitted 28 December, 2017; originally announced December 2017.

    Comments: ICML 2018; Added: additional ablation and speed comparison with MetaNet

  6. arXiv:1703.00837  [pdf, other

    cs.LG stat.ML

    Meta Networks

    Authors: Tsendsuren Munkhdalai, Hong Yu

    Abstract: Neural networks have been successfully applied in applications with a large amount of labeled data. However, the task of rapid generalization on new concepts with small training data while preserving performances on previously learned ones still presents a significant challenge to neural network models. In this work, we introduce a novel meta learning method, Meta Networks (MetaNet), that learns a… ▽ More

    Submitted 8 June, 2017; v1 submitted 2 March, 2017; originally announced March 2017.

    Comments: Accepted at ICML 2017 - rewrote: the main section; added: MetaNet algorithmic procedure; performed: Mini-ImageNet evaluation

  7. arXiv:1610.06454  [pdf, other

    cs.CL cs.AI cs.NE stat.ML

    Reasoning with Memory Augmented Neural Networks for Language Comprehension

    Authors: Tsendsuren Munkhdalai, Hong Yu

    Abstract: Hypothesis testing is an important cognitive process that supports human reasoning. In this paper, we introduce a computational hypothesis testing approach based on memory augmented neural networks. Our approach involves a hypothesis testing loop that reconsiders and progressively refines a previously formed hypothesis in order to generate new hypotheses to test. We apply the proposed approach to… ▽ More

    Submitted 28 February, 2017; v1 submitted 20 October, 2016; originally announced October 2016.

    Comments: Accepted at ICLR 2017

  8. arXiv:1607.04492  [pdf, other

    cs.CL cs.LG stat.ML

    Neural Tree Indexers for Text Understanding

    Authors: Tsendsuren Munkhdalai, Hong Yu

    Abstract: Recurrent neural networks (RNNs) process input text sequentially and model the conditional transition between word tokens. In contrast, the advantages of recursive networks include that they explicitly model the compositionality and the recursive structure of natural language. However, the current recursive architecture is limited by its dependence on syntactic tree. In this paper, we introduce a… ▽ More

    Submitted 28 February, 2017; v1 submitted 15 July, 2016; originally announced July 2016.

    Comments: Accepted at EACL 2017

  9. arXiv:1607.04315  [pdf, other

    cs.LG cs.CL stat.ML

    Neural Semantic Encoders

    Authors: Tsendsuren Munkhdalai, Hong Yu

    Abstract: We present a memory augmented neural network for natural language understanding: Neural Semantic Encoders. NSE is equipped with a novel memory update rule and has a variable sized encoding memory that evolves over time and maintains the understanding of input sequences through read}, compose and write operations. NSE can also access multiple and shared memories. In this paper, we demonstrated the… ▽ More

    Submitted 5 January, 2017; v1 submitted 14 July, 2016; originally announced July 2016.

    Comments: Accepted in EACL 2017, added: comparison with NTM, qualitative analysis and memory visualization