Skip to main content

Showing 1–2 of 2 results for author: Grachev, A M

Searching in archive cs. Search in all archives.
.
  1. Compression of Recurrent Neural Networks for Efficient Language Modeling

    Authors: Artem M. Grachev, Dmitry I. Ignatov, Andrey V. Savchenko

    Abstract: Recurrent neural networks have proved to be an effective method for statistical language modeling. However, in practice their memory and run-time complexity are usually too large to be implemented in real-time offline mobile applications. In this paper we consider several compression techniques for recurrent neural networks including Long-Short Term Memory models. We make particular attention to t… ▽ More

    Submitted 6 February, 2019; originally announced February 2019.

    Comments: 25 pages, 3 tables, 4 figures

  2. arXiv:1708.05963  [pdf, ps, other

    stat.ML cs.CL cs.LG cs.NE

    Neural Networks Compression for Language Modeling

    Authors: Artem M. Grachev, Dmitry I. Ignatov, Andrey V. Savchenko

    Abstract: In this paper, we consider several compression techniques for the language modeling problem based on recurrent neural networks (RNNs). It is known that conventional RNNs, e.g, LSTM-based networks in language modeling, are characterized with either high space complexity or substantial inference time. This problem is especially crucial for mobile applications, in which the constant interaction with… ▽ More

    Submitted 20 August, 2017; originally announced August 2017.

    Comments: Keywords: LSTM, RNN, language modeling, low-rank factorization, pruning, quantization. Published by Springer in the LNCS series, 7th International Conference on Pattern Recognition and Machine Intelligence, 2017

    MSC Class: 62M45; 68T50 ACM Class: I.2.7, I.2.6, I.5.1, I.5.4