Skip to main content

Showing 1–8 of 8 results for author: Courbariaux, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2002.03090  [pdf, other

    cs.LG stat.ML

    BitPruning: Learning Bitlengths for Aggressive and Accurate Quantization

    Authors: Miloš Nikolić, Ghouthi Boukli Hacene, Ciaran Bannon, Alberto Delmas Lascorz, Matthieu Courbariaux, Yoshua Bengio, Vincent Gripon, Andreas Moshovos

    Abstract: Neural networks have demonstrably achieved state-of-the art accuracy using low-bitlength integer quantization, yielding both execution time and energy benefits on existing hardware designs that support short bitlengths. However, the question of finding the minimum bitlength for a desired accuracy remains open. We introduce a training method for minimizing inference bitlength at any granularity whi… ▽ More

    Submitted 11 August, 2020; v1 submitted 7 February, 2020; originally announced February 2020.

  2. arXiv:1905.12300  [pdf, ps, other

    cs.NE

    Attention Based Pruning for Shift Networks

    Authors: Ghouthi Boukli Hacene, Carlos Lassance, Vincent Gripon, Matthieu Courbariaux, Yoshua Bengio

    Abstract: In many application domains such as computer vision, Convolutional Layers (CLs) are key to the accuracy of deep learning methods. However, it is often required to assemble a large number of CLs, each containing thousands of parameters, in order to reach state-of-the-art accuracy, thus resulting in complex and demanding systems that are poorly fitted to resource-limited devices. Recently, methods h… ▽ More

    Submitted 29 May, 2019; originally announced May 2019.

  3. arXiv:1812.11800  [pdf, other

    cs.LG cs.CV

    Regularized Binary Network Training

    Authors: Sajad Darabi, Mouloud Belbahri, Matthieu Courbariaux, Vahid Partovi Nia

    Abstract: There is a significant performance gap between Binary Neural Networks (BNNs) and floating point Deep Neural Networks (DNNs). We propose to improve the binary training method, by introducing a new regularization function that encourages training weights around binary values. In addition, we add trainable scaling factors to our regularization functions. Additionally, an improved approximation of the… ▽ More

    Submitted 21 April, 2020; v1 submitted 31 December, 2018; originally announced December 2018.

    Comments: NeurIPS19 Workshop on Energy Efficient Machine Learning and Cognitive Computing (2019)

  4. arXiv:1609.07061  [pdf, other

    cs.NE cs.LG

    Quantized Neural Networks: Training Neural Networks with Low Precision Weights and Activations

    Authors: Itay Hubara, Matthieu Courbariaux, Daniel Soudry, Ran El-Yaniv, Yoshua Bengio

    Abstract: We introduce a method to train Quantized Neural Networks (QNNs) --- neural networks with extremely low precision (e.g., 1-bit) weights and activations, at run-time. At train-time the quantized weights and activations are used for computing the parameter gradients. During the forward pass, QNNs drastically reduce memory size and accesses, and replace most arithmetic operations with bit-wise operati… ▽ More

    Submitted 22 September, 2016; originally announced September 2016.

    Comments: arXiv admin note: text overlap with arXiv:1602.02830

  5. arXiv:1602.02830  [pdf, other

    cs.LG

    Binarized Neural Networks: Training Deep Neural Networks with Weights and Activations Constrained to +1 or -1

    Authors: Matthieu Courbariaux, Itay Hubara, Daniel Soudry, Ran El-Yaniv, Yoshua Bengio

    Abstract: We introduce a method to train Binarized Neural Networks (BNNs) - neural networks with binary weights and activations at run-time. At training-time the binary weights and activations are used for computing the parameters gradients. During the forward pass, BNNs drastically reduce memory size and accesses, and replace most arithmetic operations with bit-wise operations, which is expected to substan… ▽ More

    Submitted 17 March, 2016; v1 submitted 8 February, 2016; originally announced February 2016.

    Comments: 11 pages and 3 figures

  6. arXiv:1511.00363  [pdf, other

    cs.LG cs.CV cs.NE

    BinaryConnect: Training Deep Neural Networks with binary weights during propagations

    Authors: Matthieu Courbariaux, Yoshua Bengio, Jean-Pierre David

    Abstract: Deep Neural Networks (DNN) have achieved state-of-the-art results in a wide range of tasks, with the best results obtained with large training sets and large models. In the past, GPUs enabled these breakthroughs because of their greater computational speed. In the future, faster computation at both training and test time is likely to be crucial for further progress and for consumer applications on… ▽ More

    Submitted 18 April, 2016; v1 submitted 1 November, 2015; originally announced November 2015.

    Comments: Accepted at NIPS 2015, 9 pages, 3 figures

  7. arXiv:1510.03009  [pdf, other

    cs.LG cs.NE

    Neural Networks with Few Multiplications

    Authors: Zhouhan Lin, Matthieu Courbariaux, Roland Memisevic, Yoshua Bengio

    Abstract: For most deep learning algorithms training is notoriously time consuming. Since most of the computation in training neural networks is typically spent on floating point multiplications, we investigate an approach to training that eliminates the need for most of these. Our method consists of two parts: First we stochastically binarize weights to convert multiplications involved in computing hidden… ▽ More

    Submitted 26 February, 2016; v1 submitted 11 October, 2015; originally announced October 2015.

    Comments: Published as a conference paper at ICLR 2016. 9 pages, 3 figures

  8. arXiv:1412.7024  [pdf, other

    cs.LG cs.CV cs.NE

    Training deep neural networks with low precision multiplications

    Authors: Matthieu Courbariaux, Yoshua Bengio, Jean-Pierre David

    Abstract: Multipliers are the most space and power-hungry arithmetic operators of the digital implementation of deep neural networks. We train a set of state-of-the-art neural networks (Maxout networks) on three benchmark datasets: MNIST, CIFAR-10 and SVHN. They are trained with three distinct formats: floating point, fixed point and dynamic fixed point. For each of those datasets and for each of those form… ▽ More

    Submitted 22 September, 2015; v1 submitted 22 December, 2014; originally announced December 2014.

    Comments: 10 pages, 5 figures, Accepted as a workshop contribution at ICLR 2015