Skip to main content

Showing 1–5 of 5 results for author: Manzagol, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:1911.01205  [pdf, other

    cs.LG cs.AI cs.SE stat.ML

    Learning to Fix Build Errors with Graph2Diff Neural Networks

    Authors: Daniel Tarlow, Subhodeep Moitra, Andrew Rice, Zimin Chen, Pierre-Antoine Manzagol, Charles Sutton, Edward Aftandilian

    Abstract: Professional software developers spend a significant amount of time fixing builds, but this has received little attention as a problem in automatic program repair. We present a new deep learning architecture, called Graph2Diff, for automatically localizing and fixing build errors. We represent source code, build configuration files, and compiler diagnostic messages as a graph, and then use a Graph… ▽ More

    Submitted 4 November, 2019; originally announced November 2019.

    Comments: Submitted for review on Aug 23, 2019

  2. arXiv:1906.03532  [pdf, other

    cs.LG math.OC stat.ML

    Reducing the variance in online optimization by transporting past gradients

    Authors: Sébastien M. R. Arnold, Pierre-Antoine Manzagol, Reza Babanezhad, Ioannis Mitliagkas, Nicolas Le Roux

    Abstract: Most stochastic optimization methods use gradients once before discarding them. While variance reduction methods have shown that reusing past gradients can be beneficial when there is a finite number of datapoints, they do not easily extend to the online setting. One issue is the staleness due to using past gradients. We propose to correct this staleness using the idea of implicit gradient transpo… ▽ More

    Submitted 18 June, 2019; v1 submitted 8 June, 2019; originally announced June 2019.

    Comments: Open-source implementation available at: https://github.com/seba-1511/igt.pth

  3. arXiv:1903.03096  [pdf, other

    cs.LG stat.ML

    Meta-Dataset: A Dataset of Datasets for Learning to Learn from Few Examples

    Authors: Eleni Triantafillou, Tyler Zhu, Vincent Dumoulin, Pascal Lamblin, Utku Evci, Kelvin Xu, Ross Goroshin, Carles Gelada, Kevin Swersky, Pierre-Antoine Manzagol, Hugo Larochelle

    Abstract: Few-shot classification refers to learning a classifier for new classes given only a few examples. While a plethora of models have emerged to tackle it, we find the procedure and datasets that are used to assess their progress lacking. To address this limitation, we propose Meta-Dataset: a new benchmark for training and evaluating models that is large-scale, consists of diverse datasets, and prese… ▽ More

    Submitted 8 April, 2020; v1 submitted 7 March, 2019; originally announced March 2019.

    Comments: Code available at https://github.com/google-research/meta-dataset

    Journal ref: International Conference on Learning Representations (2020)

  4. arXiv:1902.02366  [pdf, other

    cs.LG math.OC stat.ML

    Negative eigenvalues of the Hessian in deep neural networks

    Authors: Guillaume Alain, Nicolas Le Roux, Pierre-Antoine Manzagol

    Abstract: The loss function of deep networks is known to be non-convex but the precise nature of this nonconvexity is still an active area of research. In this work, we study the loss landscape of deep networks through the eigendecompositions of their Hessian matrix. In particular, we examine how important the negative eigenvalues are and the benefits one can observe in handling them appropriately.

    Submitted 6 February, 2019; originally announced February 2019.

  5. arXiv:1605.02688  [pdf, other

    cs.SC cs.LG cs.MS

    Theano: A Python framework for fast computation of mathematical expressions

    Authors: The Theano Development Team, Rami Al-Rfou, Guillaume Alain, Amjad Almahairi, Christof Angermueller, Dzmitry Bahdanau, Nicolas Ballas, Frédéric Bastien, Justin Bayer, Anatoly Belikov, Alexander Belopolsky, Yoshua Bengio, Arnaud Bergeron, James Bergstra, Valentin Bisson, Josh Bleecher Snyder, Nicolas Bouchard, Nicolas Boulanger-Lewandowski, Xavier Bouthillier, Alexandre de Brébisson, Olivier Breuleux, Pierre-Luc Carrier, Kyunghyun Cho, Jan Chorowski, Paul Christiano , et al. (88 additional authors not shown)

    Abstract: Theano is a Python library that allows to define, optimize, and evaluate mathematical expressions involving multi-dimensional arrays efficiently. Since its introduction, it has been one of the most used CPU and GPU mathematical compilers - especially in the machine learning community - and has shown steady performance improvements. Theano is being actively and continuously developed since 2008, mu… ▽ More

    Submitted 9 May, 2016; originally announced May 2016.

    Comments: 19 pages, 5 figures