Skip to main content

Showing 1–10 of 10 results for author: Vasilache, N

.
  1. arXiv:2205.14479  [pdf, other

    cs.PL

    TinyIREE: An ML Execution Environment for Embedded Systems from Compilation to Deployment

    Authors: Hsin-I Cindy Liu, Marius Brehler, Mahesh Ravishankar, Nicolas Vasilache, Ben Vanik, Stella Laurenzo

    Abstract: Machine learning model deployment for training and execution has been an important topic for industry and academic research in the last decade. Much of the attention has been focused on develo** specific toolchains to support acceleration hardware. In this paper, we present IREE, a unified compiler and runtime stack with the explicit goal to scale down machine learning programs to the smallest f… ▽ More

    Submitted 28 May, 2022; originally announced May 2022.

    Comments: 9 pages, 3 figures, to be published in IEEE Micro

  2. Compiler Support for Sparse Tensor Computations in MLIR

    Authors: Aart J. C. Bik, Penporn Koanantakool, Tatiana Shpeisman, Nicolas Vasilache, Bixia Zheng, Fredrik Kjolstad

    Abstract: Sparse tensors arise in problems in science, engineering, machine learning, and data analytics. Programs that operate on such tensors can exploit sparsity to reduce storage requirements and computational time. Develo** and maintaining sparse software by hand, however, is a complex and error-prone task. Therefore, we propose treating sparsity as a property of tensors, not a tedious implementation… ▽ More

    Submitted 9 February, 2022; originally announced February 2022.

  3. arXiv:2202.03293  [pdf, other

    cs.PL

    Composable and Modular Code Generation in MLIR: A Structured and Retargetable Approach to Tensor Compiler Construction

    Authors: Nicolas Vasilache, Oleksandr Zinenko, Aart J. C. Bik, Mahesh Ravishankar, Thomas Raoux, Alexander Belyaev, Matthias Springer, Tobias Gysi, Diego Caballero, Stephan Herhut, Stella Laurenzo, Albert Cohen

    Abstract: Despite significant investment in software infrastructure, machine learning systems, runtimes and compilers do not compose properly. We propose a new design aiming at providing unprecedented degrees of modularity, composability and genericity. This paper discusses a structured approach to the construction of domain-specific code generators for tensor compilers, with the stated goal of improving th… ▽ More

    Submitted 7 February, 2022; originally announced February 2022.

  4. arXiv:2002.11054  [pdf, other

    cs.PL cs.LG

    MLIR: A Compiler Infrastructure for the End of Moore's Law

    Authors: Chris Lattner, Mehdi Amini, Uday Bondhugula, Albert Cohen, Andy Davis, Jacques Pienaar, River Riddle, Tatiana Shpeisman, Nicolas Vasilache, Oleksandr Zinenko

    Abstract: This work presents MLIR, a novel approach to building reusable and extensible compiler infrastructure. MLIR aims to address software fragmentation, improve compilation for heterogeneous hardware, significantly reduce the cost of building domain specific compilers, and aid in connecting existing compilers together. MLIR facilitates the design and implementation of code generators, translators and o… ▽ More

    Submitted 29 February, 2020; v1 submitted 25 February, 2020; originally announced February 2020.

  5. arXiv:1802.04730  [pdf, other

    cs.PL cs.LG

    Tensor Comprehensions: Framework-Agnostic High-Performance Machine Learning Abstractions

    Authors: Nicolas Vasilache, Oleksandr Zinenko, Theodoros Theodoridis, Priya Goyal, Zachary DeVito, William S. Moses, Sven Verdoolaege, Andrew Adams, Albert Cohen

    Abstract: Deep learning models with convolutional and recurrent networks are now ubiquitous and analyze massive amounts of audio, image, video, text and graph data, with applications in automatic translation, speech-to-text, scene understanding, ranking user preferences, ad placement, etc. Competing frameworks for building these networks such as TensorFlow, Chainer, CNTK, Torch/PyTorch, Caffe1/2, MXNet and… ▽ More

    Submitted 28 June, 2018; v1 submitted 13 February, 2018; originally announced February 2018.

  6. arXiv:1705.09319  [pdf, other

    cs.LG stat.ML

    Diagonal Rescaling For Neural Networks

    Authors: Jean Lafond, Nicolas Vasilache, Léon Bottou

    Abstract: We define a second-order neural network stochastic gradient training algorithm whose block-diagonal structure effectively amounts to normalizing the unit activations. Investigating why this algorithm lacks in robustness then reveals two interesting insights. The first insight suggests a new way to scale the stepsizes, clarifying popular algorithms such as RMSProp as well as old neural network tric… ▽ More

    Submitted 25 May, 2017; originally announced May 2017.

  7. arXiv:1702.04770  [pdf, other

    cs.CL cs.LG cs.NE

    Training Language Models Using Target-Propagation

    Authors: Sam Wiseman, Sumit Chopra, Marc'Aurelio Ranzato, Arthur Szlam, Ruoyu Sun, Soumith Chintala, Nicolas Vasilache

    Abstract: While Truncated Back-Propagation through Time (BPTT) is the most popular approach to training Recurrent Neural Networks (RNNs), it suffers from being inherently sequential (making parallelization difficult) and from truncating gradient flow between distant time-steps. We investigate whether Target Propagation (TPROP) style approaches can address these shortcomings. Unfortunately, extensive experim… ▽ More

    Submitted 15 February, 2017; originally announced February 2017.

  8. arXiv:1511.02251  [pdf, other

    cs.CV

    Learning Visual Features from Large Weakly Supervised Data

    Authors: Armand Joulin, Laurens van der Maaten, Allan Jabri, Nicolas Vasilache

    Abstract: Convolutional networks trained on large supervised dataset produce visual features which form the basis for the state-of-the-art in many computer-vision problems. Further improvements of these visual features will likely require even larger manually labeled data sets, which severely limits the pace at which progress can be made. In this paper, we explore the potential of leveraging massive, weakly… ▽ More

    Submitted 6 November, 2015; originally announced November 2015.

  9. arXiv:1412.7580  [pdf, ps, other

    cs.LG cs.DC cs.NE

    Fast Convolutional Nets With fbfft: A GPU Performance Evaluation

    Authors: Nicolas Vasilache, Jeff Johnson, Michael Mathieu, Soumith Chintala, Serkan Piantino, Yann LeCun

    Abstract: We examine the performance profile of Convolutional Neural Network training on the current generation of NVIDIA Graphics Processing Units. We introduce two new Fast Fourier Transform convolution implementations: one based on NVIDIA's cuFFT library, and another based on a Facebook authored FFT implementation, fbfft, that provides significant speedups over cuFFT (over 1.5x) for whole CNNs. Both of t… ▽ More

    Submitted 10 April, 2015; v1 submitted 23 December, 2014; originally announced December 2014.

    Comments: Camera ready for ICLR2015

  10. arXiv:1409.1914  [pdf, ps, other

    cs.DC cs.PL

    A Tale of Three Runtimes

    Authors: Nicolas Vasilache, Muthu Baskaran, Tom Henretty, Benoit Meister, M. Harper Langston, Sanket Tavarageri, Richard Lethin

    Abstract: This contribution discusses the automatic generation of event-driven, tuple-space based programs for task-oriented execution models from a sequential C specification. We developed a hierarchical map** solution using auto-parallelizing compiler technology to target three different runtimes relying on event-driven tasks (EDTs). Our solution benefits from the important observation that loop types e… ▽ More

    Submitted 5 September, 2014; originally announced September 2014.