Skip to main content

Showing 1–9 of 9 results for author: Eyraud-Dubois, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.16443  [pdf, ps, other

    cs.CC cs.DC

    Tightening I/O Lower Bounds through the Hourglass Dependency Pattern

    Authors: Lionel Eyraud-Dubois, Guillaume Iooss, Julien Langou, Fabrice Rastello

    Abstract: When designing an algorithm, one cares about arithmetic/computational complexity, but data movement (I/O) complexity plays an increasingly important role that highly impacts performance and energy consumption. For a given algorithm and a given I/O model, scheduling strategies such as loop tiling can reduce the required I/O down to a limit, called the I/O complexity, inherent to the algorithm itsel… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

    Journal ref: 36th ACM Symposium on Parallelism in Algorithms and Architectures (SPAA '24), Jun 2024, Nantes, France

  2. arXiv:2202.10435  [pdf, ps, other

    cs.LG cs.AI

    Survey on Large Scale Neural Network Training

    Authors: Julia Gusak, Daria Cherniuk, Alena Shilova, Alexander Katrutsa, Daniel Bershatsky, Xunyi Zhao, Lionel Eyraud-Dubois, Oleg Shlyazhko, Denis Dimitrov, Ivan Oseledets, Olivier Beaumont

    Abstract: Modern Deep Neural Networks (DNNs) require significant memory to store weight, activations, and other intermediate tensors during training. Hence, many models do not fit one GPU device or can be trained using only a small per-GPU batch size. This survey provides a systematic overview of the approaches that enable more efficient DNNs training. We analyze techniques that save memory and make good us… ▽ More

    Submitted 21 February, 2022; originally announced February 2022.

  3. arXiv:2202.10217  [pdf, ps, other

    cs.DC

    I/O-Optimal Algorithms for Symmetric Linear Algebra Kernels

    Authors: Olivier Beaumont, Lionel Eyraud-Dubois, Mathieu Vérité, Julien Langou

    Abstract: In this paper, we consider two fundamental symmetric kernels in linear algebra: the Cholesky factorization and the symmetric rank-$k$ update (SYRK), with the classical three nested loops algorithms for these kernels. In addition, we consider a machine model with a fast memory of size $S$ and an unbounded slow memory. In this model, all computations must be performed on operands in fast memory, and… ▽ More

    Submitted 21 February, 2022; originally announced February 2022.

  4. arXiv:1911.13214  [pdf, other

    cs.LG cs.DC cs.NE

    Optimal checkpointing for heterogeneous chains: how to train deep neural networks with limited memory

    Authors: Julien Herrmann, Olivier Beaumont, Lionel Eyraud-Dubois, Julien Hermann, Alexis Joly, Alena Shilova

    Abstract: This paper introduces a new activation checkpointing method which allows to significantly decrease memory usage when training Deep Neural Networks with the back-propagation algorithm. Similarly to checkpoint-ing techniques coming from the literature on Automatic Differentiation, it consists in dynamically selecting the forward activations that are saved during the training phase, and then automati… ▽ More

    Submitted 27 November, 2019; originally announced November 2019.

  5. Scheduling on Two Types of Resources: a Survey

    Authors: Olivier Beaumont, Louis-claude Canon, Lionel Eyraud-Dubois, Giorgio Lucarelli, Loris Marchal, Clément Mommessin, Bertrand Simon, Denis Trystram

    Abstract: The evolution in the design of modern parallel platforms leads to revisit the scheduling jobs on distributed heterogeneous resources. The goal of this survey is to present the main existing algorithms, to classify them based on their underlying principles and to propose unified implementations to enable their fair comparison, both in terms of running time and quality of schedules, on a large set o… ▽ More

    Submitted 30 July, 2020; v1 submitted 25 September, 2019; originally announced September 2019.

    Journal ref: ACM Computing Survey, Vol. 53, No. 3, 2020

  6. Performance Models for Data Transfers: A Case Study with Molecular Chemistry Kernels

    Authors: Suraj Kumar, Lionel Eyraud-Dubois, Sriram Krishnamoorthy

    Abstract: With increasing complexity of hardwares, systems with different memory nodes are ubiquitous in High Performance Computing (HPC). It is paramount to develop strategies to overlap the data transfers between memory nodes with computations in order to exploit the full potential of these systems. In this article, we consider the problem of deciding the order of data transfers between two memory nodes f… ▽ More

    Submitted 14 April, 2019; originally announced April 2019.

    Journal ref: https://www.hpcs.cs.tsukuba.ac.jp/icpp2019/

  7. arXiv:1410.0329  [pdf, other

    cs.DC

    Parallel scheduling of task trees with limited memory

    Authors: Lionel Eyraud-Dubois, Loris Marchal, Oliver Sinnen, Frédéric Vivien

    Abstract: This paper investigates the execution of tree-shaped task graphs using multiple processors. Each edge of such a tree represents some large data. A task can only be executed if all input and output data fit into memory, and a data can only be removed from memory after the completion of the task that uses it as an input data. Such trees arise, for instance, in the multifrontal method of sparse matri… ▽ More

    Submitted 1 October, 2014; originally announced October 2014.

    Comments: arXiv admin note: substantial text overlap with arXiv:1210.2580

    Report number: RR-8606

    Journal ref: N° RR-8606 (2014)

  8. arXiv:1310.5255  [pdf, other

    cs.DC

    Efficient and Robust Allocation Algorithms in Clouds under Memory Constraints

    Authors: Olivier Beaumont, Lionel Eyraud-Dubois, Paul Renaud-Goud

    Abstract: We consider robust resource allocation of services in Clouds. More specifically, we consider the case of a large public or private Cloud platform that runs a relatively small set of large and independent services. These services are characterized by their demand along several dimensions (CPU, memory,...) and by their quality of service requirements, that have been defined through an SLA in the cas… ▽ More

    Submitted 19 October, 2013; originally announced October 2013.

  9. arXiv:cs/0702076  [pdf, ps, other

    cs.DC

    A First Step Towards Automatically Building Network Representations

    Authors: Lionel Eyraud-Dubois, Arnaud Legrand, Martin Quinson, Frédéric Vivien

    Abstract: To fully harness Grids, users or middlewares must have some knowledge on the topology of the platform interconnection network. As such knowledge is usually not available, one must uses tools which automatically build a topological network model through some measurements. In this article, we define a methodology to assess the quality of these network model building tools, and we apply this method… ▽ More

    Submitted 28 June, 2007; v1 submitted 13 February, 2007; originally announced February 2007.