Skip to main content

Showing 1–14 of 14 results for author: Vivien, F

Searching in archive cs. Search in all archives.
.
  1. arXiv:1410.7249  [pdf, other

    cs.DC

    Scheduling Trees of Malleable Tasks for Sparse Linear Algebra

    Authors: Abdou Guermouche, Loris Marchal, Bertrand Simon, Frédéric Vivien

    Abstract: Scientific workloads are often described as directed acyclic task graphs. In this paper, we focus on the multifrontal factorization of sparse matrices, whose task graph is structured as a tree of parallel tasks. Among the existing models for parallel tasks, the concept of malleable tasks is especially powerful as it allows each task to be processed on a time-varying number of processors. Following… ▽ More

    Submitted 4 June, 2015; v1 submitted 27 October, 2014; originally announced October 2014.

    Comments: Paper accepted for publication at EuroPar 2015

  2. arXiv:1410.0329  [pdf, other

    cs.DC

    Parallel scheduling of task trees with limited memory

    Authors: Lionel Eyraud-Dubois, Loris Marchal, Oliver Sinnen, Frédéric Vivien

    Abstract: This paper investigates the execution of tree-shaped task graphs using multiple processors. Each edge of such a tree represents some large data. A task can only be executed if all input and output data fit into memory, and a data can only be removed from memory after the completion of the task that uses it as an input data. Such trees arise, for instance, in the multifrontal method of sparse matri… ▽ More

    Submitted 1 October, 2014; originally announced October 2014.

    Comments: arXiv admin note: substantial text overlap with arXiv:1210.2580

    Report number: RR-8606

    Journal ref: N° RR-8606 (2014)

  3. arXiv:1310.8486  [pdf, other

    cs.DC

    On the Combination of Silent Error Detection and Checkpointing

    Authors: Guillaume Aupy, Anne Benoit, Thomas Hérault, Yves Robert, Frédéric Vivien, Dounia Zaidouni

    Abstract: In this paper, we revisit traditional checkpointing and rollback recovery strategies, with a focus on silent data corruption errors. Contrarily to fail-stop failures, such latent errors cannot be detected immediately, and a mechanism to detect them must be provided. We consider two models: (i) errors are detected after some delays following a probability distribution (typically, an Exponential dis… ▽ More

    Submitted 31 October, 2013; originally announced October 2013.

    Comments: This work was accepted to be published in PRDC'13. Work supported by ANR Rescue

    Report number: INRIA RR-8319

  4. arXiv:1302.4558  [pdf, other

    cs.DC

    Checkpointing strategies with prediction windows

    Authors: Guillaume Aupy, Yves Robert, Frédéric Vivien, Dounia Zaidouni

    Abstract: This paper deals with the impact of fault prediction techniques on checkpointing strategies. We suppose that the fault-prediction system provides prediction windows instead of exact predictions, which dramatically complicates the analysis of the checkpointing strategies. We propose a new approach based upon two periodic modes, a regular mode outside prediction windows, and a proactive mode inside… ▽ More

    Submitted 19 February, 2013; originally announced February 2013.

    Comments: 35 pages, work supported by ANR Rescue. arXiv admin note: substantial text overlap with arXiv:1207.6936, arXiv:1302.3752

    Report number: INRIA RR-8239

  5. Checkpointing algorithms and fault prediction

    Authors: Guillaume Aupy, Yves Robert, Frédéric Vivien, Dounia Zaidouni

    Abstract: This paper deals with the impact of fault prediction techniques on checkpointing strategies. We extend the classical first-order analysis of Young and Daly in the presence of a fault prediction system, characterized by its recall and its precision. In this framework, we provide an optimal algorithm to decide when to take predictions into account, and we derive the optimal value of the checkpointin… ▽ More

    Submitted 3 December, 2013; v1 submitted 15 February, 2013; originally announced February 2013.

    Comments: Supported in part by ANR Rescue. Published in Journal of Parallel and Distributed Computing. arXiv admin note: text overlap with arXiv:1207.6936

    Report number: INRIA RR-8237

    Journal ref: Journal of Parallel and Distributed Computing, Available online 7 November 2013, ISSN 0743-7315

  6. arXiv:1210.2580  [pdf, ps, other

    cs.DC

    Scheduling tree-shaped task graphs to minimize memory and makespan

    Authors: Loris Marchal, Oliver Sinnen, Frédéric Vivien

    Abstract: This paper investigates the execution of tree-shaped task graphs using multiple processors. Each edge of such a tree represents a large IO file. A task can only be executed if all input and output files fit into memory, and a file can only be removed from memory after it has been consumed. Such trees arise, for instance, in the multifrontal method of sparse matrix factorization. The maximum amount… ▽ More

    Submitted 9 October, 2012; originally announced October 2012.

  7. arXiv:1207.6936  [pdf, other

    cs.DC cs.DS

    Impact of fault prediction on checkpointing strategies

    Authors: Guillaume Aupy, Yves Robert, Frédéric Vivien, Dounia Zaidouni

    Abstract: This paper deals with the impact of fault prediction techniques on checkpointing strategies. We extend the classical analysis of Young and Daly in the presence of a fault prediction system, which is characterized by its recall and its precision, and which provides either exact or window-based time predictions. We succeed in deriving the optimal value of the checkpointing period (thereby minimizing… ▽ More

    Submitted 9 October, 2012; v1 submitted 30 July, 2012; originally announced July 2012.

    Comments: 20 pages

    Report number: INRIA Report 8023

  8. arXiv:1106.4985  [pdf, other

    cs.DC

    Dynamic Fractional Resource Scheduling vs. Batch Scheduling

    Authors: Henri Casanova, Mark Stillwell, Frédéric Vivien

    Abstract: We propose a novel job scheduling approach for homogeneous cluster computing platforms. Its key feature is the use of virtual machine technology to share fractional node resources in a precise and controlled manner. Other VM-based scheduling approaches have focused primarily on technical issues or on extensions to existing batch scheduling systems, while we take a more aggressive approach and seek… ▽ More

    Submitted 24 June, 2011; originally announced June 2011.

    Comments: N° RR-7659 (2011)

    Report number: RR-7659

  9. arXiv:1006.5376  [pdf, other

    cs.DC

    Resource Allocation using Virtual Clusters

    Authors: Mark Stillwell, David Schanzenbach, Frédéric Vivien, Henri Casanova

    Abstract: In this report we demonstrate the potential utility of resource allocation management systems that use virtual machine technology for sharing parallel computing resources among competing jobs. We formalize the resource allocation problem with a number of underlying assumptions, determine its complexity, propose several heuristic algorithms to find near-optimal solutions, and evaluate these algorit… ▽ More

    Submitted 28 June, 2010; originally announced June 2010.

    Comments: University of Hawai'i at M{ā}noa Department of Information and Computer Sciences Technical Report

    Report number: ICS2008-09-01

  10. arXiv:0706.4038  [pdf, ps, other

    cs.DC

    Scheduling multiple divisible loads on a linear processor network

    Authors: Matthieu Gallet, Yves Robert, Frédéric Vivien

    Abstract: Min, Veeravalli, and Barlas have recently proposed strategies to minimize the overall execution time of one or several divisible loads on a heterogeneous linear network, using one or more installments. We show on a very simple example that their approach does not always produce a solution and that, when it does, the solution is often suboptimal. We also show how to find an optimal schedule for a… ▽ More

    Submitted 28 June, 2007; v1 submitted 27 June, 2007; originally announced June 2007.

  11. arXiv:cs/0702076  [pdf, ps, other

    cs.DC

    A First Step Towards Automatically Building Network Representations

    Authors: Lionel Eyraud-Dubois, Arnaud Legrand, Martin Quinson, Frédéric Vivien

    Abstract: To fully harness Grids, users or middlewares must have some knowledge on the topology of the platform interconnection network. As such knowledge is usually not available, one must uses tools which automatically build a topological network model through some measurements. In this article, we define a methodology to assess the quality of these network model building tools, and we apply this method… ▽ More

    Submitted 28 June, 2007; v1 submitted 13 February, 2007; originally announced February 2007.

  12. arXiv:cs/0702066  [pdf, ps, other

    cs.DC

    Comments on "Design and performance evaluation of load distribution strategies for multiple loads on heterogeneous linear daisy chain networks''

    Authors: Matthieu Gallet, Yves Robert, Frédéric Vivien

    Abstract: Min, Veeravalli, and Barlas proposed strategies to minimize the overall execution time of one or several divisible loads on a heterogeneous linear network, using one or more installments. We show on a very simple example that the proposed approach does not always produce a solution and that, when it does, the solution is often suboptimal. We also show how to find an optimal scheduling for any in… ▽ More

    Submitted 10 February, 2007; originally announced February 2007.

  13. arXiv:cs/0612036  [pdf, ps, other

    cs.DC cs.MS

    Revisiting Matrix Product on Master-Worker Platforms

    Authors: Jack Dongarra, Jean-Francois Pineau, Yves Robert, Zhiao Shi, Frederic Vivien

    Abstract: This paper is aimed at designing efficient parallel matrix-product algorithms for heterogeneous master-worker platforms. While matrix-product is well-understood for homogeneous 2D-arrays of processors (e.g., Cannon algorithm and ScaLAPACK outer product algorithm), there are three key hypotheses that render our work original and innovative: - Centralized data. We assume that all matrix files or… ▽ More

    Submitted 6 December, 2006; originally announced December 2006.

    ACM Class: F.2.2

  14. arXiv:cs/0610131  [pdf, ps, other

    cs.DC

    Scheduling and data redistribution strategies on star platforms

    Authors: Loris Marchal, Veronika Rehn, Yves Robert, Frédéric Vivien

    Abstract: In this work we are interested in the problem of scheduling and redistributing data on master-slave platforms. We consider the case were the workers possess initial loads, some of which having to be redistributed in order to balance their completion times. We examine two different scenarios. The first model assumes that the data consists of independent and identical tasks. We prove the NP-comple… ▽ More

    Submitted 23 October, 2006; originally announced October 2006.