Skip to main content

Showing 1–3 of 3 results for author: Kristensen, M R B

Searching in archive cs. Search in all archives.
.
  1. Fusion of Array Operations at Runtime

    Authors: Mads R. B. Kristensen, Simon A. F. Lund, Troels Blum, James Avery

    Abstract: We address the problem of fusing array operations based on criteria such as shape compatibility, data reusability, and communication. We formulate the problem as a graph partition problem that is general enough to handle loop fusion, combinator fusion, and other types of subroutines.

    Submitted 21 January, 2016; v1 submitted 20 January, 2016; originally announced January 2016.

    Comments: Preprint

    Journal ref: Proceeding PACT '16 Proceedings of the 2016 International Conference on Parallel Architectures and Compilation Pages 71-85

  2. arXiv:1210.7774  [pdf, other

    cs.PL cs.DC

    cphVB: A System for Automated Runtime Optimization and Parallelization of Vectorized Applications

    Authors: Mads Ruben Burgdorff Kristensen, Simon Andreas Frimann Lund, Troels Blum, Brian Vinter

    Abstract: Modern processor architectures, in addition to having still more cores, also require still more consideration to memory-layout in order to run at full capacity. The usefulness of most languages is deprecating as their abstractions, structures or objects are hard to map onto modern processor architectures efficiently. The work in this paper introduces a new abstract machine framework, cphVB, that… ▽ More

    Submitted 25 March, 2013; v1 submitted 26 October, 2012; originally announced October 2012.

    Journal ref: Proceedings of The 11th Python In Science Conference (SciPy 2012)

  3. Managing Communication Latency-Hiding at Runtime for Parallel Programming Languages and Libraries

    Authors: Mads Ruben Burgdorff Kristensen, Brian Vinter

    Abstract: This work introduces a runtime model for managing communication with support for latency-hiding. The model enables non-computer science researchers to exploit communication latency-hiding techniques seamlessly. For compiled languages, it is often possible to create efficient schedules for communication, but this is not the case for interpreted languages. By maintaining data dependencies between sc… ▽ More

    Submitted 18 January, 2012; originally announced January 2012.

    Comments: PREPRINT

    Journal ref: Proceeding HPCC '12 Proceedings of the 2012 IEEE 14th International Conference on High Performance Computing and Communication & 2012 IEEE 9th International Conference on Embedded Software and Systems