Skip to main content

Showing 1–12 of 12 results for author: Ahrens, W

.
  1. arXiv:2407.01742  [pdf, other

    cs.PL

    The Continuous Tensor Abstraction: Where Indices are Real

    Authors: Jaeyeon Won, Willow Ahrens, Joel S. Emer, Saman Amarasinghe

    Abstract: This paper introduces the continuous tensor abstraction, allowing indices to take real-number values (e.g., A[3.14]), and provides a continuous loop construct that iterates over the infinitely large set of real numbers. This paper expands the existing tensor abstraction to include continuous tensors that exhibit a piecewise-constant property, enabling the transformation of an infinite amount of co… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  2. arXiv:2406.09266  [pdf, other

    cs.MS

    SySTeC: A Symmetric Sparse Tensor Compiler

    Authors: Radha Patel, Willow Ahrens, Saman Amarasinghe

    Abstract: Symmetric and sparse tensors arise naturally in many domains including linear algebra, statistics, physics, chemistry, and graph theory. Symmetric tensors are equal to their transposes, so in the $n$-dimensional case we can save up to a factor of $n!$ by avoiding redundant operations. Sparse tensors, on the other hand, are mostly zero, and we can save asymptotically by processing only nonzeros. Un… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  3. arXiv:2404.16730  [pdf, other

    cs.MS

    Finch: Sparse and Structured Array Programming with Control Flow

    Authors: Willow Ahrens, Teodoro Fields Collin, Radha Patel, Kyle Deeds, Changwan Hong, Saman Amarasinghe

    Abstract: From FORTRAN to NumPy, arrays have revolutionized how we express computation. However, arrays in these, and almost all prominent systems, can only handle dense rectilinear integer grids. Real world arrays often contain underlying structure, such as sparsity, runs of repeated values, or symmetry. Support for structured data is fragmented and incomplete. Existing frameworks limit the array structure… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

  4. arXiv:2404.06477  [pdf, other

    cs.PL cs.LO

    Mechanised Hypersafety Proofs about Structured Data: Extended Version

    Authors: Vladimir Gladshtein, Qiyuan Zhao, Willow Ahrens, Saman Amarasinghe, Ilya Sergey

    Abstract: Arrays are a fundamental abstraction to represent collections of data. It is often possible to exploit structural properties of the data stored in an array (e.g., repetition or sparsity) to develop a specialised representation optimised for space efficiency. Formally reasoning about correctness of manipulations with such structured data is challenging, as they are often composed of multiple loops… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: Extended version of the paper accepted at PLDI'24

  5. Verbesserung des Record Linkage für die Gesundheitsforschung in Deutschland

    Authors: Timm Intemann, Knut Kaulke, Dennis-Kenji Kipker, Vanessa Lettieri, Christoph Stallmann, Carsten O. Schmidt, Lars Geidel, Martin Bialke, Christopher Hampf, Dana Stahl, Martin Lablans, Florens Rohde, Martin Franke, Klaus Kraywinkel, Joachim Kieschke, Sebastian Bartholomäus, Anatol-Fiete Näher, Galina Tremper, Mohamed Lambarki, Stefanie March, Fabian Prasser, Anna Christine Haber, Johannes Drepper, Irene Schlünder, Toralf Kirsten , et al. (5 additional authors not shown)

    Abstract: Record linkage means linking data from multiple sources. This approach enables the answering of scientific questions that cannot be addressed using single data sources due to limited variables. The potential of linked data for health research is enormous, as it can enhance prevention, treatment, and population health policies. Due the sensitivity of health data, there are strict legal requirements… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

    Comments: in German language

  6. arXiv:2209.05250  [pdf

    cs.PL cs.MS

    Looplets: A Language For Structured Coiteration

    Authors: Willow Ahrens, Daniel Donenfeld, Fredrik Kjolstad, Saman Amarasinghe

    Abstract: Real world arrays often contain underlying structure, such as sparsity, runs of repeated values, or symmetry. Specializing for structure yields significant speedups. But automatically generating efficient code for structured data is challenging, especially when arrays with different structure interact. We show how to abstract over array structures so that the compiler can generate code to coiter… ▽ More

    Submitted 8 September, 2022; originally announced September 2022.

  7. arXiv:2111.14947  [pdf, other

    cs.MS cs.PL

    An Asymptotic Cost Model for Autoscheduling Sparse Tensor Programs

    Authors: Willow Ahrens, Fredrik Kjolstad, Saman Amarasinghe

    Abstract: While loop reordering and fusion can make big impacts on the constant-factor performance of dense tensor programs, the effects on sparse tensor programs are asymptotic, often leading to orders of magnitude performance differences in practice. Sparse tensors also introduce a choice of compressed storage formats that can have asymptotic effects. Research into sparse tensor compilers has led to sim… ▽ More

    Submitted 29 November, 2021; originally announced November 2021.

    Comments: 14 pages, 7 figures

  8. arXiv:2007.16192  [pdf, other

    cs.DS

    Contiguous Graph Partitioning For Optimal Total Or Bottleneck Communication

    Authors: Willow Ahrens

    Abstract: Graph partitioning schedules parallel calculations like sparse matrix-vector multiply (SpMV). We consider contiguous partitions, where the $m$ rows (or columns) of a sparse matrix with $N$ nonzeros are split into $K$ parts without reordering. We propose the first near-linear time algorithms for several graph partitioning problems in the contiguous regime. Traditional objectives such as the sim… ▽ More

    Submitted 21 June, 2021; v1 submitted 31 July, 2020; originally announced July 2020.

    Comments: 20 pages; added total partitioning algorithm, added total costs, added experimental results, added lazy near-linear bisection algorithm, simplified presentation. This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  9. arXiv:2005.12414  [pdf, other

    cs.DS

    On Optimal Partitioning For Sparse Matrices In Variable Block Row Format

    Authors: Willow Ahrens, Erik G. Boman

    Abstract: The Variable Block Row (VBR) format is an influential blocked sparse matrix format designed for matrices with shared sparsity structure between adjacent rows and columns. VBR groups adjacent rows and columns, storing the resulting blocks that contain nonzeros in a dense format. This reduces the memory footprint and enables optimizations such as register blocking and instruction-level parallelism… ▽ More

    Submitted 25 May, 2021; v1 submitted 25 May, 2020; originally announced May 2020.

    Comments: 22 pages; added experimental results for VBR, updated presentation of results

  10. Sparse Tensor Transpositions

    Authors: Suzanne Mueller, Willow Ahrens, Stephen Chou, Fredrik Kjolstad, Saman Amarasinghe

    Abstract: We present a new algorithm for transposing sparse tensors called Quesadilla. The algorithm converts the sparse tensor data structure to a list of coordinates and sorts it with a fast multi-pass radix algorithm that exploits knowledge of the requested transposition and the tensors input partial coordinate ordering to provably minimize the number of parallel partial sorting passes. We evaluate bot… ▽ More

    Submitted 20 May, 2020; originally announced May 2020.

    Comments: This work will be the subject of a brief announcement at the 32nd ACM Symposium on Parallelism in Algorithms and Architectures (SPAA '20)

  11. arXiv:1807.05642  [pdf, other

    cs.CL cs.DC

    LATE Ain'T Earley: A Faster Parallel Earley Parser

    Authors: Willow Ahrens, John Feser, Robin Hui

    Abstract: We present the LATE algorithm, an asynchronous variant of the Earley algorithm for parsing context-free grammars. The Earley algorithm is naturally task-based, but is difficult to parallelize because of dependencies between the tasks. We present the LATE algorithm, which uses additional data structures to maintain information about the state of the parse so that work items may be processed in an… ▽ More

    Submitted 15 July, 2018; originally announced July 2018.

  12. arXiv:1802.10574  [pdf, other

    cs.MS cs.PL

    Sparse Tensor Algebra Optimizations with Workspaces

    Authors: Fredrik Kjolstad, Willow Ahrens, Shoaib Kamil, Saman Amarasinghe

    Abstract: This paper shows how to optimize sparse tensor algebraic expressions by introducing temporary tensors, called workspaces, into the resulting loop nests. We develop a new intermediate language for tensor operations called concrete index notation that extends tensor index notation. Concrete index notation expresses when and where sub-computations occur and what tensor they are stored into. We then… ▽ More

    Submitted 24 April, 2018; v1 submitted 28 February, 2018; originally announced February 2018.

    Comments: 25 pages