Skip to main content

Showing 1–3 of 3 results for author: Ganapathi, P

.
  1. arXiv:2105.06676  [pdf, other

    cs.DS cs.DC

    Fast Stencil Computations using Fast Fourier Transforms

    Authors: Zafar Ahmad, Rezaul Chowdhury, Rathish Das, Pramod Ganapathi, Aaron Gregory, Yimin Zhu

    Abstract: Stencil computations are widely used to simulate the change of state of physical systems across a multidimensional grid over multiple timesteps. The state-of-the-art techniques in this area fall into three groups: cache-aware tiled loo** algorithms, cache-oblivious divide-and-conquer trapezoidal algorithms, and Krylov subspace methods. In this paper, we present two efficient parallel algorithm… ▽ More

    Submitted 14 May, 2021; originally announced May 2021.

    Comments: This paper will appear in the proceedings of SPAA 2021

  2. arXiv:2008.13292  [pdf, other

    cs.DS cs.DC

    Low-Depth Parallel Algorithms for the Binary-Forking Model without Atomics

    Authors: Zafar Ahmad, Rezaul Chowdhury, Rathish Das, Pramod Ganapathi, Aaron Gregory, Mohammad Mahdi Javanmard

    Abstract: The binary-forking model is a parallel computation model, formally defined by Blelloch et al. very recently, in which a thread can fork a concurrent child thread, recursively and asynchronously. The model incurs a cost of $Θ(\log n)$ to spawn or synchronize $n$ tasks or threads. The binary-forking model realistically captures the performance of parallel algorithms implemented using modern multithr… ▽ More

    Submitted 2 September, 2020; v1 submitted 30 August, 2020; originally announced August 2020.

  3. arXiv:1009.4214  [pdf, ps, other

    cs.DS

    A Versatile Algorithm to Generate Various Combinatorial Structures

    Authors: Pramod Ganapathi, Rama B

    Abstract: Algorithms to generate various combinatorial structures find tremendous importance in computer science. In this paper, we begin by reviewing an algorithm proposed by Rohl that generates all unique permutations of a list of elements which possibly contains repetitions, taking some or all of the elements at a time, in any imposed order. The algorithm uses an auxiliary array that maintains the number… ▽ More

    Submitted 30 September, 2010; v1 submitted 21 September, 2010; originally announced September 2010.