Skip to main content

Showing 1–16 of 16 results for author: Sarkar, V

.
  1. arXiv:2407.06051  [pdf

    cond-mat.mtrl-sci cond-mat.soft

    Foams with 3D Spatially Programmed Mechanics Enabled by Autonomous Active Learning on Viscous Thread Printing

    Authors: Brett Emery, Kelsey L. Snapp, Daniel Revier, Vivek Sarkar, Masa Nakura, Keith A. Brown, Jeffrey Ian Lipton

    Abstract: Foams are versatile by nature and ubiquitous in a wide range of applications, including padding, insulation, and acoustic dampening. Previous work established that foams 3D printed via Viscous Thread Printing (VTP) can in principle combine the flexibility of 3D printing with the mechanical properties of conventional foams. However, the generality of prior work is limited due to the lack of predict… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: 23 pages, 7 figures

    MSC Class: 74Q99

  2. Enabling Multi-threading in Heterogeneous Quantum-Classical Programming Models

    Authors: Akihiro Hayashi, Austin Adams, Jeffrey Young, Alexander McCaskey, Eugene Dumitrescu, Vivek Sarkar, Thomas M. Conte

    Abstract: In this paper, we address some of the key limitations to realizing a generic heterogeneous parallel programming model for quantum-classical heterogeneous platforms. We discuss our experience in enabling user-level multi-threading in QCOR as well as challenges that need to be addressed for programming future quantum-classical systems. Specifically, we discuss our design and implementation of introd… ▽ More

    Submitted 15 March, 2023; v1 submitted 27 January, 2023; originally announced January 2023.

  3. arXiv:2203.06233  [pdf, other

    cs.DC

    Automatic Parallelization of Python Programs for Distributed Heterogeneous Computing

    Authors: Jun Shirako, Akihiro Hayashi, Sri Raj Paul, Alexey Tumanov, Vivek Sarkar

    Abstract: This paper introduces a novel approach to automatic ahead-of-time (AOT) parallelization and optimization of sequential Python programs for execution on distributed heterogeneous platforms. Our approach enables AOT source-to-source transformation of Python programs, driven by the inclusion of type hints for function parameters and return values. These hints can be supplied by the programmer or obta… ▽ More

    Submitted 11 March, 2022; originally announced March 2022.

    Comments: 14 pages, 10 figures, under submission to Euro-Par 2022 conference (https://2022.euro-par.org)

  4. arXiv:2107.05516  [pdf, other

    cs.DC

    A Scalable Actor-based Programming System for PGAS Runtimes

    Authors: Sri Raj Paul, Akihiro Hayashi, Kun Chen, Vivek Sarkar

    Abstract: The PGAS model is well suited for executing irregular applications on cluster-based systems, due to its efficient support for short, one-sided messages. However, there are currently two major limitations faced by PGAS applications. The first relates to scalability: despite the availability of APIs that support non-blocking operations in special cases, many PGAS operations on remote locations are s… ▽ More

    Submitted 18 June, 2022; v1 submitted 12 July, 2021; originally announced July 2021.

  5. An Ownership Policy and Deadlock Detector for Promises

    Authors: Caleb Voss, Vivek Sarkar

    Abstract: Task-parallel programs often enjoy deadlock freedom under certain restrictions, such as the use of structured join operations, as in Cilk and X10, or the use of asynchronous task futures together with deadlock-avoiding policies such as Known Joins or Transitive Joins. However, the promise, a popular synchronization primitive for parallel tasks, does not enjoy deadlock-freedom guarantees. Promises… ▽ More

    Submitted 4 January, 2021; originally announced January 2021.

    Journal ref: Principles and Practice of Parallel Programming, 2021, ACM, pp. 348-361

  6. arXiv:2011.03196  [pdf, other

    cs.DC

    Task-Graph Scheduling Extensions for Efficient Synchronization and Communication

    Authors: Seonmyeong Bak, Oscar Hernandez, Mark Gates, Piotr Luszczek, Vivek Sarkar

    Abstract: Task graphs have been studied for decades as a foundation for scheduling irregular parallel applications and incorporated in programming models such as OpenMP. While many high-performance parallel libraries are based on task graphs, they also have additional scheduling requirements, such as synchronization from inner levels of data parallelism and internal blocking communications. In this paper, w… ▽ More

    Submitted 6 November, 2020; originally announced November 2020.

  7. arXiv:2009.05949  [pdf, other

    cs.PL cs.LG

    Advanced Graph-Based Deep Learning for Probabilistic Type Inference

    Authors: Fangke Ye, Jisheng Zhao, Vivek Sarkar

    Abstract: Dynamically typed languages such as JavaScript and Python have emerged as the most popular programming languages in use. Important benefits can accrue from including type annotations in dynamically typed programs. This approach to gradual ty** is exemplified by the TypeScript programming system which allows programmers to specify partially typed programs, and then uses static analysis to infer t… ▽ More

    Submitted 14 November, 2021; v1 submitted 13 September, 2020; originally announced September 2020.

  8. arXiv:2006.05265  [pdf, other

    cs.LG cs.SE stat.ML

    MISIM: A Neural Code Semantics Similarity System Using the Context-Aware Semantics Structure

    Authors: Fangke Ye, Shengtian Zhou, Anand Venkat, Ryan Marcus, Nesime Tatbul, Jesmin Jahan Tithi, Niranjan Hasabnis, Paul Petersen, Timothy Mattson, Tim Kraska, Pradeep Dubey, Vivek Sarkar, Justin Gottschlich

    Abstract: Code semantics similarity can be used for many tasks such as code recommendation, automated software defect correction, and clone detection. Yet, the accuracy of such systems has not yet reached a level of general purpose reliability. To help address this, we present Machine Inferred Code Similarity (MISIM), a neural code semantics similarity system consisting of two core components: (i)MISIM uses… ▽ More

    Submitted 2 June, 2021; v1 submitted 5 June, 2020; originally announced June 2020.

    Comments: arXiv admin note: text overlap with arXiv:2003.11118

  9. arXiv:2006.01331  [pdf, other

    cs.DC

    Vyasa: A High-Performance Vectorizing Compiler for Tensor Convolutions on the Xilinx AI Engine

    Authors: Prasanth Chatarasi, Stephen Neuendorffer, Samuel Bayliss, Kees Vissers, Vivek Sarkar

    Abstract: Xilinx's AI Engine is a recent industry example of energy-efficient vector processing that includes novel support for 2D SIMD datapaths and shuffle interconnection network. The current approach to programming the AI Engine relies on a C/C++ API for vector intrinsics. While an advance over assembly-level programming, it requires the programmer to specify a number of low-level operations based on de… ▽ More

    Submitted 1 June, 2020; originally announced June 2020.

  10. arXiv:2003.11118  [pdf, ps, other

    cs.PL cs.AI

    Context-Aware Parse Trees

    Authors: Fangke Ye, Shengtian Zhou, Anand Venkat, Ryan Marcus, Paul Petersen, Jesmin Jahan Tithi, Tim Mattson, Tim Kraska, Pradeep Dubey, Vivek Sarkar, Justin Gottschlich

    Abstract: The simplified parse tree (SPT) presented in Aroma, a state-of-the-art code recommendation system, is a tree-structured representation used to infer code semantics by capturing program \emph{structure} rather than program \emph{syntax}. This is a departure from the classical abstract syntax tree, which is principally driven by programming language syntax. While we believe a semantics-driven repres… ▽ More

    Submitted 24 March, 2020; originally announced March 2020.

  11. arXiv:2002.07752  [pdf, other

    cs.DC cs.LG cs.PF

    Marvel: A Data-centric Compiler for DNN Operators on Spatial Accelerators

    Authors: Prasanth Chatarasi, Hyoukjun Kwon, Natesh Raina, Saurabh Malik, Vaisakh Haridas, Angshuman Parashar, Michael Pellauer, Tushar Krishna, Vivek Sarkar

    Abstract: The efficiency of a spatial DNN accelerator depends heavily on the compiler and its cost model ability to generate optimized map**s for various operators of DNN models on to the accelerator's compute and memory resources. But, existing cost models lack a formal boundary over the operators for precise and tractable analysis, which poses adaptability challenges for new DNN operators. To address th… ▽ More

    Submitted 11 June, 2020; v1 submitted 18 February, 2020; originally announced February 2020.

  12. arXiv:1805.02566  [pdf, other

    cs.DC cs.LG

    Understanding Reuse, Performance, and Hardware Cost of DNN Dataflows: A Data-Centric Approach Using MAESTRO

    Authors: Hyoukjun Kwon, Prasanth Chatarasi, Michael Pellauer, Angshuman Parashar, Vivek Sarkar, Tushar Krishna

    Abstract: The data partitioning and scheduling strategies used by DNN accelerators to leverage reuse and perform staging are known as dataflow, and they directly impact the performance and energy efficiency of DNN accelerator designs. An accelerator microarchitecture dictates the dataflow(s) that can be employed to execute a layer or network. Selecting an optimal dataflow for a layer shape can have a large… ▽ More

    Submitted 11 May, 2020; v1 submitted 4 May, 2018; originally announced May 2018.

  13. arXiv:1608.00636  [pdf, ps, other

    cs.PF cs.DC math.NA

    A survey of sparse matrix-vector multiplication performance on large matrices

    Authors: Max Grossman, Christopher Thiele, Mauricio Araya-Polo, Florian Frank, Faruk O. Alpak, Vivek Sarkar

    Abstract: We contribute a third-party survey of sparse matrix-vector (SpMV) product performance on industrial-strength, large matrices using: (1) The SpMV implementations in Intel MKL, the Trilinos project (Tpetra subpackage), the CUSPARSE library, and the CUSP library, each running on modern architectures. (2) NVIDIA GPUs and Intel multi-core CPUs (supported by each software package). (3) The CSR, BSR, COO… ▽ More

    Submitted 1 August, 2016; originally announced August 2016.

    Comments: Rice Oil & Gas High Performance Computing Workshop. March 2016

  14. Formalization of Phase Ordering

    Authors: Tiago Cogumbreiro, Jun Shirako, Vivek Sarkar

    Abstract: Phasers pose an interesting synchronization mechanism that generalizes many collective synchronization patterns seen in parallel programming languages, including barriers, clocks, and point-to-point synchronization using latches or semaphores. This work characterizes scheduling constraints on phaser operations, by relating the execution state of two tasks that operate on the same phaser. We propos… ▽ More

    Submitted 19 June, 2016; originally announced June 2016.

    Comments: In Proceedings PLACES 2016, arXiv:1606.05403

    ACM Class: D.1.3; D.2.4; D.3.3; D.4.1

    Journal ref: EPTCS 211, 2016, pp. 13-24

  15. arXiv:1504.05967  [pdf, other

    cs.CR

    Finding Tizen security bugs through whole-system static analysis

    Authors: Daniel Song, Jisheng Zhao, Michael Burke, Dragoş Sbîrlea, Dan Wallach, Vivek Sarkar

    Abstract: Tizen is a new Linux-based open source platform for consumer devices including smartphones, televisions, vehicles, and wearables. While Tizen provides kernel-level mandatory policy enforcement, it has a large collection of libraries, implemented in a mix of C and C++, which make their own security checks. In this research, we describe the design and engineering of a static analysis engine which dr… ▽ More

    Submitted 22 April, 2015; originally announced April 2015.

  16. arXiv:1407.4859  [pdf, other

    cs.DC

    ADHA: Automatic Data layout framework for Heterogeneous Architectures

    Authors: Deepak Majeti, Kuldeep S. Meel, Rajkishore Barik, Vivek Sarkar

    Abstract: Data layouts play a crucial role in determining the performance of a given application running on a given architecture. Existing parallel programming frameworks for both multicore and heterogeneous systems leave the onus of selecting a data layout to the programmer. Therefore, shifting the burden of data layout selection to optimizing compilers can greatly enhance programmer productivity and appli… ▽ More

    Submitted 17 July, 2014; originally announced July 2014.