Skip to main content

Showing 1–5 of 5 results for author: Grubisic, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.14714  [pdf, other

    cs.PL cs.LG

    Compiler generated feedback for Large Language Models

    Authors: Dejan Grubisic, Chris Cummins, Volker Seeker, Hugh Leather

    Abstract: We introduce a novel paradigm in compiler optimization powered by Large Language Models with compiler feedback to optimize the code size of LLVM assembly. The model takes unoptimized LLVM IR as input and produces optimized IR, the best optimization passes, and instruction counts of both unoptimized and optimized IRs. Then we compile the input with generated optimization passes and evaluate if the… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  2. arXiv:2402.18734  [pdf, other

    cs.LG cs.CL cs.PF

    Priority Sampling of Large Language Models for Compilers

    Authors: Dejan Grubisic, Chris Cummins, Volker Seeker, Hugh Leather

    Abstract: Large language models show great potential in generating and optimizing code. Widely used sampling methods such as Nucleus Sampling increase the diversity of generation but often produce repeated samples for low temperatures and incoherent samples for high temperatures. Furthermore, the temperature coefficient has to be tuned for each task, limiting its usability. We present Priority Sampling, a s… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

  3. arXiv:2309.07062  [pdf, other

    cs.PL cs.AI cs.CL cs.LG

    Large Language Models for Compiler Optimization

    Authors: Chris Cummins, Volker Seeker, Dejan Grubisic, Mostafa Elhoushi, Youwei Liang, Baptiste Roziere, Jonas Gehring, Fabian Gloeckle, Kim Hazelwood, Gabriel Synnaeve, Hugh Leather

    Abstract: We explore the novel application of Large Language Models to code optimization. We present a 7B-parameter transformer model trained from scratch to optimize LLVM assembly for code size. The model takes as input unoptimized assembly and outputs a list of compiler options to best optimize the program. Crucially, during training, we ask the model to predict the instruction counts before and after opt… ▽ More

    Submitted 11 September, 2023; originally announced September 2023.

  4. arXiv:2309.01825  [pdf, other

    cs.LG cs.PL

    LoopTune: Optimizing Tensor Computations with Reinforcement Learning

    Authors: Dejan Grubisic, Bram Wasti, Chris Cummins, John Mellor-Crummey, Aleksandar Zlateski

    Abstract: Advanced compiler technology is crucial for enabling machine learning applications to run on novel hardware, but traditional compilers fail to deliver performance, popular auto-tuners have long search times and expert-optimized libraries introduce unsustainable costs. To address this, we developed LoopTune, a deep reinforcement learning compiler that optimizes tensor computations in deep learning… ▽ More

    Submitted 8 November, 2023; v1 submitted 4 September, 2023; originally announced September 2023.

  5. Measurement and Analysis of GPU-accelerated Applications with HPCToolkit

    Authors: Keren Zhou, Laksono Adhianto, Jonathon Anderson, Aaron Cherian, Dejan Grubisic, Mark Krentel, Yumeng Liu, Xiaozhu Meng, John Mellor-Crummey

    Abstract: To address the challenge of performance analysis on the US DOE's forthcoming exascale supercomputers, Rice University has been extending its HPCToolkit performance tools to support measurement and analysis of GPU-accelerated applications. To help developers understand the performance of accelerated applications as a whole, HPCToolkit's measurement and analysis tools attribute metrics to calling co… ▽ More

    Submitted 14 September, 2021; originally announced September 2021.

    Journal ref: Parallel Computing 2021