Skip to main content

Showing 1–21 of 21 results for author: Leather, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.14714  [pdf, other

    cs.PL cs.LG

    Compiler generated feedback for Large Language Models

    Authors: Dejan Grubisic, Chris Cummins, Volker Seeker, Hugh Leather

    Abstract: We introduce a novel paradigm in compiler optimization powered by Large Language Models with compiler feedback to optimize the code size of LLVM assembly. The model takes unoptimized LLVM IR as input and produces optimized IR, the best optimization passes, and instruction counts of both unoptimized and optimized IRs. Then we compile the input with generated optimization passes and evaluate if the… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  2. arXiv:2403.11522  [pdf

    cs.PL cs.DC cs.LG

    LOOPer: A Learned Automatic Code Optimizer For Polyhedral Compilers

    Authors: Massinissa Merouani, Khaled Afif Boudaoud, Iheb Nassim Aouadj, Nassim Tchoulak, Islem Kara Bernou, Hamza Benyamina, Fatima Benbouzid-Si Tayeb, Karima Benatchba, Hugh Leather, Riyadh Baghdadi

    Abstract: While polyhedral compilers have shown success in implementing advanced code transformations, they still have challenges in selecting the most profitable transformations that lead to the best speedups. This has motivated the use of machine learning to build cost models to guide the search for polyhedral optimizations. State-of-the-art polyhedral compilers have demonstrated a viable proof-of-concept… ▽ More

    Submitted 22 March, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

  3. arXiv:2402.18734  [pdf, other

    cs.LG cs.CL cs.PF

    Priority Sampling of Large Language Models for Compilers

    Authors: Dejan Grubisic, Chris Cummins, Volker Seeker, Hugh Leather

    Abstract: Large language models show great potential in generating and optimizing code. Widely used sampling methods such as Nucleus Sampling increase the diversity of generation but often produce repeated samples for low temperatures and incoherent samples for high temperatures. Furthermore, the temperature coefficient has to be tuned for each task, limiting its usability. We present Priority Sampling, a s… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

  4. arXiv:2401.03065  [pdf, other

    cs.SE cs.AI cs.LG

    CRUXEval: A Benchmark for Code Reasoning, Understanding and Execution

    Authors: Alex Gu, Baptiste Rozière, Hugh Leather, Armando Solar-Lezama, Gabriel Synnaeve, Sida I. Wang

    Abstract: We present CRUXEval (Code Reasoning, Understanding, and eXecution Evaluation), a benchmark consisting of 800 Python functions (3-13 lines). Each function comes with an input-output pair, leading to two natural tasks: input prediction and output prediction. First, we propose a generic recipe for generating our execution benchmark which can be used to create future variation of the benchmark. Second… ▽ More

    Submitted 5 January, 2024; originally announced January 2024.

    Comments: 71 pages, 29 figures

  5. arXiv:2310.02110  [pdf, other

    cs.CV

    Sieve: Multimodal Dataset Pruning Using Image Captioning Models

    Authors: Anas Mahmoud, Mostafa Elhoushi, Amro Abbas, Yu Yang, Newsha Ardalani, Hugh Leather, Ari Morcos

    Abstract: Vision-Language Models (VLMs) are pretrained on large, diverse, and noisy web-crawled datasets. This underscores the critical need for dataset pruning, as the quality of these datasets is strongly correlated with the performance of VLMs on downstream tasks. Using CLIPScore from a pretrained model to only train models using highly-aligned samples is one of the most successful methods for pruning. W… ▽ More

    Submitted 10 March, 2024; v1 submitted 3 October, 2023; originally announced October 2023.

    Comments: Accepted in CVPR 2024

  6. arXiv:2309.07062  [pdf, other

    cs.PL cs.AI cs.CL cs.LG

    Large Language Models for Compiler Optimization

    Authors: Chris Cummins, Volker Seeker, Dejan Grubisic, Mostafa Elhoushi, Youwei Liang, Baptiste Roziere, Jonas Gehring, Fabian Gloeckle, Kim Hazelwood, Gabriel Synnaeve, Hugh Leather

    Abstract: We explore the novel application of Large Language Models to code optimization. We present a 7B-parameter transformer model trained from scratch to optimize LLVM assembly for code size. The model takes as input unoptimized assembly and outputs a list of compiler options to best optimize the program. Crucially, during training, we ask the model to predict the instruction counts before and after opt… ▽ More

    Submitted 11 September, 2023; originally announced September 2023.

  7. arXiv:2303.01557  [pdf, other

    cs.LG cs.AI

    BenchDirect: A Directed Language Model for Compiler Benchmarks

    Authors: Foivos Tsimpourlas, Pavlos Petoumenos, Min Xu, Chris Cummins, Kim Hazelwood, Ajitha Rajan, Hugh Leather

    Abstract: The exponential increase of hardware-software complexity has made it impossible for compiler engineers to find the right optimization heuristics manually. Predictive models have been shown to find near optimal heuristics with little human effort but they are limited by a severe lack of diverse benchmarks to train on. Generative AI has been used by researchers to synthesize benchmarks into existing… ▽ More

    Submitted 2 March, 2023; originally announced March 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2208.06555

  8. arXiv:2301.05104  [pdf, other

    cs.PL cs.AI cs.LG

    Learning Compiler Pass Orders using Coreset and Normalized Value Prediction

    Authors: Youwei Liang, Kevin Stone, Ali Shameli, Chris Cummins, Mostafa Elhoushi, Jiadong Guo, Benoit Steiner, Xiaomeng Yang, Pengtao Xie, Hugh Leather, Yuandong Tian

    Abstract: Finding the optimal pass sequence of compilation can lead to a significant reduction in program size and/or improvement in program efficiency. Prior works on compilation pass ordering have two major drawbacks. They either require an excessive budget (in terms of compilation steps) at compile time or fail to generalize to unseen programs. In this paper, for code-size reduction tasks, we propose a n… ▽ More

    Submitted 27 January, 2023; v1 submitted 9 January, 2023; originally announced January 2023.

  9. arXiv:2212.11353  [pdf, other

    cs.CL cs.LG

    Contrastive Distillation Is a Sample-Efficient Self-Supervised Loss Policy for Transfer Learning

    Authors: Chris Lengerich, Gabriel Synnaeve, Amy Zhang, Hugh Leather, Kurt Shuster, François Charton, Charysse Redwood

    Abstract: Traditional approaches to RL have focused on learning decision policies directly from episodic decisions, while slowly and implicitly learning the semantics of compositional representations needed for generalization. While some approaches have been adopted to refine representations via auxiliary self-supervised losses while simultaneously learning decision policies, learning compositional represen… ▽ More

    Submitted 21 December, 2022; originally announced December 2022.

  10. arXiv:2208.06555  [pdf, other

    cs.AI

    BenchPress: A Deep Active Benchmark Generator

    Authors: Foivos Tsimpourlas, Pavlos Petoumenos, Min Xu, Chris Cummins, Kim Hazelwood, Ajitha Rajan, Hugh Leather

    Abstract: We develop BenchPress, the first ML benchmark generator for compilers that is steerable within feature space representations of source code. BenchPress synthesizes compiling functions by adding new code in any part of an empty or existing sequence by jointly observing its left and right context, achieving excellent compilation rate. BenchPress steers benchmark generation towards desired target fea… ▽ More

    Submitted 15 August, 2022; v1 submitted 12 August, 2022; originally announced August 2022.

    Comments: To appear in PACT 2022

  11. arXiv:2207.03578  [pdf, other

    cs.PL cs.CL cs.LG

    Code Translation with Compiler Representations

    Authors: Marc Szafraniec, Baptiste Roziere, Hugh Leather, Francois Charton, Patrick Labatut, Gabriel Synnaeve

    Abstract: In this paper, we leverage low-level compiler intermediate representations (IR) to improve code translation. Traditional transpilers rely on syntactic information and handcrafted rules, which limits their applicability and produces unnatural-looking code. Applying neural machine translation (NMT) approaches to code has successfully broadened the set of programs on which one can get a natural-looki… ▽ More

    Submitted 24 April, 2023; v1 submitted 30 June, 2022; originally announced July 2022.

    Comments: 9 pages

  12. arXiv:2206.03684  [pdf

    cs.PL cs.LG

    Progress Report: A Deep Learning Guided Exploration of Affine Unimodular Loop Transformations

    Authors: Massinissa Merouani, Khaled Afif Boudaoud, Iheb Nassim Aouadj, Nassim Tchoulak, Fatima Benbouzid-Sitayeb, Karima Benatchba, Hugh Leather, Riyadh Baghdadi

    Abstract: In this paper, we present a work in progress about a deep learning based approach for automatic code optimization in polyhedral compilers. The proposed technique explores combinations of affine and non-affine loop transformations to find the sequence of transformations that minimizes the execution time of a given program. This exploration is guided by a deep learning based cost model that evaluate… ▽ More

    Submitted 8 June, 2022; originally announced June 2022.

    Journal ref: In Proceedings of 12th International Workshop on Polyhedral Compilation Techniques (IMPACT 2022)

  13. arXiv:2205.00618  [pdf, other

    cs.LG cs.PF cs.SC

    LoopStack: a Lightweight Tensor Algebra Compiler Stack

    Authors: Bram Wasti, José Pablo Cambronero, Benoit Steiner, Hugh Leather, Aleksandar Zlateski

    Abstract: We present LoopStack, a domain specific compiler stack for tensor operations, composed of a frontend, LoopTool, and an efficient optimizing code generator, LoopNest. This stack enables us to compile entire neural networks and generate code targeting the AVX2, AVX512, NEON, and NEONfp16 instruction sets while incorporating optimizations often missing from other machine learning compiler backends. W… ▽ More

    Submitted 1 May, 2022; originally announced May 2022.

  14. arXiv:2111.12116  [pdf, other

    cs.PL

    Caviar: An E-graph Based TRS for Automatic Code Optimization

    Authors: Smail Kourta, Adel Namani, Fatima Benbouzid-Si Tayeb, Kim Hazelwood, Chris Cummins, Hugh Leather, Riyadh Baghdadi

    Abstract: Term Rewriting Systems (TRSs) are used in compilers to simplify and prove expressions. State-of-the-art TRSs in compilers use a greedy algorithm that applies a set of rewriting rules in a predefined order (where some of the rules are not axiomatic). This leads to a loss of the ability to simplify certain expressions. E-graphs and equality saturation sidestep this issue by representing the differen… ▽ More

    Submitted 27 February, 2022; v1 submitted 23 November, 2021; originally announced November 2021.

    Comments: Accepted in the 31st Conference on Compiler Construction (CC 2022)

  15. arXiv:2109.08267  [pdf, other

    cs.PL cs.AI cs.LG cs.PF

    CompilerGym: Robust, Performant Compiler Optimization Environments for AI Research

    Authors: Chris Cummins, Bram Wasti, Jiadong Guo, Brandon Cui, Jason Ansel, Sahir Gomez, Somya Jain, Jia Liu, Olivier Teytaud, Benoit Steiner, Yuandong Tian, Hugh Leather

    Abstract: Interest in applying Artificial Intelligence (AI) techniques to compiler optimizations is increasing rapidly, but compiler research has a high entry barrier. Unlike in other domains, compiler and AI researchers do not have access to the datasets and frameworks that enable fast iteration and development of ideas, and getting started requires a significant engineering investment. What is needed is a… ▽ More

    Submitted 22 December, 2021; v1 submitted 16 September, 2021; originally announced September 2021.

    Comments: 12 pages. Source code available at https://github.com/facebookresearch/CompilerGym

  16. arXiv:2108.12489  [pdf, ps, other

    cs.LG cs.SE

    Using Graph Neural Networks to model the performance of Deep Neural Networks

    Authors: Shikhar Singh, Benoit Steiner, James Hegarty, Hugh Leather

    Abstract: With the unprecedented proliferation of machine learning software, there is an ever-increasing need to generate efficient code for such applications. State-of-the-art deep-learning compilers like TVM and Halide incorporate a learning-based performance model to search the space of valid implementations of a given deep learning algorithm. For a given application, the model generates a performance me… ▽ More

    Submitted 27 August, 2021; originally announced August 2021.

  17. arXiv:2012.01470  [pdf, other

    cs.PL cs.LG

    Deep Data Flow Analysis

    Authors: Chris Cummins, Hugh Leather, Zacharias Fisches, Tal Ben-Nun, Torsten Hoefler, Michael O'Boyle

    Abstract: Compiler architects increasingly look to machine learning when building heuristics for compiler optimization. The promise of automatic heuristic design, freeing the compiler engineer from the complex interactions of program, architecture, and other optimizations, is alluring. However, most machine learning methods cannot replicate even the simplest of the abstract interpretations of data flow anal… ▽ More

    Submitted 20 November, 2020; originally announced December 2020.

    Comments: 9 pages, plus appendices. arXiv admin note: text overlap with arXiv:2003.10536

  18. arXiv:2011.14486  [pdf, other

    cs.LG cs.AI

    Value Function Based Performance Optimization of Deep Learning Workloads

    Authors: Benoit Steiner, Chris Cummins, Horace He, Hugh Leather

    Abstract: As machine learning techniques become ubiquitous, the efficiency of neural network implementations is becoming correspondingly paramount. Frameworks, such as Halide and TVM, separate out the algorithmic representation of the network from the schedule that determines its implementation. Finding good schedules, however, remains extremely challenging. We model this scheduling problem as a sequence of… ▽ More

    Submitted 29 November, 2020; originally announced November 2020.

  19. arXiv:2003.10536  [pdf, other

    cs.LG cs.PF cs.PL stat.ML

    ProGraML: Graph-based Deep Learning for Program Optimization and Analysis

    Authors: Chris Cummins, Zacharias V. Fisches, Tal Ben-Nun, Torsten Hoefler, Hugh Leather

    Abstract: The increasing complexity of computing systems places a tremendous burden on optimizing compilers, requiring ever more accurate and aggressive optimizations. Machine learning offers significant benefits for constructing optimization heuristics but there remains a gap between what state-of-the-art methods achieve and the performance of an optimal heuristic. Closing this gap requires improvements in… ▽ More

    Submitted 23 March, 2020; originally announced March 2020.

    Comments: 20 pages, author preprint

  20. arXiv:1511.02603  [pdf, other

    cs.PL

    Iterative compilation on mobile devices

    Authors: Paschalis Mpeis, Pavlos Petoumenos, Hugh Leather

    Abstract: The abundance of poorly optimized mobile applications coupled with their increasing centrality in our digital lives make a framework for mobile app optimization an imperative. While tuning strategies for desktop and server applications have a long history, it is difficult to adapt them for use on mobile phones. Reference inputs which trigger behavior similar to a mobile application's typical are… ▽ More

    Submitted 6 January, 2016; v1 submitted 9 November, 2015; originally announced November 2015.

    Comments: 8 pages, 8 figures

  21. arXiv:1511.02490  [pdf, other

    cs.DC

    Autotuning OpenCL Workgroup Size for Stencil Patterns

    Authors: Chris Cummins, Pavlos Petoumenos, Michel Steuwer, Hugh Leather

    Abstract: Selecting an appropriate workgroup size is critical for the performance of OpenCL kernels, and requires knowledge of the underlying hardware, the data being operated on, and the implementation of the kernel. This makes portable performance of OpenCL programs a challenging goal, since simple heuristics and statically chosen values fail to exploit the available performance. To address this, we propo… ▽ More

    Submitted 6 January, 2016; v1 submitted 8 November, 2015; originally announced November 2015.

    Comments: 8 pages, 6 figures, presented at the 6th International Workshop on Adaptive Self-tuning Computing Systems (ADAPT '16)