Skip to main content

Showing 1–4 of 4 results for author: Ashouri, A H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2312.09982  [pdf, other

    cs.PL cs.AI cs.LG cs.PF

    ACPO: AI-Enabled Compiler-Driven Program Optimization

    Authors: Amir H. Ashouri, Muhammad Asif Manzoor, Duc Minh Vu, Raymond Zhang, Ziwen Wang, Angel Zhang, Bryan Chan, Tomasz S. Czajkowski, Yaoqing Gao

    Abstract: The key to performance optimization of a program is to decide correctly when a certain transformation should be applied by a compiler. This is an ideal opportunity to apply machine-learning models to speed up the tuning process; while this realization has been around since the late 90s, only recent advancements in ML enabled a practical application of ML to compilers as an end-to-end framework.… ▽ More

    Submitted 11 March, 2024; v1 submitted 15 December, 2023; originally announced December 2023.

    Comments: Preprint version of ACPO (12 pages)

    ACM Class: I.2.5; D.3.0; I.2.6

  2. arXiv:2207.08389  [pdf, other

    cs.PL cs.AI cs.LG cs.NE cs.PF

    MLGOPerf: An ML Guided Inliner to Optimize Performance

    Authors: Amir H. Ashouri, Mostafa Elhoushi, Yuzhe Hua, Xiang Wang, Muhammad Asif Manzoor, Bryan Chan, Yaoqing Gao

    Abstract: For the past 25 years, we have witnessed an extensive application of Machine Learning to the Compiler space; the selection and the phase-ordering problem. However, limited works have been upstreamed into the state-of-the-art compilers, i.e., LLVM, to seamlessly integrate the former into the optimization pipeline of a compiler to be readily deployed by the user. MLGO was among the first of such pro… ▽ More

    Submitted 19 July, 2022; v1 submitted 18 July, 2022; originally announced July 2022.

    Comments: Version 2: Added the missing Table 6. The short version of this work is accepted at ACM/IEEE CASES 2022

    ACM Class: I.2.5; D.3.0; I.2.6

  3. Fast On-the-fly Retraining-free Sparsification of Convolutional Neural Networks

    Authors: Amir H. Ashouri, Tarek S. Abdelrahman, Alwyn Dos Remedios

    Abstract: Modern Convolutional Neural Networks (CNNs) are complex, encompassing millions of parameters. Their deployment exerts computational, storage and energy demands, particularly on embedded platforms. Existing approaches to prune or sparsify CNNs require retraining to maintain inference accuracy. Such retraining is not feasible in some contexts. In this paper, we explore the sparsification of CNNs by… ▽ More

    Submitted 8 September, 2019; v1 submitted 10 November, 2018; originally announced November 2018.

    Comments: Extended Version of Our Accepted Paper in NIPS 2018, CDNNRIA Workshop: (https://nips.cc/Conferences/2018/Schedule?showEvent=10941)- Reviews are available at OpenReview (https://openreview.net/forum?id=rkz1YD0vjm)

    Journal ref: Elsevier Neurocomputing, 2019

  4. arXiv:1801.04405  [pdf, other

    cs.PL cs.LG

    A Survey on Compiler Autotuning using Machine Learning

    Authors: Amir H. Ashouri, William Killian, John Cavazos, Gianluca Palermo, Cristina Silvano

    Abstract: Since the mid-1990s, researchers have been trying to use machine-learning based approaches to solve a number of different compiler optimization problems. These techniques primarily enhance the quality of the obtained results and, more importantly, make it feasible to tackle two main compiler optimization problems: optimization selection (choosing which optimizations to apply) and phase-ordering (c… ▽ More

    Submitted 3 September, 2018; v1 submitted 13 January, 2018; originally announced January 2018.

    Comments: version 5.0 (updated on September 2018)- Preprint Version For our Accepted Journal @ ACM CSUR 2018 (42 pages) - This survey will be updated quarterly here (Send me your new published papers to be added in the subsequent version) History: Received November 2016; Revised August 2017; Revised February 2018; Accepted March 2018-