Skip to main content

Showing 1–19 of 19 results for author: Hasabnis, N

.
  1. arXiv:2406.03003  [pdf, other

    cs.PL

    Verified Code Transpilation with LLMs

    Authors: Sahil Bhatia, Jie Qiu, Niranjan Hasabnis, Sanjit A. Seshia, Alvin Cheung

    Abstract: Domain-specific languages (DSLs) are integral to various software workflows. Such languages offer domain-specific optimizations and abstractions that improve code readability and maintainability. However, leveraging these languages requires developers to rewrite existing code using the specific DSL's API. While large language models (LLMs) have shown some success in automatic code transpilation, n… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  2. arXiv:2404.18249  [pdf, other

    cs.PL

    Tenspiler: A Verified Lifting-Based Compiler for Tensor Operations

    Authors: Jie Qiu, Colin Cai, Sahil Bhatia, Niranjan Hasabnis, Sanjit A. Seshia, Alvin Cheung

    Abstract: Tensor processing infrastructures such as deep learning frameworks and specialized hardware accelerators have revolutionized how computationally intensive code from domains such as deep learning and image processing is executed and optimized. These infrastructures provide powerful and expressive abstractions while ensuring high performance. However, to utilize them, code must be written specifical… ▽ More

    Submitted 28 April, 2024; originally announced April 2024.

  3. arXiv:2404.15204  [pdf, other

    cs.PL cs.AI cs.AR cs.DC cs.LG

    Towards a high-performance AI compiler with upstream MLIR

    Authors: Renato Golin, Lorenzo Chelini, Adam Siemieniuk, Kavitha Madhu, Niranjan Hasabnis, Hans Pabst, Evangelos Georganas, Alexander Heinecke

    Abstract: This work proposes a compilation flow using open-source compiler passes to build a framework to achieve ninja performance from a generic linear algebra high-level abstraction. We demonstrate this flow with a proof-of-concept MLIR project that uses input IR in Linalg-on-Tensor from TensorFlow and PyTorch, performs cache-level optimizations and lowering to micro-kernels for efficient vectorization,… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: 13 pages, 8 figures, presented at CGO C4ML 2024 & MLIR Workshop EuroLLVM 2024

  4. arXiv:2402.09126  [pdf, other

    cs.DC cs.AI cs.CL cs.LG cs.SE

    MPIrigen: MPI Code Generation through Domain-Specific Language Models

    Authors: Nadav Schneider, Niranjan Hasabnis, Vy A. Vo, Tal Kadosh, Neva Krien, Mihai Capotă, Guy Tamir, Ted Willke, Nesreen Ahmed, Yuval Pinter, Timothy Mattson, Gal Oren

    Abstract: The imperative need to scale computation across numerous nodes highlights the significance of efficient parallel computing, particularly in the realm of Message Passing Interface (MPI) integration. The challenging parallel programming task of generating MPI-based parallel programs has remained unexplored. This study first investigates the performance of state-of-the-art language models in generati… ▽ More

    Submitted 23 April, 2024; v1 submitted 14 February, 2024; originally announced February 2024.

  5. arXiv:2402.02018  [pdf, other

    cs.LG

    The Landscape and Challenges of HPC Research and LLMs

    Authors: Le Chen, Nesreen K. Ahmed, Akash Dutta, Arijit Bhattacharjee, Sixing Yu, Quazi Ishtiaque Mahmud, Waqwoya Abebe, Hung Phan, Aishwarya Sarkar, Branden Butler, Niranjan Hasabnis, Gal Oren, Vy A. Vo, Juan Pablo Munoz, Theodore L. Willke, Tim Mattson, Ali Jannesari

    Abstract: Recently, language models (LMs), especially large language models (LLMs), have revolutionized the field of deep learning. Both encoder-decoder models and prompt-based techniques have shown immense potential for natural language processing and code-based tasks. Over the past several years, many research labs and institutions have invested heavily in high-performance computing, approaching or breach… ▽ More

    Submitted 6 February, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

  6. arXiv:2401.16445  [pdf, other

    cs.SE cs.DC cs.LG

    OMPGPT: A Generative Pre-trained Transformer Model for OpenMP

    Authors: Le Chen, Arijit Bhattacharjee, Nesreen Ahmed, Niranjan Hasabnis, Gal Oren, Vy Vo, Ali Jannesari

    Abstract: Large language models (LLMs)such as ChatGPT have significantly advanced the field of Natural Language Processing (NLP). This trend led to the development of code-based large language models such as StarCoder, WizardCoder, and CodeLlama, which are trained extensively on vast repositories of code and programming languages. While the generic abilities of these code LLMs are useful for many programmer… ▽ More

    Submitted 21 June, 2024; v1 submitted 28 January, 2024; originally announced January 2024.

  7. arXiv:2312.13322  [pdf, other

    cs.PL cs.AI cs.LG cs.SE

    Domain-Specific Code Language Models: Unraveling the Potential for HPC Codes and Tasks

    Authors: Tal Kadosh, Niranjan Hasabnis, Vy A. Vo, Nadav Schneider, Neva Krien, Mihai Capota, Abdul Wasay, Nesreen Ahmed, Ted Willke, Guy Tamir, Yuval Pinter, Timothy Mattson, Gal Oren

    Abstract: With easier access to powerful compute resources, there is a growing trend in AI for software development to develop larger language models (LLMs) to address a variety of programming tasks. Even LLMs applied to tasks from the high-performance computing (HPC) domain are huge in size and demand expensive compute resources for training. This is partly because these LLMs for HPC tasks are obtained by… ▽ More

    Submitted 20 December, 2023; originally announced December 2023.

  8. arXiv:2311.06505  [pdf, other

    cs.LG

    CompCodeVet: A Compiler-guided Validation and Enhancement Approach for Code Dataset

    Authors: Le Chen, Arijit Bhattacharjee, Nesreen K. Ahmed, Niranjan Hasabnis, Gal Oren, Bin Lei, Ali Jannesari

    Abstract: Large language models (LLMs) have become increasingly prominent in academia and industry due to their remarkable performance in diverse applications. As these models evolve with increasing parameters, they excel in tasks like sentiment analysis and machine translation. However, even models with billions of parameters face challenges in tasks demanding multi-step reasoning. Code generation and comp… ▽ More

    Submitted 11 November, 2023; originally announced November 2023.

  9. arXiv:2308.09440  [pdf, other

    cs.CL cs.PL

    Scope is all you need: Transforming LLMs for HPC Code

    Authors: Tal Kadosh, Niranjan Hasabnis, Vy A. Vo, Nadav Schneider, Neva Krien, Abdul Wasay, Nesreen Ahmed, Ted Willke, Guy Tamir, Yuval Pinter, Timothy Mattson, Gal Oren

    Abstract: With easier access to powerful compute resources, there is a growing trend in the field of AI for software development to develop larger and larger language models (LLMs) to address a variety of programming tasks. Even LLMs applied to tasks from the high-performance computing (HPC) domain are huge in size (e.g., billions of parameters) and demand expensive compute resources for training. We found… ▽ More

    Submitted 29 September, 2023; v1 submitted 18 August, 2023; originally announced August 2023.

  10. arXiv:2308.08002  [pdf, ps, other

    cs.DC cs.DB

    Quantifying OpenMP: Statistical Insights into Usage and Adoption

    Authors: Tal Kadosh, Niranjan Hasabnis, Timothy Mattson, Yuval Pinter, Gal Oren

    Abstract: In high-performance computing (HPC), the demand for efficient parallel programming models has grown dramatically since the end of Dennard Scaling and the subsequent move to multi-core CPUs. OpenMP stands out as a popular choice due to its simplicity and portability, offering a directive-driven approach for shared-memory parallel programming. Despite its wide adoption, however, there is a lack of c… ▽ More

    Submitted 17 August, 2023; v1 submitted 15 August, 2023; originally announced August 2023.

  11. arXiv:2305.11999  [pdf, other

    cs.DC cs.AI cs.LG cs.PF

    Advising OpenMP Parallelization via a Graph-Based Approach with Transformers

    Authors: Tal Kadosh, Nadav Schneider, Niranjan Hasabnis, Timothy Mattson, Yuval Pinter, Gal Oren

    Abstract: There is an ever-present need for shared memory parallelization schemes to exploit the full potential of multi-core architectures. The most common parallelization API addressing this need today is OpenMP. Nevertheless, writing parallel code manually is complex and effort-intensive. Thus, many deterministic source-to-source (S2S) compilers have emerged, intending to automate the process of translat… ▽ More

    Submitted 16 May, 2023; originally announced May 2023.

  12. arXiv:2305.09438  [pdf, other

    cs.DC cs.CL cs.LG

    MPI-rical: Data-Driven MPI Distributed Parallelism Assistance with Transformers

    Authors: Nadav Schneider, Tal Kadosh, Niranjan Hasabnis, Timothy Mattson, Yuval Pinter, Gal Oren

    Abstract: Message Passing Interface (MPI) plays a crucial role in distributed memory parallelization across multiple nodes. However, parallelizing MPI code manually, and specifically, performing domain decomposition, is a challenging, error-prone task. In this paper, we address this problem by develo** MPI-RICAL, a novel data-driven, programming-assistance tool that assists programmers in writing domain d… ▽ More

    Submitted 30 August, 2023; v1 submitted 16 May, 2023; originally announced May 2023.

  13. arXiv:2211.15739  [pdf, other

    cs.DC cs.LG

    CWD: A Machine Learning based Approach to Detect Unknown Cloud Workloads

    Authors: Mohammad Hossain, Derssie Mebratu, Niranjan Hasabnis, Jun **, Gaurav Chaudhary, Noah Shen

    Abstract: Workloads in modern cloud data centers are becoming increasingly complex. The number of workloads running in cloud data centers has been growing exponentially for the last few years, and cloud service providers (CSP) have been supporting on-demand services in real-time. Realizing the growing complexity of cloud environment and cloud workloads, hardware vendors such as Intel and AMD are increasingl… ▽ More

    Submitted 28 November, 2022; originally announced November 2022.

    Comments: 7 pages, 4 figures, Appeared at The MLSys'22 Workshop on Cloud Intelligence(AIOps), In conjunction with the 5th Conference on Machine Learning and Systems

  14. arXiv:2209.11946  [pdf, other

    cs.SE cs.AI cs.LG

    Are Machine Programming Systems using Right Source-Code Measures to Select Code Repositories?

    Authors: Niranjan Hasabnis

    Abstract: Machine programming (MP) is an emerging field at the intersection of deterministic and probabilistic computing, and it aims to assist software and hardware engineers, among other applications. Along with powerful compute resources, MP systems often rely on vast amount of open-source code to learn interesting properties about code and programming and solve problems in the areas of debugging, code r… ▽ More

    Submitted 24 September, 2022; originally announced September 2022.

    Comments: 6 pages, 1 figure, to be presented at MaLTeSQuE 2022 workshop to be held with ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering (ESEC-FSE) 2022, November 18, Singapore,

  15. arXiv:2205.02360  [pdf, other

    cs.SE cs.AI cs.LG

    GitRank: A Framework to Rank GitHub Repositories

    Authors: Niranjan Hasabnis

    Abstract: Open-source repositories provide wealth of information and are increasingly being used to build artificial intelligence (AI) based systems to solve problems in software engineering. Open-source repositories could be of varying quality levels, and bad-quality repositories could degrade performance of these systems. Evaluating quality of open-source repositories, which is not available directly on c… ▽ More

    Submitted 4 May, 2022; originally announced May 2022.

    Comments: 3 pages, 1 figure; to be published in Mining Software Repositories 2022 conference (hackathon)

  16. arXiv:2109.06266  [pdf, other

    cs.LG cs.AI cs.DC

    Automatic Tuning of Tensorflow's CPU Backend using Gradient-Free Optimization Algorithms

    Authors: Derssie Mebratu, Niranjan Hasabnis, Pietro Mercati, Gaurit Sharma, Shamima Najnin

    Abstract: Modern deep learning (DL) applications are built using DL libraries and frameworks such as TensorFlow and PyTorch. These frameworks have complex parameters and tuning them to obtain good training and inference performance is challenging for typical users, such as DL developers and data scientists. Manual tuning requires deep knowledge of the user-controllable parameters of DL frameworks as well as… ▽ More

    Submitted 13 September, 2021; originally announced September 2021.

    Comments: To appear in the Proceedings of the Machine Learning on HPC Systems (MLHPCS) workshop held in conjunction with International Supercomputing Conference (ISC), July 2, 2021

  17. arXiv:2011.03616  [pdf, other

    cs.SE cs.AI cs.PL

    ControlFlag: A Self-Supervised Idiosyncratic Pattern Detection System for Software Control Structures

    Authors: Niranjan Hasabnis, Justin Gottschlich

    Abstract: Software debugging has been shown to utilize upwards of half of developers' time. Yet, machine programming (MP), the field concerned with the automation of software (and hardware) development, has recently made strides in both research and production-quality automated debugging systems. In this paper we present ControlFlag, a self-supervised MP system that aims to improve debugging by attempting t… ▽ More

    Submitted 17 May, 2021; v1 submitted 6 November, 2020; originally announced November 2020.

    Comments: To appear in Proceedings of the 5th ACM SIGPLAN International Symposium on Machine Programming (MAPS '21)

  18. arXiv:2006.05265  [pdf, other

    cs.LG cs.SE stat.ML

    MISIM: A Neural Code Semantics Similarity System Using the Context-Aware Semantics Structure

    Authors: Fangke Ye, Shengtian Zhou, Anand Venkat, Ryan Marcus, Nesime Tatbul, Jesmin Jahan Tithi, Niranjan Hasabnis, Paul Petersen, Timothy Mattson, Tim Kraska, Pradeep Dubey, Vivek Sarkar, Justin Gottschlich

    Abstract: Code semantics similarity can be used for many tasks such as code recommendation, automated software defect correction, and clone detection. Yet, the accuracy of such systems has not yet reached a level of general purpose reliability. To help address this, we present Machine Inferred Code Similarity (MISIM), a neural code semantics similarity system consisting of two core components: (i)MISIM uses… ▽ More

    Submitted 2 June, 2021; v1 submitted 5 June, 2020; originally announced June 2020.

    Comments: arXiv admin note: text overlap with arXiv:2003.11118

  19. arXiv:1812.01665  [pdf, other

    cs.DC cs.AI cs.LG

    Auto-tuning TensorFlow Threading Model for CPU Backend

    Authors: Niranjan Hasabnis

    Abstract: TensorFlow is a popular deep learning framework used by data scientists to solve a wide-range of machine learning and deep learning problems such as image classification and speech recognition. It also operates at a large scale and in heterogeneous environments --- it allows users to train neural network models or deploy them for inference using GPUs, CPUs and deep learning specific custom-designe… ▽ More

    Submitted 4 December, 2018; originally announced December 2018.

    Comments: Paper presented at Machine Learning in HPC Environments workshop held along with SuperComputing 2018, Dallas, Texas