Skip to main content

Showing 1–50 of 85 results for author: Ray, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.06461  [pdf, other

    cs.CL

    Reasoning in Token Economies: Budget-Aware Evaluation of LLM Reasoning Strategies

    Authors: Junlin Wang, Siddhartha Jain, Dejiao Zhang, Baishakhi Ray, Varun Kumar, Ben Athiwaratkun

    Abstract: A diverse array of reasoning strategies has been proposed to elicit the capabilities of large language models. However, in this paper, we point out that traditional evaluations which focus solely on performance metrics miss a key factor: the increased effectiveness due to additional compute. By overlooking this aspect, a skewed view of strategy efficiency is often presented. This paper introduces… ▽ More

    Submitted 14 June, 2024; v1 submitted 10 June, 2024; originally announced June 2024.

  2. arXiv:2406.06435  [pdf, other

    cs.CL cs.AI

    Language Models are Alignable Decision-Makers: Dataset and Application to the Medical Triage Domain

    Authors: Brian Hu, Bill Ray, Alice Leung, Amy Summerville, David Joy, Christopher Funk, Arslan Basharat

    Abstract: In difficult decision-making scenarios, it is common to have conflicting opinions among expert human decision-makers as there may not be a single right answer. Such decisions may be guided by different attributes that can be used to characterize an individual's decision. We introduce a novel dataset for medical triage decision-making, labeled with a set of decision-maker attributes (DMAs). This da… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 15 pages total (including appendix), NAACL 2024 Industry Track

  3. arXiv:2406.01006  [pdf, other

    cs.CL cs.AI cs.SE

    SemCoder: Training Code Language Models with Comprehensive Semantics

    Authors: Yangruibo Ding, **jun Peng, Marcus J. Min, Gail Kaiser, Junfeng Yang, Baishakhi Ray

    Abstract: Code Large Language Models (Code LLMs) have excelled at tasks like code completion but often miss deeper semantics such as execution effects and dynamic states. This paper aims to bridge the gap between Code LLMs' reliance on static text data and the need for thorough semantic understanding for complex tasks like debugging and program repair. We introduce a novel strategy to train Code LLMs with c… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  4. arXiv:2405.18649  [pdf, other

    cs.CL cs.AI cs.SE

    Training LLMs to Better Self-Debug and Explain Code

    Authors: Nan Jiang, Xiaopeng Li, Shiqi Wang, Qiang Zhou, Soneya Binta Hossain, Baishakhi Ray, Varun Kumar, Xiaofei Ma, Anoop Deoras

    Abstract: In the domain of code generation, self-debugging is crucial. It allows LLMs to refine their generated code based on execution feedback. This is particularly important because generating correct solutions in one attempt proves challenging for complex tasks. Prior works on self-debugging mostly focus on prompting methods by providing LLMs with few-shot examples, which work poorly on small open-sourc… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  5. arXiv:2405.18574  [pdf, other

    cs.SE

    SpecTra: Enhancing the Code Translation Ability of Language Models by Generating Multi-Modal Specifications

    Authors: Vikram Nitin, Baishakhi Ray

    Abstract: Large language models (LLMs) are increasingly being used for the task of automated code translation, which has important real-world applications. However, most existing approaches use only the source code of a program as an input to an LLM, and do not consider the different kinds of specifications that can be extracted from a program. In this paper, we propose SpecTra, a multi-stage approach that… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  6. arXiv:2405.15805  [pdf, other

    q-bio.NC cs.AI cs.LG

    DSAM: A Deep Learning Framework for Analyzing Temporal and Spatial Dynamics in Brain Networks

    Authors: Bishal Thapaliya, Robyn Miller, Jiayu Chen, Yu-** Wang, Esra Akbas, Ram Sapkota, Bhaskar Ray, Pranav Suresh, Santosh Ghimire, Vince Calhoun, **gyu Liu

    Abstract: Resting-state functional magnetic resonance imaging (rs-fMRI) is a noninvasive technique pivotal for understanding human neural mechanisms of intricate cognitive processes. Most rs-fMRI studies compute a single static functional connectivity matrix across brain regions of interest, or dynamic functional connectivity matrices with a sliding window approach. These approaches are at risk of oversimpl… ▽ More

    Submitted 19 May, 2024; originally announced May 2024.

    Comments: 18 Pages, 4 figures

  7. arXiv:2405.02213  [pdf, other

    cs.SE cs.AI cs.LG

    Automatic Programming: Large Language Models and Beyond

    Authors: Michael R. Lyu, Baishakhi Ray, Abhik Roychoudhury, Shin Hwei Tan, Patanamon Thongtanunam

    Abstract: Automatic programming has seen increasing popularity due to the emergence of tools like GitHub Copilot which rely on Large Language Models (LLMs). At the same time, automatically generated code faces challenges during deployment due to concerns around quality and trust. In this article, we study automated coding in a general sense and study the concerns around code quality, security and related is… ▽ More

    Submitted 15 May, 2024; v1 submitted 3 May, 2024; originally announced May 2024.

  8. arXiv:2405.01567  [pdf, other

    cs.SE cs.AI

    CodeFort: Robust Training for Code Generation Models

    Authors: Yuhao Zhang, Shiqi Wang, Haifeng Qian, Zijian Wang, Mingyue Shang, Linbo Liu, Sanjay Krishna Gouda, Baishakhi Ray, Murali Krishna Ramanathan, Xiaofei Ma, Anoop Deoras

    Abstract: Code generation models are not robust to small perturbations, which often lead to inconsistent and incorrect generations and significantly degrade the performance of these models. Improving the robustness of code generation models is crucial to better user experience when these models are deployed in real-world applications. However, existing efforts have not addressed this issue for code generati… ▽ More

    Submitted 11 April, 2024; originally announced May 2024.

  9. arXiv:2403.18746  [pdf, other

    cs.SE cs.CL

    CYCLE: Learning to Self-Refine the Code Generation

    Authors: Yangruibo Ding, Marcus J. Min, Gail Kaiser, Baishakhi Ray

    Abstract: Pre-trained code language models have achieved promising performance in code generation and improved the programming efficiency of human developers. However, their self-refinement capability is typically overlooked by the existing evaluations of code LMs, which focus only on the accuracy of the one-time prediction. For the cases when code LMs fail to implement the correct program, developers actua… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

    Comments: Camera-ready for OOPSLA'24

  10. arXiv:2403.18624  [pdf, other

    cs.SE cs.CL

    Vulnerability Detection with Code Language Models: How Far Are We?

    Authors: Yangruibo Ding, Yanjun Fu, Omniyyah Ibrahim, Chawin Sitawarin, Xinyun Chen, Basel Alomair, David Wagner, Baishakhi Ray, Yizheng Chen

    Abstract: In the context of the rising interest in code language models (code LMs) and vulnerability detection, we study the effectiveness of code LMs for detecting vulnerabilities. Our analysis reveals significant shortcomings in existing vulnerability datasets, including poor data quality, low label accuracy, and high duplication rates, leading to unreliable model performance in realistic vulnerability de… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

  11. arXiv:2403.16921  [pdf, other

    cs.CV

    PropTest: Automatic Property Testing for Improved Visual Programming

    Authors: Jaywon Koo, Ziyan Yang, Paola Cascante-Bonilla, Baishakhi Ray, Vicente Ordonez

    Abstract: Visual Programming has emerged as an alternative to end-to-end black-box visual reasoning models. This type of methods leverage Large Language Models (LLMs) to decompose a problem and generate the source code for an executable computer program. This strategy has the advantage of offering an interpretable reasoning path and does not require finetuning a model with task-specific data. We propose Pro… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: Project Page: https://jaywonkoo17.github.io/PropTest/

  12. arXiv:2402.00097  [pdf, other

    cs.SE cs.LG

    Code-Aware Prompting: A study of Coverage Guided Test Generation in Regression Setting using LLM

    Authors: Gabriel Ryan, Siddhartha Jain, Mingyue Shang, Shiqi Wang, Xiaofei Ma, Murali Krishna Ramanathan, Baishakhi Ray

    Abstract: Testing plays a pivotal role in ensuring software quality, yet conventional Search Based Software Testing (SBST) methods often struggle with complex software units, achieving suboptimal test coverage. Recent works using large language models (LLMs) for test generation have focused on improving generation quality through optimizing the test generation context and correcting errors in model outputs,… ▽ More

    Submitted 2 April, 2024; v1 submitted 31 January, 2024; originally announced February 2024.

  13. arXiv:2311.03520  [pdf, other

    cs.LG cs.AI q-bio.NC

    Brain Networks and Intelligence: A Graph Neural Network Based Approach to Resting State fMRI Data

    Authors: Bishal Thapaliya, Esra Akbas, Jiayu Chen, Raam Sapkota, Bhaskar Ray, Pranav Suresh, Vince Calhoun, **gyu Liu

    Abstract: Resting-state functional magnetic resonance imaging (rsfMRI) is a powerful tool for investigating the relationship between brain function and cognitive processes as it allows for the functional organization of the brain to be captured without relying on a specific task or stimuli. In this paper, we present a novel modeling architecture called BrainRGIN for predicting intelligence (fluid, crystalli… ▽ More

    Submitted 26 March, 2024; v1 submitted 6 November, 2023; originally announced November 2023.

  14. arXiv:2310.14053  [pdf, other

    cs.LG cs.CL cs.SE

    Beyond Accuracy: Evaluating Self-Consistency of Code Large Language Models with IdentityChain

    Authors: Marcus J. Min, Yangruibo Ding, Luca Buratti, Saurabh Pujar, Gail Kaiser, Suman Jana, Baishakhi Ray

    Abstract: Code Large Language Models (Code LLMs) are being increasingly employed in real-life applications, so evaluating them is critical. While the conventional accuracy evaluates the performance of Code LLMs on a set of individual tasks, their self-consistency across different tasks is overlooked. Intuitively, a trustworthy model should be self-consistent when generating natural language specifications f… ▽ More

    Submitted 26 February, 2024; v1 submitted 21 October, 2023; originally announced October 2023.

    Comments: ICLR 2024

    MSC Class: 68 ACM Class: I.2; D.2

  15. arXiv:2310.08507  [pdf, other

    cs.SE

    Yuga: Automatically Detecting Lifetime Annotation Bugs in the Rust Language

    Authors: Vikram Nitin, Anne Mulhern, Sanjay Arora, Baishakhi Ray

    Abstract: The Rust programming language is becoming increasingly popular among systems programmers due to its efficient performance and robust memory safety guarantees. Rust employs an ownership model to ensure this guarantee by allowing each value to be owned by only one identifier at a time. Additionally, it introduces the concept of borrowing and lifetimes to enable other variables to borrow the values u… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

  16. arXiv:2310.07958  [pdf, other

    cs.SE cs.CR cs.LG stat.ME

    Towards Causal Deep Learning for Vulnerability Detection

    Authors: Md Mahbubur Rahman, Ira Ceka, Chengzhi Mao, Saikat Chakraborty, Baishakhi Ray, Wei Le

    Abstract: Deep learning vulnerability detection has shown promising results in recent years. However, an important challenge that still blocks it from being very useful in practice is that the model is not robust under perturbation and it cannot generalize well over the out-of-distribution (OOD) data, e.g., applying a trained model to unseen projects in real world. We hypothesize that this is because the mo… ▽ More

    Submitted 14 January, 2024; v1 submitted 11 October, 2023; originally announced October 2023.

    Comments: ICSE 2024, Camera Ready Version

  17. arXiv:2306.07888  [pdf, other

    cs.PF cs.SE eess.SY

    CAMEO: A Causal Transfer Learning Approach for Performance Optimization of Configurable Computer Systems

    Authors: Md Shahriar Iqbal, Ziyuan Zhong, Iftakhar Ahmad, Baishakhi Ray, Pooyan Jamshidi

    Abstract: Modern computer systems are highly configurable, with hundreds of configuration options that interact, resulting in an enormous configuration space. As a result, optimizing performance goals (e.g., latency) in such systems is challenging due to frequent uncertainties in their environments (e.g., workload fluctuations). Recently, transfer learning has been applied to address this problem by reusing… ▽ More

    Submitted 3 October, 2023; v1 submitted 13 June, 2023; originally announced June 2023.

  18. arXiv:2306.07487  [pdf, other

    cs.SE

    TRACED: Execution-aware Pre-training for Source Code

    Authors: Yangruibo Ding, Ben Steenhoek, Kexin Pei, Gail Kaiser, Wei Le, Baishakhi Ray

    Abstract: Most existing pre-trained language models for source code focus on learning the static code text, typically augmented with static code structures (abstract syntax tree, dependency graphs, etc.). However, program semantics will not be fully exposed before the real execution. Without an understanding of the program execution, statically pre-trained models fail to comprehensively capture the dynamic… ▽ More

    Submitted 12 June, 2023; originally announced June 2023.

    Comments: Accepted by ICSE 2024 (Early Cycle). Camera-ready is in preparation

  19. arXiv:2306.06490  [pdf, other

    cs.SE cs.PL

    Automated Code Editing with Search-Generate-Modify

    Authors: Changshu Liu, Pelin Cetin, Yogesh Patodia, Saikat Chakraborty, Yangruibo Ding, Baishakhi Ray

    Abstract: Code editing is essential in evolving software development. Many automated code editing tools have been proposed that leverage both Information Retrieval-based techniques and Machine Learning-based code generation and code editing models. Each technique comes with its own promises and perils, and they are often used together to complement their strengths and compensate for their weaknesses. This p… ▽ More

    Submitted 26 February, 2024; v1 submitted 10 June, 2023; originally announced June 2023.

    Comments: 12 pages, 10 figures

  20. arXiv:2306.06344  [pdf, other

    cs.RO cs.AI cs.LG

    Language-Guided Traffic Simulation via Scene-Level Diffusion

    Authors: Ziyuan Zhong, Davis Rempe, Yuxiao Chen, Boris Ivanovic, Yulong Cao, Danfei Xu, Marco Pavone, Baishakhi Ray

    Abstract: Realistic and controllable traffic simulation is a core capability that is necessary to accelerate autonomous vehicle (AV) development. However, current approaches for controlling learning-based traffic models require significant domain expertise and are difficult for practitioners to use. To remedy this, we present CTG++, a scene-level conditional diffusion model that can be guided by language in… ▽ More

    Submitted 18 October, 2023; v1 submitted 10 June, 2023; originally announced June 2023.

  21. arXiv:2306.03234  [pdf, other

    cs.SE

    CONCORD: Clone-aware Contrastive Learning for Source Code

    Authors: Yangruibo Ding, Saikat Chakraborty, Luca Buratti, Saurabh Pujar, Alessandro Morari, Gail Kaiser, Baishakhi Ray

    Abstract: Deep Learning (DL) models to analyze source code have shown immense promise during the past few years. More recently, self-supervised pre-training has gained traction for learning generic code representations valuable for many downstream SE tasks, such as clone and bug detection. While previous work successfully learned from different code abstractions (e.g., token, AST, graph), we argue that it… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

    Comments: Camera-ready for 32nd ACM SIGSOFT International Symposium on Software Testing and Analysis (ISSTA 23)

  22. arXiv:2306.03203  [pdf, other

    cs.CL cs.SE

    A Static Evaluation of Code Completion by Large Language Models

    Authors: Hantian Ding, Varun Kumar, Yuchen Tian, Zijian Wang, Rob Kwiatkowski, Xiaopeng Li, Murali Krishna Ramanathan, Baishakhi Ray, Parminder Bhatia, Sudipta Sengupta, Dan Roth, Bing Xiang

    Abstract: Large language models trained on code have shown great potential to increase productivity of software developers. Several execution-based benchmarks have been proposed to evaluate functional correctness of model-generated code on simple programming problems. Nevertheless, it is expensive to perform the same evaluation on complex real-world projects considering the execution cost. On the contrary,… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

    Comments: Accepted by ACL 2023 industry track

  23. arXiv:2304.12743  [pdf, other

    cs.SE

    TraceFixer: Execution Trace-Driven Program Repair

    Authors: Islem Bouzenia, Yangruibo Ding, Kexin Pei, Baishakhi Ray, Michael Pradel

    Abstract: When debugging unintended program behavior, developers can often identify the point in the execution where the actual behavior diverges from the desired behavior. For example, a variable may get assigned a wrong value, which then negatively influences the remaining computation. Once a developer identifies such a divergence, how to fix the code so that it provides the desired behavior? This paper p… ▽ More

    Submitted 25 April, 2023; originally announced April 2023.

  24. arXiv:2303.07615  [pdf, other

    cs.CV

    Variation of Gender Biases in Visual Recognition Models Before and After Finetuning

    Authors: Jaspreet Ranjit, Tianlu Wang, Baishakhi Ray, Vicente Ordonez

    Abstract: We introduce a framework to measure how biases change before and after fine-tuning a large scale visual recognition model for a downstream task. Deep learning models trained on increasing amounts of data are known to encode societal biases. Many computer vision systems today rely on models typically pretrained on large scale datasets. While bias mitigation techniques have been developed for tuning… ▽ More

    Submitted 13 March, 2023; originally announced March 2023.

    Comments: 10 pages, 3 Figures

  25. arXiv:2303.05378  [pdf, other

    cs.LG cs.SE

    Greener yet Powerful: Taming Large Code Generation Models with Quantization

    Authors: Xiaokai Wei, Sujan Gonugondla, Wasi Ahmad, Shiqi Wang, Baishakhi Ray, Haifeng Qian, Xiaopeng Li, Varun Kumar, Zijian Wang, Yuchen Tian, Qing Sun, Ben Athiwaratkun, Mingyue Shang, Murali Krishna Ramanathan, Parminder Bhatia, Bing Xiang

    Abstract: ML-powered code generation aims to assist developers to write code in a more productive manner, by intelligently generating code blocks based on natural language prompts. Recently, large pretrained deep learning models have substantially pushed the boundary of code generation and achieved impressive performance. Despite their great power, the huge number of model parameters poses a significant thr… ▽ More

    Submitted 9 March, 2023; originally announced March 2023.

    Comments: 10 pages, 7 figures, 10 tables

  26. arXiv:2302.10812  [pdf, other

    cs.PL cs.AI cs.SE

    On ML-Based Program Translation: Perils and Promises

    Authors: Aniketh Malyala, Katelyn Zhou, Baishakhi Ray, Saikat Chakraborty

    Abstract: With the advent of new and advanced programming languages, it becomes imperative to migrate legacy software to new programming languages. Unsupervised Machine Learning-based Program Translation could play an essential role in such migration, even without a sufficiently sizeable reliable corpus of parallel source code. However, these translators are far from perfect due to their statistical nature.… ▽ More

    Submitted 21 February, 2023; originally announced February 2023.

    Comments: 5 pages, 2 figures. Accepted at ICSE 2023 NIER - New Ideas and Emerging Results

  27. arXiv:2212.10264  [pdf, other

    cs.LG cs.CL cs.SE

    ReCode: Robustness Evaluation of Code Generation Models

    Authors: Shiqi Wang, Zheng Li, Haifeng Qian, Chenghao Yang, Zijian Wang, Mingyue Shang, Varun Kumar, Samson Tan, Baishakhi Ray, Parminder Bhatia, Ramesh Nallapati, Murali Krishna Ramanathan, Dan Roth, Bing Xiang

    Abstract: Code generation models have achieved impressive performance. However, they tend to be brittle as slight edits to a prompt could lead to very different generations; these robustness properties, critical for user experience when deployed in real-life applications, are not well understood. Most existing works on robustness in text or code tasks have focused on classification, while robustness in gene… ▽ More

    Submitted 20 December, 2022; originally announced December 2022.

    Comments: Code and data available at https://github.com/amazon-science/recode

  28. arXiv:2210.17366  [pdf, other

    cs.RO cs.AI cs.LG stat.ML

    Guided Conditional Diffusion for Controllable Traffic Simulation

    Authors: Ziyuan Zhong, Davis Rempe, Danfei Xu, Yuxiao Chen, Sushant Veer, Tong Che, Baishakhi Ray, Marco Pavone

    Abstract: Controllable and realistic traffic simulation is critical for develo** and verifying autonomous vehicles. Typical heuristic-based traffic models offer flexible control to make vehicles follow specific trajectories and traffic rules. On the other hand, data-driven approaches generate realistic and human-like behaviors, improving transfer from simulated to real-world traffic. However, to the best… ▽ More

    Submitted 31 October, 2022; originally announced October 2022.

  29. arXiv:2210.14868  [pdf, other

    cs.LG cs.CL

    Multi-lingual Evaluation of Code Generation Models

    Authors: Ben Athiwaratkun, Sanjay Krishna Gouda, Zijian Wang, Xiaopeng Li, Yuchen Tian, Ming Tan, Wasi Uddin Ahmad, Shiqi Wang, Qing Sun, Mingyue Shang, Sujan Kumar Gonugondla, Hantian Ding, Varun Kumar, Nathan Fulton, Arash Farahani, Siddhartha Jain, Robert Giaquinto, Haifeng Qian, Murali Krishna Ramanathan, Ramesh Nallapati, Baishakhi Ray, Parminder Bhatia, Sudipta Sengupta, Dan Roth, Bing Xiang

    Abstract: We present new benchmarks on evaluation code generation models: MBXP and Multilingual HumanEval, and MathQA-X. These datasets cover over 10 programming languages and are generated using a scalable conversion framework that transpiles prompts and test cases from the original Python datasets into the corresponding data in the target language. Using these benchmarks, we are able to assess the perform… ▽ More

    Submitted 28 March, 2023; v1 submitted 26 October, 2022; originally announced October 2022.

    Comments: Code and data release: https://github.com/amazon-research/mxeval

  30. arXiv:2210.14250  [pdf, other

    cs.CL

    Exploring Document-Level Literary Machine Translation with Parallel Paragraphs from World Literature

    Authors: Katherine Thai, Marzena Karpinska, Kalpesh Krishna, Bill Ray, Moira Inghilleri, John Wieting, Mohit Iyyer

    Abstract: Literary translation is a culturally significant task, but it is bottlenecked by the small number of qualified literary translators relative to the many untranslated works published around the world. Machine translation (MT) holds potential to complement the work of human translators by improving both training procedures and their overall efficiency. Literary translation is less constrained than m… ▽ More

    Submitted 25 October, 2022; originally announced October 2022.

    Comments: EMNLP 2022

  31. arXiv:2210.02853  [pdf, other

    cs.CR cs.LG cs.PL cs.SE

    NeuDep: Neural Binary Memory Dependence Analysis

    Authors: Kexin Pei, Dongdong She, Michael Wang, Scott Geng, Zhou Xuan, Yaniv David, Junfeng Yang, Suman Jana, Baishakhi Ray

    Abstract: Determining whether multiple instructions can access the same memory location is a critical task in binary analysis. It is challenging as statically computing precise alias information is undecidable in theory. The problem aggravates at the binary level due to the presence of compiler optimizations and the absence of symbols and types. Existing approaches either produce significant spurious depend… ▽ More

    Submitted 4 October, 2022; originally announced October 2022.

    Comments: ESEC/FSE 2022

  32. arXiv:2210.01185  [pdf, other

    cs.CL

    ContraCLM: Contrastive Learning For Causal Language Model

    Authors: Nihal Jain, Dejiao Zhang, Wasi Uddin Ahmad, Zijian Wang, Feng Nan, Xiaopeng Li, Ming Tan, Ramesh Nallapati, Baishakhi Ray, Parminder Bhatia, Xiaofei Ma, Bing Xiang

    Abstract: Despite exciting progress in causal language models, the expressiveness of the representations is largely limited due to poor discrimination ability. To remedy this issue, we present ContraCLM, a novel contrastive learning framework at both token-level and sequence-level. We assess ContraCLM on a variety of downstream tasks. We show that ContraCLM enhances discrimination of the representations and… ▽ More

    Submitted 2 May, 2023; v1 submitted 3 October, 2022; originally announced October 2022.

    Comments: 10 pages

    Journal ref: ACL 2023

  33. arXiv:2209.14921  [pdf

    cs.CR

    IvySyn: Automated Vulnerability Discovery in Deep Learning Frameworks

    Authors: Neophytos Christou, Di **, Vaggelis Atlidakis, Baishakhi Ray, Vasileios P. Kemerlis

    Abstract: We present IvySyn, the first fully-automated framework for discovering memory error vulnerabilities in Deep Learning (DL) frameworks. IvySyn leverages the statically-typed nature of native APIs in order to automatically perform type-aware mutation-based fuzzing on low-level kernel code. Given a set of offending inputs that trigger memory safety (and runtime) errors in low-level, native DL (C/C++)… ▽ More

    Submitted 27 April, 2023; v1 submitted 29 September, 2022; originally announced September 2022.

    Comments: Accepted at USENIX Security 2023

  34. arXiv:2207.11784  [pdf, other

    cs.SE

    CARGO: AI-Guided Dependency Analysis for Migrating Monolithic Applications to Microservices Architecture

    Authors: Vikram Nitin, Shubhi Asthana, Baishakhi Ray, Rahul Krishna

    Abstract: Microservices Architecture (MSA) has become a de-facto standard for designing cloud-native enterprise applications due to its efficient infrastructure setup, service availability, elastic scalability, dependability, and better security. Existing (monolithic) systems must be decomposed into microservices to harness these characteristics. Since manual decomposition of large scale applications can be… ▽ More

    Submitted 6 October, 2022; v1 submitted 24 July, 2022; originally announced July 2022.

    Comments: ACM Distinguished Paper ASE '22, October 10-14, 2022, Ann Arbor, MI, USA

    ACM Class: D.2.11

  35. arXiv:2206.09357  [pdf, other

    cs.SE

    Automatic Map Generation for Autonomous Driving System Testing

    Authors: Yun Tang, Yuan Zhou, Kairui Yang, Ziyuan Zhong, Baishakhi Ray, Yang Liu, ** Zhang, Junbo Chen

    Abstract: High-definition (HD) maps are essential in testing autonomous driving systems (ADSs). HD maps essentially determine the potential diversity of the testing scenarios. However, the current HD maps suffer from two main limitations: lack of junction diversity in the publicly available HD maps and cost-consuming to build a new HD map. Hence, in this paper, we propose, FEAT2MAP, to automatically generat… ▽ More

    Submitted 19 June, 2022; originally announced June 2022.

    Comments: 7 pages, 7 figures

  36. arXiv:2206.07585  [pdf, other

    cs.PL cs.AI cs.LG cs.SE

    NatGen: Generative pre-training by "Naturalizing" source code

    Authors: Saikat Chakraborty, Toufique Ahmed, Yangruibo Ding, Premkumar Devanbu, Baishakhi Ray

    Abstract: Pre-trained Generative Language models (e.g. PLBART, CodeT5, SPT-Code) for source code yielded strong results on several tasks in the past few years, including code generation and translation. These models have adopted varying pre-training objectives to learn statistics of code construction from very large-scale corpora in a self-supervised fashion; the success of pre-trained models largely hinges… ▽ More

    Submitted 5 July, 2022; v1 submitted 15 June, 2022; originally announced June 2022.

    Comments: Accepted to be published in ESEC/FSE 2022

  37. arXiv:2205.11116  [pdf, other

    cs.CL cs.PL

    Summarize and Generate to Back-translate: Unsupervised Translation of Programming Languages

    Authors: Wasi Uddin Ahmad, Saikat Chakraborty, Baishakhi Ray, Kai-Wei Chang

    Abstract: Back-translation is widely known for its effectiveness in neural machine translation when there is little to no parallel data. In this approach, a source-to-target model is coupled with a target-to-source model trained in parallel. The target-to-source model generates noisy sources, while the source-to-target model is trained to reconstruct the targets and vice versa. Recent developments of multil… ▽ More

    Submitted 11 February, 2023; v1 submitted 23 May, 2022; originally announced May 2022.

    Comments: Accepted to EACL 2023 (Main)

  38. arXiv:2203.13612  [pdf, other

    cs.LG cs.AI cs.CV cs.SE

    Repairing Group-Level Errors for DNNs Using Weighted Regularization

    Authors: Ziyuan Zhong, Yuchi Tian, Conor J. Sweeney, Vicente Ordonez, Baishakhi Ray

    Abstract: Deep Neural Networks (DNNs) have been widely used in software making decisions impacting people's lives. However, they have been found to exhibit severe erroneous behaviors that may lead to unfortunate outcomes. Previous work shows that such misbehaviors often occur due to class property violations rather than errors on a single image. Although methods for detecting such errors have been proposed,… ▽ More

    Submitted 4 April, 2022; v1 submitted 24 March, 2022; originally announced March 2022.

  39. arXiv:2201.08413  [pdf, other

    cs.LG cs.AI cs.AR cs.DC cs.PF

    Unicorn: Reasoning about Configurable System Performance through the lens of Causality

    Authors: Md Shahriar Iqbal, Rahul Krishna, Mohammad Ali Javidian, Baishakhi Ray, Pooyan Jamshidi

    Abstract: Modern computer systems are highly configurable, with the total variability space sometimes larger than the number of atoms in the universe. Understanding and reasoning about the performance behavior of highly configurable systems, over a vast and variable space, is challenging. State-of-the-art methods for performance modeling and analyses rely on predictive machine learning models, therefore, th… ▽ More

    Submitted 17 March, 2022; v1 submitted 20 January, 2022; originally announced January 2022.

    Comments: EuroSys 2022 (camera-ready)

  40. arXiv:2201.03758  [pdf, other

    cs.SE

    Predictive Synthesis of API-Centric Code

    Authors: Daye Nam, Baishakhi Ray, Seohyun Kim, Xianshan Qu, Satish Chandra

    Abstract: Today's programmers, especially data science practitioners, make heavy use of data-processing libraries (APIs) such as PyTorch, Tensorflow, NumPy, Pandas, and the like. Program synthesizers can provide significant coding assistance to this community of users; however program synthesis also can be slow due to enormous search spaces. In this work, we examine ways in which machine learning can be use… ▽ More

    Submitted 17 May, 2022; v1 submitted 10 January, 2022; originally announced January 2022.

  41. arXiv:2112.10893  [pdf, other

    cs.SE cs.LG

    VELVET: a noVel Ensemble Learning approach to automatically locate VulnErable sTatements

    Authors: Yangruibo Ding, Sahil Suneja, Yunhui Zheng, Jim Laredo, Alessandro Morari, Gail Kaiser, Baishakhi Ray

    Abstract: Automatically locating vulnerable statements in source code is crucial to assure software security and alleviate developers' debugging efforts. This becomes even more important in today's software ecosystem, where vulnerable code can flow easily and unwittingly within and across software repositories like GitHub. Across such millions of lines of code, traditional static and dynamic approaches stru… ▽ More

    Submitted 12 January, 2022; v1 submitted 20 December, 2021; originally announced December 2021.

    Comments: Camera Ready for Research Track of 29th IEEE International Conference on Software Analysis, Evolution and Reengineering (SANER 2022)

  42. arXiv:2112.00964  [pdf, other

    cs.SE cs.AI cs.LG cs.RO

    A Survey on Scenario-Based Testing for Automated Driving Systems in High-Fidelity Simulation

    Authors: Ziyuan Zhong, Yun Tang, Yuan Zhou, Vania de Oliveira Neves, Yang Liu, Baishakhi Ray

    Abstract: Automated Driving Systems (ADSs) have seen rapid progress in recent years. To ensure the safety and reliability of these systems, extensive testings are being conducted before their future mass deployment. Testing the system on the road is the closest to real-world and desirable approach, but it is incredibly costly. Also, it is infeasible to cover rare corner cases using such real-world testing.… ▽ More

    Submitted 1 December, 2021; originally announced December 2021.

  43. arXiv:2110.03868  [pdf, other

    cs.PL cs.AI cs.LG cs.SE

    Towards Learning (Dis)-Similarity of Source Code from Program Contrasts

    Authors: Yangruibo Ding, Luca Buratti, Saurabh Pujar, Alessandro Morari, Baishakhi Ray, Saikat Chakraborty

    Abstract: Understanding the functional (dis)-similarity of source code is significant for code modeling tasks such as software vulnerability and code clone detection. We present DISCO(DIS-similarity of COde), a novel self-supervised model focusing on identifying (dis)similar functionalities of source code. Different from existing works, our approach does not require a huge amount of randomly collected datas… ▽ More

    Submitted 20 March, 2022; v1 submitted 7 October, 2021; originally announced October 2021.

    Comments: ACL 2022 Camera-Ready

  44. arXiv:2109.06404  [pdf, other

    cs.RO cs.AI cs.LG cs.SE

    Detecting Multi-Sensor Fusion Errors in Advanced Driver-Assistance Systems

    Authors: Ziyuan Zhong, Zhisheng Hu, Shengjian Guo, Xinyang Zhang, Zhenyu Zhong, Baishakhi Ray

    Abstract: Advanced Driver-Assistance Systems (ADAS) have been thriving and widely deployed in recent years. In general, these systems receive sensor data, compute driving decisions, and output control signals to the vehicles. To smooth out the uncertainties brought by sensor outputs, they usually leverage multi-sensor fusion (MSF) to fuse the sensor outputs and produce a more reliable understanding of the s… ▽ More

    Submitted 25 May, 2022; v1 submitted 13 September, 2021; originally announced September 2021.

  45. arXiv:2109.06126  [pdf, other

    cs.SE cs.LG cs.NE cs.RO

    Neural Network Guided Evolutionary Fuzzing for Finding Traffic Violations of Autonomous Vehicles

    Authors: Ziyuan Zhong, Gail Kaiser, Baishakhi Ray

    Abstract: Self-driving cars and trucks, autonomous vehicles (AVs), should not be accepted by regulatory bodies and the public until they have much higher confidence in their safety and reliability -- which can most practically and convincingly be achieved by testing. But existing testing methods are inadequate for checking the end-to-end behaviors of AV controllers against complex, real-world corner cases i… ▽ More

    Submitted 21 July, 2022; v1 submitted 13 September, 2021; originally announced September 2021.

  46. arXiv:2108.11601  [pdf, other

    cs.SE cs.CL

    Retrieval Augmented Code Generation and Summarization

    Authors: Md Rizwan Parvez, Wasi Uddin Ahmad, Saikat Chakraborty, Baishakhi Ray, Kai-Wei Chang

    Abstract: Software developers write a lot of source code and documentation during software development. Intrinsically, developers often recall parts of source code or code summaries that they had written in the past while implementing software or documenting them. To mimic developers' code or summary generation behavior, we propose a retrieval augmented framework, REDCODER, that retrieves relevant code or s… ▽ More

    Submitted 10 September, 2021; v1 submitted 26 August, 2021; originally announced August 2021.

    Comments: accepted in EMNLP-Findings 2021

  47. arXiv:2108.06645  [pdf, other

    cs.SE cs.LG cs.PL

    On Multi-Modal Learning of Editing Source Code

    Authors: Saikat Chakraborty, Baishakhi Ray

    Abstract: In recent years, Neural Machine Translator (NMT) has shown promise in automatically editing source code. Typical NMT based code editor only considers the code that needs to be changed as input and suggests developers with a ranked list of patched code to choose from - where the correct one may not always be at the top of the list. While NMT based code editing systems generate a broad spectrum of p… ▽ More

    Submitted 14 August, 2021; originally announced August 2021.

    Comments: Accepted for publication in 36th IEEE/ACM conference on Automated Software Engineering (ASE-2021)

  48. arXiv:2103.06333  [pdf, other

    cs.CL cs.PL

    Unified Pre-training for Program Understanding and Generation

    Authors: Wasi Uddin Ahmad, Saikat Chakraborty, Baishakhi Ray, Kai-Wei Chang

    Abstract: Code summarization and generation empower conversion between programming language (PL) and natural language (NL), while code translation avails the migration of legacy code from one PL to another. This paper introduces PLBART, a sequence-to-sequence model capable of performing a broad spectrum of program and language understanding and generation tasks. PLBART is pre-trained on an extensive collect… ▽ More

    Submitted 10 April, 2021; v1 submitted 10 March, 2021; originally announced March 2021.

    Comments: NAACL 2021 (camera ready)

  49. arXiv:2012.08680  [pdf, other

    cs.CR cs.LG cs.SE

    Trex: Learning Execution Semantics from Micro-Traces for Binary Similarity

    Authors: Kexin Pei, Zhou Xuan, Junfeng Yang, Suman Jana, Baishakhi Ray

    Abstract: Detecting semantically similar functions -- a crucial analysis capability with broad real-world security usages including vulnerability detection, malware lineage, and forensics -- requires understanding function behaviors and intentions. This task is challenging as semantically similar functions can be implemented differently, run on different architectures, and compiled with diverse compiler opt… ▽ More

    Submitted 26 April, 2021; v1 submitted 15 December, 2020; originally announced December 2020.

  50. arXiv:2010.06080  [pdf, other

    stat.ML cs.LG q-bio.QM

    Point Process Modeling of Drug Overdoses with Heterogeneous and Missing Data

    Authors: Xueying Liu, Jeremy Carter, Brad Ray, George Mohler

    Abstract: Opioid overdose rates have increased in the United States over the past decade and reflect a major public health crisis. Modeling and prediction of drug and opioid hotspots, where a high percentage of events fall in a small percentage of space-time, could help better focus limited social and health services. In this work we present a spatial-temporal point process model for drug overdose clusterin… ▽ More

    Submitted 12 October, 2020; originally announced October 2020.