Skip to main content

Showing 1–28 of 28 results for author: Rajan, H

.
  1. Data-Driven Evidence-Based Syntactic Sugar Design

    Authors: David OBrien, Robert Dyer, Tien N. Nguyen, Hridesh Rajan

    Abstract: Programming languages are essential tools for developers, and their evolution plays a crucial role in supporting the activities of developers. One instance of programming language evolution is the introduction of syntactic sugars, which are additional syntax elements that provide alternative, more readable code constructs. However, the process of designing and evolving a programming language has t… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

    Comments: 12 pages, 12 figures, to be published in ICSE'24

  2. arXiv:2401.14628  [pdf, other

    cs.SE cs.LG

    Inferring Data Preconditions from Deep Learning Models for Trustworthy Prediction in Deployment

    Authors: Shibbir Ahmed, Hongyang Gao, Hridesh Rajan

    Abstract: Deep learning models are trained with certain assumptions about the data during the development stage and then used for prediction in the deployment stage. It is important to reason about the trustworthiness of the model's predictions with unseen data during deployment. Existing methods for specifying and verifying traditional software are insufficient for this task, as they cannot handle the comp… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

    Comments: Accepted for publication at the 46th International Conference on Software Engineering (ICSE 2024)

  3. arXiv:2309.05067  [pdf, other

    cs.SE cs.LG

    Mutation-based Fault Localization of Deep Neural Networks

    Authors: Ali Ghanbari, Deepak-George Thomas, Muhammad Arbab Arshad, Hridesh Rajan

    Abstract: Deep neural networks (DNNs) are susceptible to bugs, just like other types of software systems. A significant uptick in using DNN, and its applications in wide-ranging areas, including safety-critical systems, warrant extensive research on software engineering tools for improving the reliability of DNN-based systems. One such tool that has gained significant attention in the recent years is DNN fa… ▽ More

    Submitted 10 September, 2023; originally announced September 2023.

    Comments: 38th IEEE/ACM International Conference on Automated Software Engineering (ASE 2023)

  4. arXiv:2307.14465  [pdf, other

    cs.SE cs.LG cs.PL

    What Kinds of Contracts Do ML APIs Need?

    Authors: Samantha Syeda Khairunnesa, Shibbir Ahmed, Sayem Mohammad Imtiaz, Hridesh Rajan, Gary T. Leavens

    Abstract: Recent work has shown that Machine Learning (ML) programs are error-prone and called for contracts for ML code. Contracts, as in the design by contract methodology, help document APIs and aid API users in writing correct code. The question is: what kinds of contracts would provide the most help to API users? We are especially interested in what kinds of contracts help API users catch errors at ear… ▽ More

    Submitted 26 July, 2023; originally announced July 2023.

    Comments: Accepted for publication at EMSE (Empirical Software Engineering) Journal, 2023

  5. arXiv:2307.08947  [pdf, other

    cs.SE

    An Effective Data-Driven Approach for Localizing Deep Learning Faults

    Authors: Mohammad Wardat, Breno Dantas Cruz, Wei Le, Hridesh Rajan

    Abstract: Deep Learning (DL) applications are being used to solve problems in critical domains (e.g., autonomous driving or medical diagnosis systems). Thus, developers need to debug their systems to ensure that the expected behavior is delivered. However, it is hard and expensive to debug DNNs. When the failure symptoms or unsatisfied accuracies are reported after training, we lose the traceability as to w… ▽ More

    Submitted 17 July, 2023; originally announced July 2023.

  6. Fix Fairness, Don't Ruin Accuracy: Performance Aware Fairness Repair using AutoML

    Authors: Giang Nguyen, Sumon Biswas, Hridesh Rajan

    Abstract: Machine learning (ML) is increasingly being used in critical decision-making software, but incidents have raised questions about the fairness of ML predictions. To address this issue, new tools and methods are needed to mitigate bias in ML-based software. Previous studies have proposed bias mitigation algorithms that only work in specific situations and often result in a loss of accuracy. Our prop… ▽ More

    Submitted 28 August, 2023; v1 submitted 15 June, 2023; originally announced June 2023.

    Comments: In Proceedings of The 31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering (ESEC/FSE 2023)

  7. arXiv:2306.08632  [pdf, other

    cs.SE

    Characterizing Bugs in Python and R Data Analytics Programs

    Authors: Shibbir Ahmed, Mohammad Wardat, Hamid Bagheri, Breno Dantas Cruz, Hridesh Rajan

    Abstract: R and Python are among the most popular languages used in many critical data analytics tasks. However, we still do not fully understand the capabilities of these two languages w.r.t. bugs encountered in data analytics tasks. What type of bugs are common? What are the main root causes? What is the relation between bugs and root causes? How to mitigate these bugs? We present a comprehensive study of… ▽ More

    Submitted 14 June, 2023; originally announced June 2023.

  8. arXiv:2212.06140  [pdf, other

    cs.LG cs.AI

    Fairify: Fairness Verification of Neural Networks

    Authors: Sumon Biswas, Hridesh Rajan

    Abstract: Fairness of machine learning (ML) software has become a major concern in the recent past. Although recent research on testing and improving fairness have demonstrated impact on real-world software, providing fairness guarantee in practice is still lacking. Certification of ML models is challenging because of the complex decision-making process of the models. In this paper, we proposed Fairify, an… ▽ More

    Submitted 13 December, 2022; v1 submitted 8 December, 2022; originally announced December 2022.

    Journal ref: ICSE 2023: The 45th International Conference on Software Engineering, Melbourne, Australia

  9. arXiv:2212.05970  [pdf, other

    cs.SE cs.CL cs.LG

    Decomposing a Recurrent Neural Network into Modules for Enabling Reusability and Replacement

    Authors: Sayem Mohammad Imtiaz, Fraol Batole, Astha Singh, Rangeet Pan, Breno Dantas Cruz, Hridesh Rajan

    Abstract: Can we take a recurrent neural network (RNN) trained to translate between languages and augment it to support a new natural language without retraining the model from scratch? Can we fix the faulty behavior of the RNN by replacing portions associated with the faulty behavior? Recent works on decomposing a fully connected neural network (FCNN) and convolutional neural network (CNN) into modules hav… ▽ More

    Submitted 9 February, 2023; v1 submitted 8 December, 2022; originally announced December 2022.

    Comments: Accepted at 45th international conference on software engineering (ICSE'2023)

  10. Towards Understanding Fairness and its Composition in Ensemble Machine Learning

    Authors: Usman Gohar, Sumon Biswas, Hridesh Rajan

    Abstract: Machine Learning (ML) software has been widely adopted in modern society, with reported fairness implications for minority groups based on race, sex, age, etc. Many recent works have proposed methods to measure and mitigate algorithmic bias in ML models. The existing approaches focus on single classifier-based ML models. However, real-world ML models are often composed of multiple independent or d… ▽ More

    Submitted 25 April, 2023; v1 submitted 8 December, 2022; originally announced December 2022.

    Comments: Accepted at ICSE 2023

    Journal ref: ICSE 2023: The 45th International Conference on Software Engineering, Melbourne, Australia

  11. arXiv:2212.00105  [pdf, other

    cs.SE

    An Empirical Study on the Bugs Found while Reusing Pre-trained Natural Language Processing Models

    Authors: Rangeet Pan, Sumon Biswas, Mohna Chakraborty, Breno Dantas Cruz, Hridesh Rajan

    Abstract: In NLP, reusing pre-trained models instead of training from scratch has gained popularity; however, NLP models are mostly black boxes, very large, and often require significant resources. To ease, models trained with large corpora are made available, and developers reuse them for different problems. In contrast, developers mostly build their models from scratch for traditional DL-related problems.… ▽ More

    Submitted 30 November, 2022; originally announced December 2022.

    Comments: 12 pages, 12 figures

    MSC Class: 68T50 ACM Class: D.6; D.2.5; D.2.13

  12. arXiv:2207.07817  [pdf, ps, other

    cs.DC

    On Scheduling Ring-All-Reduce Learning Jobs in Multi-Tenant GPU Clusters with Communication Contention

    Authors: Menglu Yu, Bo Ji, Hridesh Rajan, Jia Liu

    Abstract: Powered by advances in deep learning (DL) techniques, machine learning and artificial intelligence have achieved astonishing successes. However, the rapidly growing needs for DL also led to communication- and resource-intensive distributed training jobs for large-scale DL training, which are typically deployed over GPU clusters. To sustain the ever-increasing demand for DL training, the so-called… ▽ More

    Submitted 14 August, 2022; v1 submitted 15 July, 2022; originally announced July 2022.

    Comments: Accepted in Proc. ACM Mobihoc, July 2022

  13. arXiv:2202.01158  [pdf, other

    cs.DC

    GADGET: Online Resource Optimization for Scheduling Ring-All-Reduce Learning Jobs

    Authors: Menglu Yu, Ye Tian, Bo Ji, Chuan Wu, Hridesh Rajan, Jia Liu

    Abstract: Fueled by advances in distributed deep learning (DDL), recent years have witnessed a rapidly growing demand for resource-intensive distributed/parallel computing to process DDL computing jobs. To resolve network communication bottleneck and load balancing issues in distributed computing, the so-called ``ring-all-reduce'' decentralized architecture has been increasingly adopted to remove the need f… ▽ More

    Submitted 2 February, 2022; originally announced February 2022.

    Comments: Accepted in Proc. IEEE INFOCOM, Virtual Event, May 2022

  14. arXiv:2112.04036  [pdf, other

    cs.SE cs.LG

    DeepDiagnosis: Automatically Diagnosing Faults and Recommending Actionable Fixes in Deep Learning Programs

    Authors: Mohammad Wardat, Breno Dantas Cruz, Wei Le, Hridesh Rajan

    Abstract: Deep Neural Networks (DNNs) are used in a wide variety of applications. However, as in any software application, DNN-based apps are afflicted with bugs. Previous work observed that DNN bug fix patterns are different from traditional bug fix patterns. Furthermore, those buggy models are non-trivial to diagnose and fix due to inexplicit errors with several options to fix them. To support developers… ▽ More

    Submitted 7 December, 2021; originally announced December 2021.

    Comments: Accepted at ICSE 2022

  15. Manas: Mining Software Repositories to Assist AutoML

    Authors: Giang Nguyen, Md Johir Islam, Rangeet Pan, Hridesh Rajan

    Abstract: Today deep learning is widely used for building software. A software engineering problem with deep learning is that finding an appropriate convolutional neural network (CNN) model for the task can be a challenge for developers. Recent work on AutoML, more precisely neural architecture search (NAS), embodied by tools like Auto-Keras aims to solve this problem by essentially viewing it as a search p… ▽ More

    Submitted 13 February, 2022; v1 submitted 6 December, 2021; originally announced December 2021.

  16. The Art and Practice of Data Science Pipelines: A Comprehensive Study of Data Science Pipelines In Theory, In-The-Small, and In-The-Large

    Authors: Sumon Biswas, Mohammad Wardat, Hridesh Rajan

    Abstract: Increasingly larger number of software systems today are including data science components for descriptive, predictive, and prescriptive analytics. The collection of data science stages from acquisition, to cleaning/curation, to modeling, and so on are referred to as data science pipelines. To facilitate research and practice on data science pipelines, it is essential to understand their nature. W… ▽ More

    Submitted 14 February, 2022; v1 submitted 2 December, 2021; originally announced December 2021.

    ACM Class: D.2.0

    Journal ref: ICSE 2022: The 44th International Conference on Software Engineering

  17. arXiv:2110.07720  [pdf, other

    cs.CV

    Decomposing Convolutional Neural Networks into Reusable and Replaceable Modules

    Authors: Rangeet Pan, Hridesh Rajan

    Abstract: Training from scratch is the most common way to build a Convolutional Neural Network (CNN) based model. What if we can build new CNN models by reusing parts from previously build CNN models? What if we can improve a CNN model by replacing (possibly faulty) parts with other parts? In both cases, instead of training, can we identify the part responsible for each output class (module) in the model(s)… ▽ More

    Submitted 20 December, 2021; v1 submitted 11 October, 2021; originally announced October 2021.

    Comments: Accepted at ICSE'22

  18. arXiv:2110.05645  [pdf, other

    cs.LG math.OC stat.ML

    A global convergence theory for deep ReLU implicit networks via over-parameterization

    Authors: Tianxiang Gao, Hailiang Liu, Jia Liu, Hridesh Rajan, Hongyang Gao

    Abstract: Implicit deep learning has received increasing attention recently due to the fact that it generalizes the recursive prediction rules of many commonly used neural network architectures. Its prediction rule is provided implicitly based on the solution of an equilibrium equation. Although a line of recent empirical studies has demonstrated its superior performances, the theoretical understanding of i… ▽ More

    Submitted 18 February, 2022; v1 submitted 11 October, 2021; originally announced October 2021.

    Comments: Accepted by ICLR 2022

  19. Fair Preprocessing: Towards Understanding Compositional Fairness of Data Transformers in Machine Learning Pipeline

    Authors: Sumon Biswas, Hridesh Rajan

    Abstract: In recent years, many incidents have been reported where machine learning models exhibited discrimination among people based on race, sex, age, etc. Research has been conducted to measure and mitigate unfairness in machine learning models. For a machine learning task, it is a common practice to build a pipeline that includes an ordered set of data preprocessing stages followed by a classifier. How… ▽ More

    Submitted 19 July, 2021; v1 submitted 2 June, 2021; originally announced June 2021.

    Comments: ESEC/FSE'2021: The 29th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, Athens, Greece, August 23-28, 2021

    ACM Class: D.2.0; I.2.5

    Journal ref: Proceedings of the ESEC/FSE 2021

  20. arXiv:2103.03376  [pdf, other

    cs.SE

    DeepLocalize: Fault Localization for Deep Neural Networks

    Authors: Mohammad Wardat, Wei Le, Hridesh Rajan

    Abstract: Deep neural networks (DNNs) are becoming an integral part of most software systems. Previous work has shown that DNNs have bugs. Unfortunately, existing debugging techniques do not support localizing DNN bugs because of the lack of understanding of model behaviors. The entire DNN model appears as a black box. To address these problems, we propose an approach that automatically determines whether t… ▽ More

    Submitted 4 March, 2021; originally announced March 2021.

    Comments: Accepted at ICSE 2021

  21. arXiv:2005.12379  [pdf, other

    cs.LG cs.SE stat.ML

    Do the Machine Learning Models on a Crowd Sourced Platform Exhibit Bias? An Empirical Study on Model Fairness

    Authors: Sumon Biswas, Hridesh Rajan

    Abstract: Machine learning models are increasingly being used in important decision-making software such as approving bank loans, recommending criminal sentencing, hiring employees, and so on. It is important to ensure the fairness of these models so that no discrimination is made based on protected attribute (e.g., race, sex, age) while decision making. Algorithms have been developed to measure unfairness… ▽ More

    Submitted 22 September, 2020; v1 submitted 21 May, 2020; originally announced May 2020.

    Comments: To be appeared in ESEC/FSE 2020

    Journal ref: ESEC/FSE'2020: The 28th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, Sacramento, California, United States, November 8-13, 2020

  22. BCFA: Bespoke Control Flow Analysis for CFA at Scale

    Authors: Ramanathan Ramu, Ganesha B Upadhyaya, Hoan Anh Nguyen, Hridesh Rajan

    Abstract: Many data-driven software engineering tasks such as discovering programming patterns, mining API specifications, etc., perform source code analysis over control flow graphs (CFGs) at scale. Analyzing millions of CFGs can be expensive and performance of the analysis heavily depends on the underlying CFG traversal strategy. State-of-the-art analysis frameworks use a fixed traversal strategy. We argu… ▽ More

    Submitted 3 May, 2020; originally announced May 2020.

    Comments: 12 pages

  23. arXiv:2005.00972  [pdf, other

    cs.SE

    Repairing Deep Neural Networks: Fix Patterns and Challenges

    Authors: Md Johirul Islam, Rangeet Pan, Giang Nguyen, Hridesh Rajan

    Abstract: Significant interest in applying Deep Neural Network (DNN) has fueled the need to support engineering of software that uses DNNs. Repairing software that uses DNNs is one such unmistakable SE need where automated tools could be beneficial; however, we do not fully understand challenges to repairing and patterns that are utilized when manually repairing DNNs. What challenges should automated repair… ▽ More

    Submitted 2 May, 2020; originally announced May 2020.

  24. arXiv:1906.11940  [pdf, other

    cs.SE

    What Do Developers Ask About ML Libraries? A Large-scale Study Using Stack Overflow

    Authors: Md Johirul Islam, Hoan Anh Nguyen, Rangeet Pan, Hridesh Rajan

    Abstract: Modern software systems are increasingly including machine learning (ML) as an integral component. However, we do not yet understand the difficulties faced by software developers when learning about ML libraries and using them within their systems. To that end, this work reports on a detailed (manual) examination of 3,243 highly-rated Q&A posts related to ten ML libraries, namely Tensorflow, Keras… ▽ More

    Submitted 27 June, 2019; originally announced June 2019.

  25. arXiv:1906.01388  [pdf, other

    cs.SE cs.LG

    A Comprehensive Study on Deep Learning Bug Characteristics

    Authors: Md Johirul Islam, Giang Nguyen, Rangeet Pan, Hridesh Rajan

    Abstract: Deep learning has gained substantial popularity in recent years. Developers mainly rely on libraries and tools to add deep learning capabilities to their software. What kinds of bugs are frequently found in such software? What are the root causes of such bugs? What impacts do such bugs have? Which stages of deep learning pipeline are more bug prone? Are there any antipatterns? Understanding such c… ▽ More

    Submitted 3 June, 2019; originally announced June 2019.

    Journal ref: The ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering (ESEC/FSE) (Aug. 2019)

  26. arXiv:1905.13284  [pdf, other

    cs.LG cs.CR stat.ML

    Identifying Classes Susceptible to Adversarial Attacks

    Authors: Rangeet Pan, Md Johirul Islam, Shibbir Ahmed, Hridesh Rajan

    Abstract: Despite numerous attempts to defend deep learning based image classifiers, they remain susceptible to the adversarial attacks. This paper proposes a technique to identify susceptible classes, those classes that are more easily subverted. To identify the susceptible classes we use distance-based measures and apply them on a trained model. Based on the distance among original classes, we create mapp… ▽ More

    Submitted 30 May, 2019; originally announced May 2019.

  27. Inferring Concise Specifications of APIs

    Authors: John L. Singleton, Gary T. Leavens, Hridesh Rajan, David R. Cok

    Abstract: Modern software relies on libraries and uses them via application programming interfaces (APIs). Correct API usage as well as many software engineering tasks are enabled when APIs have formal specifications. In this work, we analyze the implementation of each method in an API to infer a formal postcondition. Conventional wisdom is that, if one has preconditions, then one can use the strongest post… ▽ More

    Submitted 16 May, 2019; originally announced May 2019.

  28. arXiv:1805.00105  [pdf, other

    cs.OH

    A Cyberinfrastructure for BigData Transportation Engineering

    Authors: Md Johirul Islam, Anuj Sharma, Hridesh Rajan

    Abstract: Big Data-driven transportation engineering has the potential to improve utilization of road infrastructure, decrease traffic fatalities, improve fuel consumption, decrease construction worker injuries, among others. Despite these benefits, research on Big Data-driven transportation engineering is difficult today due to the computational expertise required to get started. This work proposes BoaT, a… ▽ More

    Submitted 30 April, 2018; originally announced May 2018.