Skip to main content

Showing 1–5 of 5 results for author: Cruz, B D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2307.08947  [pdf, other

    cs.SE

    An Effective Data-Driven Approach for Localizing Deep Learning Faults

    Authors: Mohammad Wardat, Breno Dantas Cruz, Wei Le, Hridesh Rajan

    Abstract: Deep Learning (DL) applications are being used to solve problems in critical domains (e.g., autonomous driving or medical diagnosis systems). Thus, developers need to debug their systems to ensure that the expected behavior is delivered. However, it is hard and expensive to debug DNNs. When the failure symptoms or unsatisfied accuracies are reported after training, we lose the traceability as to w… ▽ More

    Submitted 17 July, 2023; originally announced July 2023.

  2. arXiv:2306.08632  [pdf, other

    cs.SE

    Characterizing Bugs in Python and R Data Analytics Programs

    Authors: Shibbir Ahmed, Mohammad Wardat, Hamid Bagheri, Breno Dantas Cruz, Hridesh Rajan

    Abstract: R and Python are among the most popular languages used in many critical data analytics tasks. However, we still do not fully understand the capabilities of these two languages w.r.t. bugs encountered in data analytics tasks. What type of bugs are common? What are the main root causes? What is the relation between bugs and root causes? How to mitigate these bugs? We present a comprehensive study of… ▽ More

    Submitted 14 June, 2023; originally announced June 2023.

  3. arXiv:2212.05970  [pdf, other

    cs.SE cs.CL cs.LG

    Decomposing a Recurrent Neural Network into Modules for Enabling Reusability and Replacement

    Authors: Sayem Mohammad Imtiaz, Fraol Batole, Astha Singh, Rangeet Pan, Breno Dantas Cruz, Hridesh Rajan

    Abstract: Can we take a recurrent neural network (RNN) trained to translate between languages and augment it to support a new natural language without retraining the model from scratch? Can we fix the faulty behavior of the RNN by replacing portions associated with the faulty behavior? Recent works on decomposing a fully connected neural network (FCNN) and convolutional neural network (CNN) into modules hav… ▽ More

    Submitted 9 February, 2023; v1 submitted 8 December, 2022; originally announced December 2022.

    Comments: Accepted at 45th international conference on software engineering (ICSE'2023)

  4. arXiv:2212.00105  [pdf, other

    cs.SE

    An Empirical Study on the Bugs Found while Reusing Pre-trained Natural Language Processing Models

    Authors: Rangeet Pan, Sumon Biswas, Mohna Chakraborty, Breno Dantas Cruz, Hridesh Rajan

    Abstract: In NLP, reusing pre-trained models instead of training from scratch has gained popularity; however, NLP models are mostly black boxes, very large, and often require significant resources. To ease, models trained with large corpora are made available, and developers reuse them for different problems. In contrast, developers mostly build their models from scratch for traditional DL-related problems.… ▽ More

    Submitted 30 November, 2022; originally announced December 2022.

    Comments: 12 pages, 12 figures

    MSC Class: 68T50 ACM Class: D.6; D.2.5; D.2.13

  5. arXiv:2112.04036  [pdf, other

    cs.SE cs.LG

    DeepDiagnosis: Automatically Diagnosing Faults and Recommending Actionable Fixes in Deep Learning Programs

    Authors: Mohammad Wardat, Breno Dantas Cruz, Wei Le, Hridesh Rajan

    Abstract: Deep Neural Networks (DNNs) are used in a wide variety of applications. However, as in any software application, DNN-based apps are afflicted with bugs. Previous work observed that DNN bug fix patterns are different from traditional bug fix patterns. Furthermore, those buggy models are non-trivial to diagnose and fix due to inexplicit errors with several options to fix them. To support developers… ▽ More

    Submitted 7 December, 2021; originally announced December 2021.

    Comments: Accepted at ICSE 2022