Skip to main content

Showing 1–33 of 33 results for author: Hemmati, H

.
  1. arXiv:2402.11702  [pdf, other

    cs.SE cs.AI cs.LG

    Can ChatGPT Support Developers? An Empirical Evaluation of Large Language Models for Code Generation

    Authors: Kailun **, Chung-Yu Wang, Hung Viet Pham, Hadi Hemmati

    Abstract: Large language models (LLMs) have demonstrated notable proficiency in code generation, with numerous prior studies showing their promising capabilities in various development scenarios. However, these studies mainly provide evaluations in research settings, which leaves a significant gap in understanding how effectively LLMs can support developers in real-world. To address this, we conducted an em… ▽ More

    Submitted 16 March, 2024; v1 submitted 18 February, 2024; originally announced February 2024.

    Comments: 4 pages, 3 figures, 21st International Conference on Mining Software Repositories (MSR '24), April 15-16, 2024, Lisbon, Portugal

    ACM Class: I.2.2

  2. arXiv:2401.16633  [pdf

    cs.SE cs.AI

    I came, I saw, I certified: some perspectives on the safety assurance of cyber-physical systems

    Authors: Mithila Sivakumar, Alvine B. Belle, Kimya Khakzad Shahandashti, Oluwafemi Odu, Hadi Hemmati, Segla Kpodjedo, Song Wang, Opeyemi O. Adesina

    Abstract: The execution failure of cyber-physical systems (e.g., autonomous driving systems, unmanned aerial systems, and robotic systems) could result in the loss of life, severe injuries, large-scale environmental damage, property destruction, and major economic loss. Hence, such systems usually require a strong justification that they will effectively support critical requirements (e.g., safety, security… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

  3. arXiv:2310.20492  [pdf, other

    cs.LG cs.SE

    Log-based Anomaly Detection of Enterprise Software: An Empirical Study

    Authors: Nadun Wijesinghe, Hadi Hemmati

    Abstract: Most enterprise applications use logging as a mechanism to diagnose anomalies, which could help with reducing system downtime. Anomaly detection using software execution logs has been explored in several prior studies, using both classical and deep neural network-based machine learning models. In recent years, the research has largely focused in using variations of sequence-based deep neural netwo… ▽ More

    Submitted 31 October, 2023; originally announced October 2023.

    Comments: 12 pages, 14 figures. Submitted to QRS 2023 - 23rd IEEE International Conference on Software Quality, Reliability and Security

    ACM Class: I.5.2; I.5.1; I.5.4; I.2.7; I.2.6; D.2.5

  4. arXiv:2310.10508  [pdf, other

    cs.SE

    Prompt Engineering or Fine Tuning: An Empirical Assessment of Large Language Models in Automated Software Engineering Tasks

    Authors: Jiho Shin, Clark Tang, Tahmineh Mohati, Maleknaz Nayebi, Song Wang, Hadi Hemmati

    Abstract: In this paper, we investigate the effectiveness of state-of-the-art LLM, i.e., GPT-4, with three different prompting engineering techniques (i.e., basic prompting, in-context learning, and task-specific prompting) against 18 fine-tuned LLMs on three typical ASE tasks, i.e., code generation, code summarization, and code translation. Our quantitative analysis of these prompting strategies suggests t… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

    Comments: 19 pages + reference

  5. arXiv:2310.07856  [pdf, ps, other

    cs.CL cs.SE

    Assessing Evaluation Metrics for Neural Test Oracle Generation

    Authors: Jiho Shin, Hadi Hemmati, Moshi Wei, Song Wang

    Abstract: In this work, we revisit existing oracle generation studies plus ChatGPT to empirically investigate the current standing of their performance in both NLG-based and test adequacy metrics. Specifically, we train and run four state-of-the-art test oracle generation models on five NLG-based and two test adequacy metrics for our analysis. We apply two different correlation analyses between these two di… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

    Comments: 10 pages + reference

  6. arXiv:2309.14615  [pdf, other

    cs.LG cs.CE q-fin.TR

    Gray-box Adversarial Attack of Deep Reinforcement Learning-based Trading Agents

    Authors: Foozhan Ataiefard, Hadi Hemmati

    Abstract: In recent years, deep reinforcement learning (Deep RL) has been successfully implemented as a smart agent in many systems such as complex games, self-driving cars, and chat-bots. One of the interesting use cases of Deep RL is its application as an automated stock trading agent. In general, any automated trading agent is prone to manipulations by adversaries in the trading environment. Thus studyin… ▽ More

    Submitted 25 September, 2023; originally announced September 2023.

  7. Method-Level Bug Severity Prediction using Source Code Metrics and LLMs

    Authors: Ehsan Mashhadi, Hossein Ahmadvand, Hadi Hemmati

    Abstract: In the past couple of decades, significant research efforts are devoted to the prediction of software bugs. However, most existing work in this domain treats all bugs the same, which is not the case in practice. It is important for a defect prediction method to estimate the severity of the identified bugs so that the higher-severity ones get immediate attention. In this study, we investigate sourc… ▽ More

    Submitted 6 September, 2023; originally announced September 2023.

  8. arXiv:2308.08033  [pdf, other

    cs.SE cs.AI

    Domain Adaptation for Deep Unit Test Case Generation

    Authors: Jiho Shin, Sepehr Hashtroudi, Hadi Hemmati, Song Wang

    Abstract: Recently, deep learning-based test case generation approaches have been proposed to automate the generation of unit test cases. In this study, we leverage Transformer-based code models to generate unit tests with the help of Domain Adaptation (DA) at a project level. Specifically, we use CodeT5, which is a relatively small language model trained on source code data, and fine-tune it on the test ge… ▽ More

    Submitted 19 January, 2024; v1 submitted 15 August, 2023; originally announced August 2023.

    Comments: 10 pages + reference

  9. arXiv:2307.11298  [pdf, ps, other

    cs.SE

    A First Look at Fairness of Machine Learning Based Code Reviewer Recommendation

    Authors: Mohammad Mahdi Mohajer, Alvine Boaye Belle, Nima Shiri harzevili, Junjie Wang, Hadi Hemmati, Song Wang, Zhen Ming, Jiang

    Abstract: The fairness of machine learning (ML) approaches is critical to the reliability of modern artificial intelligence systems. Despite extensive study on this topic, the fairness of ML models in the software engineering (SE) domain has not been well explored yet. As a result, many ML-powered software systems, particularly those utilized in the software engineering community, continue to be prone to fa… ▽ More

    Submitted 20 July, 2023; originally announced July 2023.

  10. arXiv:2307.00012  [pdf, other

    cs.SE cs.AI cs.LG

    FlakyFix: Using Large Language Models for Predicting Flaky Test Fix Categories and Test Code Repair

    Authors: Sakina Fatima, Hadi Hemmati, Lionel Briand

    Abstract: Flaky tests are problematic because they non-deterministically pass or fail for the same software version under test, causing confusion and wasting development effort. While machine learning models have been used to predict flakiness and its root causes, there is much less work on providing support to fix the problem. To address this gap, in this paper, we focus on predicting the type of fix that… ▽ More

    Submitted 19 May, 2024; v1 submitted 21 June, 2023; originally announced July 2023.

    Comments: 25 pages, 19 Figures

  11. arXiv:2302.06065  [pdf, other

    cs.SE

    A Systematic Literature Review of Explainable AI for Software Engineering

    Authors: Ahmad Haji Mohammadkhani, Nitin Sai Bommi, Mariem Daboussi, Onkar Sabnis, Chakkrit Tantithamthavorn, Hadi Hemmati

    Abstract: Context: In recent years, leveraging machine learning (ML) techniques has become one of the main solutions to tackle many software engineering (SE) tasks, in research studies (ML4SE). This has been achieved by utilizing state-of-the-art models that tend to be more complex and black-box, which is led to less explainable solutions that reduce trust and uptake of ML4SE solutions by professionals in t… ▽ More

    Submitted 12 February, 2023; originally announced February 2023.

  12. arXiv:2212.11414  [pdf, other

    cs.SE cs.AI cs.LG

    Improving Automated Program Repair with Domain Adaptation

    Authors: Armin Zirak, Hadi Hemmati

    Abstract: Automated Program Repair (APR) is defined as the process of fixing a bug/defect in the source code, by an automated tool. APR tools have recently experienced promising results by leveraging state-of-the-art Neural Language Processing (NLP) techniques. APR tools such as TFix and CodeXGLUE combine text-to-text transformers with software-specific techniques are outperforming alternatives, these days.… ▽ More

    Submitted 21 December, 2022; originally announced December 2022.

    Comments: 43 pages

  13. arXiv:2211.14391  [pdf, other

    cs.LG

    MDA: Availability-Aware Federated Learning Client Selection

    Authors: Amin Eslami Abyane, Steve Drew, Hadi Hemmati

    Abstract: Recently, a new distributed learning scheme called Federated Learning (FL) has been introduced. FL is designed so that server never collects user-owned data meaning it is great at preserving privacy. FL's process starts with the server sending a model to clients, then the clients train that model using their data and send the updated model back to the server. Afterward, the server aggregates all t… ▽ More

    Submitted 25 November, 2022; originally announced November 2022.

    Comments: Submitted to IEEE Transactions on Reliability

  14. arXiv:2211.12821  [pdf, other

    cs.SE

    Explainable AI for Pre-Trained Code Models: What Do They Learn? When They Do Not Work?

    Authors: Ahmad Haji Mohammadkhani, Chakkrit Tantithamthavorn, Hadi Hemmati

    Abstract: In recent years, there has been a wide interest in designing deep neural network-based models that automate downstream software engineering tasks on source code, such as code document generation, code search, and program repair. Although the main objective of these studies is to improve the effectiveness of the downstream task, many studies only attempt to employ the next best neural network model… ▽ More

    Submitted 28 August, 2023; v1 submitted 23 November, 2022; originally announced November 2022.

    Comments: 10 pages, 7 figures, Accepted at SCAM 2023

  15. arXiv:2209.08625  [pdf, ps, other

    cs.LG cs.PF

    Improving the Performance of DNN-based Software Services using Automated Layer Caching

    Authors: Mohammadamin Abedi, Yanni Iouannou, Pooyan Jamshidi, Hadi Hemmati

    Abstract: Deep Neural Networks (DNNs) have become an essential component in many application domains including web-based services. A variety of these services require high throughput and (close to) real-time features, for instance, to respond or react to users' requests or to process a stream of incoming data on time. However, the trend in DNN design is toward larger models with many layers and parameters t… ▽ More

    Submitted 18 September, 2022; originally announced September 2022.

  16. arXiv:2206.15428  [pdf

    cs.SE cs.AI

    Test2Vec: An Execution Trace Embedding for Test Case Prioritization

    Authors: Emad Jabbar, Soheila Zangeneh, Hadi Hemmati, Robert Feldt

    Abstract: Most automated software testing tasks can benefit from the abstract representation of test cases. Traditionally, this is done by encoding test cases based on their code coverage. Specification-level criteria can replace code coverage to better represent test cases' behavior, but they are often not cost-effective. In this paper, we hypothesize that execution traces of the test cases can be a good a… ▽ More

    Submitted 28 June, 2022; originally announced June 2022.

  17. arXiv:2206.12927  [pdf, other

    cs.SE

    An Empirical Study on Bug Severity Estimation Using Source Code Metrics and Static Analysis

    Authors: Ehsan Mashhadi, Shaiful Chowdhury, Somayeh Modaberi, Hadi Hemmati, Gias Uddin

    Abstract: In the past couple of decades, significant research efforts are devoted to the prediction of software bugs (i.e., defects). These works leverage a diverse set of metrics, tools, and techniques to predict which classes, methods, lines, or commits are buggy. However, most existing work in this domain treats all bugs the same, which is not the case in practice. The more severe the bugs the higher the… ▽ More

    Submitted 26 June, 2022; originally announced June 2022.

  18. arXiv:2201.01409  [pdf, other

    cs.LG

    Towards Understanding Quality Challenges of the Federated Learning for Neural Networks: A First Look from the Lens of Robustness

    Authors: Amin Eslami Abyane, Derui Zhu, Roberto Souza, Lei Ma, Hadi Hemmati

    Abstract: Federated learning (FL) is a distributed learning paradigm that preserves users' data privacy while leveraging the entire dataset of all participants. In FL, multiple models are trained independently on the clients and aggregated centrally to update a global model in an iterative process. Although this approach is excellent at preserving privacy, FL still suffers from quality issues such as attack… ▽ More

    Submitted 9 January, 2023; v1 submitted 4 January, 2022; originally announced January 2022.

    Comments: Accepted by the EMSE journal

  19. arXiv:2109.09869  [pdf, other

    cs.LG cs.SE

    Robustness Analysis of Deep Learning Frameworks on Mobile Platforms

    Authors: Amin Eslami Abyane, Hadi Hemmati

    Abstract: With the recent increase in the computational power of modern mobile devices, machine learning-based heavy tasks such as face detection and speech recognition are now integral parts of such devices. This requires frameworks to execute machine learning models (e.g., Deep Neural Networks) on mobile devices. Although there exist studies on the accuracy and performance of these frameworks, the quality… ▽ More

    Submitted 20 September, 2021; originally announced September 2021.

  20. arXiv:2103.11626  [pdf, ps, other

    cs.SE

    Applying CodeBERT for Automated Program Repair of Java Simple Bugs

    Authors: Ehsan Mashhadi, Hadi Hemmati

    Abstract: Software debugging, and program repair are among the most time-consuming and labor-intensive tasks in software engineering that would benefit a lot from automation. In this paper, we propose a novel automated program repair approach based on CodeBERT, which is a transformer-based neural architecture pre-trained on large corpus of source code. We fine-tune our model on the ManySStuBs4J small and la… ▽ More

    Submitted 30 March, 2021; v1 submitted 22 March, 2021; originally announced March 2021.

    Comments: Added the repository of source code

  21. arXiv:2102.05953  [pdf

    physics.med-ph

    Dosimetric characterization of a new 192Ir pulse dose rate brachytherapy source with the Monte Carlo simulation and thermoluminescent dosimeter

    Authors: Vahid Lohrabian, Alireza Kamali-Asl, Hossein Arabi, Hamidreza Hemmati, Majid Pournezam Esfahani

    Abstract: In this study, recommendations of the AAPM TG- 43 (U1) report have been followed to characterize the new 192Ir pulse dose rate source, provided by the Applied Radiation Research School, Nuclear Science and Technology Research Institute in Iran. Dose rate constant, radial dose function, geometry factors, and anisotropy function were calculated according to the relevant American Association of Physi… ▽ More

    Submitted 11 February, 2021; originally announced February 2021.

  22. arXiv:2101.07910  [pdf, other

    cs.SE

    A Search-Based Testing Framework for Deep Neural Networks of Source Code Embedding

    Authors: Maryam Vahdat Pour, Zhuo Li, Lei Ma, Hadi Hemmati

    Abstract: Over the past few years, deep neural networks (DNNs) have been continuously expanding their real-world applications for source code processing tasks across the software engineering domain, e.g., clone detection, code search, comment generation. Although quite a few recent works have been performed on testing of DNNs in the context of image and speech processing, limited progress has been achieved… ▽ More

    Submitted 19 January, 2021; originally announced January 2021.

    Comments: ICST 2021

  23. arXiv:2101.05862  [pdf, other

    cs.SE

    GloBug: Using Global Data in Fault Localization

    Authors: Nima Miryeganeh, Sepehr Hashtroudi, Hadi Hemmati

    Abstract: Fault Localization (FL) is an important first step in software debugging and is mostly manual in the current practice. Many methods have been proposed over years to automate the FL process, including information retrieval (IR)-based techniques. These methods localize the fault based on the similarity of the reported bug report and the source code. Newer variations of IR-based FL (IRFL) techniques… ▽ More

    Submitted 14 January, 2021; originally announced January 2021.

  24. arXiv:2101.05738  [pdf, other

    cs.SE

    A Pragmatic Approach for Hyper-Parameter Tuning in Search-based Test Case Generation

    Authors: Shayan Zamani, Hadi Hemmati

    Abstract: Search-based test case generation, which is the application of meta-heuristic search for generating test cases, has been studied a lot in the literature, lately. Since, in theory, the performance of meta-heuristic search methods is highly dependent on their hyper-parameters, there is a need to study hyper-parameter tuning in this domain. In this paper, we propose a new metric ("Tuning Gain"), whic… ▽ More

    Submitted 14 January, 2021; originally announced January 2021.

  25. arXiv:2101.04948  [pdf, other

    cs.LG cs.SE

    Deep State Inference: Toward Behavioral Model Inference of Black-box Software Systems

    Authors: Foozhan Ataiefard, Mohammad Jafar Mashhadi, Hadi Hemmati, Niel Walkinshaw

    Abstract: Many software engineering tasks, such as testing, and anomaly detection can benefit from the ability to infer a behavioral model of the software.Most existing inference approaches assume access to code to collect execution sequences. In this paper, we investigate a black-box scenario, where the system under analysis cannot be instrumented, in this granular fashion.This scenario is particularly pre… ▽ More

    Submitted 12 October, 2021; v1 submitted 13 January, 2021; originally announced January 2021.

    Comments: 17 pages,9 figures. arXiv admin note: text overlap with arXiv:2008.11856

  26. arXiv:2012.09321  [pdf

    physics.optics

    Perfectly-reflecting guided-mode-resonant photonic lattices possessing Mie modal memory

    Authors: Yeong Hwan Ko, Nasrin Razmjooei, Hafez Hemmati, Robert Magnusson

    Abstract: Resonant periodic nanostructures provide perfect reflection across small or large spectral bandwidths depending on the choice of materials and design parameters. This effect has been known for decades, observed theoretically and experimentally via one-dimensional and two-dimensional structures commonly known as resonant gratings, metamaterials, and metasurfaces. The physical cause of this extraord… ▽ More

    Submitted 16 December, 2020; originally announced December 2020.

    Comments: 9 pages, 5 figures

  27. arXiv:2008.11856  [pdf, other

    cs.LG cs.SE stat.ML

    Hybrid Deep Neural Networks to Infer State Models of Black-Box Systems

    Authors: Mohammad Jafar Mashhadi, Hadi Hemmati

    Abstract: Inferring behavior model of a running software system is quite useful for several automated software engineering tasks, such as program comprehension, anomaly detection, and testing. Most existing dynamic model inference techniques are white-box, i.e., they require source code to be instrumented to get run-time traces. However, in many systems, instrumenting the entire source code is not possible… ▽ More

    Submitted 26 August, 2020; originally announced August 2020.

    Comments: 11 Pages, ASE '20 conference

  28. arXiv:2006.04852  [pdf

    physics.optics

    Applicability of the Rytov full effective-medium formalism to the physical description and design of resonant metasurfaces

    Authors: Hafez Hemmati, Robert Magnusson

    Abstract: Periodic photonic lattices constitute a fundamental pillar of physics supporting a plethora of scientific concepts and applications. The advent of metamaterials and metastructures is grounded in deep understanding of their properties. Based on the original 1956 formulation by Rytov, it is well known that a photonic lattice with deep subwavelength periodicity can be approximated with a homogeneous… ▽ More

    Submitted 8 June, 2020; originally announced June 2020.

    Comments: 14 pages, 7 figures

  29. An IR-based Approach Towards Automated Integration of Geo-spatial Datasets in Map-based Software Systems

    Authors: Nima Miryeganeh, Mehdi Amoui, Hadi Hemmati

    Abstract: Data is arguably the most valuable asset of the modern world. In this era, the success of any data-intensive solution relies on the quality of data that drives it. Among vast amount of data that are captured, managed, and analyzed everyday, geospatial data are one of the most interesting class of data that hold geographical information of real-world phenomena and can be visualized as digital maps.… ▽ More

    Submitted 27 June, 2019; v1 submitted 13 June, 2019; originally announced June 2019.

    Comments: ESEC/FSE 2019 - Industry track

  30. arXiv:1906.02349  [pdf, other

    cs.SE

    Revisiting Hyper-Parameter Tuning for Search-based Test Data Generation

    Authors: Shayan Zamani, Hadi Hemmati

    Abstract: Search-based software testing (SBST) has been studied a lot in the literature, lately. Since, in theory, the performance of meta-heuristic search methods are highly dependent on their parameters, there is a need to study SBST tuning. In this study, we partially replicate a previous paper on SBST tool tuning and revisit some of the claims of that paper. In particular, unlike the previous work, our… ▽ More

    Submitted 5 June, 2019; originally announced June 2019.

    Comments: Accepted to SSBSE 2019 conference

  31. Interactive Semi-automated Specification Mining for Debugging: An Experience Report

    Authors: Mohammad Jafar Mashhadi, Taha R. Siddiqui, Hadi Hemmati, Howard Loewen

    Abstract: Context: Specification mining techniques are typically used to extract the specification of a software in the absence of (up-to-date) specification documents. This is useful for program comprehension, testing, and anomaly detection. However, specification mining can also potentially be used for debugging, where a faulty behavior is abstracted to give developers a context about the bug and help the… ▽ More

    Submitted 6 May, 2019; originally announced May 2019.

    Comments: Accepted at IST journal

  32. arXiv:1903.11242  [pdf, ps, other

    cs.SE

    An Empirical Study on Practicality of Specification Mining Algorithms on a Real-world Application

    Authors: Mohammad Jafar Mashhadi, Hadi Hemmati

    Abstract: Dynamic model inference techniques have been the center of many research projects recently. There are now multiple open source implementations of state-of-the-art algorithms, which provide basic abstraction and merging capabilities. Most of these tools and algorithms have been developed with one particular application in mind, which is program comprehension. The outputs models can abstract away th… ▽ More

    Submitted 28 March, 2019; v1 submitted 27 March, 2019; originally announced March 2019.

    Comments: This is the camera ready version to be published in proceedings of ICPC 2019, Montreal. The previous version was the initially submitted version

  33. arXiv:1003.4961  [pdf, ps, other

    gr-qc physics.space-ph

    Advancing Tests of Relativistic Gravity via Laser Ranging to Phobos

    Authors: Slava G. Turyshev, William Farr, William M. Folkner, Andre R. Girerd, Hamid Hemmati, Thomas W. Murphy, Jr., James G. Williams, John J. Degnan

    Abstract: Phobos Laser Ranging (PLR) is a concept for a space mission designed to advance tests of relativistic gravity in the solar system. PLR's primary objective is to measure the curvature of space around the Sun, represented by the Eddington parameter $γ$, with an accuracy of two parts in $10^7$, thereby improving today's best result by two orders of magnitude. Other mission goals include measurements… ▽ More

    Submitted 3 September, 2010; v1 submitted 25 March, 2010; originally announced March 2010.

    Comments: 25 pages, 10 figures, 9 tables

    Journal ref: Experimental Astronomy 28, 209-249 (2010)