Skip to main content

Showing 1–11 of 11 results for author: Tambon, F

.
  1. arXiv:2405.13932  [pdf, other

    cs.SE cs.AI

    Chain of Targeted Verification Questions to Improve the Reliability of Code Generated by LLMs

    Authors: Sylvain Kouemo Ngassom, Arghavan Moradi Dakhel, Florian Tambon, Foutse Khomh

    Abstract: LLM-based assistants, such as GitHub Copilot and ChatGPT, have the potential to generate code that fulfills a programming task described in a natural language description, referred to as a prompt. The widespread accessibility of these assistants enables users with diverse backgrounds to generate code and integrate it into software projects. However, studies show that code generated by LLMs is pron… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: 10 pages, 2 figures

  2. arXiv:2403.08937  [pdf, other

    cs.SE cs.AI

    Bugs in Large Language Models Generated Code: An Empirical Study

    Authors: Florian Tambon, Arghavan Moradi Dakhel, Amin Nikanjam, Foutse Khomh, Michel C. Desmarais, Giuliano Antoniol

    Abstract: Large Language Models (LLMs) for code have gained significant attention recently. They can generate code in different programming languages based on provided prompts, fulfilling a long-lasting dream in Software Engineering (SE), i.e., automatic code generation. Similar to human-written code, LLM-generated code is prone to bugs, and these bugs have not yet been thoroughly examined by the community.… ▽ More

    Submitted 18 March, 2024; v1 submitted 13 March, 2024; originally announced March 2024.

    Comments: 47 pages, 7 figures

  3. arXiv:2401.13177  [pdf, other

    cs.SE cs.CY cs.LG

    Deep Learning Model Reuse in the HuggingFace Community: Challenges, Benefit and Trends

    Authors: Mina Taraghi, Gianolli Dorcelus, Armstrong Foundjem, Florian Tambon, Foutse Khomh

    Abstract: The ubiquity of large-scale Pre-Trained Models (PTMs) is on the rise, sparking interest in model hubs, and dedicated platforms for hosting PTMs. Despite this trend, a comprehensive exploration of the challenges that users encounter and how the community leverages PTMs remains lacking. To address this gap, we conducted an extensive mixed-methods empirical study by focusing on discussion forums and… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

    Comments: Accepted by IEEE SANER 2024

  4. arXiv:2311.00801  [pdf, other

    cs.LG cs.SE

    GIST: Generated Inputs Sets Transferability in Deep Learning

    Authors: Florian Tambon, Foutse Khomh, Giuliano Antoniol

    Abstract: To foster the verifiability and testability of Deep Neural Networks (DNN), an increasing number of methods for test case generation techniques are being developed. When confronted with testing DNN models, the user can apply any existing test generation technique. However, it needs to do so for each technique and each DNN model under test, which can be expensive. Therefore, a paradigm shift could… ▽ More

    Submitted 20 May, 2024; v1 submitted 1 November, 2023; originally announced November 2023.

    Comments: accepted for publication in the "ACM Transactions on Software Engineering and Methodology" journal

  5. arXiv:2310.09575  [pdf, other

    cs.SE

    Common Challenges of Deep Reinforcement Learning Applications Development: An Empirical Study

    Authors: Mohammad Mehdi Morovati, Florian Tambon, Mina Taraghi, Amin Nikanjam, Foutse Khomh

    Abstract: Machine Learning (ML) is increasingly being adopted in different industries. Deep Reinforcement Learning (DRL) is a subdomain of ML used to produce intelligent agents. Despite recent developments in DRL technology, the main challenges that developers face in the development of DRL applications are still unknown. To fill this gap, in this paper, we conduct a large-scale empirical study of 927 DRL-r… ▽ More

    Submitted 18 May, 2024; v1 submitted 14 October, 2023; originally announced October 2023.

    Comments: Submitted to Empirical Software Engineering journal

  6. arXiv:2307.14512  [pdf, other

    cs.SE cs.LG

    Bug Characterization in Machine Learning-based Systems

    Authors: Mohammad Mehdi Morovati, Amin Nikanjam, Florian Tambon, Foutse Khomh, Zhen Ming, Jiang

    Abstract: Rapid growth of applying Machine Learning (ML) in different domains, especially in safety-critical areas, increases the need for reliable ML components, i.e., a software component operating based on ML. Understanding the bugs characteristics and maintenance challenges in ML-based systems can help developers of these systems to identify where to focus maintenance and testing efforts, by giving insi… ▽ More

    Submitted 26 July, 2023; originally announced July 2023.

  7. arXiv:2301.05651  [pdf, other

    cs.LG cs.SE

    Mutation Testing of Deep Reinforcement Learning Based on Real Faults

    Authors: Florian Tambon, Vahid Majdinasab, Amin Nikanjam, Foutse Khomh, Giuliano Antonio

    Abstract: Testing Deep Learning (DL) systems is a complex task as they do not behave like traditional systems would, notably because of their stochastic nature. Nonetheless, being able to adapt existing testing techniques such as Mutation Testing (MT) to DL settings would greatly improve their potential verifiability. While some efforts have been made to extend MT to the Supervised Learning paradigm, little… ▽ More

    Submitted 13 January, 2023; originally announced January 2023.

    Comments: Accepted to the International Conference of Software Testing (ICST2023)

  8. A Probabilistic Framework for Mutation Testing in Deep Neural Networks

    Authors: Florian Tambon, Foutse Khomh, Giuliano Antoniol

    Abstract: Context: Mutation Testing (MT) is an important tool in traditional Software Engineering (SE) white-box testing. It aims to artificially inject faults in a system to evaluate a test suite's capability to detect them, assuming that the test suite defects finding capability will then translate to real faults. If MT has long been used in SE, it is only recently that it started gaining the attention of… ▽ More

    Submitted 11 August, 2022; originally announced August 2022.

    Comments: 23 pages, 5 figures, 4 tables. Submitted to Information and Software Technology Journal

    Journal ref: Information and Software Technology. 155 (2023) 107129

  9. arXiv:2112.13314  [pdf, other

    cs.SE cs.LG

    Silent Bugs in Deep Learning Frameworks: An Empirical Study of Keras and TensorFlow

    Authors: Florian Tambon, Amin Nikanjam, Le An, Foutse Khomh, Giuliano Antoniol

    Abstract: Deep Learning (DL) frameworks are now widely used, simplifying the creation of complex models as well as their integration to various applications even to non DL experts. However, like any other programs, they are prone to bugs. This paper deals with the subcategory of bugs named silent bugs: they lead to wrong behavior but they do not cause system crashes or hangs, nor show an error message to th… ▽ More

    Submitted 1 September, 2023; v1 submitted 25 December, 2021; originally announced December 2021.

  10. How to Certify Machine Learning Based Safety-critical Systems? A Systematic Literature Review

    Authors: Florian Tambon, Gabriel Laberge, Le An, Amin Nikanjam, Paulina Stevia Nouwou Mindom, Yann Pequignot, Foutse Khomh, Giulio Antoniol, Ettore Merlo, François Laviolette

    Abstract: Context: Machine Learning (ML) has been at the heart of many innovations over the past years. However, including it in so-called 'safety-critical' systems such as automotive or aeronautic has proven to be very challenging, since the shift in paradigm that ML brings completely changes traditional certification approaches. Objective: This paper aims to elucidate challenges related to the certifica… ▽ More

    Submitted 1 December, 2021; v1 submitted 26 July, 2021; originally announced July 2021.

    Comments: 60 pages (92 pages with references and complements), submitted to a journal (Automated Software Engineering). Changes: Emphasizing difference traditional software engineering / ML approach. Adding Related Works, Threats to Validity and Complementary Materials. Adding a table listing papers reference for each section/subsections

    Journal ref: Autom Softw Eng 29, 38 (2022)

  11. arXiv:2107.04863  [pdf, other

    cs.LG cs.SE

    HOMRS: High Order Metamorphic Relations Selector for Deep Neural Networks

    Authors: Florian Tambon, Giulio Antoniol, Foutse Khomh

    Abstract: Deep Neural Networks (DNN) applications are increasingly becoming a part of our everyday life, from medical applications to autonomous cars. Traditional validation of DNN relies on accuracy measures, however, the existence of adversarial examples has highlighted the limitations of these accuracy measures, raising concerns especially when DNN are integrated into safety-critical systems. In this p… ▽ More

    Submitted 21 December, 2021; v1 submitted 10 July, 2021; originally announced July 2021.

    Comments: 33 pages