Skip to main content

Showing 1–12 of 12 results for author: Quaranta, L

.
  1. arXiv:2401.11366  [pdf

    cs.SE

    A Multivocal Literature Review on the Benefits and Limitations of Automated Machine Learning Tools

    Authors: Kelly Azevedo, Luigi Quaranta, Fabio Calefato, Marcos Kalinowski

    Abstract: Context. Advancements in Machine Learning (ML) are revolutionizing every application domain, driving unprecedented transformations and fostering innovation. However, despite these advances, several organizations are experiencing friction in the adoption of ML-based technologies, mainly due to the shortage of ML professionals. In this context, Automated Machine Learning (AutoML) techniques have bee… ▽ More

    Submitted 20 January, 2024; originally announced January 2024.

  2. Assessing the Use of AutoML for Data-Driven Software Engineering

    Authors: Fabio Calefato, Luigi Quaranta, Filippo Lanubile, Marcos Kalinowski

    Abstract: Background. Due to the widespread adoption of Artificial Intelligence (AI) and Machine Learning (ML) for building software applications, companies are struggling to recruit employees with a deep understanding of such technologies. In this scenario, AutoML is soaring as a promising solution to fill the AI/ML skills gap since it promises to automate the building of end-to-end AI/ML pipelines that wo… ▽ More

    Submitted 20 July, 2023; originally announced July 2023.

    Journal ref: Proc. of 17th ACM/IEEE International Symposium on Empirical Software Engineering and Measurement (ESEM'23), Oct. 2023

  3. arXiv:2303.14702  [pdf, other

    cs.SE

    A Lot of Talk and a Badge: An Exploratory Analysis of Personal Achievements in GitHub

    Authors: Fabio Calefato, Luigi Quaranta, Filippo Lanubile

    Abstract: Context. GitHub has introduced a new gamification element through personal achievements, whereby badges are unlocked and displayed on developers' personal profile pages in recognition of their development activities. Objective. In this paper, we present an exploratory analysis using mixed methods to study the diffusion of personal badges in GitHub, in addition to the effects and reactions to their… ▽ More

    Submitted 2 February, 2024; v1 submitted 26 March, 2023; originally announced March 2023.

  4. Teaching MLOps in Higher Education through Project-Based Learning

    Authors: Filippo Lanubile, Silverio Martínez-Fernández, Luigi Quaranta

    Abstract: Building and maintaining production-grade ML-enabled components is a complex endeavor that goes beyond the current approach of academic education, focused on the optimization of ML model performance in the lab. In this paper, we present a project-based learning approach to teaching MLOps, focused on the demonstration and experience with emerging practices and tools to automatize the construction o… ▽ More

    Submitted 2 February, 2023; originally announced February 2023.

    Comments: Accepted in 2023 IEEE/ACM 45th International Conference on Software Engineering: Software Engineering Education and Training (ICSE-SEET)

    ACM Class: D.2; I.2

    Journal ref: 2023 IEEE/ACM 45th International Conference on Software Engineering: Software Engineering Education and Training (ICSE-SEET)

  5. A Preliminary Investigation of MLOps Practices in GitHub

    Authors: Fabio Calefato, Filippo Lanubile, Luigi Quaranta

    Abstract: Background. The rapid and growing popularity of machine learning (ML) applications has led to an increasing interest in MLOps, that is, the practice of continuous integration and deployment (CI/CD) of ML-enabled systems. Aims. Since changes may affect not only the code but also the ML model parameters and the data themselves, the automation of traditional CI/CD needs to be extended to manage model… ▽ More

    Submitted 23 September, 2022; originally announced September 2022.

    Comments: Presented at ESEM '22, the 16th ACM / IEEE International Symposium on Empirical Software Engineering and Measurement

    Journal ref: Proceedings of the 16th ACM / IEEE International Symposium on Empirical Software Engineering and Measurement (ESEM 2022), September 2022, Pages 283-288

  6. Assessing the Quality of Computational Notebooks for a Frictionless Transition from Exploration to Production

    Authors: Luigi Quaranta

    Abstract: The massive trend of integrating data-driven AI capabilities into traditional software systems is rising new intriguing challenges. One of such challenges is achieving a smooth transition from the explorative phase of Machine Learning projects - in which data scientists build prototypical models in the lab - to their production phase - in which software engineers translate prototypes into producti… ▽ More

    Submitted 24 May, 2022; originally announced May 2022.

    Comments: 5 pages

    Journal ref: Proc. of 2022 IEEE/ACM 44th International Conference on Software Engineering Companion (ICSE 22 Companion), May 21-29, 2022, Pittsburgh, PA, USA

  7. Pynblint: a Static Analyzer for Python Jupyter Notebooks

    Authors: Luigi Quaranta, Fabio Calefato, Filippo Lanubile

    Abstract: Jupyter Notebook is the tool of choice of many data scientists in the early stages of ML workflows. The notebook format, however, has been criticized for inducing bad programming practices; indeed, researchers have already shown that open-source repositories are inundated by poor-quality notebooks. Low-quality output from the prototypical stages of ML workflows constitutes a clear bottleneck towar… ▽ More

    Submitted 24 May, 2022; originally announced May 2022.

    Comments: 2 pages

    Journal ref: Proc. of 2022 IEEE/ACM 1st Conference on AI Engineering - Software Engineering for AI (CAIN), May 16-24, 2022, Pittsburgh, PA, USA

  8. arXiv:2202.07233  [pdf, other

    cs.HC cs.AI cs.LG

    Eliciting Best Practices for Collaboration with Computational Notebooks

    Authors: Luigi Quaranta, Fabio Calefato, Filippo Lanubile

    Abstract: Despite the widespread adoption of computational notebooks, little is known about best practices for their usage in collaborative contexts. In this paper, we fill this gap by eliciting a catalog of best practices for collaborative data science with computational notebooks. With this aim, we first look for best practices through a multivocal literature review. Then, we conduct interviews with profe… ▽ More

    Submitted 15 February, 2022; originally announced February 2022.

    Journal ref: Proc. ACM Hum.-Comput. Interact., Vol. 6, No. CSCW1, Article 87, April 2022

  9. KGTorrent: A Dataset of Python Jupyter Notebooks from Kaggle

    Authors: Luigi Quaranta, Fabio Calefato, Filippo Lanubile

    Abstract: Computational notebooks have become the tool of choice for many data scientists and practitioners for performing analyses and disseminating results. Despite their increasing popularity, the research community cannot yet count on a large, curated dataset of computational notebooks. In this paper, we fill this gap by introducing KGTorrent, a dataset of Python Jupyter notebooks with rich metadata ret… ▽ More

    Submitted 18 March, 2021; originally announced March 2021.

    Journal ref: Proc. of 2021 IEEE/ACM 18th International Conference on Mining Software Repositories (MSR), pp. 550-554

  10. Towards Productizing AI/ML Models: An Industry Perspective from Data Scientists

    Authors: Filippo Lanubile, Fabio Calefato, Luigi Quaranta, Maddalena Amoruso, Fabio Fumarola, Michele Filannino

    Abstract: The transition from AI/ML models to production-ready AI-based systems is a challenge for both data scientists and software engineers. In this paper, we report the results of a workshop conducted in a consulting company to understand how this transition is perceived by practitioners. Starting from the need for making AI experiments reproducible, the main themes that emerged are related to the use o… ▽ More

    Submitted 18 March, 2021; originally announced March 2021.

    Comments: 4 pages

    Journal ref: Proc. of 2021 IEEE/ACM 1st Workshop on AI Engineering - Software Engineering for AI (WAIN), pp. 129-132

  11. EMTk -- The Emotion Mining Toolkit

    Authors: Fabio Calefato, Filippo Lanubile, Nicole Novielli, Luigi Quaranta

    Abstract: The Emotion Mining Toolkit (EMTk) is a suite of modules and datasets offering a comprehensive solution for mining sentiment and emotions from technical text contributed by developers on communication channels. The toolkit is written in Java, Python, and R, and is released under the MIT open source license. In this paper, we describe its architecture and the benchmark against the previous, standalo… ▽ More

    Submitted 12 April, 2021; v1 submitted 22 March, 2019; originally announced March 2019.

    Comments: Proceedings of the 4th International Workshop on Emotion Awareness in Software Engineering (SEmotion '19), May 2019, pp. 34-37

  12. arXiv:1903.03426  [pdf, other

    cs.SE

    A Replication Study on Code Comprehension and Expertise using Lightweight Biometric Sensors

    Authors: Davide Fucci, Daniela Girardi, Nicole Novielli, Luigi Quaranta, Filippo Lanubile

    Abstract: Code comprehension has been recently investigated from physiological and cognitive perspectives through the use of medical imaging. Floyd et al (i.e., the original study) used fMRI to classify the type of comprehension tasks performed by developers and relate such results to their expertise. We replicate the original study using lightweight biometrics sensors which participants (28 undergrads in c… ▽ More

    Submitted 2 April, 2019; v1 submitted 8 March, 2019; originally announced March 2019.

    Comments: Author version submitted to ICPC2019 (Replication track)