Skip to main content

Showing 1–26 of 26 results for author: Gerosa, M A

.
  1. arXiv:2407.04674  [pdf, other

    cs.SE

    Game Elements to Engage Students Learning the Open Source Software Contribution Process

    Authors: Italo Santos, Katia Romero Felizardo, Marco A. Gerosa, Igor Steinmacher

    Abstract: Contributing to OSS projects can help students to enhance their skills and expand their professional networks. However, novice contributors often feel discouraged due to various barriers. Gamification techniques hold the potential to foster engagement and facilitate the learning process. Nevertheless, it is unknown which game elements are effective in this context. This study explores students' pe… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  2. arXiv:2404.18677  [pdf, other

    cs.SE

    Towards the First Code Contribution: Processes and Information Needs

    Authors: Christoph Treude, Marco A. Gerosa, Igor Steinmacher

    Abstract: Newcomers to a software project must overcome many barriers before they can successfully place their first code contribution, and they often struggle to find information that is relevant to them. In this work, we argue that much of the information needed by newcomers already exists, albeit scattered among many different sources, and that many barriers can be addressed by automatically identifying,… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

  3. arXiv:2401.04637  [pdf, other

    cs.SE cs.AI cs.CL cs.LG

    Applying Large Language Models API to Issue Classification Problem

    Authors: Gabriel Aracena, Kyle Luster, Fabio Santos, Igor Steinmacher, Marco A. Gerosa

    Abstract: Effective prioritization of issue reports is crucial in software engineering to optimize resource allocation and address critical problems promptly. However, the manual classification of issue reports for prioritization is laborious and lacks scalability. Alternatively, many open source software (OSS) projects employ automated processes for this task, albeit relying on substantial datasets for ade… ▽ More

    Submitted 9 January, 2024; originally announced January 2024.

    Comments: 4 pages, 1 figure, NLBSE and ICSE conference submission, ACM formatted, pre print

  4. arXiv:2311.11081  [pdf, other

    cs.SE

    Can AI Serve as a Substitute for Human Subjects in Software Engineering Research?

    Authors: Marco A. Gerosa, Bianca Trinkenreich, Igor Steinmacher, Anita Sarma

    Abstract: Research within sociotechnical domains, such as Software Engineering, fundamentally requires a thorough consideration of the human perspective. However, traditional qualitative data collection methods suffer from challenges related to scale, labor intensity, and the increasing difficulty of participant recruitment. This vision paper proposes a novel approach to qualitative data collection in softw… ▽ More

    Submitted 18 November, 2023; originally announced November 2023.

  5. Anticipating User Needs: Insights from Design Fiction on Conversational Agents for Computational Thinking

    Authors: Jacob Penney, João Felipe Pimentel, Igor Steinmacher, Marco A. Gerosa

    Abstract: Computational thinking, and by extension, computer programming, is notoriously challenging to learn. Conversational agents and generative artificial intelligence (genAI) have the potential to facilitate this learning process by offering personalized guidance, interactive learning experiences, and code generation. However, current genAI-based chatbots focus on professional developers and may not ad… ▽ More

    Submitted 13 June, 2024; v1 submitted 12 November, 2023; originally announced November 2023.

    Comments: 17 pages, three figures, Conversations 2023. Chatbot research and design (2024). In Lecture notes in computer science

  6. Tag that issue: Applying API-domain labels in issue tracking systems

    Authors: Fabio Santos, Joseph Vargovich, Bianca Trinkenreich, Italo Santos, Jacob Penney, Ricardo Britto, João Felipe Pimentel, Igor Wiese, Igor Steinmacher, Anita Sarma, Marco A. Gerosa

    Abstract: Labeling issues with the skills required to complete them can help contributors to choose tasks in Open Source Software projects. However, manually labeling issues is time-consuming and error-prone, and current automated approaches are mostly limited to classifying issues as bugs/non-bugs. We investigate the feasibility and relevance of automatically labeling issues with what we call "API-domains,… ▽ More

    Submitted 6 April, 2023; originally announced April 2023.

    Comments: arXiv admin note: text overlap with arXiv:2103.12653

    ACM Class: D.2.0; I.2.7; K.6.3

    Journal ref: Empirical Software Engineering (EMSE), 2023

  7. The State of Diversity and Inclusion in Apache: A Pulse Check

    Authors: Zixuan Feng, Mariam Guizani, Marco A. Gerosa, Anita Sarma

    Abstract: Diversity and inclusion in open source software (OSS) is a multifaceted concept that arises from differences in contributors' gender, seniority, language, region, and other characteristics. D&I has received growing attention in OSS ecosystems and projects, and various programs have been implemented to foster contributor diversity. However, we do not yet know how the state of D&I is evolving. By un… ▽ More

    Submitted 28 March, 2023; originally announced March 2023.

    Comments: 11 pages, 1 figure

    Journal ref: 16th International Conference on Cooperative and Human Aspects of Software Engineering (CHASE 2023)

  8. GiveMeLabeledIssues: An Open Source Issue Recommendation System

    Authors: Joseph Vargovich, Fabio Santos, Jacob Penney, Marco A. Gerosa, Igor Steinmacher

    Abstract: Developers often struggle to navigate an Open Source Software (OSS) project's issue-tracking system and find a suitable task. Proper issue labeling can aid task selection, but current tools are limited to classifying the issues according to their type (e.g., bug, question, good first issue, feature, etc.). In contrast, this paper presents a tool (GiveMeLabeledIssues) that mines project repositorie… ▽ More

    Submitted 23 March, 2023; originally announced March 2023.

    Comments: MSR Data and Tool Showcase 2023

    Journal ref: IEEE/ACM 20th International Conference on Mining Software Repositories (MSR 2023), Data and Tool Showcase

  9. Designing for Cognitive Diversity: Improving the GitHub Experience for Newcomers

    Authors: Italo Santos, João Felipe Pimentel, Igor Wiese, Igor Steinmacher, Anita Sarma, Marco A. Gerosa

    Abstract: Social coding platforms such as GitHub have become defacto environments for collaborative programming and open source. When these platforms do not support specific cognitive styles, they create barriers to programming for some populations. Research shows that the cognitive styles typically favored by women are often unsupported, creating barriers to entry for woman newcomers. In this paper, we use… ▽ More

    Submitted 10 February, 2023; v1 submitted 25 January, 2023; originally announced January 2023.

    Journal ref: 45th IEEE/ACM International Conference on Software Engineering, Software Engineering in Society (ICSE SEIS 2023)

  10. Do I Belong? Modeling Sense of Virtual Community Among Linux Kernel Contributors

    Authors: Bianca Trinkenreich, Klaas-Jan Stol, Anita Sarma, Daniel M. German, Marco A. Gerosa, Igor Steinmacher

    Abstract: The sense of belonging to a community is a basic human need that impacts an individuals behavior, long-term engagement, and job satisfaction, as revealed by research in disciplines such as psychology, healthcare, and education. Despite much research on how to retain developers in Open Source Software projects and other virtual, peer-production communities, there is a paucity of research investigat… ▽ More

    Submitted 22 February, 2023; v1 submitted 16 January, 2023; originally announced January 2023.

    Journal ref: 45th IEEE/ACM International Conference on Software Engineering (ICSE 2023)

  11. I depended on you and you broke me: An empirical study of manifesting breaking changes in client packages

    Authors: Daniel Venturini, Filipe Roseiro Cogo, Ivanilton Polato, Marco A Gerosa, Igor Scaliante Wiese

    Abstract: Complex software systems have a network of dependencies. Developers often configure package managers (e.g., npm) to automatically update dependencies with each publication of new releases containing bug fixes and new features. When a dependency release introduces backward-incompatible changes, commonly known as breaking changes, dependent packages may not build anymore. This may indirectly impact… ▽ More

    Submitted 11 January, 2023; originally announced January 2023.

    Journal ref: ACM Transactions on Software Engineering and Methodology (TOSEM 2023)

  12. The Present and Future of Bots in Software Engineering

    Authors: Emad Shihab, Stefan Wagner, Marco A. Gerosa, Mairieli Wessel, Jordi Cabot

    Abstract: We are witnessing a massive adoption of software engineering bots, applications that react to events triggered by tools and messages posted by users and run automated tasks in response, in a variety of domains. This thematic issues describes experiences and challenges with these bots.

    Submitted 4 July, 2022; originally announced July 2022.

    Comments: 5 pages, to be published in IEEE Software

    ACM Class: D.2.0

    Journal ref: IEEE Software (2022)

  13. GitHub Actions: The Impact on the Pull Request Process

    Authors: Mairieli Wessel, Joseph Vargovich, Marco A. Gerosa, Christoph Treude

    Abstract: Software projects frequently use automation tools to perform repetitive activities in the distributed software development process. Recently, GitHub introduced GitHub Actions, a feature providing automated workflows for software projects. Understanding and anticipating the effects of adopting such technology is important for planning and management. Our research investigates how projects use GitHu… ▽ More

    Submitted 27 July, 2023; v1 submitted 28 June, 2022; originally announced June 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2103.12224, arXiv:2103.13547

    Journal ref: Empirical Software Engineering (EMSE 2023)

  14. An Empirical Investigation on the Challenges Faced by Women in the Software Industry: A Case Study

    Authors: Bianca Trinkenreich, Ricardo Britto, Marco Aurelio Gerosa, Igor Steinmacher

    Abstract: Addressing women's under-representation in the software industry, a widely recognized concern, requires attracting as well as retaining more women. Hearing from women practitioners, particularly those positioned in multi-cultural settings, about their challenges and and adopting their lived experienced solutions can support the design of programs to resolve the under-representation issue. Goal:… ▽ More

    Submitted 20 March, 2022; originally announced March 2022.

    Journal ref: ACM/IEEE 44rd International Conference on Software Engineering, SEIS - Software Engineering in Society (ICSE SEIS 2022)

  15. arXiv:2104.14699  [pdf, other

    cs.HC

    Why should we care about register? Reflections on chatbot language design

    Authors: Ana Paula Chaves, Marco Aurelio Gerosa

    Abstract: This position paper discusses the relevance of register as a theoretical framework for chatbot language design. We present the concept of register and discuss how using register-specific language influence the user's perceptions of the interaction with chatbots. Additionally, we point several research opportunities that are important to pursue to establish register as a foundation for advancing ch… ▽ More

    Submitted 29 April, 2021; originally announced April 2021.

    Comments: arXiv admin note: text overlap with arXiv:2101.11089

  16. arXiv:2104.12542  [pdf, other

    cs.CY cs.PL

    Catalogs of C and Python Antipatterns by CS1 Students

    Authors: Yorah Bosse, Igor Scaliante Wiese, Marco Aurélio Graciotto Silva, Nelson Lago, Leônidas de Oliveira Brandão, David Redmiles, Fabio Kon, Marco A. Gerosa

    Abstract: Understanding students' programming misconceptions is critical. Doing so depends on identifying the reasons why students make errors when learning a new programming language. Knowing the misconceptions can help students to improve their reflection about their mistakes and also help instructors to design better teaching strategies. In this technical report, we propose catalogs of antipatterns for t… ▽ More

    Submitted 1 April, 2021; originally announced April 2021.

    Comments: 114 pages, 3 figures, 1 table, 5 appendices

    Report number: RT-MAC-2021-01

  17. Don't Disturb Me: Challenges of Interacting with SoftwareBots on Open Source Software Projects

    Authors: Mairieli Wessel, Igor Wiese, Igor Steinmacher, Marco A. Gerosa

    Abstract: Software bots are used to streamline tasks in Open Source Software (OSS) projects' pull requests, saving development cost, time, and effort. However, their presence can be disruptive to the community. We identified several challenges caused by bots in pull request interactions by interviewing 21 practitioners, including project maintainers, contributors, and bot developers. In particular, our find… ▽ More

    Submitted 25 March, 2021; originally announced March 2021.

    Journal ref: 2021 24th ACM Conference on Computer-Supported Coperative Work and Social Computing

  18. Quality Gatekeepers: Investigating the Effects ofCode Review Bots on Pull Request Activities

    Authors: Mairieli Wessel, Alexander Serebrenik, Igor Wiese, Igor Steinmacher, Marco A. Gerosa

    Abstract: Software bots have been facilitating several development activities in Open Source Software (OSS) projects, including code review. However, these bots may bring unexpected impacts to group dynamics, as frequently occurs with new technology adoption. Understanding and anticipating such effects is important for planning and management. To analyze these effects, we investigate how several activity in… ▽ More

    Submitted 30 March, 2022; v1 submitted 24 March, 2021; originally announced March 2021.

    Journal ref: Empirical Software Engineering (EMSE 2020)

  19. How Do Software Developers Use GitHub Actions to Automate Their Workflows?

    Authors: Timothy Kinsman, Mairieli Wessel, Marco A. Gerosa, Christoph Treude

    Abstract: Automated tools are frequently used in social coding repositories to perform repetitive activities that are part of the distributed software development process. Recently, GitHub introduced GitHub Actions, a feature providing automated workflows for repository maintainers. Although several Actions have been built and used by practitioners, relatively little has been done to evaluate them. Understa… ▽ More

    Submitted 22 March, 2021; originally announced March 2021.

    Journal ref: Mining Software Repositories Conference (MSR 2021), May 2021

  20. Will You Come Back to Contribute? Investigating the Inactivity of OSS Core Developers in GitHub

    Authors: Fabio Calefato, Marco Aurelio Gerosa, Giuseppe Iaffaldano, Filippo Lanubile, Igor Steinmacher

    Abstract: Several Open Source Software (OSS) projects depend on the continuity of their development communities to remain sustainable. Understanding how developers become inactive or why they take breaks can help communities prevent abandonment and incentivize developers to come back. In this paper, we propose a novel method to identify developers' inactive periods by analyzing the individual rhythm of cont… ▽ More

    Submitted 30 June, 2021; v1 submitted 8 March, 2021; originally announced March 2021.

    Comments: Empirical Software Engineering, to appear

    Journal ref: Empirical Software Engineering (2022) Vol 27, no 3 March 2022

  21. Chatbots language design: the influence of language variation on user experience

    Authors: Ana Paula Chaves, Jesse Egbert, Toby Hocking, Eck Doerry, Marco Aurelio Gerosa

    Abstract: Chatbots are often designed to mimic social roles attributed to humans. However, little is known about the impact on user's perceptions of using language that fails to conform to the associated social role. Our research draws on sociolinguistic theory to investigate how a chatbot's language choices can adhere to the expected social role the agent performs within a given context. In doing so, we se… ▽ More

    Submitted 26 January, 2021; originally announced January 2021.

    Journal ref: 7th International Conference on Human-Agent Interaction (HAI 2019), Kyoto, Japan

  22. arXiv:2009.01959  [pdf, other

    cs.LG cs.CL cs.SE stat.ML

    CoNCRA: A Convolutional Neural Network Code Retrieval Approach

    Authors: Marcelo de Rezende Martins, Marco A. Gerosa

    Abstract: Software developers routinely search for code using general-purpose search engines. However, these search engines cannot find code semantically unless it has an accompanying description. We propose a technique for semantic code search: A Convolutional Neural Network approach to code retrieval (CoNCRA). Our technique aims to find the code snippet that most closely matches the developer's intent, ex… ▽ More

    Submitted 3 September, 2020; originally announced September 2020.

    Journal ref: 34th Brazilian Symposium on Software Engineering (SBES 2020), Insightful Ideas and Emerging Results Track

  23. Challenges for Inclusion in Software Engineering: The Case of the Emerging Papua New Guinean Society

    Authors: Raula Gaikovina Kula, Christoph Treude, Hideaki Hata, Sebastian Baltes, Igor Steinmacher, Marco Aurelio Gerosa, Winifred Kula Amini

    Abstract: Software plays a central role in modern societies, with its high economic value and potential for advancing societal change. In this paper, we characterise challenges and opportunities for a country progressing towards entering the global software industry, focusing on Papua New Guinea (PNG). By hosting a Software Engineering workshop, we conducted a qualitative study by recording talks (n=3), emp… ▽ More

    Submitted 22 July, 2021; v1 submitted 31 October, 2019; originally announced November 2019.

    Comments: IEEE Software

    Journal ref: IEEE Software (2021)

  24. Google Summer of Code: Student Motivations and Contributions

    Authors: Jefferson O. Silva, Igor Wiese, Daniel M. German, Christoph Treude, Marco A. Gerosa, Igor Steinmacher

    Abstract: Several open source software (OSS) projects expect to foster newcomers' onboarding and to receive contributions by participating in engagement programs, like Summers of Code. However, there is little empirical evidence showing why students join such programs. In this paper, we study the well-established Google Summer of Code (GSoC), which is a 3-month OSS engagement program that offers stipends an… ▽ More

    Submitted 13 October, 2019; originally announced October 2019.

    Comments: 30 pages

    Journal ref: Journal of Systems and Software (JSS), V. 162, April 2020, 110487

  25. How should my chatbot interact? A survey on human-chatbot interaction design

    Authors: Ana Paula Chaves, Marco Aurelio Gerosa

    Abstract: Chatbots' growing popularity has brought new challenges to HCI, having changed the patterns of human interactions with computers. The increasing need to approximate conversational interaction styles raises expectations for chatbots to present social behaviors that are habitual in human-human communication. In this survey, we argue that chatbots should be enriched with social characteristics that c… ▽ More

    Submitted 22 October, 2020; v1 submitted 4 April, 2019; originally announced April 2019.

    Journal ref: International Journal of Human-Computer Interaction. Vol 37(2), pp 729-758, 2021

  26. arXiv:1609.08089  [pdf, other

    cs.CY cs.SE

    Software Platforms for Smart Cities: Concepts, Requirements, Challenges, and a Unified Reference Architecture

    Authors: Eduardo Felipe Zambom Santana, Ana Paula Chaves, Marco Aurelio Gerosa, Fabio Kon, Dejan Milojicic

    Abstract: Making cities smarter help improve city services and increase citizens' quality of life. Information and communication technologies (ICT) are fundamental for progressing towards smarter city environments. Smart City software platforms potentially support the development and integration of Smart City applications. However, the ICT community must overcome current significant technological and scient… ▽ More

    Submitted 23 July, 2017; v1 submitted 26 September, 2016; originally announced September 2016.

    Comments: Accepted for publication in ACM Computing Surveys

    Journal ref: ACM Computing Surveys (CSUR), Vol 50, Issue 6, 2018