Skip to main content

Showing 1–9 of 9 results for author: Henley, A Z

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.01556  [pdf, other

    cs.SE cs.AI cs.CL

    Semantically Aligned Question and Code Generation for Automated Insight Generation

    Authors: Ananya Singha, Bhavya Chopra, Anirudh Khatry, Sumit Gulwani, Austin Z. Henley, Vu Le, Chris Parnin, Mukul Singh, Gust Verbruggen

    Abstract: Automated insight generation is a common tactic for hel** knowledge workers, such as data scientists, to quickly understand the potential value of new and unfamiliar data. Unfortunately, automated insights produced by large-language models can generate code that does not correctly correspond (or align) to the insight. In this paper, we leverage the semantic knowledge of large language models to… ▽ More

    Submitted 21 March, 2024; originally announced May 2024.

  2. arXiv:2403.07762  [pdf, other

    cs.HC

    Supporting Annotators with Affordances for Efficiently Labeling Conversational Data

    Authors: Austin Z. Henley, David Piorkowski

    Abstract: Without well-labeled ground truth data, machine learning-based systems would not be as ubiquitous as they are today, but these systems rely on substantial amounts of correctly labeled data. Unfortunately, crowdsourced labeling is time consuming and expensive. To address the concerns of effort and tedium, we designed CAL, a novel interface to aid in data labeling. We made several key design decisio… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

  3. CodeAid: Evaluating a Classroom Deployment of an LLM-based Programming Assistant that Balances Student and Educator Needs

    Authors: Majeed Kazemitabaar, Runlong Ye, Xiaoning Wang, Austin Z. Henley, Paul Denny, Michelle Craig, Tovi Grossman

    Abstract: Timely, personalized feedback is essential for students learning programming. LLM-powered tools like ChatGPT offer instant support, but reveal direct answers with code, which may hinder deep conceptual engagement. We developed CodeAid, an LLM-powered programming assistant delivering helpful, technically correct responses, without revealing code solutions. CodeAid answers conceptual questions, gene… ▽ More

    Submitted 25 February, 2024; v1 submitted 20 January, 2024; originally announced January 2024.

    Comments: CHI 2024 Paper - The paper includes 17 pages, 8 figures, 2 tables, along with a 2-page appendix

  4. arXiv:2312.14231  [pdf, other

    cs.SE

    Building Your Own Product Copilot: Challenges, Opportunities, and Needs

    Authors: Chris Parnin, Gustavo Soares, Rahul Pandita, Sumit Gulwani, Jessica Rich, Austin Z. Henley

    Abstract: A race is underway to embed advanced AI capabilities into products. These product copilots enable users to ask questions in natural language and receive relevant responses that are specific to the user's context. In fact, virtually every large technology company is looking to add these capabilities to their software products. However, for most software engineers, this is often their first encounte… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

    Comments: 11 pages

  5. arXiv:2310.16164  [pdf, other

    cs.HC

    Conversational Challenges in AI-Powered Data Science: Obstacles, Needs, and Design Opportunities

    Authors: Bhavya Chopra, Ananya Singha, Anna Fariha, Sumit Gulwani, Chris Parnin, Ashish Tiwari, Austin Z. Henley

    Abstract: Large Language Models (LLMs) are being increasingly employed in data science for tasks like data preprocessing and analytics. However, data scientists encounter substantial obstacles when conversing with LLM-powered chatbots and acting on their suggestions and answers. We conducted a mixed-methods study, including contextual observations, semi-structured interviews (n=14), and a survey (n=114), to… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: 24 pages, 8 figures

  6. arXiv:2210.05506  [pdf, other

    cs.SE cs.AI cs.HC cs.LG

    Extracting Meaningful Attention on Source Code: An Empirical Study of Developer and Neural Model Code Exploration

    Authors: Matteo Paltenghi, Rahul Pandita, Austin Z. Henley, Albert Ziegler

    Abstract: The high effectiveness of neural models of code, such as OpenAI Codex and AlphaCode, suggests coding capabilities of models that are at least comparable to those of humans. However, previous work has only used these models for their raw completion, ignoring how the model reasoning, in the form of attention weights, can be used for other downstream tasks. Disregarding the attention weights means di… ▽ More

    Submitted 11 October, 2022; originally announced October 2022.

  7. arXiv:2204.08108  [pdf, other

    cs.SE

    How are Software Repositories Mined? A Systematic Literature Review of Workflows, Methodologies, Reproducibility, and Tools

    Authors: Adam Tutko, Austin Z. Henley, Audris Mockus

    Abstract: With the advent of open source software, a veritable treasure trove of previously proprietary software development data was made available. This opened the field of empirical software engineering research to anyone in academia. Data that is mined from software projects, however, requires extensive processing and needs to be handled with utmost care to ensure valid conclusions. Since the software d… ▽ More

    Submitted 17 April, 2022; originally announced April 2022.

    Comments: 11 Pages

    MSC Class: 68N99

  8. arXiv:2102.06098  [pdf, other

    cs.SE

    An Inquisitive Code Editor for Addressing Novice Programmers' Misconceptions of Program Behavior

    Authors: Austin Z. Henley, Julian Ball, Benjamin Klein, Aiden Rutter, Dylan Lee

    Abstract: Novice programmers face numerous barriers while attempting to learn how to code that may deter them from pursuing a computer science degree or career in software development. In this work, we propose a tool concept to address the particularly challenging barrier of novice programmers holding misconceptions about how their code behaves. Specifically, the concept involves an inquisitive code editor… ▽ More

    Submitted 11 February, 2021; originally announced February 2021.

    Comments: Accepted to ICSE-JSEET'21

  9. arXiv:2011.06244  [pdf, other

    cs.SE

    A Fine-grained Data Set and Analysis of Tangling in Bug Fixing Commits

    Authors: Steffen Herbold, Alexander Trautsch, Benjamin Ledel, Alireza Aghamohammadi, Taher Ahmed Ghaleb, Kuljit Kaur Chahal, Tim Bossenmaier, Bhaveet Nagaria, Philip Makedonski, Matin Nili Ahmadabadi, Kristof Szabados, Helge Spieker, Matej Madeja, Nathaniel Hoy, Valentina Lenarduzzi, Shangwen Wang, Gema Rodríguez-Pérez, Ricardo Colomo-Palacios, Roberto Verdecchia, Paramvir Singh, Yihao Qin, Debasish Chakroborti, Willard Davis, Vijay Walunj, Hongjun Wu , et al. (23 additional authors not shown)

    Abstract: Context: Tangled commits are changes to software that address multiple concerns at once. For researchers interested in bugs, tangled commits mean that they actually study not only bugs, but also other concerns irrelevant for the study of bugs. Objective: We want to improve our understanding of the prevalence of tangling and the types of changes that are tangled within bug fixing commits. Metho… ▽ More

    Submitted 13 October, 2021; v1 submitted 12 November, 2020; originally announced November 2020.

    Comments: Status: Accepted at Empirical Software Engineering