Skip to main content

Showing 1–17 of 17 results for author: Parnin, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.01556  [pdf, other

    cs.SE cs.AI cs.CL

    Semantically Aligned Question and Code Generation for Automated Insight Generation

    Authors: Ananya Singha, Bhavya Chopra, Anirudh Khatry, Sumit Gulwani, Austin Z. Henley, Vu Le, Chris Parnin, Mukul Singh, Gust Verbruggen

    Abstract: Automated insight generation is a common tactic for hel** knowledge workers, such as data scientists, to quickly understand the potential value of new and unfamiliar data. Unfortunately, automated insights produced by large-language models can generate code that does not correctly correspond (or align) to the insight. In this paper, we leverage the semantic knowledge of large language models to… ▽ More

    Submitted 21 March, 2024; originally announced May 2024.

  2. arXiv:2402.06229  [pdf, other

    cs.HC cs.AI cs.SE

    Exploring Interaction Patterns for Debugging: Enhancing Conversational Capabilities of AI-assistants

    Authors: Bhavya Chopra, Yasharth Bajpai, Param Biyani, Gustavo Soares, Arjun Radhakrishna, Chris Parnin, Sumit Gulwani

    Abstract: The widespread availability of Large Language Models (LLMs) within Integrated Development Environments (IDEs) has led to their speedy adoption. Conversational interactions with LLMs enable programmers to obtain natural language explanations for various software development tasks. However, LLMs often leap to action without sufficient context, giving rise to implicit assumptions and inaccurate respo… ▽ More

    Submitted 9 February, 2024; originally announced February 2024.

    Comments: 7 pages, 4 figures, 2 tables

  3. arXiv:2312.14231  [pdf, other

    cs.SE

    Building Your Own Product Copilot: Challenges, Opportunities, and Needs

    Authors: Chris Parnin, Gustavo Soares, Rahul Pandita, Sumit Gulwani, Jessica Rich, Austin Z. Henley

    Abstract: A race is underway to embed advanced AI capabilities into products. These product copilots enable users to ask questions in natural language and receive relevant responses that are specific to the user's context. In fact, virtually every large technology company is looking to add these capabilities to their software products. However, for most software engineers, this is often their first encounte… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

    Comments: 11 pages

  4. arXiv:2310.16164  [pdf, other

    cs.HC

    Conversational Challenges in AI-Powered Data Science: Obstacles, Needs, and Design Opportunities

    Authors: Bhavya Chopra, Ananya Singha, Anna Fariha, Sumit Gulwani, Chris Parnin, Ashish Tiwari, Austin Z. Henley

    Abstract: Large Language Models (LLMs) are being increasingly employed in data science for tasks like data preprocessing and analytics. However, data scientists encounter substantial obstacles when conversing with LLM-powered chatbots and acting on their suggestions and answers. We conducted a mixed-methods study, including contextual observations, semi-structured interviews (n=14), and a survey (n=114), to… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: 24 pages, 8 figures

  5. arXiv:2310.10358  [pdf, other

    cs.CL cs.AI

    Tabular Representation, Noisy Operators, and Impacts on Table Structure Understanding Tasks in LLMs

    Authors: Ananya Singha, José Cambronero, Sumit Gulwani, Vu Le, Chris Parnin

    Abstract: Large language models (LLMs) are increasingly applied for tabular tasks using in-context learning. The prompt representation for a table may play a role in the LLMs ability to process the table. Inspired by prior work, we generate a collection of self-supervised structural tasks (e.g. navigate to a cell and row; transpose the table) and evaluate the performance differences when using 8 formats. In… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

  6. Correlates of Programmer Efficacy and Their Link to Experience: A Combined EEG and Eye-Tracking Study

    Authors: Norman Peitek, Annabelle Bergum, Maurice Rekrut, Jonas Mucke, Matthias Nadig, Chris Parnin, Janet Siegmund, Sven Apel

    Abstract: Background: Despite similar education and background, programmers can exhibit vast differences in efficacy. While research has identified some potential factors, such as programming experience and domain knowledge, the effect of these factors on programmers' efficacy is not well understood. Aims: We aim at unraveling the relationship between efficacy (speed and correctness) and measures of progr… ▽ More

    Submitted 13 March, 2023; originally announced March 2023.

    Comments: 12 pages, 7 figures, To be published at Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering (ESEC/FSE '22)

  7. arXiv:2208.01242  [pdf

    cs.CR cs.SE

    Detecting and Characterizing Propagation of Security Weaknesses in Puppet-based Infrastructure Management

    Authors: Akond Rahman, Chris Parnin

    Abstract: Despite being beneficial for managing computing infrastructure automatically, Puppet manifests are susceptible to security weaknesses, e.g., hard-coded secrets and use of weak cryptography algorithms. Adequate mitigation of security weaknesses in Puppet manifests is thus necessary to secure computing infrastructure that are managed with Puppet manifests. A characterization of how security weakness… ▽ More

    Submitted 2 August, 2022; originally announced August 2022.

    Comments: 14 pages, currently under review

  8. arXiv:2203.12065  [pdf

    cs.SE

    Dozer: Migrating Shell Commands to Ansible Modules via Execution Profiling and Synthesis

    Authors: Eric Horton, Chris Parnin

    Abstract: Software developers frequently use the system shell to perform configuration management tasks. Unfortunately, the shell does not scale well to large systems, and configuration management tools like Ansible are more difficult to learn. We address this problem with Dozer, a technique to help developers push their shell commands into Ansible task definitions. It operates by tracing and comparing syst… ▽ More

    Submitted 22 March, 2022; originally announced March 2022.

  9. arXiv:2103.09685  [pdf, other

    cs.SE

    Nudging Students Toward Better Software Engineering Behaviors

    Authors: Chris Brown, Chris Parnin

    Abstract: Student experiences in large undergraduate Computer Science courses are increasingly impacted by automated systems. Bots, or agents of software automation, are useful for efficiently grading and generating feedback. Current efforts at automation in CS education focus on supporting instructional tasks, but do not address student struggles due to poor behaviors, such as procrastination. In this pape… ▽ More

    Submitted 17 March, 2021; originally announced March 2021.

    Comments: 5 pages, 1 figure, BotSE 2021

  10. SLACC: Simion-based Language Agnostic Code Clones

    Authors: George Mathew, Chris Parnin, Kathryn T Stolee

    Abstract: Successful cross-language clone detection could enable researchers and developers to create robust language migration tools, facilitate learning additional programming languages once one is mastered, and promote reuse of code snippets over a broader codebase. However, identifying cross-language clones presents special challenges to the clone detection problem. A lack of common underlying represent… ▽ More

    Submitted 7 February, 2020; originally announced February 2020.

    Comments: 11 Pages, 3 Figures, Accepted at ICSE 2020 technical track

  11. arXiv:1909.06251  [pdf, other

    cs.SE

    V2: Fast Detection of Configuration Drift in Python

    Authors: Eric Horton, Chris Parnin

    Abstract: Code snippets are prevalent, but are hard to reuse because they often lack an accompanying environment configuration. Most are not actively maintained, allowing for drift between the most recent possible configuration and the code snippet as the snippet becomes out-of-date over time. Recent work has identified the problem of validating and detecting out-of-date code snippets as the most important… ▽ More

    Submitted 13 September, 2019; originally announced September 2019.

  12. arXiv:1907.07159  [pdf, other

    cs.CR cs.SE

    Security Smells in Ansible and Chef Scripts: A Replication Study

    Authors: Akond Rahman, Md. Rayhanur Rahman, Chris Parnin, Laurie Williams

    Abstract: Context: Security smells are recurring coding patterns that are indicative of security weakness, and require further inspection. As infrastructure as code (IaC) scripts, such as Ansible and Chef scripts, are used to provision cloud-based servers and systems at scale, security smells in IaC scripts could be used to enable malicious users to exploit vulnerabilities in the provisioned systems. Goal:… ▽ More

    Submitted 20 June, 2020; v1 submitted 16 July, 2019; originally announced July 2019.

    Comments: 31 pages: This paper has been accepted at the journal of ACM Transaction on Software Engineering and Methodology (TOSEM)

  13. arXiv:1905.11127  [pdf, other

    cs.SE

    DockerizeMe: Automatic Inference of Environment Dependencies for Python Code Snippets

    Authors: Eric Horton, Chris Parnin

    Abstract: Platforms like Stack Overflow and GitHub's gist system promote the sharing of ideas and programming techniques via the distribution of code snippets designed to illustrate particular tasks. Python, a popular and fast-growing programming language, sees heavy use on both sites, with nearly one million questions asked on Stack Overflow and 400 thousand public gists on GitHub. Unfortunately, around 75… ▽ More

    Submitted 27 May, 2019; originally announced May 2019.

  14. arXiv:1808.09008  [pdf, other

    cs.SE

    It's Like Python But: Towards Supporting Transfer of Programming Language Knowledge

    Authors: Nischal Shrestha, Titus Barik, Chris Parnin

    Abstract: Expertise in programming traditionally assumes a binary novice-expert divide. Learning resources typically target programmers who are learning programming for the first time, or expert programmers for that language. An underrepresented, yet important group of programmers are those that are experienced in one programming language, but desire to author code in a different language. For this scenario… ▽ More

    Submitted 27 August, 2018; originally announced August 2018.

    Comments: To appear in IEEE Symposium on Visual Languages and Human-Centric Computing (VL/HCC) 2018

  15. arXiv:1808.04919  [pdf, ps, other

    cs.SE

    Gistable: Evaluating the Executability of Python Code Snippets on GitHub

    Authors: Eric Horton, Chris Parnin

    Abstract: Software developers create and share code online to demonstrate programming language concepts and programming tasks. Code snippets can be a useful way to explain and demonstrate a programming concept, but may not always be directly executable. A code snippet can contain parse errors, or fail to execute if the environment contains unmet dependencies. This paper presents an empirical analysis of t… ▽ More

    Submitted 14 August, 2018; originally announced August 2018.

  16. arXiv:1803.08612  [pdf, ps, other

    cs.SE cs.IR

    Evaluating How Developers Use General-Purpose Web-Search for Code Retrieval

    Authors: Md Masudur Rahman, Jed Barson, Sydney Paul, Joshua Kayan, Federico Andres Lois, Sebastian Fernandez Quezada, Christopher Parnin, Kathryn T. Stolee, Baishakhi Ray

    Abstract: Search is an integral part of a software development process. Developers often use search engines to look for information during development, including reusable code snippets, API understanding, and reference examples. Developers tend to prefer general-purpose search engines like Google, which are often not optimized for code related documents and use search strategies and ranking techniques that… ▽ More

    Submitted 22 March, 2018; originally announced March 2018.

    Comments: Accepted at MSR-2018

  17. arXiv:1411.6118  [pdf, ps, other

    cs.SE

    Code Drones

    Authors: Mithun P. Acharya, Chris Parnin, Nicholas A. Kraft, Aldo Dagnino, Xiao Qu

    Abstract: We propose and explore a new paradigm called Code Drones in which every software artifact such as a class is an intelligent and socially active entity. In this paradigm, humanized artifacts take the lead and choreograph (socially, in collaboration with other intelligent software artifacts and humans) automated software engineering solutions to a myriad of development and maintenance challenges, in… ▽ More

    Submitted 16 February, 2016; v1 submitted 22 November, 2014; originally announced November 2014.