Skip to main content

Showing 1–16 of 16 results for author: Brun, Y

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.15551  [pdf, other

    cs.LG

    Thinking Forward: Memory-Efficient Federated Finetuning of Language Models

    Authors: Kunjal Panchal, Nisarg Parikh, Sunav Choudhary, Lijun Zhang, Yuriy Brun, Hui Guan

    Abstract: Finetuning large language models (LLMs) in federated learning (FL) settings has become important as it allows resource-constrained devices to finetune a model using private data. However, finetuning LLMs using backpropagation requires excessive memory (especially from intermediate activations) for resource-constrained devices. While Forward-mode Auto-Differentiation (AD) can reduce memory footprin… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  2. arXiv:2401.04247  [pdf, other

    cs.CV cs.AI

    Robust Image Watermarking using Stable Diffusion

    Authors: Lijun Zhang, Xiao Liu, Antoni Viros Martin, Cindy Xiong Bearfield, Yuriy Brun, Hui Guan

    Abstract: Watermarking images is critical for tracking image provenance and claiming ownership. With the advent of generative models, such as stable diffusion, able to create fake but realistic images, watermarking has become particularly important, e.g., to make generated images reliably identifiable. Unfortunately, the very same stable diffusion technology can remove watermarks injected using existing met… ▽ More

    Submitted 8 January, 2024; originally announced January 2024.

    Comments: 15 pages, 14 figures

  3. My Model is Unfair, Do People Even Care? Visual Design Affects Trust and Perceived Bias in Machine Learning

    Authors: Aimen Gaba, Zhanna Kaufman, Jason Chueng, Marie Shvakel, Kyle Wm. Hall, Yuriy Brun, Cindy Xiong Bearfield

    Abstract: Machine learning technology has become ubiquitous, but, unfortunately, often exhibits bias. As a consequence, disparate stakeholders need to interact with and make informed decisions about using machine learning models in everyday systems. Visualization technology can support stakeholders in understanding and evaluating trade-offs between, for example, accuracy and fairness of models. This paper a… ▽ More

    Submitted 7 August, 2023; originally announced August 2023.

    Comments: 11 pages, 6 figures, to appear in IEEE Transactions of Visualization and Computer Graphics (Also in proceedings of IEEE VIS 2023)

    ACM Class: H.5.0

    Journal ref: IEEE TVCG 30(1):327-337

  4. arXiv:2303.04910  [pdf, other

    cs.LG cs.LO cs.SE

    Baldur: Whole-Proof Generation and Repair with Large Language Models

    Authors: Emily First, Markus N. Rabe, Talia Ringer, Yuriy Brun

    Abstract: Formally verifying software properties is a highly desirable but labor-intensive task. Recent work has developed methods to automate formal verification using proof assistants, such as Coq and Isabelle/HOL, e.g., by training a model to predict one proof step at a time, and using that model to search through the space of possible proofs. This paper introduces a new method to automate formal verific… ▽ More

    Submitted 15 March, 2023; v1 submitted 8 March, 2023; originally announced March 2023.

  5. Avgust: Automating Usage-Based Test Generation from Videos of App Executions

    Authors: Yixue Zhao, Saghar Talebipour, Kesina Baral, Hyojae Park, Leon Yee, Safwat Ali Khan, Yuriy Brun, Nenad Medvidovic, Kevin Moran

    Abstract: Writing and maintaining UI tests for mobile apps is a time-consuming and tedious task. While decades of research have produced automated approaches for UI test generation, these approaches typically focus on testing for crashes or maximizing code coverage. By contrast, recent research has shown that developers prefer usage-based tests, which center around specific uses of app features, to help sup… ▽ More

    Submitted 1 November, 2022; v1 submitted 6 September, 2022; originally announced September 2022.

    Journal ref: ESEC/FSE 2022

  6. arXiv:2208.11744  [pdf, other

    cs.LG cs.AI cs.CY

    Enforcing Delayed-Impact Fairness Guarantees

    Authors: Aline Weber, Blossom Metevier, Yuriy Brun, Philip S. Thomas, Bruno Castro da Silva

    Abstract: Recent research has shown that seemingly fair machine learning models, when used to inform decisions that have an impact on peoples' lives or well-being (e.g., applications involving education, employment, and lending), can inadvertently increase social inequality in the long term. This is because prior fairness-aware algorithms only consider static fairness constraints, such as equal opportunity… ▽ More

    Submitted 24 August, 2022; originally announced August 2022.

    Comments: 24 pages, 5 figures

  7. Passport: Improving Automated Formal Verification Using Identifiers

    Authors: Alex Sanchez-Stern, Emily First, Timothy Zhou, Zhanna Kaufman, Yuriy Brun, Talia Ringer

    Abstract: Formally verifying system properties is one of the most effective ways of improving system quality, but its high manual effort requirements often render it prohibitively expensive. Tools that automate formal verification, by learning from proof corpora to suggest proofs, have just begun to show their promise. These tools are effective because of the richness of the data the proof corpora contain.… ▽ More

    Submitted 2 August, 2022; v1 submitted 21 April, 2022; originally announced April 2022.

    Journal ref: ACM Transactions on Programming Languages and Systems (TOPLAS), 45(2):12:1-12:30, June 2023

  8. arXiv:2103.06091  [pdf, other

    cs.SE cs.CR

    Blindspots in Python and Java APIs Result in Vulnerable Code

    Authors: Yuriy Brun, Tian Lin, Jessie Elise Somerville, Elisha Myers, Natalie C. Ebner

    Abstract: Blindspots in APIs can cause software engineers to introduce vulnerabilities, but such blindspots are, unfortunately, common. We study the effect APIs with blindspots have on developers in two languages by replicating an 109-developer, 24-Java-API controlled experiment. Our replication applies to Python and involves 129 new developers and 22 new APIs. We find that using APIs with blindspots statis… ▽ More

    Submitted 10 March, 2021; originally announced March 2021.

  9. arXiv:2012.09951  [pdf, other

    cs.LG cs.HC

    Fairkit, Fairkit, on the Wall, Who's the Fairest of Them All? Supporting Data Scientists in Training Fair Models

    Authors: Brittany Johnson, Jesse Bartola, Rico Angell, Katherine Keith, Sam Witty, Stephen J. Giguere, Yuriy Brun

    Abstract: Modern software relies heavily on data and machine learning, and affects decisions that shape our world. Unfortunately, recent studies have shown that because of biases in data, software systems frequently inject bias into their decisions, from producing better closed caption transcriptions of men's voices than of women's voices to overcharging people of color for financial loans. To address bias… ▽ More

    Submitted 17 December, 2020; originally announced December 2020.

  10. arXiv:2011.08340  [pdf, other

    cs.SE

    Better Automatic Program Repair by Using Bug Reports and Tests Together

    Authors: Manish Motwani, Yuriy Brun

    Abstract: Automated program repair is already deployed in industry, but concerns remain about repair quality. Recent research has shown that one of the main reasons repair tools produce incorrect (but seemingly correct) patches is imperfect fault localization (FL). This paper demonstrates that combining information from natural-language bug reports and test executions when localizing faults can have a signi… ▽ More

    Submitted 6 February, 2023; v1 submitted 16 November, 2020; originally announced November 2020.

    Comments: accepted in ICSE'23 technical track

  11. Wasm/k: Delimited Continuations for WebAssembly

    Authors: Donald Pinckney, Arjun Guha, Yuriy Brun

    Abstract: WebAssembly is designed to be an alternative to JavaScript that is a safe, portable, and efficient compilation target for a variety of languages. The performance of high-level languages depends not only on the underlying performance of WebAssembly, but also on the quality of the generated WebAssembly code. In this paper, we identify several features of high-level languages that current approaches… ▽ More

    Submitted 4 October, 2020; originally announced October 2020.

    Journal ref: Proceedings of the ACM SIGPLAN International Symposium on Dynamic Languages (DLS 2020)

  12. Formal Foundations of Serverless Computing

    Authors: Abhinav Jangda, Donald Pinckney, Yuriy Brun, Arjun Guha

    Abstract: Serverless computing (also known as functions as a service) is a new cloud computing abstraction that makes it easier to write robust, large-scale web services. In serverless computing, programmers write what are called serverless functions, and the cloud platform transparently manages the operating system, resource allocation, load-balancing, and fault tolerance. When demand for the service spike… ▽ More

    Submitted 4 October, 2020; v1 submitted 15 February, 2019; originally announced February 2019.

    Journal ref: PACMPL, OOPSLA issue, vol. 3, October 2019, pp. 149:1-149:26

  13. Causal Testing: Finding Defects' Root Causes

    Authors: Brittany Johnson, Yuriy Brun, Alexandra Meliou

    Abstract: Understanding the root cause of a defect is critical to isolating and repairing buggy behavior. We present Causal Testing, a new method of root-cause analysis that relies on the theory of counterfactual causality to identify a set of executions that likely hold key causal information necessary to understand and repair buggy behavior. Using the Defects4J benchmark, we find that Causal Testing could… ▽ More

    Submitted 18 February, 2020; v1 submitted 18 September, 2018; originally announced September 2018.

    Comments: in Proceedings of the 42nd International Conference on Software Engineering (ICSE), 2020

  14. Tortoise: Interactive System Configuration Repair

    Authors: Aaron Weiss, Arjun Guha, Yuriy Brun

    Abstract: System configuration languages provide powerful abstractions that simplify managing large-scale, networked systems. Thousands of organizations now use configuration languages, such as Puppet. However, specifications written in configuration languages can have bugs and the shell remains the simplest way to debug a misconfigured system. Unfortunately, it is unsafe to use the shell to fix problems wh… ▽ More

    Submitted 15 September, 2017; originally announced September 2017.

    Comments: Published version in proceedings of IEEE/ACM International Conference on Automated Software Engineering (ASE) 2017

    Journal ref: in Proceedings of the 32nd IEEE/ACM International Conference on Automated Software Engineering (ASE), 2017, pp. 625-636

  15. arXiv:1709.03221  [pdf, other

    cs.SE cs.AI cs.CY cs.DB cs.LG

    Fairness Testing: Testing Software for Discrimination

    Authors: Sainyam Galhotra, Yuriy Brun, Alexandra Meliou

    Abstract: This paper defines software fairness and discrimination and develops a testing-based method for measuring if and how much software discriminates, focusing on causality in discriminatory behavior. Evidence of software discrimination has been found in modern software systems that recommend criminal sentences, grant access to financial products, and determine who is allowed to participate in promotio… ▽ More

    Submitted 10 September, 2017; originally announced September 2017.

    Comments: Sainyam Galhotra, Yuriy Brun, and Alexandra Meliou. 2017. Fairness Testing: Testing Software for Discrimination. In Proceedings of 2017 11th Joint Meeting of the European Software Engineering Conference and the ACM SIGSOFT Symposium on the Foundations of Software Engineering (ESEC/FSE), Paderborn, Germany, September 4-8, 2017 (ESEC/FSE'17). https://doi.org/10.1145/3106237.3106277, ESEC/FSE, 2017

  16. arXiv:1709.01609  [pdf, other

    cs.DL cs.GL cs.SE

    Effectiveness of Anonymization in Double-Blind Review

    Authors: Claire Le Goues, Yuriy Brun, Sven Apel, Emery Berger, Sarfraz Khurshid, Yannis Smaragdakis

    Abstract: Double-blind review relies on the authors' ability and willingness to effectively anonymize their submissions. We explore anonymization effectiveness at ASE 2016, OOPSLA 2016, and PLDI 2016 by asking reviewers if they can guess author identities. We find that 74%-90% of reviews contain no correct guess and that reviewers who self-identify as experts on a paper's topic are more likely to attempt to… ▽ More

    Submitted 5 September, 2017; originally announced September 2017.

    Comments: Effectiveness of Anonymization in Double-Blind Review. Communications of the ACM. 2017

    Journal ref: Communications of the ACM, vol. 61, no. 6, June 2018, pp. 34-37