Skip to main content

Showing 1–26 of 26 results for author: Terry, M

.
  1. arXiv:2406.09264  [pdf, other

    cs.HC cs.AI cs.CL

    Towards Bidirectional Human-AI Alignment: A Systematic Review for Clarifications, Framework, and Future Directions

    Authors: Hua Shen, Tiffany Knearem, Reshmi Ghosh, Kenan Alkiek, Kundan Krishna, Yachuan Liu, Ziqiao Ma, Savvas Petridis, Yi-Hao Peng, Li Qiwei, Sushrita Rakshit, Chenglei Si, Yutong Xie, Jeffrey P. Bigham, Frank Bentley, Joyce Chai, Zachary Lipton, Qiaozhu Mei, Rada Mihalcea, Michael Terry, Diyi Yang, Meredith Ringel Morris, Paul Resnick, David Jurgens

    Abstract: Recent advancements in general-purpose AI have highlighted the importance of guiding AI systems towards the intended goals, ethical principles, and values of individuals and groups, a concept broadly recognized as alignment. However, the lack of clarified definitions and scopes of human-AI alignment poses a significant obstacle, hampering collaborative efforts across research domains to achieve th… ▽ More

    Submitted 17 June, 2024; v1 submitted 13 June, 2024; originally announced June 2024.

    Comments: 56 pages

  2. arXiv:2405.03806  [pdf, other

    cs.HC

    In Situ AI Prototy**: Infusing Multimodal Prompts into Mobile Settings with MobileMaker

    Authors: Savvas Petridis, Michael Xieyang Liu, Alexander J. Fiannaca, Vivian Tsai, Michael Terry, Carrie J. Cai

    Abstract: Recent advances in multimodal large language models (LLMs) have lowered the barriers to rapidly prototy** AI-powered features via prompting, especially for mobile-intended use cases. Despite the value of situated user feedback, the process of soliciting early, mobile-situated user feedback on AI prototypes remains challenging. The broad scope and flexibility of LLMs means that, for a given use-c… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

  3. "We Need Structured Output": Towards User-centered Constraints on Large Language Model Output

    Authors: Michael Xieyang Liu, Frederick Liu, Alexander J. Fiannaca, Terry Koo, Lucas Dixon, Michael Terry, Carrie J. Cai

    Abstract: Large language models can produce creative and diverse responses. However, to integrate them into current developer workflows, it is essential to constrain their outputs to follow specific formats or standards. In this work, we surveyed 51 experienced industry professionals to understand the range of scenarios and motivations driving the need for output constraints from a user-centered perspective… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

    Journal ref: "We Need Structured Output": Towards User-centered Constraints on LLM Output. In Extended Abstracts of the CHI Conference on Human Factors in Computing Systems (CHI EA '24), May 11-16, 2024, Honolulu, HI, USA

  4. arXiv:2404.04289  [pdf, ps, other

    cs.AI cs.HC cs.LG

    Designing for Human-Agent Alignment: Understanding what humans want from their agents

    Authors: Nitesh Goyal, Minsuk Chang, Michael Terry

    Abstract: Our ability to build autonomous agents that leverage Generative AI continues to increase by the day. As builders and users of such agents it is unclear what parameters we need to align on before the agents start performing tasks on our behalf. To discover these parameters, we ran a qualitative empirical research study about designing agents that can negotiate during a fictional yet relatable task… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

    Comments: Human-AI Alignment, Human-Agent Alignment, Agents, Generative AI, Large Language Models

    ACM Class: I.2.0

  5. arXiv:2402.15350  [pdf, other

    cs.HC cs.AI cs.CY cs.LG

    Farsight: Fostering Responsible AI Awareness During AI Application Prototy**

    Authors: Zijie J. Wang, Chinmay Kulkarni, Lauren Wilcox, Michael Terry, Michael Madaio

    Abstract: Prompt-based interfaces for Large Language Models (LLMs) have made prototy** and building AI-powered applications easier than ever before. However, identifying potential harms that may arise from AI applications remains a challenge, particularly during prompt-based prototy**. To address this, we present Farsight, a novel in situ interactive tool that helps people identify potential harms from… ▽ More

    Submitted 2 July, 2024; v1 submitted 23 February, 2024; originally announced February 2024.

    Comments: Accepted to CHI 2024 (Best Paper, Honorable Mention). 40 pages, 19 figures, 5 tables. For a demo video, see https://youtu.be/BlSFbGkOlHk. For a live demo, visit https://PAIR-code.github.io/farsight. The source code is available at https://github.com/PAIR-code/farsight

  6. arXiv:2402.10524  [pdf, other

    cs.HC cs.AI cs.CL cs.LG

    LLM Comparator: Visual Analytics for Side-by-Side Evaluation of Large Language Models

    Authors: Minsuk Kahng, Ian Tenney, Mahima Pushkarna, Michael Xieyang Liu, James Wexler, Emily Reif, Krystal Kallarackal, Minsuk Chang, Michael Terry, Lucas Dixon

    Abstract: Automatic side-by-side evaluation has emerged as a promising approach to evaluating the quality of responses from large language models (LLMs). However, analyzing the results from this evaluation approach raises scalability and interpretability challenges. In this paper, we present LLM Comparator, a novel visual analytics tool for interactively analyzing results from automatic side-by-side evaluat… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

  7. arXiv:2311.00710  [pdf, other

    cs.HC cs.AI

    AI Alignment in the Design of Interactive AI: Specification Alignment, Process Alignment, and Evaluation Support

    Authors: Michael Terry, Chinmay Kulkarni, Martin Wattenberg, Lucas Dixon, Meredith Ringel Morris

    Abstract: AI alignment considers the overall problem of ensuring an AI produces desired outcomes, without undesirable side effects. While often considered from the perspectives of safety and human values, AI alignment can also be considered in the context of designing and evaluating interfaces for interactive AI systems. This paper maps concepts from AI alignment onto a basic, three step interaction cycle,… ▽ More

    Submitted 23 October, 2023; originally announced November 2023.

  8. arXiv:2310.15435  [pdf, other

    cs.HC cs.AI

    PromptInfuser: How Tightly Coupling AI and UI Design Impacts Designers' Workflows

    Authors: Savvas Petridis, Michael Terry, Carrie J. Cai

    Abstract: Prototy** AI applications is notoriously difficult. While large language model (LLM) prompting has dramatically lowered the barriers to AI prototy**, designers are still prototy** AI functionality and UI separately. We investigate how coupling prompt and UI design affects designers' workflows. Grounding this research, we developed PromptInfuser, a Figma plugin that enables users to create se… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

  9. arXiv:2310.15428  [pdf, other

    cs.HC cs.AI

    ConstitutionMaker: Interactively Critiquing Large Language Models by Converting Feedback into Principles

    Authors: Savvas Petridis, Ben Wedin, James Wexler, Aaron Donsbach, Mahima Pushkarna, Nitesh Goyal, Carrie J. Cai, Michael Terry

    Abstract: Large language model (LLM) prompting is a promising new approach for users to create and customize their own chatbots. However, current methods for steering a chatbot's outputs, such as prompt engineering and fine-tuning, do not support users in converting their natural feedback on the model's outputs to changes in the prompt or model. In this work, we explore how to enable users to interactively… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

  10. Imaging a $^6$Li Atom In An Optical Tweezer 2000 Times with $Λ$-Enhanced Gray Molasses

    Authors: Karl N. Blodgett, David Peana, Saumitra Phatak, Lane M. Terry, Maria Paula Montes, Jonathan Hood

    Abstract: We have imaged lithium-6 thousands of times in an optical tweezer using $Λ$-enhanced gray molasses cooling light. Despite being the lightest alkali, with a recoil temperature of 3.5 $μ$K, we achieve an imaging survival of 0.99950(2), which sets the new benchmark for low-loss imaging of neutral atoms in optical tweezers. Lithium is loaded directly from a MOT into a tweezer with an enhanced loading… ▽ More

    Submitted 3 May, 2023; originally announced May 2023.

  11. arXiv:2304.10547  [pdf, ps, other

    cs.AI cs.HC

    The Design Space of Generative Models

    Authors: Meredith Ringel Morris, Carrie J. Cai, Jess Holbrook, Chinmay Kulkarni, Michael Terry

    Abstract: Card et al.'s classic paper "The Design Space of Input Devices" established the value of design spaces as a tool for HCI analysis and invention. We posit that develo** design spaces for emerging pre-trained, generative AI models is necessary for supporting their integration into human-centered systems and practices. We explore what it means to develop an AI model design space by proposing two de… ▽ More

    Submitted 15 April, 2023; originally announced April 2023.

    Journal ref: NeurIps 2022 Human-Centered AI Workshop

  12. arXiv:2303.12647  [pdf, other

    cs.HC

    A Word is Worth a Thousand Pictures: Prompts as AI Design Material

    Authors: Chinmay Kulkarni, Stefania Druga, Minsuk Chang, Alex Fiannaca, Carrie Cai, Michael Terry

    Abstract: Recent advances in Machine-Learning have led to the development of models that generate images based on a text description.Such large prompt-based text to image models (TTIs), trained on a considerable amount of data, allow the creation of high-quality images by users with no graphics or design training. This paper examines the role such TTI models can playin collaborative, goal-oriented design. T… ▽ More

    Submitted 22 March, 2023; originally announced March 2023.

    Comments: 22 pages, 5 figures

  13. arXiv:2303.12253  [pdf, other

    cs.HC

    The Prompt Artists

    Authors: Minsuk Chang, Stefania Druga, Alex Fiannaca, Pedro Vergani, Chinmay Kulkarni, Carrie Cai, Michael Terry

    Abstract: This paper examines the art practices, artwork, and motivations of prolific users of the latest generation of text-to-image models. Through interviews, observations, and a user survey, we present a sampling of the artistic styles and describe the developed community of practice around generative AI. We find that: 1) the text prompt and the resulting image can be considered collectively as an art p… ▽ More

    Submitted 21 March, 2023; originally announced March 2023.

    Comments: 20 pages, 7 figures

  14. arXiv:2203.06566  [pdf, other

    cs.HC

    PromptChainer: Chaining Large Language Model Prompts through Visual Programming

    Authors: Tongshuang Wu, Ellen Jiang, Aaron Donsbach, Jeff Gray, Alejandra Molina, Michael Terry, Carrie J Cai

    Abstract: While LLMs can effectively help prototype single ML functionalities, many real-world applications involve complex tasks that cannot be easily handled via a single run of an LLM. Recent work has found that chaining multiple LLM runs together (with the output of one step being the input to the next) can help users accomplish these more complex tasks, and in a way that is perceived to be more transpa… ▽ More

    Submitted 12 March, 2022; originally announced March 2022.

    Comments: CHI LBW 2022

  15. arXiv:2201.11196  [pdf, other

    cs.LG cs.HC

    IMACS: Image Model Attribution Comparison Summaries

    Authors: Eldon Schoop, Ben Wedin, Andrei Kapishnikov, Tolga Bolukbasi, Michael Terry

    Abstract: Develo** a suitable Deep Neural Network (DNN) often requires significant iteration, where different model versions are evaluated and compared. While metrics such as accuracy are a powerful means to succinctly describe a model's performance across a dataset or to directly compare model versions, practitioners often wish to gain a deeper understanding of the factors that influence a model's predic… ▽ More

    Submitted 26 January, 2022; originally announced January 2022.

  16. arXiv:2110.01691  [pdf, other

    cs.HC cs.CL

    AI Chains: Transparent and Controllable Human-AI Interaction by Chaining Large Language Model Prompts

    Authors: Tongshuang Wu, Michael Terry, Carrie J. Cai

    Abstract: Although large language models (LLMs) have demonstrated impressive potential on simple tasks, their breadth of scope, lack of transparency, and insufficient controllability can make them less effective when assisting humans on more complex tasks. In response, we introduce the concept of Chaining LLM steps together, where the output of one step becomes the input for the next, thus aggregating the g… ▽ More

    Submitted 17 March, 2022; v1 submitted 4 October, 2021; originally announced October 2021.

  17. arXiv:2108.07732  [pdf, other

    cs.PL cs.LG

    Program Synthesis with Large Language Models

    Authors: Jacob Austin, Augustus Odena, Maxwell Nye, Maarten Bosma, Henryk Michalewski, David Dohan, Ellen Jiang, Carrie Cai, Michael Terry, Quoc Le, Charles Sutton

    Abstract: This paper explores the limits of the current generation of large language models for program synthesis in general purpose programming languages. We evaluate a collection of such models (with between 244M and 137B parameters) on two new benchmarks, MBPP and MathQA-Python, in both the few-shot and fine-tuning regimes. Our benchmarks are designed to measure the ability of these models to synthesize… ▽ More

    Submitted 15 August, 2021; originally announced August 2021.

    Comments: Jacob and Augustus contributed equally

  18. arXiv:2106.09788  [pdf, other

    cs.CV cs.LG

    Guided Integrated Gradients: An Adaptive Path Method for Removing Noise

    Authors: Andrei Kapishnikov, Subhashini Venugopalan, Besim Avci, Ben Wedin, Michael Terry, Tolga Bolukbasi

    Abstract: Integrated Gradients (IG) is a commonly used feature attribution method for deep neural networks. While IG has many desirable properties, the method often produces spurious/noisy pixel attributions in regions that are not related to the predicted class when applied to visual models. While this has been previously noted, most existing solutions are aimed at addressing the symptoms by explicitly red… ▽ More

    Submitted 17 June, 2021; originally announced June 2021.

    Comments: 13 pages, 11 figures, for implementation sources see https://github.com/PAIR-code/saliency

    Journal ref: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2021, pp. 5050-5058

  19. arXiv:2002.05249  [pdf, other

    stat.ME stat.AP

    A Competing Risks Model with Binary Time Varying Covariates for Estimation of Breast Cancer Risks in BRCA1 Families

    Authors: Yun-Hee Choi, Hae Jung, Saundra Buys, Mary Daly, Esther John, John Hopper, Irene Andrulis, Mary-Beth Terry, Laurent Briollais

    Abstract: Mammographic screening and prophylactic surgery such as risk-reducing sal**o oophorectomy (RRSO) can potentially reduce breast cancer risks among mutation carriers of BRCA families. The evaluation of these interventions is usually complicated by the fact that their effects on breast cancer may change over time and by the presence of competing risks. We introduce a correlated competing risks mode… ▽ More

    Submitted 5 November, 2020; v1 submitted 12 February, 2020; originally announced February 2020.

  20. arXiv:1906.02825  [pdf, other

    cs.CV stat.ML

    XRAI: Better Attributions Through Regions

    Authors: Andrei Kapishnikov, Tolga Bolukbasi, Fernanda Viégas, Michael Terry

    Abstract: Saliency methods can aid understanding of deep neural networks. Recent years have witnessed many improvements to saliency methods, as well as new ways for evaluating them. In this paper, we 1) present a novel region-based attribution method, XRAI, that builds upon integrated gradients (Sundararajan et al. 2017), 2) introduce evaluation methods for empirically assessing the quality of image-based s… ▽ More

    Submitted 20 August, 2019; v1 submitted 6 June, 2019; originally announced June 2019.

  21. arXiv:1902.02960  [pdf

    cs.HC cs.CY

    Human-Centered Tools for Co** with Imperfect Algorithms during Medical Decision-Making

    Authors: Carrie J. Cai, Emily Reif, Narayan Hegde, Jason Hipp, Been Kim, Daniel Smilkov, Martin Wattenberg, Fernanda Viegas, Greg S. Corrado, Martin C. Stumpe, Michael Terry

    Abstract: Machine learning (ML) is increasingly being used in image retrieval systems for medical decision making. One application of ML is to retrieve visually similar medical images from past patients (e.g. tissue from biopsies) to reference when making a medical decision with a new patient. However, no algorithm can perfectly capture an expert's ideal notion of similarity for every case: an image that is… ▽ More

    Submitted 8 February, 2019; originally announced February 2019.

  22. Similar Image Search for Histopathology: SMILY

    Authors: Narayan Hegde, Jason D. Hipp, Yun Liu, Michael E. Buck, Emily Reif, Daniel Smilkov, Michael Terry, Carrie J. Cai, Mahul B. Amin, Craig H. Mermel, Phil Q. Nelson, Lily H. Peng, Greg S. Corrado, Martin C. Stumpe

    Abstract: The increasing availability of large institutional and public histopathology image datasets is enabling the searching of these datasets for diagnosis, research, and education. Though these datasets typically have associated metadata such as diagnosis or clinical notes, even carefully curated datasets rarely contain annotations of the location of regions of interest on each image. Because pathology… ▽ More

    Submitted 5 February, 2019; v1 submitted 30 January, 2019; originally announced January 2019.

    Comments: 23 Pages with 6 figures and 3 tables. The file also has 6 pages of supplemental material. Improved figure resolution, edited metadata

    Journal ref: Nature Partner Journal Digital Medicine (2019)

  23. arXiv:1901.05350  [pdf, other

    cs.LG

    TensorFlow.js: Machine Learning for the Web and Beyond

    Authors: Daniel Smilkov, Nikhil Thorat, Yannick Assogba, Ann Yuan, Nick Kreeger, ** Yu, Kangyi Zhang, Shanqing Cai, Eric Nielsen, David Soergel, Stan Bileschi, Michael Terry, Charles Nicholson, Sandeep N. Gupta, Sarah Sirajuddin, D. Sculley, Rajat Monga, Greg Corrado, Fernanda B. Viégas, Martin Wattenberg

    Abstract: TensorFlow.js is a library for building and executing machine learning algorithms in JavaScript. TensorFlow.js models run in a web browser and in the Node.js environment. The library is part of the TensorFlow ecosystem, providing a set of APIs that are compatible with those in Python, allowing models to be ported between the Python and JavaScript ecosystems. TensorFlow.js has empowered a new set o… ▽ More

    Submitted 27 February, 2019; v1 submitted 16 January, 2019; originally announced January 2019.

    Comments: 10 pages, expanded performance section, fixed page breaks in code listings

  24. arXiv:1811.02708  [pdf, ps, other

    math.CO

    The Relationship Between Pascal's Triangle and Random Walks

    Authors: Tonia Bell, Shakuan Frankson, Nikita Sachdeva, Myka Terry

    Abstract: Random walks are a series of up, down, and level steps that enumerate distinct paths from $(0,0)$ to $(2n,0)$, where $n$ is the semi-length of the path. We used these paths to analyze Catalan, Schröder, and Motzkin number sequences through a combination of matrix operations, quadratic functions, and inductive reasoning. Our results revealed a number of distinct patterns, some unnamed, between thes… ▽ More

    Submitted 6 November, 2018; originally announced November 2018.

    Comments: Comments welcome! 27 pages

  25. arXiv:1811.02707  [pdf, ps, other

    math.CO

    Investigating First Returns: The Effect of Multicolored Vectors

    Authors: Shakuan Frankson, Myka Terry

    Abstract: By definition, a first return is the immediate moment that a path, using vectors in the Cartesian plane, touches the $x$-axis after leaving it previously from a given point; the initial point is often the origin. In this case, using certain diagonal and horizontal vectors while restricting the movements to the first quadrant will cause almost every first return to end at the point $(2n,0)$, where… ▽ More

    Submitted 6 November, 2018; originally announced November 2018.

    Comments: Comments welcome! 12 pages

  26. arXiv:1611.09207  [pdf, other

    cs.CL cs.LG stat.ML

    AutoMOS: Learning a non-intrusive assessor of naturalness-of-speech

    Authors: Brian Patton, Yannis Agiomyrgiannakis, Michael Terry, Kevin Wilson, Rif A. Saurous, D. Sculley

    Abstract: Developers of text-to-speech synthesizers (TTS) often make use of human raters to assess the quality of synthesized speech. We demonstrate that we can model human raters' mean opinion scores (MOS) of synthesized speech using a deep recurrent neural network whose inputs consist solely of a raw waveform. Our best models provide utterance-level estimates of MOS only moderately inferior to sampled hum… ▽ More

    Submitted 28 November, 2016; originally announced November 2016.

    Comments: 4 pages, 2 figures, 2 tables, NIPS 2016 End-to-end Learning for Speech and Audio Processing Workshop