Skip to main content

Showing 1–5 of 5 results for author: Arawjo, I

.
  1. arXiv:2404.12272  [pdf, other

    cs.HC cs.AI

    Who Validates the Validators? Aligning LLM-Assisted Evaluation of LLM Outputs with Human Preferences

    Authors: Shreya Shankar, J. D. Zamfirescu-Pereira, Björn Hartmann, Aditya G. Parameswaran, Ian Arawjo

    Abstract: Due to the cumbersome nature of human evaluation and limitations of code-based evaluation, Large Language Models (LLMs) are increasingly being used to assist humans in evaluating LLM outputs. Yet LLM-generated evaluators simply inherit all the problems of the LLMs they evaluate, requiring further human validation. We present a mixed-initiative approach to ``validate the validators'' -- aligning LL… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

    Comments: 16 pages, 4 figures, 2 tables

  2. arXiv:2402.07350  [pdf, other

    cs.AI cs.HC

    Antagonistic AI

    Authors: Alice Cai, Ian Arawjo, Elena L. Glassman

    Abstract: The vast majority of discourse around AI development assumes that subservient, "moral" models aligned with "human values" are universally beneficial -- in short, that good AI is sycophantic AI. We explore the shadow of the sycophantic paradigm, a design space we term antagonistic AI: AI systems that are disagreeable, rude, interrupting, confrontational, challenging, etc. -- embedding opposite beha… ▽ More

    Submitted 11 February, 2024; originally announced February 2024.

    Comments: 17 pages, 1 figure, 5 tables

    ACM Class: I.2.0; J.0; K.4.0

  3. arXiv:2402.07342  [pdf, other

    cs.HC cs.AI

    Imagining a Future of Designing with AI: Dynamic Grounding, Constructive Negotiation, and Sustainable Motivation

    Authors: Priyan Vaithilingam, Ian Arawjo, Elena L. Glassman

    Abstract: We ideate a future design workflow that involves AI technology. Drawing from activity and communication theory, we attempt to isolate the new value large AI models can provide design compared to past technologies. We arrive at three affordances -- dynamic grounding, constructive negotiation, and sustainable motivation -- that summarize latent qualities of natural language-enabled foundation models… ▽ More

    Submitted 11 February, 2024; originally announced February 2024.

    Comments: 12 pages, 4 figures

    ACM Class: J.6; I.2.0; H.5.2

  4. arXiv:2401.10873  [pdf, other

    cs.HC

    An AI-Resilient Text Rendering Technique for Reading and Skimming Documents

    Authors: Ziwei Gu, Ian Arawjo, Kenneth Li, Jonathan K. Kummerfeld, Elena L. Glassman

    Abstract: Readers find text difficult to consume for many reasons. Summarization can address some of these difficulties, but introduce others, such as omitting, misrepresenting, or hallucinating information, which can be hard for a reader to notice. One approach to addressing this problem is to instead modify how the original text is rendered to make important information more salient. We introduce Grammar-… ▽ More

    Submitted 19 January, 2024; originally announced January 2024.

    Comments: Conditionally accepted to CHI 2024

  5. ChainForge: A Visual Toolkit for Prompt Engineering and LLM Hypothesis Testing

    Authors: Ian Arawjo, Chelse Swoopes, Priyan Vaithilingam, Martin Wattenberg, Elena Glassman

    Abstract: Evaluating outputs of large language models (LLMs) is challenging, requiring making -- and making sense of -- many responses. Yet tools that go beyond basic prompting tend to require knowledge of programming APIs, focus on narrow domains, or are closed-source. We present ChainForge, an open-source visual toolkit for prompt engineering and on-demand hypothesis testing of text generation LLMs. Chain… ▽ More

    Submitted 3 May, 2024; v1 submitted 16 September, 2023; originally announced September 2023.

    Comments: 18 pages, 7 figures, published at CHI 2024

    ACM Class: H.5.2; I.2