Skip to main content

Showing 1–6 of 6 results for author: Rytting, C M

.
  1. arXiv:2402.12556  [pdf, other

    cs.HC cs.CL

    IMBUE: Improving Interpersonal Effectiveness through Simulation and Just-in-time Feedback with Human-Language Model Interaction

    Authors: Inna Wanyin Lin, Ashish Sharma, Christopher Michael Rytting, Adam S. Miner, **a Suh, Tim Althoff

    Abstract: Navigating certain communication situations can be challenging due to individuals' lack of skills and the interference of strong emotions. However, effective learning opportunities are rarely accessible. In this work, we conduct a human-centered study that uses language models to simulate bespoke communication training and provide just-in-time feedback to support the practice and learning of inter… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

  2. arXiv:2402.05070  [pdf, other

    cs.AI cs.CL cs.IR

    A Roadmap to Pluralistic Alignment

    Authors: Taylor Sorensen, Jared Moore, Jillian Fisher, Mitchell Gordon, Niloofar Mireshghallah, Christopher Michael Rytting, Andre Ye, Liwei Jiang, Ximing Lu, Nouha Dziri, Tim Althoff, Ye** Choi

    Abstract: With increased power and prevalence of AI systems, it is ever more critical that AI systems are designed to serve all, i.e., people with diverse values and perspectives. However, aligning models to serve pluralistic human values remains an open research question. In this piece, we propose a roadmap to pluralistic alignment, specifically using language models as a test bed. We identify and formaliz… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

  3. arXiv:2306.02177  [pdf, other

    cs.AI

    Towards Coding Social Science Datasets with Language Models

    Authors: Christopher Michael Rytting, Taylor Sorensen, Lisa Argyle, Ethan Busby, Nancy Fulda, Joshua Gubler, David Wingate

    Abstract: Researchers often rely on humans to code (label, annotate, etc.) large sets of texts. This kind of human coding forms an important part of social science research, yet the coding process is both resource intensive and highly variable from application to application. In some cases, efforts to automate this process have achieved human-level accuracies, but to achieve this, these attempts frequently… ▽ More

    Submitted 3 June, 2023; originally announced June 2023.

  4. arXiv:2210.12353  [pdf, other

    cs.CL cs.LG

    Leveraging Large Language Models for Multiple Choice Question Answering

    Authors: Joshua Robinson, Christopher Michael Rytting, David Wingate

    Abstract: While large language models (LLMs) like GPT-3 have achieved impressive results on multiple choice question answering (MCQA) tasks in the zero, one, and few-shot settings, they generally lag behind the MCQA state of the art (SOTA). MCQA tasks have traditionally been presented to LLMs like cloze tasks. An LLM is conditioned on a question (without the associated answer options) and its chosen option… ▽ More

    Submitted 16 March, 2023; v1 submitted 22 October, 2022; originally announced October 2022.

    Comments: Accepted for ICLR 2023

  5. An Information-theoretic Approach to Prompt Engineering Without Ground Truth Labels

    Authors: Taylor Sorensen, Joshua Robinson, Christopher Michael Rytting, Alexander Glenn Shaw, Kyle Jeffrey Rogers, Alexia Pauline Delorey, Mahmoud Khalil, Nancy Fulda, David Wingate

    Abstract: Pre-trained language models derive substantial linguistic and factual knowledge from the massive corpora on which they are trained, and prompt engineering seeks to align these models to specific tasks. Unfortunately, existing prompt engineering methods require significant amounts of labeled data, access to model parameters, or both. We introduce a new method for selecting prompt templates \textit{… ▽ More

    Submitted 21 March, 2022; originally announced March 2022.

  6. arXiv:2110.02370  [pdf, other

    cs.CL cs.AI

    Leveraging the Inductive Bias of Large Language Models for Abstract Textual Reasoning

    Authors: Christopher Michael Rytting, David Wingate

    Abstract: Large natural language models (such as GPT-3 or T5) demonstrate impressive abilities across a range of general NLP tasks. Here, we show that the knowledge embedded in such models provides a useful inductive bias, not just on traditional NLP tasks, but also in the nontraditional task of training a symbolic reasoning engine. We observe that these engines learn quickly and generalize in a natural way… ▽ More

    Submitted 5 October, 2021; originally announced October 2021.