Skip to main content

Showing 1–8 of 8 results for author: Hallinan, S

.
  1. arXiv:2311.07167  [pdf, other

    cs.CL cs.AI

    STEER: Unified Style Transfer with Expert Reinforcement

    Authors: Skyler Hallinan, Faeze Brahman, Ximing Lu, Jaehun Jung, Sean Welleck, Ye** Choi

    Abstract: While text style transfer has many applications across natural language processing, the core premise of transferring from a single source style is unrealistic in a real-world setting. In this work, we focus on arbitrary style transfer: rewriting a text from an arbitrary, unknown style to a target style. We propose STEER: Unified Style Transfer with Expert Reinforcement, a unified frame-work deve… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

    Comments: for associated code, see https://github.com/shallinan1/STEERStyleTransfer

  2. arXiv:2311.02805  [pdf, other

    cs.CL

    Tailoring Self-Rationalizers with Multi-Reward Distillation

    Authors: Sahana Ramnath, Brihi Joshi, Skyler Hallinan, Ximing Lu, Liunian Harold Li, Aaron Chan, Jack Hessel, Ye** Choi, Xiang Ren

    Abstract: Large language models (LMs) are capable of generating free-text rationales to aid question answering. However, prior work 1) suggests that useful self-rationalization is emergent only at significant scales (e.g., 175B parameter GPT-3); and 2) focuses largely on downstream performance, ignoring the semantics of the rationales themselves, e.g., are they faithful, true, and helpful for humans? In thi… ▽ More

    Submitted 22 May, 2024; v1 submitted 5 November, 2023; originally announced November 2023.

    Journal ref: The Twelfth International Conference on Learning Representations, 2024

  3. arXiv:2305.15065  [pdf, other

    cs.CL

    Inference-Time Policy Adapters (IPA): Tailoring Extreme-Scale LMs without Fine-tuning

    Authors: Ximing Lu, Faeze Brahman, Peter West, Jaehun Jang, Khyathi Chandu, Abhilasha Ravichander, Lianhui Qin, Prithviraj Ammanabrolu, Liwei Jiang, Sahana Ramnath, Nouha Dziri, Jillian Fisher, Bill Yuchen Lin, Skyler Hallinan, Xiang Ren, Sean Welleck, Ye** Choi

    Abstract: While extreme-scale language models have demonstrated exceptional performance on a variety of language tasks, the degree of control over these language models through pure prompting can often be limited. Directly fine-tuning such language models can be effective for tailoring them, but it can be either extremely costly (e.g., GPT-3) or not even feasible for the broader community (e.g., GPT-4). W… ▽ More

    Submitted 6 December, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: EMNLP 2023

  4. arXiv:2303.17651  [pdf, other

    cs.CL cs.AI cs.LG

    Self-Refine: Iterative Refinement with Self-Feedback

    Authors: Aman Madaan, Niket Tandon, Prakhar Gupta, Skyler Hallinan, Luyu Gao, Sarah Wiegreffe, Uri Alon, Nouha Dziri, Shrimai Prabhumoye, Yiming Yang, Shashank Gupta, Bodhisattwa Prasad Majumder, Katherine Hermann, Sean Welleck, Amir Yazdanbakhsh, Peter Clark

    Abstract: Like humans, large language models (LLMs) do not always generate the best output on their first try. Motivated by how humans refine their written text, we introduce Self-Refine, an approach for improving initial outputs from LLMs through iterative feedback and refinement. The main idea is to generate an initial output using an LLMs; then, the same LLMs provides feedback for its output and uses it… ▽ More

    Submitted 25 May, 2023; v1 submitted 30 March, 2023; originally announced March 2023.

    Comments: Code, data, and demo at https://selfrefine.info/

  5. arXiv:2212.10543  [pdf, other

    cs.CL cs.AI

    Detoxifying Text with MaRCo: Controllable Revision with Experts and Anti-Experts

    Authors: Skyler Hallinan, Alisa Liu, Ye** Choi, Maarten Sap

    Abstract: Text detoxification has the potential to mitigate the harms of toxicity by rephrasing text to remove offensive meaning, but subtle toxicity remains challenging to tackle. We introduce MaRCo, a detoxification algorithm that combines controllable generation and text rewriting methods using a Product of Experts with autoencoder language models (LMs). MaRCo uses likelihoods under a non-toxic LM (exper… ▽ More

    Submitted 26 May, 2023; v1 submitted 20 December, 2022; originally announced December 2022.

  6. arXiv:2210.03078  [pdf, other

    cs.CL cs.AI

    Rainier: Reinforced Knowledge Introspector for Commonsense Question Answering

    Authors: Jiacheng Liu, Skyler Hallinan, Ximing Lu, Pengfei He, Sean Welleck, Hannaneh Hajishirzi, Ye** Choi

    Abstract: Knowledge underpins reasoning. Recent research demonstrates that when relevant knowledge is provided as additional context to commonsense question answering (QA), it can substantially enhance the performance even on top of state-of-the-art. The fundamental challenge is where and how to find such knowledge that is high quality and on point with respect to the question; knowledge retrieved from know… ▽ More

    Submitted 22 October, 2022; v1 submitted 6 October, 2022; originally announced October 2022.

    Comments: EMNLP 2022 main conference

  7. arXiv:2106.13753  [pdf

    cs.HC

    Investigating behavior change indicators and cognitive measures in persuasive health games

    Authors: S. Durga, S. Hallinan, M. Seif El-Nasr, M. Shiyko, C. Sceppa

    Abstract: Outcome-driven studies designed to evaluate potential effects of games and apps designed to promote healthy eating and exercising remain limited either targeting design or usability factors while omitting out health-based outcomes altogether, or tend to be too narrowly focuses on behavioral outcomes within a short periods of time thereby less likely to influence longitudinal factors that can help… ▽ More

    Submitted 25 June, 2021; originally announced June 2021.

    Journal ref: Foundations of Digital Games, 2015

  8. arXiv:2104.08790  [pdf, other

    cs.CL

    Misinfo Reaction Frames: Reasoning about Readers' Reactions to News Headlines

    Authors: Saadia Gabriel, Skyler Hallinan, Maarten Sap, Pemi Nguyen, Franziska Roesner, Eunsol Choi, Ye** Choi

    Abstract: Even to a simple and short news headline, readers react in a multitude of ways: cognitively (e.g. inferring the writer's intent), emotionally (e.g. feeling distrust), and behaviorally (e.g. sharing the news with their friends). Such reactions are instantaneous and yet complex, as they rely on factors that go beyond interpreting factual content of news. We propose Misinfo Reaction Frames (MRF), a p… ▽ More

    Submitted 22 March, 2022; v1 submitted 18 April, 2021; originally announced April 2021.

    Comments: ACL 2022 camera-ready