Skip to main content

Showing 1–1 of 1 results for author: Hollinsworth, O J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2310.15154  [pdf, other

    cs.LG cs.AI cs.CL

    Linear Representations of Sentiment in Large Language Models

    Authors: Curt Tigges, Oskar John Hollinsworth, Atticus Geiger, Neel Nanda

    Abstract: Sentiment is a pervasive feature in natural language text, yet it is an open question how sentiment is represented within Large Language Models (LLMs). In this study, we reveal that across a range of models, sentiment is represented linearly: a single direction in activation space mostly captures the feature across a range of tasks with one extreme for positive and the other for negative. Through… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.