System 2 Attention (is something you might need too)

Weston, Jason; Sukhbaatar, Sainbayar

Computer Science > Computation and Language

arXiv:2311.11829 (cs)

[Submitted on 20 Nov 2023]

Title:System 2 Attention (is something you might need too)

Authors:Jason Weston, Sainbayar Sukhbaatar

View PDF

Abstract:Soft attention in Transformer-based Large Language Models (LLMs) is susceptible to incorporating irrelevant information from the context into its latent representations, which adversely affects next token generations. To help rectify these issues, we introduce System 2 Attention (S2A), which leverages the ability of LLMs to reason in natural language and follow instructions in order to decide what to attend to. S2A regenerates the input context to only include the relevant portions, before attending to the regenerated context to elicit the final response. In experiments, S2A outperforms standard attention-based LLMs on three tasks containing opinion or irrelevant information, QA, math word problems and longform generation, where S2A increases factuality and objectivity, and decreases sycophancy.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2311.11829 [cs.CL]
	(or arXiv:2311.11829v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2311.11829

Submission history

From: Jason Weston [view email]
[v1] Mon, 20 Nov 2023 15:04:50 UTC (97 KB)

Full-text links:

Access Paper:

view license

Current browse context:

< prev | next >

new | recent | 2023-11

Change to browse by:

cs.AI
cs.CL
cs.LG

References & Citations

export BibTeX citation

Computer Science > Computation and Language

Title:System 2 Attention (is something you might need too)

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:System 2 Attention (is something you might need too)

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators