Memory-Augmented Generative Adversarial Transformers

Raaijmakers, Stephan; Bakker, Roos; Cremers, Anita; de Kleijn, Roy; Kouwenhoven, Tom; Verhoef, Tessa

Computer Science > Computation and Language

arXiv:2402.19218 (cs)

[Submitted on 29 Feb 2024]

Title:Memory-Augmented Generative Adversarial Transformers

Authors:Stephan Raaijmakers, Roos Bakker, Anita Cremers, Roy de Kleijn, Tom Kouwenhoven, Tessa Verhoef

View PDF HTML (experimental)

Abstract:Conversational AI systems that rely on Large Language Models, like Transformers, have difficulty interweaving external data (like facts) with the language they generate. Vanilla Transformer architectures are not designed for answering factual questions with high accuracy. This paper investigates a possible route for addressing this problem. We propose to extend the standard Transformer architecture with an additional memory bank holding extra information (such as facts drawn from a knowledge base), and an extra attention layer for addressing this memory. We add this augmented memory to a Generative Adversarial Network-inspired Transformer architecture. This setup allows for implementing arbitrary felicity conditions on the generated language of the Transformer. We first demonstrate how this machinery can be deployed for handling factual questions in goal-oriented dialogues. Secondly, we demonstrate that our approach can be useful for applications like {\it style adaptation} as well: the adaptation of utterances according to certain stylistic (external) constraints, like social properties of human interlocutors in dialogues.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2402.19218 [cs.CL]
	(or arXiv:2402.19218v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2402.19218

Submission history

From: Stephan Raaijmakers [view email]
[v1] Thu, 29 Feb 2024 14:47:24 UTC (544 KB)

Computer Science > Computation and Language

Title:Memory-Augmented Generative Adversarial Transformers

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Memory-Augmented Generative Adversarial Transformers

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators