Language-Guided World Models: A Model-Based Approach to AI Control

Zhang, Alex; Nguyen, Khanh; Tuyls, Jens; Lin, Albert; Narasimhan, Karthik

Computer Science > Computation and Language

arXiv:2402.01695 (cs)

[Submitted on 24 Jan 2024 (v1), last revised 5 Jul 2024 (this version, v2)]

Title:Language-Guided World Models: A Model-Based Approach to AI Control

Authors:Alex Zhang, Khanh Nguyen, Jens Tuyls, Albert Lin, Karthik Narasimhan

View PDF HTML (experimental)

Abstract:This paper introduces the concept of Language-Guided World Models (LWMs) -- probabilistic models that can simulate environments by reading texts. Agents equipped with these models provide humans with more extensive and efficient control, allowing them to simultaneously alter agent behaviors in multiple tasks via natural verbal communication. In this work, we take initial steps in develo** robust LWMs that can generalize to compositionally novel language descriptions. We design a challenging world modeling benchmark based on the game of MESSENGER (Hanjie et al., 2021), featuring evaluation settings that require varying degrees of compositional generalization. Our experiments reveal the lack of generalizability of the state-of-the-art Transformer model, as it offers marginal improvements in simulation quality over a no-text baseline. We devise a more robust model by fusing the Transformer with the EMMA attention mechanism (Hanjie et al., 2021). Our model substantially outperforms the Transformer and approaches the performance of a model with an oracle semantic parsing and grounding capability. To demonstrate the practicality of this model in improving AI safety and transparency, we simulate a scenario in which the model enables an agent to present plans to a human before execution, and to revise plans based on their language feedback.

Comments:	SpLU-RoboNLP workshop at ACL 2024
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2402.01695 [cs.CL]
	(or arXiv:2402.01695v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2402.01695

Submission history

From: Khanh Nguyen [view email]
[v1] Wed, 24 Jan 2024 03:11:36 UTC (1,569 KB)
[v2] Fri, 5 Jul 2024 02:49:47 UTC (1,560 KB)

Computer Science > Computation and Language

Title:Language-Guided World Models: A Model-Based Approach to AI Control

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Language-Guided World Models: A Model-Based Approach to AI Control

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators