A Prefrontal Cortex-inspired Architecture for Planning in Large Language Models

Webb, Taylor; Mondal, Shanka Subhra; Wang, Chi; Krabach, Brian; Momennejad, Ida

Computer Science > Artificial Intelligence

arXiv:2310.00194 (cs)

[Submitted on 30 Sep 2023 (v1), last revised 6 Mar 2024 (this version, v3)]

Title:A Prefrontal Cortex-inspired Architecture for Planning in Large Language Models

Authors:Taylor Webb, Shanka Subhra Mondal, Chi Wang, Brian Krabach, Ida Momennejad

View PDF HTML (experimental)

Abstract:Large language models (LLMs) demonstrate impressive performance on a wide variety of tasks, but they often struggle with tasks that require multi-step reasoning or goal-directed planning. To address this, we take inspiration from the human brain, in which planning is accomplished via the recurrent interaction of specialized modules in the prefrontal cortex (PFC). These modules perform functions such as conflict monitoring, state prediction, state evaluation, task decomposition, and task coordination. We find that LLMs are sometimes capable of carrying out these functions in isolation, but struggle to autonomously coordinate them in the service of a goal. Therefore, we propose a black box architecture with multiple LLM-based (GPT-4) modules. The architecture improves planning through the interaction of specialized PFC-inspired modules that break down a larger problem into multiple brief automated calls to the LLM. We evaluate the combined architecture on three challenging planning tasks -- graph traversal, Tower of Hanoi, and logistics -- finding that it yields significant improvements over standard LLM methods (e.g., zero-shot prompting, in-context learning, and chain-of-thought). These results demonstrate the benefit of utilizing knowledge from cognitive neuroscience to improve planning in LLMs.

Subjects:	Artificial Intelligence (cs.AI); Neural and Evolutionary Computing (cs.NE)
Cite as:	arXiv:2310.00194 [cs.AI]
	(or arXiv:2310.00194v3 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2310.00194

Submission history

From: Taylor Webb [view email]
[v1] Sat, 30 Sep 2023 00:10:14 UTC (1,570 KB)
[v2] Tue, 5 Mar 2024 18:12:06 UTC (505 KB)
[v3] Wed, 6 Mar 2024 03:24:45 UTC (505 KB)

Computer Science > Artificial Intelligence

Title:A Prefrontal Cortex-inspired Architecture for Planning in Large Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:A Prefrontal Cortex-inspired Architecture for Planning in Large Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators