A Prefrontal Cortex-inspired Architecture for Planning in Large Language Models

AI-generated keywords: Large Language Models (LLMs)

AI-generated Key Points

  • Large Language Models (LLMs) struggle with tasks requiring multi-step reasoning or goal-directed planning
  • Authors propose a black box architecture inspired by the human brain's prefrontal cortex (PFC)
  • PFC-inspired modules perform functions like conflict monitoring, state prediction, state evaluation, task decomposition, and task coordination
  • LLMs can perform these functions individually but struggle to coordinate them towards a goal
  • Proposed architecture combines LLM-based modules (GPT-4) with specialized PFC-inspired modules to break down complex problems into automated calls to the LLM
  • Evaluation on planning tasks shows significant improvements compared to standard LLM methods like zero-shot prompting or in-context learning
  • Incorporating knowledge from cognitive neuroscience enhances planning capabilities of LLMs
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Taylor Webb, Shanka Subhra Mondal, Chi Wang, Brian Krabach, Ida Momennejad

License: CC BY 4.0

Abstract: Large language models (LLMs) demonstrate impressive performance on a wide variety of tasks, but they often struggle with tasks that require multi-step reasoning or goal-directed planning. To address this, we take inspiration from the human brain, in which planning is accomplished via the recurrent interaction of specialized modules in the prefrontal cortex (PFC). These modules perform functions such as conflict monitoring, state prediction, state evaluation, task decomposition, and task coordination. We find that LLMs are sometimes capable of carrying out these functions in isolation, but struggle to autonomously coordinate them in the service of a goal. Therefore, we propose a black box architecture with multiple LLM-based (GPT-4) modules. The architecture improves planning through the interaction of specialized PFC-inspired modules that break down a larger problem into multiple brief automated calls to the LLM. We evaluate the combined architecture on two challenging planning tasks -- graph traversal and Tower of Hanoi -- finding that it yields significant improvements over standard LLM methods (e.g., zero-shot prompting or in-context learning). These results demonstrate the benefit of utilizing knowledge from cognitive neuroscience to improve planning in LLMs.

Submitted to arXiv on 30 Sep. 2023

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2310.00194v1

Large Language Models (LLMs) have shown impressive performance on various tasks, but they often struggle with tasks that require multi-step reasoning or goal-directed planning. To address this limitation, the authors propose a black box architecture inspired by the human brain's prefrontal cortex (PFC), which is responsible for planning through recurrent interaction of specialized modules. These PFC-inspired modules perform functions such as conflict monitoring, state prediction, state evaluation, task decomposition, and task coordination. While LLMs can perform these functions in isolation, they struggle to coordinate them autonomously towards a goal. The proposed architecture utilizes multiple LLM-based modules (GPT-4) that interact with specialized PFC-inspired modules to break down complex problems into brief automated calls to the LLM. The authors evaluate this combined architecture on challenging planning tasks like graph traversal and Tower of Hanoi and find significant improvements compared to standard LLM methods such as zero-shot prompting or in-context learning. This research demonstrates the potential benefits of incorporating knowledge from cognitive neuroscience into LLMs to enhance their planning capabilities.
Created on 06 Nov. 2023

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.