Agents Thinking Fast and Slow: A Talker-Reasoner Architecture

AI-generated keywords: Language Models Conversational Agents Dual-System Architecture Natural Language Feedback Human Cognition

AI-generated Key Points

  • Large language models (LLMs) have transformed agent-user interactions through natural conversation
  • Agents now balance conversing with users and engaging in multi-step reasoning and planning to achieve goals
  • Dual-system architecture introduced: "Talker" agent for quick, intuitive responses and "Reasoner" agent for deliberate reasoning, planning, and action execution
  • Advantages of Talker-Reasoner architecture include modularity and decreased latency
  • Model integrates talking while reasoning/planning and explicit belief modeling for a sophisticated understanding of user behavior
  • Incorporates natural language feedback into decision-making process to continuously update beliefs about user goals, plans, motivations, and barriers
  • Theoretical underpinnings of the model related to human cognition systems are discussed alongside a practical example - a sleep coaching agent - to demonstrate real-world relevance
  • Integration of fast intuitive responses with slower logical reasoning enhances conversational agents' capabilities across various domains
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Konstantina Christakopoulou, Shibl Mourad, Maja Matarić

License: CC BY 4.0

Abstract: Large language models have enabled agents of all kinds to interact with users through natural conversation. Consequently, agents now have two jobs: conversing and planning/reasoning. Their conversational responses must be informed by all available information, and their actions must help to achieve goals. This dichotomy between conversing with the user and doing multi-step reasoning and planning can be seen as analogous to the human systems of "thinking fast and slow" as introduced by Kahneman. Our approach is comprised of a "Talker" agent (System 1) that is fast and intuitive, and tasked with synthesizing the conversational response; and a "Reasoner" agent (System 2) that is slower, more deliberative, and more logical, and is tasked with multi-step reasoning and planning, calling tools, performing actions in the world, and thereby producing the new agent state. We describe the new Talker-Reasoner architecture and discuss its advantages, including modularity and decreased latency. We ground the discussion in the context of a sleep coaching agent, in order to demonstrate real-world relevance.

Submitted to arXiv on 10 Oct. 2024

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2410.08328v1

In recent years, large language models (LLMs) have revolutionized the way agents interact with users through natural conversation. This has led to a shift in the roles of agents, which now must balance conversing with users and engaging in multi-step reasoning and planning to achieve goals. This dichotomy is akin to the concept of "thinking fast and slow" introduced by Kahneman, where quick, intuitive responses (System 1) are complemented by slower, more logical reasoning and planning (System 2). Building upon this framework, our approach introduces a dual-system architecture consisting of a "Talker" agent responsible for synthesizing conversational responses quickly and intuitively, and a "Reasoner" agent tasked with more deliberate reasoning, planning, and executing actions to drive the agent towards its goals. This Talker-Reasoner architecture offers advantages such as modularity and decreased latency. Drawing inspiration from related work on LLM-driven agents that focus on text-based interactions as well as embodied agents capable of multimodal interactions, our model integrates both talking while reasoning/planning and explicit belief modeling. By incorporating natural language feedback into the agent's decision-making process and continuously updating its beliefs about user goals, plans, motivations, and barriers, we aim to create a more sophisticated understanding of user behavior. In addition to discussing the theoretical underpinnings of our Talker-Reasoner model in relation to human cognition systems, we also ground our discussion in a practical example - a sleep coaching agent - to demonstrate the real-world relevance of our approach. Through this detailed exploration of our novel agent architecture, we aim to showcase how integrating fast intuitive responses with slower logical reasoning can enhance the capabilities of conversational agents in various domains.
Created on 12 Nov. 2024

Assess the quality of the AI-generated content by voting

Score: -1

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.