Building Cooperative Embodied Agents Modularly with Large Language Models

AI-generated keywords: Large Language Models (LLMs)

AI-generated Key Points

The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

  • Authors explore the capabilities of Large Language Models (LLMs) in multi-agent cooperation tasks
  • Present a novel framework that utilizes LLMs for multi-agent cooperation in different embodied environments
  • Framework enables embodied agents to plan, communicate, and cooperate efficiently for long-horizon tasks
  • Recent LLMs like GPT-4 surpass strong planning-based methods and exhibit effective communication without fine-tuning or few-shot prompting
  • LLM-based agents that communicate in natural language earn more trust and cooperate effectively with humans
  • Highlights potential of LLMs for embodied AI and future research in multi-agent cooperation
  • Emphasizes importance of natural language communication for collaboration between human users and AI agents
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Hongxin Zhang, Weihua Du, Jiaming Shan, Qinhong Zhou, Yilun Du, Joshua B. Tenenbaum, Tianmin Shu, Chuang Gan

Project page: https://vis-www.cs.umass.edu/Co-LLM-Agents/

Abstract: Large Language Models (LLMs) have demonstrated impressive planning abilities in single-agent embodied tasks across various domains. However, their capacity for planning and communication in multi-agent cooperation remains unclear, even though these are crucial skills for intelligent embodied agents. In this paper, we present a novel framework that utilizes LLMs for multi-agent cooperation and tests it in various embodied environments. Our framework enables embodied agents to plan, communicate, and cooperate with other embodied agents or humans to accomplish long-horizon tasks efficiently. We demonstrate that recent LLMs, such as GPT-4, can surpass strong planning-based methods and exhibit emergent effective communication using our framework without requiring fine-tuning or few-shot prompting. We also discover that LLM-based agents that communicate in natural language can earn more trust and cooperate more effectively with humans. Our research underscores the potential of LLMs for embodied AI and lays the foundation for future research in multi-agent cooperation. Videos can be found on the project website https://vis-www.cs.umass.edu/Co-LLM-Agents/.

Submitted to arXiv on 05 Jul. 2023

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2307.02485v1

This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

In this paper titled "Building Cooperative Embodied Agents Modularly with Large Language Models," authors Hongxin Zhang, Weihua Du, Jiaming Shan, Qinhong Zhou, Yilun Du, Joshua B. Tenenbaum, Tianmin Shu, and Chuang Gan explore the capabilities of Large Language Models (LLMs) in multi-agent cooperation tasks. While LLMs have shown impressive planning abilities in single-agent embodied tasks across various domains, their capacity for planning and communication in multi-agent settings remains unclear. To address this gap, the authors present a novel framework that utilizes LLMs for multi-agent cooperation and test it in different embodied environments. Their framework enables embodied agents to plan, communicate, and cooperate with other agents or humans efficiently to accomplish long-horizon tasks. The researchers demonstrate that recent LLMs like GPT-4 can surpass strong planning-based methods and exhibit emergent effective communication using their framework without requiring fine-tuning or few-shot prompting. Additionally, the study reveals that LLM-based agents that communicate in natural language can earn more trust and cooperate more effectively with humans. This finding highlights the potential of LLMs for embodied AI and lays the foundation for future research in multi-agent cooperation. The paper provides valuable insights into how LLMs can be leveraged to enhance planning and communication abilities in cooperative multi-agent scenarios. It also emphasizes the importance of natural language communication for effective collaboration between human users and AI agents. Videos related to the project can be found on the project website at https://vis-www.cs.umass.edu/Co-LLM-Agents/.
Created on 07 Oct. 2023

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

The license of this specific paper does not allow us to build upon its content and the summarizing tools will be run using the paper metadata rather than the full article. However, it still does a good job, and you can also try our tools on papers with more open licenses.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.