Agent Hospital: A Simulacrum of Hospital with Evolvable Medical Agents

AI-generated keywords: Simulated Hospital Environment Autonomous Agents Large Language Models MedAgent-Zero AI-driven Healthcare Professionals

AI-generated Key Points

  • Introduction of Agent Hospital, a simulacrum populated by autonomous agents powered by large language models (LLMs)
  • Central goal: enable doctor agents to learn how to effectively treat illnesses within the simulated environment
  • Proposed method: MedAgent-Zero, utilizing knowledge bases and LLMs to simulate disease onset and progression
  • Results show consistent improvement in treatment performance of doctor agents on various tasks
  • Evolved doctor agent achieves state-of-the-art accuracy of 93.06% on a subset of the MedQA dataset after treating around ten thousand patients
  • Aim to train proficient doctor agents for critical medical tasks through simulation environment named Agent Hospital
  • Training strategy involves simulated interactions with patient agents within Agent Hospital, leading to rapid evolution through experience gained from successful and failed cases
  • Offers cost-effective and efficient means of training doctor agents compared to real-world methods
  • Demonstrates potential benefits of simulating hospital environments for training AI-driven healthcare professionals
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Junkai Li, Siyu Wang, Meng Zhang, Weitao Li, Yunghwei Lai, Xinhui Kang, Weizhi Ma, Yang Liu

License: CC BY 4.0

Abstract: In this paper, we introduce a simulacrum of hospital called Agent Hospital that simulates the entire process of treating illness. All patients, nurses, and doctors are autonomous agents powered by large language models (LLMs). Our central goal is to enable a doctor agent to learn how to treat illness within the simulacrum. To do so, we propose a method called MedAgent-Zero. As the simulacrum can simulate disease onset and progression based on knowledge bases and LLMs, doctor agents can keep accumulating experience from both successful and unsuccessful cases. Simulation experiments show that the treatment performance of doctor agents consistently improves on various tasks. More interestingly, the knowledge the doctor agents have acquired in Agent Hospital is applicable to real-world medicare benchmarks. After treating around ten thousand patients (real-world doctors may take over two years), the evolved doctor agent achieves a state-of-the-art accuracy of 93.06% on a subset of the MedQA dataset that covers major respiratory diseases. This work paves the way for advancing the applications of LLM-powered agent techniques in medical scenarios.

Submitted to arXiv on 05 May. 2024

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2405.02957v1

In this paper, the authors introduce a simulacrum of a hospital called Agent Hospital. The hospital is populated by autonomous agents powered by large language models (LLMs), including patients, nurses, and doctors. The central goal of this simulation is to enable doctor agents to learn how to effectively treat illnesses within the simulated environment. To achieve this goal, the authors propose a method called MedAgent-Zero. The simulacrum utilizes knowledge bases and LLMs to simulate disease onset and progression, allowing doctor agents to accumulate experience from both successful and unsuccessful cases. Through simulation experiments, the authors demonstrate that the treatment performance of doctor agents consistently improves on various tasks. Impressively, the knowledge acquired by doctor agents in Agent Hospital proves applicable to real-world medicare benchmarks. After treating around ten thousand patients (a task that would take real-world doctors over two years), the evolved doctor agent achieves a state-of-the-art accuracy of 93.06% on a subset of the MedQA dataset covering major respiratory diseases. This study builds upon previous successes with LLM agents in various tasks and explores combining social simulation with specific task-solving capabilities. By designing a comprehensive simulation environment named Agent Hospital that mimics medical processes in a hospital setting, the authors aim to train proficient doctor agents for critical medical tasks such as diagnosis and treatment recommendation. The proposed strategy of training doctor agents through simulated interactions with patient agents within Agent Hospital - referred to as MedAgent-Zero - allows for rapid evolution of these agents through experience gained from successful and failed cases. This approach offers a cost-effective and efficient means of training doctor agents to handle tens of thousands of cases within days compared to several years it would take real-world doctors. Overall, this work paves the way for advancing LLM-powered agent techniques in medical scenarios by demonstrating the potential benefits of simulating hospital environments for training AI-driven healthcare professionals.
Created on 02 Jul. 2024

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.