MINDSTORES: Memory-Informed Neural Decision Synthesis for Task-Oriented Reinforcement in Embodied Systems
AI-generated Key Points
- Large language models (LLMs) have shown promise as zero-shot planners in embodied AI systems.
- LLMs lack the ability to learn from experience and construct persistent mental models, limiting their effectiveness in complex open-world environments like Minecraft.
- MINDSTORES is a novel approach that addresses this limitation by introducing an experience-augmented planning framework for embodied agents.
- MINDSTORES enables agents to build and utilize mental models through natural interaction with their environment, extending zero-shot LLM planning by maintaining a database of past experiences.
- The key innovation of MINDSTORES lies in representing accumulated experiences as natural language embeddings of (state, task, plan, outcome) tuples for efficient retrieval and reasoning by LLM planners.
- Extensive experiments in the MineDojo environment have demonstrated that MINDSTORES outperforms existing memory-based LLM planners while retaining flexibility and generalization benefits.
- Zero-shot LLM planning with the DEPS framework involves components such as descriptor summarizing current state and outcomes, explainer analyzing plan failures, planner generating action sequences, and selector ranking candidate sub-goals based on completion estimates.
- MINDSTORES represents a significant advancement towards more capable embodied AI systems that can continuously learn through natural experiences by combining memory-informed decision synthesis with task-oriented reinforcement learning principles.
Authors: Anirudh Chari, Suraj Reddy, Aditya Tiwari, Richard Lian, Brian Zhou
Abstract: While large language models (LLMs) have shown promising capabilities as zero-shot planners for embodied agents, their inability to learn from experience and build persistent mental models limits their robustness in complex open-world environments like Minecraft. We introduce MINDSTORES, an experience-augmented planning framework that enables embodied agents to build and leverage mental models through natural interaction with their environment. Drawing inspiration from how humans construct and refine cognitive mental models, our approach extends existing zero-shot LLM planning by maintaining a database of past experiences that informs future planning iterations. The key innovation is representing accumulated experiences as natural language embeddings of (state, task, plan, outcome) tuples, which can then be efficiently retrieved and reasoned over by an LLM planner to generate insights and guide plan refinement for novel states and tasks. Through extensive experiments in the MineDojo environment, a simulation environment for agents in Minecraft that provides low-level controls for Minecraft, we find that MINDSTORES learns and applies its knowledge significantly better than existing memory-based LLM planners while maintaining the flexibility and generalization benefits of zero-shot approaches, representing an important step toward more capable embodied AI systems that can learn continuously through natural experience.
Ask questions about this paper to our AI assistant
You can also chat with multiple papers at once here.
Assess the quality of the AI-generated content by voting
Score: 0
Why do we need votes?
Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.
Similar papers summarized with our AI tools
Navigate through even more similar papers through a
tree representationLook for similar papers (in beta version)
By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.
Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.