MemOS: A Memory OS for AI System

AI-generated keywords: Large Language Models Artificial General Intelligence Memory Management Systems MemOS Continual Learning

AI-generated Key Points

  • Large Language Models (LLMs) are critical for Artificial General Intelligence (AGI) and excel in natural language processing tasks
  • LLMs have evolved to handle structured code generation, cross-modal reasoning, multi-turn dialogue, and complex planning
  • Efficient memory management systems are crucial due to the increasing size and complexity of models
  • MemOS proposes a memory operating system that treats memory as a manageable resource through MemCubes
  • LLMs are expected to become persistent agents embedded in workflows, accumulating interaction histories and adapting over time
  • The evolution of memory systems in LLMs is categorized based on object type, form, temporal aspects, and retention duration
  • MemOS establishes a memory-centric framework for controllability, plasticity, and evolvability in LLMs towards achieving AGI capabilities
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Zhiyu Li, Shichao Song, Chenyang Xi, Hanyu Wang, Chen Tang, Simin Niu, Ding Chen, Jiawei Yang, Chunyu Li, Qingchen Yu, Jihao Zhao, Yezhaohui Wang, Peng Liu, Zehao Lin, Pengyuan Wang, Jiahao Huo, Tianyi Chen, Kai Chen, Kehang Li, Zhen Tao, Junpeng Ren, Huayi Lai, Hao Wu, Bo Tang, Zhenren Wang, Zhaoxin Fan, Ningyu Zhang, Linfeng Zhang, Junchi Yan, Mingchuan Yang, Tong Xu, Wei Xu, Huajun Chen, Haofeng Wang, Hongkang Yang, Wentao Zhang, Zhi-Qin John Xu, Siheng Chen, Feiyu Xiong

36 pages, 10 figures, 5 tables
License: CC BY 4.0

Abstract: Large Language Models (LLMs) have become an essential infrastructure for Artificial General Intelligence (AGI), yet their lack of well-defined memory management systems hinders the development of long-context reasoning, continual personalization, and knowledge consistency.Existing models mainly rely on static parameters and short-lived contextual states, limiting their ability to track user preferences or update knowledge over extended periods.While Retrieval-Augmented Generation (RAG) introduces external knowledge in plain text, it remains a stateless workaround without lifecycle control or integration with persistent representations.Recent work has modeled the training and inference cost of LLMs from a memory hierarchy perspective, showing that introducing an explicit memory layer between parameter memory and external retrieval can substantially reduce these costs by externalizing specific knowledge. Beyond computational efficiency, LLMs face broader challenges arising from how information is distributed over time and context, requiring systems capable of managing heterogeneous knowledge spanning different temporal scales and sources. To address this challenge, we propose MemOS, a memory operating system that treats memory as a manageable system resource. It unifies the representation, scheduling, and evolution of plaintext, activation-based, and parameter-level memories, enabling cost-efficient storage and retrieval. As the basic unit, a MemCube encapsulates both memory content and metadata such as provenance and versioning. MemCubes can be composed, migrated, and fused over time, enabling flexible transitions between memory types and bridging retrieval with parameter-based learning. MemOS establishes a memory-centric system framework that brings controllability, plasticity, and evolvability to LLMs, laying the foundation for continual learning and personalized modeling.

Submitted to arXiv on 04 Jul. 2025

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2507.03724v1

Large Language Models (LLMs) have emerged as a critical component of Artificial General Intelligence (AGI), showcasing near-human performance in various natural language processing tasks. With the advancement of Transformer architecture and self-supervised pretraining, LLMs have expanded their capabilities to structured code generation, cross-modal reasoning, multi-turn dialogue, and complex planning. As models continue to grow in size and complexity, they are positioned as a key pathway towards AGI. In response to the evolving landscape of LLMs, the need for efficient memory management systems has become increasingly apparent. Current models lack well-defined memory structures, hindering long-context reasoning, continual personalization, and knowledge consistency. While existing approaches like Retrieval-Augmented Generation (RAG) introduce external knowledge in plain text, they lack lifecycle control and integration with persistent representations. To address these challenges, MemOS proposes a memory operating system that treats memory as a manageable resource. By unifying plaintext, activation-based, and parameter-level memories within MemCubes - encapsulating both content and metadata - MemOS enables cost-efficient storage and retrieval. This framework allows for flexible transitions between different memory types and bridges retrieval with parameter-based learning. Looking ahead, the presence of LLMs is expected to expand temporally and spatially. Models will transition from stateless tools to persistent agents embedded in long-running workflows, accumulating interaction histories and adapting internal states over time. Spatially, LLMs are becoming foundational intelligence layers across users, platforms, and ecosystems - necessitating efficient organization, storage, and retrieval of knowledge. The evolution of memory systems in large language models is highlighted through systematic classifications based on parameters such as object type (personal vs. system), form (parametric vs. non-parametric), temporal aspects (short-term vs. long-term), retention duration distinguishing sensory memory from short-term to long-term memory. Overall, MemOS establishes a memory-centric framework that brings controllability, plasticity, and evolvability to LLMs - laying the foundation for continual learning and personalized modeling towards achieving AGI capabilities.
Created on 13 Jul. 2025

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.