$\text{Memory}^3$: Language Modeling with Explicit Memory

AI-generated keywords: Language modeling Explicit memory Large language models Knowledge externalization Computational efficiency

AI-generated Key Points

The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

  • Authors introduce a groundbreaking approach to enhancing large language models (LLMs) by incorporating explicit memory
  • Named $\text{Memory}^3$ to signify explicit memory as the third form of memory in LLMs after implicit memory and working memory
  • Equipping LLMs with explicit memory is proposed as a cost-effective alternative to model parameters and text retrieval-augmented generation (RAG)
  • Introduction of pioneering techniques such as a memory sparsification mechanism and a two-stage pretraining scheme for effective memory formation
  • Research represents a significant advancement in language modeling by reducing computational costs associated with large-scale models
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Hongkang Yang, Zehao Lin, Wenjin Wang, Hao Wu, Zhiyu Li, Bo Tang, Wenqiang Wei, Jinbo Wang, Zeyun Tang, Shichao Song, Chenyang Xi, Yu Yu, Kai Chen, Feiyu Xiong, Linpeng Tang, Weinan E

Abstract: The training and inference of large language models (LLMs) are together a costly process that transports knowledge from raw data to meaningful computation. Inspired by the memory hierarchy of the human brain, we reduce this cost by equipping LLMs with explicit memory, a memory format cheaper than model parameters and text retrieval-augmented generation (RAG). Conceptually, with most of its knowledge externalized to explicit memories, the LLM can enjoy a smaller parameter size, training cost, and inference cost, all proportional to the amount of remaining "abstract knowledge". As a preliminary proof of concept, we train from scratch a 2.4B LLM, which achieves better performance than much larger LLMs as well as RAG models, and maintains higher decoding speed than RAG. The model is named $\text{Memory}^3$, since explicit memory is the third form of memory in LLMs after implicit memory (model parameters) and working memory (context key-values). We introduce a memory circuitry theory to support the externalization of knowledge, and present novel techniques including a memory sparsification mechanism that makes storage tractable and a two-stage pretraining scheme that facilitates memory formation.

Submitted to arXiv on 01 Jul. 2024

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2407.01178v1

This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

In their paper titled "$\text{Memory}^3$: Language Modeling with Explicit Memory," authors Hongkang Yang, Zehao Lin, Wenjin Wang, Hao Wu, Zhiyu Li, Bo Tang, Wenqiang Wei, Jinbo Wang, Zeyun Tang, Shichao Song, Chenyang Xi, Yu Yu, Kai Chen, Feiyu Xiong,Linpeng Tang,and Weinan E introduce a groundbreaking approach to enhancing large language models (LLMs) by incorporating explicit memory. The concept behind this innovation lies in externalizing a significant portion of the LLM's knowledge into explicit memories. To demonstrate the efficacy of their approach,the authors undertake the task of training a 2.4B LLM from scratch. Named $\text{Memory}^3$ to signify explicit memory as the third form of memory in LLMs after implicit memory (model parameters) and working memory (context key-values), this innovative model is supported by a novel memory circuitry theory that facilitates knowledge externalization. has long been recognized as a crucial component in natural language processing tasks. However, has not been widely explored as an avenue for improving . In their research, propose equipping LLMs with explicit memory as a more cost-effective alternative to model parameters and text retrieval-augmented generation (RAG). This not only reduces computational costs but also improves performance metrics. The training and inference processes of LLMs are traditionally resource-intensive as they involve transferring knowledge from raw data to meaningful computation. Drawing inspiration from the of the human brain,the authors introduce pioneering techniques such as a memory sparsification mechanism for manageable storage and a two-stage pretraining scheme that aids in effective memory formation. These advancements pave the way for future developments in optimizing LLM architectures for enhanced efficiency and effectiveness. Overall, this research represents a significant advancement in language modeling by introducing explicit memory as an efficient means of reducing computational costs associated with large-scale models. The findings not only showcase improved performance metrics but also open doors for further exploration of and its impact on .
Created on 07 Jul. 2024

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.