OneRec Technical Report

AI-generated keywords: Recommender Systems Multi-stage Cascaded Architectures Artificial Intelligence OneRec End-to-end Generative Framework

AI-generated Key Points

The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

  • Longstanding reliance on multi-stage cascaded architectures in recommender systems
  • Limitations due to computational fragmentation and optimization inconsistencies
  • Development of OneRec by Guorui Zhou, Jiaxin Deng, and team as a groundbreaking approach
  • OneRec reshapes recommendation systems through an end-to-end generative framework
  • Enhances computational FLOPs of existing models by 10 times and establishes scaling laws for recommendations
  • Leverages reinforcement learning techniques to optimize recommendations effectively
  • Achieves impressive Model FLOPs Utilization rates on flagship GPUs during training and inference stages
  • Deployment in Kuaishou/Kuaishou Lite APP results in handling a quarter of total queries per second and enhancing App Stay Time significantly
  • Substantial increases in metrics like 7-day Lifetime post-OneRec implementation, improving user engagement and satisfaction
  • Drastic reduction in operational expenses associated with traditional recommendation pipelines to just 10.6%
  • Technical report authored by Zhou et al. provides insights into the development and optimization process behind OneRec, with real-world implications for production-scale recommendation systems
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Guorui Zhou, Jiaxin Deng, Jinghao Zhang, Kuo Cai, Lejian Ren, Qiang Luo, Qianqian Wang, Qigen Hu, Rui Huang, Shiyao Wang, Weifeng Ding, Wuchao Li, Xinchen Luo, Xingmei Wang, Zexuan Cheng, Zixing Zhang, Bin Zhang, Boxuan Wang, Chaoyi Ma, Chengru Song, Chenhui Wang, Di Wang, Dongxue Meng, Fan Yang, Fangyu Zhang, Feng Jiang, Fuxing Zhang, Gang Wang, Guowang Zhang, Han Li, Hengrui Hu, Hezheng Lin, Hongtao Cheng, Hongyang Cao, Huanjie Wang, Jiaming Huang, Jiapeng Chen, Jiaqiang Liu, Jinghui Jia, Kun Gai, Lantao Hu, Liang Zeng, Liao Yu, Qiang Wang, Qidong Zhou, Shengzhe Wang, Shihui He, Shuang Yang, Shujie Yang, Sui Huang, Tao Wu, Tiantian He, Tingting Gao, Wei Yuan, Xiao Liang, Xiaoxiao Xu, Xugang Liu, Yan Wang, Yi Wang, Yiwu Liu, Yue Song, Yufei Zhang, Yunfan Wu, Yunfeng Zhao, Zhanyu Liu

Authors are listed alphabetically by their first name

Abstract: Recommender systems have been widely used in various large-scale user-oriented platforms for many years. However, compared to the rapid developments in the AI community, recommendation systems have not achieved a breakthrough in recent years. For instance, they still rely on a multi-stage cascaded architecture rather than an end-to-end approach, leading to computational fragmentation and optimization inconsistencies, and hindering the effective application of key breakthrough technologies from the AI community in recommendation scenarios. To address these issues, we propose OneRec, which reshapes the recommendation system through an end-to-end generative approach and achieves promising results. Firstly, we have enhanced the computational FLOPs of the current recommendation model by 10 $\times$ and have identified the scaling laws for recommendations within certain boundaries. Secondly, reinforcement learning techniques, previously difficult to apply for optimizing recommendations, show significant potential in this framework. Lastly, through infrastructure optimizations, we have achieved 23.7% and 28.8% Model FLOPs Utilization (MFU) on flagship GPUs during training and inference, respectively, aligning closely with the LLM community. This architecture significantly reduces communication and storage overhead, resulting in operating expense that is only 10.6% of traditional recommendation pipelines. Deployed in Kuaishou/Kuaishou Lite APP, it handles 25% of total queries per second, enhancing overall App Stay Time by 0.54% and 1.24%, respectively. Additionally, we have observed significant increases in metrics such as 7-day Lifetime, which is a crucial indicator of recommendation experience. We also provide practical lessons and insights derived from developing, optimizing, and maintaining a production-scale recommendation system with significant real-world impact.

Submitted to arXiv on 16 Jun. 2025

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2506.13695v1

This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

In the field of recommender systems, there has been a longstanding reliance on multi-stage cascaded architectures that have not kept pace with the rapid advancements in artificial intelligence. This has led to computational fragmentation and optimization inconsistencies, limiting the effective integration of key AI breakthroughs into recommendation scenarios. To address these challenges, a team of researchers led by Guorui Zhou, Jiaxin Deng, and their colleagues have developed OneRec - a groundbreaking approach that reshapes recommendation systems through an end-to-end generative framework. OneRec represents a significant leap forward in recommendation technology by enhancing the computational FLOPs of existing models by 10 times and establishing scaling laws for recommendations within specific boundaries. By leveraging reinforcement learning techniques previously deemed challenging for optimizing recommendations, OneRec demonstrates substantial potential in improving recommendation accuracy and efficiency. Furthermore, through infrastructure optimizations, the team has achieved impressive Model FLOPs Utilization rates on flagship GPUs during both training and inference stages - aligning closely with leading-edge practices in the AI community. The deployment of OneRec in the Kuaishou/Kuaishou Lite APP has yielded remarkable results - handling a quarter of total queries per second while enhancing overall App Stay Time by significant margins. Notably, metrics such as 7-day Lifetime have shown substantial increases following the implementation of OneRec - underscoring its positive impact on user engagement and satisfaction. Additionally, operational expenses associated with traditional recommendation pipelines have been drastically reduced to just 10.6% through the adoption of this innovative architecture. The technical report authored by Zhou et al. not only presents the development and optimization process behind OneRec but also offers valuable insights derived from maintaining a production-scale recommendation system with tangible real-world implications. The comprehensive approach taken by the research team showcases how cutting-edge technologies can be effectively harnessed to revolutionize recommender systems and enhance user experiences across diverse platforms.
Created on 25 Jun. 2025

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.