Monolith: Real Time Recommendation System With Collisionless Embedding Table

AI-generated keywords: Monolith Recommendation System Deep Learning Real-Time Learning Parameter Synchronization

AI-generated Key Points

  • Importance of building a scalable and real-time recommendation system for time-sensitive customer feedback
  • Deep learning frameworks like TensorFlow or PyTorch are not suitable for recommendation scenarios due to static parameters and dense computations
  • Introduction of Monolith, a system specifically designed for online training in recommendation systems
  • Monolith's design is driven by observations of application workloads and production environments, setting it apart from other recommendation systems
  • Contributions of Monolith include collisionless embedding table with optimizations like expirable embeddings and frequency filtering to reduce memory footprint
  • Production-ready online training architecture with high fault-tolerance
  • Trade-off between system reliability and real-time learning explored
  • Successful implementation of Monolith in the BytePlus Recommend product demonstrated
  • Incremental on-the-fly periodic parameter synchronization mechanism incorporated to scale up online training based on business needs
  • Optimization of parameter updates based on model characteristics considering the dominance of sparse parameters in recommendation models
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Zhuoran Liu, Leqi Zou, Xuan Zou, Caihua Wang, Biao Zhang, Da Tang, Bolin Zhu, Yijie Zhu, Peng Wu, Ke Wang, Youlong Cheng

ORSUM@ACM RecSys 2022
License: CC BY 4.0

Abstract: Building a scalable and real-time recommendation system is vital for many businesses driven by time-sensitive customer feedback, such as short-videos ranking or online ads. Despite the ubiquitous adoption of production-scale deep learning frameworks like TensorFlow or PyTorch, these general-purpose frameworks fall short of business demands in recommendation scenarios for various reasons: on one hand, tweaking systems based on static parameters and dense computations for recommendation with dynamic and sparse features is detrimental to model quality; on the other hand, such frameworks are designed with batch-training stage and serving stage completely separated, preventing the model from interacting with customer feedback in real-time. These issues led us to reexamine traditional approaches and explore radically different design choices. In this paper, we present Monolith, a system tailored for online training. Our design has been driven by observations of our application workloads and production environment that reflects a marked departure from other recommendations systems. Our contributions are manifold: first, we crafted a collisionless embedding table with optimizations such as expirable embeddings and frequency filtering to reduce its memory footprint; second, we provide an production-ready online training architecture with high fault-tolerance; finally, we proved that system reliability could be traded-off for real-time learning. Monolith has successfully landed in the BytePlus Recommend product.

Submitted to arXiv on 16 Sep. 2022

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2209.07663v1

The paper discusses the importance of building a scalable and real-time recommendation system for businesses that rely on time-sensitive customer feedback, such as short-videos ranking or online ads. While deep learning frameworks like TensorFlow or PyTorch are widely used, they fall short in recommendation scenarios due to static parameters and dense computations that don't work well with dynamic and sparse features. To address these issues, the authors present Monolith, a system specifically designed for online training. The design of Monolith is driven by observations of application workloads and production environments which sets it apart from other recommendation systems. The contributions of Monolith include a collisionless embedding table with optimizations like expirable embeddings and frequency filtering to reduce memory footprint. It also provides a production-ready online training architecture with high fault-tolerance. The paper further explores how system reliability can be traded-off for real-time learning. The authors demonstrate the successful implementation of Monolith in the BytePlus Recommend product. In order to scale up online training to match business needs, Monolith incorporates an incremental on-the-fly periodic parameter synchronization mechanism. This mechanism takes into account the dominance of sparse parameters in recommendation models and optimizes parameter updates based on model characteristics. Overall, the paper provides insights into the challenges faced by traditional deep learning frameworks in recommendation scenarios and presents a novel approach with Monolith that addresses these challenges effectively.
Created on 12 Jul. 2023

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.