Mastering Atari with Discrete World Models

AI-generated keywords: DreamerV2 Atari games world models reinforcement learning behavior learning

AI-generated Key Points

The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

  • DreamerV2 achieves human-level performance on the Atari benchmark of 55 tasks
  • World models are crucial for generalization in intelligent agents by allowing them to learn from imagined outcomes
  • DreamerV2 learns behaviors purely from predictions within a compact latent space of a powerful world model
  • The world model in DreamerV2 uses discrete representations and is trained separately from the policy, marking a departure from traditional approaches
  • DreamerV2 surpasses other top single-GPU agents IQN and Rainbow in final performance metrics by reaching 200 million frames with the same computational budget and wall-clock time
  • The success of DreamerV2 demonstrates the efficacy of leveraging world models for behavior learning and showcases potential for advancing reinforcement learning techniques
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Danijar Hafner, Timothy Lillicrap, Mohammad Norouzi, Jimmy Ba

8 pages, 4 figures, 4 tables

Abstract: Intelligent agents need to generalize from past experience to achieve goals in complex environments. World models facilitate such generalization and allow learning behaviors from imagined outcomes to increase sample-efficiency. While learning world models from image inputs has recently become feasible for some tasks, modeling Atari games accurately enough to derive successful behaviors has remained an open challenge for many years. We introduce DreamerV2, a reinforcement learning agent that learns behaviors purely from predictions in the compact latent space of a powerful world model. The world model uses discrete representations and is trained separately from the policy. DreamerV2 constitutes the first agent that achieves human-level performance on the Atari benchmark of 55 tasks by learning behaviors inside a separately trained world model. With the same computational budget and wall-clock time, DreamerV2 reaches 200M frames and exceeds the final performance of the top single-GPU agents IQN and Rainbow.

Submitted to arXiv on 05 Oct. 2020

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2010.02193v1

This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

In their paper titled "Mastering Atari with Discrete World Models," authors Danijar Hafner, Timothy Lillicrap, Mohammad Norouzi, and Jimmy Ba introduce DreamerV2: a reinforcement learning agent that achieves human-level performance on the Atari benchmark of 55 tasks. The key challenge in developing intelligent agents lies in their ability to generalize from past experiences to achieve goals in complex environments. World models play a crucial role in facilitating this generalization by allowing agents to learn behaviors from imagined outcomes, thereby increasing sample-efficiency. While the concept of learning world models from image inputs has gained traction for certain tasks, accurately modeling Atari games to derive successful behaviors has remained a longstanding open challenge. DreamerV2 addresses this challenge by learning behaviors purely from predictions within the compact latent space of a powerful world model. This world model utilizes discrete representations and is trained separately from the policy, marking a significant departure from traditional approaches. The groundbreaking aspect of DreamerV2 lies in its achievement of human-level performance on the Atari benchmark through learning behaviors inside a separately trained world model. With the same computational budget and wall-clock time as other top single-GPU agents IQN and Rainbow, DreamerV2 surpasses their final performance metrics by reaching 200 million frames. This innovative approach not only demonstrates the efficacy of leveraging world models for behavior learning but also showcases the potential for advancing reinforcement learning techniques in complex environments such as Atari games. The success of DreamerV2 underscores the importance of exploring novel strategies for training intelligent agents and opens up new possibilities for achieving high-performance outcomes in challenging domains.
Created on 30 Oct. 2025

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.