When do neural networks learn world models?

AI-generated keywords: Neural Networks

AI-generated Key Points

The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

  • Neural networks' ability to develop world models that capture data generation process
  • Theoretical results in a multi-task setting
  • Models with low-degree bias can recover latent data-generating variables under mild assumptions
  • Sensitivity of recovery process to model architecture
  • Leveraging Boolean models of task solutions through Fourier-Walsh transform
  • Novel techniques for analyzing invertible Boolean transforms introduced
  • Algorithmic implications and connections to related research areas discussed
  • Contribution of valuable theoretical insights into neural networks learning world models
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Tianren Zhang, Guanyu Chen, Feng Chen

28 pages, 9 figures

Abstract: Humans develop world models that capture the underlying generation process of data. Whether neural networks can learn similar world models remains an open problem. In this work, we provide the first theoretical results for this problem, showing that in a multi-task setting, models with a low-degree bias provably recover latent data-generating variables under mild assumptions -- even if proxy tasks involve complex, non-linear functions of the latents. However, such recovery is also sensitive to model architecture. Our analysis leverages Boolean models of task solutions via the Fourier-Walsh transform and introduces new techniques for analyzing invertible Boolean transforms, which may be of independent interest. We illustrate the algorithmic implications of our results and connect them to related research areas, including self-supervised learning, out-of-distribution generalization, and the linear representation hypothesis in large language models.

Submitted to arXiv on 13 Feb. 2025

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2502.09297v1

This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

In their paper titled "When do neural networks learn world models? ", authors Tianren Zhang, Guanyu Chen, and Feng Chen explore the ability of neural networks to develop world models that capture the underlying generation process of data. They address the open problem of whether neural networks can learn similar world models and present theoretical results in a multi-task setting. The study demonstrates that models with a low-degree bias can recover latent data-generating variables under mild assumptions, even when proxy tasks involve complex, non-linear functions of the latents. However, the recovery process is shown to be sensitive to model architecture. The analysis conducted by Zhang, Chen, and Chen leverages Boolean models of task solutions through the Fourier-Walsh transform. They introduce novel techniques for analyzing invertible Boolean transforms, which hold potential for independent interest in future research endeavors. The authors also discuss the algorithmic implications of their findings and establish connections to various related research areas such as self-supervised learning, out-of-distribution generalization, and the linear representation hypothesis in large language models. Overall, this work sheds light on the intricate relationship between neural networks and world modeling processes. By providing theoretical insights into how neural networks can potentially learn world models in a multi-task environment, Zhang et al. 's study contributes valuable knowledge to the field of machine learning and artificial intelligence.
Created on 13 Feb. 2026

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.