Directions of Curvature as an Explanation for Loss of Plasticity

AI-generated keywords: Plasticity Neural Networks Curvature Directions Continual Learning Regularizers

AI-generated Key Points

  • Loss of plasticity in neural networks is a phenomenon where networks struggle to learn from new experiences.
  • The authors suggest that loss of plasticity occurs due to the loss of directions of curvature during training.
  • Their research involved investigating continual learning tasks using datasets like MNIST, CIFAR-10, and ImageNet to support their claim.
  • The study found that the loss of curvature directions correlates with the loss of plasticity in neural networks.
  • The authors challenge previous explanations for loss of plasticity by providing counterexamples using a linearly separable subset of the MNIST dataset with periodically shuffled labels.
  • They explore how regularizers can help mitigate loss of plasticity by preserving curvature in neural networks.
  • A distributional regularizer introduced by the authors was effective in maintaining plasticity across different problem settings.
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Alex Lewandowski, Haruto Tanaka, Dale Schuurmans, Marlos C. Machado

License: CC BY 4.0

Abstract: Loss of plasticity is a phenomenon in which neural networks lose their ability to learn from new experience. Despite being empirically observed in several problem settings, little is understood about the mechanisms that lead to loss of plasticity. In this paper, we offer a consistent explanation for loss of plasticity: Neural networks lose directions of curvature during training and that loss of plasticity can be attributed to this reduction in curvature. To support such a claim, we provide a systematic investigation of loss of plasticity across continual learning tasks using MNIST, CIFAR-10 and ImageNet. Our findings illustrate that loss of curvature directions coincides with loss of plasticity, while also showing that previous explanations are insufficient to explain loss of plasticity in all settings. Lastly, we show that regularizers which mitigate loss of plasticity also preserve curvature, motivating a simple distributional regularizer that proves to be effective across the problem settings we considered.

Submitted to arXiv on 30 Nov. 2023

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2312.00246v4

In their paper "Directions of Curvature as an Explanation for Loss of Plasticity," authors Alex Lewandowski, Haruto Tanaka, Dale Schuurmans, and Marlos C. Machado delve into the phenomenon of loss of plasticity in neural networks. This refers to the inability of neural networks to learn from new experiences, a problem that has been observed in various settings without a clear understanding of its underlying mechanisms. The authors propose a consistent explanation for this issue: during training, neural networks lose directions of curvature which leads to a reduction in plasticity. To support their claim, they conduct a systematic investigation across continual learning tasks using popular datasets such as MNIST, CIFAR-10, and ImageNet. Their findings demonstrate that the loss of curvature directions aligns with the loss of plasticity, highlighting the significance of this factor in neural network behavior. Moreover, the authors challenge previous explanations for loss of plasticity by providing counterexamples using a linearly separable subset of the MNIST dataset with periodically shuffled labels. This analysis reveals inconsistencies in existing explanations and emphasizes the complexities associated with preserving plasticity even in simple classification problems. Furthermore, the paper explores how regularizers can mitigate loss of plasticity by preserving curvature in neural networks. The authors introduce a distributional regularizer that proves effective across different problem settings considered in their study.
Created on 08 May. 2025

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.