Activation by Interval-wise Dropout: A Simple Way to Prevent Neural Networks from Plasticity Loss

AI-generated keywords: Neural Networks

AI-generated Key Points

  • The paper addresses the critical challenge of plasticity loss in neural network training, which hinders a model's ability to adapt to new tasks or shifts in data distribution.
  • The proposed method, AID (Activation by Interval-wise Dropout), applies different dropout probabilities on each preactivation interval to generate subnetworks, effectively regularizing the network and preventing plasticity loss.
  • Evaluation on standard image classification datasets like CIFAR10, CIFAR100, and TinyImageNet shows that AID maintains plasticity across benchmarks and enhances reinforcement learning performance.
  • Comparison with Dropout in a warm-start learning experiment reveals that while Dropout improves generalizability, AID effectively mitigates plasticity loss by retaining a higher degree of plasticity in warm-start models.
  • Overall findings suggest that AID is an effective method for preventing plasticity loss in neural networks and improving their adaptability to new tasks or changes in data distribution.
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Sangyeon Park, Isaac Han, Seungwon Oh, Kyung-Joong Kim

License: CC BY 4.0

Abstract: Plasticity loss, a critical challenge in neural network training, limits a model's ability to adapt to new tasks or shifts in data distribution. This paper introduces AID (Activation by Interval-wise Dropout), a novel method inspired by Dropout, designed to address plasticity loss. Unlike Dropout, AID generates subnetworks by applying Dropout with different probabilities on each preactivation interval. Theoretical analysis reveals that AID regularizes the network, promoting behavior analogous to that of deep linear networks, which do not suffer from plasticity loss. We validate the effectiveness of AID in maintaining plasticity across various benchmarks, including continual learning tasks on standard image classification datasets such as CIFAR10, CIFAR100, and TinyImageNet. Furthermore, we show that AID enhances reinforcement learning performance in the Arcade Learning Environment benchmark.

Submitted to arXiv on 03 Feb. 2025

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2502.01342v1

, , , , The paper "Activation by Interval-wise Dropout: A Simple Way to Prevent Neural Networks from Plasticity Loss" addresses the critical challenge of plasticity loss in neural network training. This issue hinders a model's ability to adapt to new tasks or shifts in data distribution. To overcome this problem, the proposed method, AID (Activation by Interval-wise Dropout), is inspired by Dropout but introduces a novel approach by applying different dropout probabilities on each preactivation interval to generate subnetworks. Theoretical analysis shows that AID effectively regularizes the network, resulting in behavior similar to deep linear networks that do not suffer from plasticity loss. To evaluate the effectiveness of AID, various benchmarks were conducted on standard image classification datasets such as CIFAR10, CIFAR100, and TinyImageNet. The results demonstrate that AID maintains plasticity across these benchmarks and enhances reinforcement learning performance in the Arcade Learning Environment benchmark. In a warm-start learning experiment inspired by previous research, models trained with vanilla settings, Dropout, and AID were compared after pre-training a RESNET-18 model on 10% of the training data for 1,000 epochs before continuing training on the full dataset. While Dropout appeared to improve generalizability in both warm-start and cold-start models, it was argued that this improvement stemmed from enhanced model generalization rather than mitigating plasticity loss. In contrast, AID showed a smaller performance improvement compared to the vanilla model but effectively mitigated plasticity loss as warm-start models trained with AID retained a higher degree of plasticity compared to those trained with Dropout. Overall, the findings suggest that AID is an effective method for preventing plasticity loss in neural networks and improving their adaptability to new tasks or changes in data distribution.
Created on 30 Mar. 2025

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.