Plastic Learning with Deep Fourier Features

AI-generated keywords: Continual learning Deep neural networks Plasticity Deep Fourier features Adaptive-linearity

AI-generated Key Points

  • Authors address the challenge of continual learning in deep neural networks
  • Loss of plasticity is a key issue identified
  • Theoretical results show that linear function approximation and deep linear networks do not suffer from loss of plasticity
  • Proposal of deep Fourier features involving sine and cosine concatenation in every layer
  • Deep Fourier features strike a balance between trainability and effectiveness in neural networks
  • Networks with deep Fourier features exhibit high trainability throughout learning process
  • Empirical results show significant improvements in continual learning performance when replacing ReLU activations with deep Fourier features
  • Improvements observed across various continual learning scenarios on datasets like CIFAR10, CIFAR100, and tiny-ImageNet
  • Deep Fourier features are effective in diminishing label noise settings
  • Networks using deep Fourier features consistently achieve high test accuracy despite initial challenges with corrupted labels on early tasks
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Alex Lewandowski, Dale Schuurmans, Marlos C. Machado

License: CC BY 4.0

Abstract: Deep neural networks can struggle to learn continually in the face of non-stationarity. This phenomenon is known as loss of plasticity. In this paper, we identify underlying principles that lead to plastic algorithms. In particular, we provide theoretical results showing that linear function approximation, as well as a special case of deep linear networks, do not suffer from loss of plasticity. We then propose deep Fourier features, which are the concatenation of a sine and cosine in every layer, and we show that this combination provides a dynamic balance between the trainability obtained through linearity and the effectiveness obtained through the nonlinearity of neural networks. Deep networks composed entirely of deep Fourier features are highly trainable and sustain their trainability over the course of learning. Our empirical results show that continual learning performance can be drastically improved by replacing ReLU activations with deep Fourier features. These results hold for different continual learning scenarios (e.g., label noise, class incremental learning, pixel permutations) on all major supervised learning datasets used for continual learning research, such as CIFAR10, CIFAR100, and tiny-ImageNet.

Submitted to arXiv on 27 Oct. 2024

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2410.20634v1

In the paper "Plastic Learning with Deep Fourier Features," authors Alex Lewandowski, Dale Schuurmans, and Marlos C. Machado address the challenge of continual learning in deep neural networks. They specifically focus on the phenomenon known as loss of plasticity and identify underlying principles that lead to plastic algorithms. The authors provide theoretical results showing that linear function approximation and a special case of deep linear networks do not suffer from loss of plasticity. To overcome this challenge, the authors propose deep Fourier features which involve the concatenation of a sine and cosine in every layer of a neural network. This combination strikes a dynamic balance between trainability achieved through linearity and effectiveness obtained through nonlinearity in neural networks. Networks composed entirely of deep Fourier features exhibit high trainability throughout the learning process. Empirical results presented in the paper demonstrate significant improvements in continual learning performance when ReLU activations are replaced with deep Fourier features. These improvements are observed across various continual learning scenarios such as label noise, class incremental learning, and pixel permutations on popular supervised learning datasets including CIFAR10, CIFAR100, and tiny-ImageNet. Further experiments conducted by the authors highlight the benefits of deep Fourier features in diminishing label noise settings. Despite initial challenges with corrupted labels on early tasks, networks utilizing deep Fourier features consistently achieve high test accuracy on uncorrupted test sets as label noise diminishes over subsequent tasks. Overall, this study underscores the effectiveness of adaptive-linearity as an inductive bias for continual learning in deep neural networks. The incorporation of deep Fourier features offers a promising approach to enhancing trainability and generalization capabilities in evolving environments.
Created on 30 Mar. 2025

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.