Structured Pruning Adapters

AI-generated keywords: Structured Pruning Adapters Compressing Task-Switching Accelerate Specialize

AI-generated Key Points

  • Introduction of Structured Pruning Adapters (SPAs), a family of compressing, task-switching network adapters
  • SPAs accelerate and specialize networks using tiny parameter sets and structured pruning
  • Evaluation of channel-based SPA with various pruning methods on multiple computer vision benchmarks
  • Channel-SPAs improve accuracy by an average of 6.9% while using only half the parameters at 90% pruned weights
  • Channel-SPAs can learn adaptations with 17 times fewer parameters at 70% pruning with a slight decrease in accuracy of 1.6%
  • Block-SPA requires significantly fewer parameters than pruning with fine-tuning
  • Knowledge distillation using unpruned model as teacher helps retain accuracy better in pruning methods
  • Other approaches for accelerating neural networks mentioned: Continual Inference Networks, quantization approaches, and pruning methods that remove unnecessary network weights from pre-trained models
  • Structured Pruning Adapters offer an efficient alternative to fine-tuning by achieving higher accuracy with fewer parameters
  • Experimental code and Python library of adapters available for further exploration
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Lukas Hedegaard, Aman Alok, Juby Jose, Alexandros Iosifidis

11 pages, 6 figures, 2 tables
License: CC BY-NC-SA 4.0

Abstract: Adapters are a parameter-efficient alternative to fine-tuning, which augment a frozen base network to learn new tasks. Yet, the inference of the adapted model is often slower than the corresponding fine-tuned model. To improve on this, we propose Structured Pruning Adapters (SPAs), a family of compressing, task-switching network adapters, that accelerate and specialize networks using tiny parameter sets and structured pruning. Specifically, we propose a channel-based SPA and evaluate it with a suite of pruning methods on multiple computer vision benchmarks. Compared to regular structured pruning with fine-tuning, our channel-SPAs improve accuracy by 6.9% on average while using half the parameters at 90% pruned weights. Alternatively, they can learn adaptations with 17x fewer parameters at 70% pruning with 1.6% lower accuracy. Similarly, our block-SPA requires far fewer parameters than pruning with fine-tuning. Our experimental code and Python library of adapters are available at github.com/lukashedegaard/structured-pruning-adapters.

Submitted to arXiv on 17 Nov. 2022

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2211.10155v3

The paper introduces Structured Pruning Adapters (SPAs), a family of compressing, task-switching network adapters that accelerate and specialize networks using tiny parameter sets and structured pruning. The authors propose a channel-based SPA and evaluate it with various pruning methods on multiple computer vision benchmarks. Compared to regular structured pruning with fine-tuning, the channel-SPAs improve accuracy by an average of 6.9% while using only half the parameters at 90% pruned weights. Alternatively, they can learn adaptations with 17 times fewer parameters at 70% pruning with a slight decrease in accuracy of 1.6%. Similarly, the block-SPA requires significantly fewer parameters than pruning with fine-tuning. The authors also mention that knowledge distillation using the unpruned model as the teacher has been found to help pruning methods retain accuracy better. The paper highlights other approaches for accelerating neural networks such as Continual Inference Networks which optimize computational sequences and intra-layer caching for online stream processing; quantization approaches that reduce model size and run-time costs through low-resolution numerical representations of network weights; and pruning methods that entirely remove unnecessary network weights from pre-trained models. Overall, the proposed Structured Pruning Adapters offer an efficient alternative to fine-tuning by achieving higher accuracy with fewer parameters. The experimental code and Python library of adapters are available for further exploration.
Created on 05 Nov. 2023

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.