FeDeRA:Efficient Fine-tuning of Language Models in Federated Learning Leveraging Weight Decomposition

AI-generated keywords: FeDeRA Efficient Fine-tuning Language Models Federated Learning Weight Decomposition

AI-generated Key Points

  • FeDeRA is a groundbreaking study that addresses user privacy concerns in centralized training of Pre-trained Language Models (PLMs)
  • Introduces FeDeRA as an improvement over existing methods like LoRA with exceptional performance in federated learning settings
  • Utilizes Parameter-Efficient Fine-Tuning (PEFT) to tackle the burden on client computational resources in Federated Learning (FL)
  • Proposes FeDeRA, which uses Singular Value Decomposition (SVD) on pre-trained matrices to select principal components for fine-tuning language models
  • Outperformed all other PEFT methods and demonstrated comparable or superior performance to full parameter fine-tuning (FT) methods
  • Significantly reduced training time by 95.9% to 97.9% compared to FT across different tasks using RoBERTa and DeBERTaV3
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Yuxuan Yan, Shunpu Tang, Zhiguo Shi, Qianqian Yang

License: CC BY 4.0

Abstract: Pre-trained Language Models (PLMs) have shown excellent performance on various downstream tasks after fine-tuning. Nevertheless, the escalating concerns surrounding user privacy have posed significant challenges to centralized training reliant on extensive data collection. Federated learning(FL), which only requires training on the clients and aggregates weights on the server without sharing data, has emerged as a solution. However, the substantial parameter size of PLMs places a significant burden on the computational resources of client devices, while also leading to costly communication expenses. Introducing Parameter-Efficient Fine-Tuning(PEFT) into FL can effectively address this problem. However, we observe that the non-IID data in federated learning leads to a gap in performance between the PEFT method and full parameter fine-tuning(FT). To overcome this, we propose FeDeRA, an improvement over the LoRA method in FL. FeDeRA uses the same adapter module as LoRA. However, the difference lies in FeDeRA's initialization of the adapter module by performing Singular Value Decomposition (SVD) on the pre-trained matrix and selecting its principal components. We conducted extensive experiments, using RoBERTa and DeBERTaV3, on three tasks and six datasets, comparing the methods including FT and the other three different PEFT methods. FeDeRA outperforms all other PEFT methods and is comparable to or even surpasses the performance of FT methods. We also deployed federated learning on Jetson AGX Orin and compared the time required by different methods to achieve the target accuracy on specific tasks. Compared to FT, FeDeRA reduces the training time by 95.9%, 97.9%, 96.9%, and 97.3%, 96.5%, and 96.5% respectively on three tasks using RoBERTa and DeBERTaV3. The overall experiments indicate that FeDeRA achieves good performance while also maintaining efficiency.

Submitted to arXiv on 29 Apr. 2024

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2404.18848v1

FeDeRA: Efficient Fine-tuning of Language Models in Federated Learning Leveraging Weight Decomposition is a groundbreaking study that addresses the challenges posed by user privacy concerns in centralized training of Pre-trained Language Models (PLMs). The study introduces FeDeRA as an improvement over existing methods like LoRA and showcases its exceptional performance while maintaining efficiency in federated learning settings. The escalating concerns surrounding extensive data collection have prompted the adoption of Federated Learning (FL) as a solution, where training occurs on client devices without sharing data. However, the substantial parameter size of PLMs places a burden on client computational resources and leads to costly communication expenses. To tackle this issue, Parameter-Efficient Fine-Tuning (PEFT) has been introduced into FL. Despite its effectiveness, non-IID data in federated learning results in performance gaps between PEFT and full parameter fine-tuning (FT). In response, the authors propose FeDeRA which utilizes an adapter module initialized through Singular Value Decomposition (SVD) on pre-trained matrices to select principal components. This innovative approach sets FeDeRA apart as a promising method for enhancing language model fine-tuning in FL scenarios. Extensive experiments were conducted using RoBERTa and DeBERTaV3 on three tasks across six datasets, comparing various methods including FT and other PEFT approaches. FeDeRA outperformed all other PEFT methods and demonstrated comparable or superior performance to FT methods. Additionally, federated learning was deployed on Jetson AGX Orin to compare training times for specific tasks. Compared to FT, FeDeRA significantly reduced training time by 95.9% to 97.9% across different tasks using RoBERTa and DeBERTaV3. In conclusion, FeDeRA offers an efficient solution for fine-tuning language models in federated learning settings. Its use of weight decomposition through SVD allows for improved performance while maintaining efficiency, making it a promising method for future FL applications.
Created on 03 Jun. 2025

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.