LoraHub: Efficient Cross-Task Generalization via Dynamic LoRA Composition

AI-generated keywords: LoraHub LoRA LLMs Big-Bench Hard Cross-Task Transfer

AI-generated Key Points

  • LoraHub framework enables cross-task generalization and adaptability in large language models (LLMs)
  • Low-rank adaptations (LoRA) modules are strategically assembled in LoraHub
  • LoraHub learning performs comparably or better than gradient-dependent methods in few-shot scenarios
  • Investigation of different LoRA modules for tasks in the Big-Bench Hard (BBH) benchmark
  • Five tasks identified with substantial influence and effective for cross-task transfer
  • These tasks require higher-level skills such as reading comprehension and reasoning
  • Contribution to the development of a community for LoRA, where users can share trained modules and advance general intelligence and LLMs in production.
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Chengsong Huang, Qian Liu, Bill Yuchen Lin, Tianyu Pang, Chao Du, Min Lin

Work in progress. The first three authors contributed equally to this work
License: CC BY-SA 4.0

Abstract: Low-rank adaptations (LoRA) are often employed to fine-tune large language models (LLMs) for new tasks. This paper investigates LoRA composability for cross-task generalization and introduces LoraHub, a strategic framework devised for the purposive assembly of LoRA modules trained on diverse given tasks, with the objective of achieving adaptable performance on unseen tasks. With just a few examples from a novel task, LoraHub enables the fluid combination of multiple LoRA modules, eradicating the need for human expertise. Notably, the composition requires neither additional model parameters nor gradients. Our empirical results, derived from the Big-Bench Hard (BBH) benchmark, suggest that LoraHub can effectively mimic the performance of in-context learning in few-shot scenarios, excluding the necessity of in-context examples alongside each inference input. A significant contribution of our research is the fostering of a community for LoRA, where users can share their trained LoRA modules, thereby facilitating their application to new tasks. We anticipate this resource will widen access to and spur advancements in general intelligence as well as LLMs in production. Code will be available at https://github.com/sail-sg/lorahub.

Submitted to arXiv on 25 Jul. 2023

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2307.13269v1

This paper presents an analysis of the LoraHub framework which enables cross-task generalization and adaptability in large language models (LLMs) through the strategic assembly of Low-rank adaptations (LoRA) modules. The authors compare LoraHub learning with LoRA tuning and conventional fine-tuning methods and find that it performs comparably or even better than gradient-dependent methods in few-shot scenarios. They also investigate the effectiveness of different LoRA modules for tasks in the Big-Bench Hard (BBH) benchmark, identifying five tasks that have substantial influence and are particularly effective for cross-task transfer. These tasks require higher-level skills such as reading comprehension and reasoning. This research contributes to the development of a community for LoRA where users can share their trained modules and advance general intelligence and LLMs in production.
Created on 27 Jul. 2023

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.