Vector-ICL: In-context Learning with Continuous Vector Representations

AI-generated keywords: Large Language Models In-Context Learning Vector-ICL Pretraining Performance Evaluation

AI-generated Key Points

  • Large Language Models (LLMs) can extend in-context learning (ICL) capabilities to continuous vectors from diverse domains
  • Vector-ICL aligns input data with LLM's embedding space through lightweight projectors for effective processing and learning
  • Pretraining projectors with general language modeling objectives facilitates Vector-ICL, while task-specific finetuning enhances performance
  • Vector-ICL consistently outperforms few-shot ICL and domain-specific models or tuning in experiments and case studies
  • LLMs show potential in text summarization and molecule captioning tasks with evaluation metrics like RougeL and BLEU score
  • LLMs are investigated for text reconstruction and arithmetic/function regression tasks using synthetic datasets
  • Research highlights the versatility of LLMs in processing diverse data types through Vector-ICL methodology for applications beyond traditional token-based paradigms
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Yufan Zhuang, Chandan Singh, Liyuan Liu, Jingbo Shang, Jianfeng Gao

License: CC BY 4.0

Abstract: Large language models (LLMs) have shown remarkable in-context learning (ICL) capabilities on textual data. We explore whether these capabilities can be extended to continuous vectors from diverse domains, obtained from black-box pretrained encoders. By aligning input data with an LLM's embedding space through lightweight projectors, we observe that LLMs can effectively process and learn from these projected vectors, which we term Vector-ICL. In particular, we find that pretraining projectors with general language modeling objectives enables Vector-ICL, while task-specific finetuning further enhances performance. In our experiments across various tasks and modalities, including text reconstruction, numerical function regression, text classification, summarization, molecule captioning, time-series classification, graph classification, and fMRI decoding, Vector-ICL often surpasses both few-shot ICL and domain-specific model or tuning. We further conduct analyses and case studies, indicating the potential of LLMs to process vector representations beyond traditional token-based paradigms.

Submitted to arXiv on 08 Oct. 2024

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2410.05629v1

This study explores the potential of Large Language Models (LLMs) to extend their remarkable in-context learning (ICL) capabilities beyond textual data to continuous vectors from diverse domains. The researchers introduce Vector-ICL, a method that aligns input data with an LLM's embedding space through lightweight projectors, enabling effective processing and learning from these projected vectors. They find that pretraining projectors with general language modeling objectives facilitates Vector-ICL and task-specific finetuning further enhances performance. Through various experiments and case studies, Vector-ICL consistently outperforms few-shot ICL and domain-specific models or tuning. Additionally, the study showcases LLMs' potential in text summarization and molecule captioning tasks using performance evaluation metrics like RougeL and BLEU score. The researchers also investigate LLMs' ability in text reconstruction and arithmetic/function regression tasks using synthetic datasets. Overall, this comprehensive research highlights the versatility of LLMs in processing diverse data types through Vector-ICL methodology and their potential in various applications beyond traditional token-based paradigms.
Created on 16 Dec. 2024

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.