Knowledge Refinement via Interaction Between Search Engines and Large Language Models
AI-generated Key Points
- Information retrieval (IR) is crucial for finding relevant resources from vast amounts of data
- Applications of IR have evolved from traditional knowledge bases to modern search engines (SEs)
- Large language models (LLMs) have revolutionized the field by enabling natural language interaction with search systems
- The authors explore the advantages and disadvantages of LLMs and SEs in understanding user queries and retrieving up-to-date information
- They propose a novel framework called InteR that facilitates knowledge refinement through interaction between SEs and LLMs
- InteR allows SEs to expand knowledge in queries using LLM-generated knowledge collections and enables LLMs to enhance prompt formulation using SE-retrieved documents
- Experiments on large-scale retrieval benchmarks show that InteR achieves superior zero-shot retrieval performance compared to state-of-the-art methods
- The proposed framework can benefit various domains beyond traditional keyword-based searches, such as research on jazz music
- LLMs excel in understanding contextual queries and generating specific answers, while SEs are efficient at indexing vast amounts of data and delivering results based on precise keywords
- By combining the strengths of both LLMs and SEs through iterative refinement, InteR offers an enhanced retrieval experience with improved accuracy.
Authors: Jiazhan Feng, Chongyang Tao, Xiubo Geng, Tao Shen, Can Xu, Guodong Long, Dongyan Zhao, Daxin Jiang
Abstract: Information retrieval (IR) plays a crucial role in locating relevant resources from vast amounts of data, and its applications have evolved from traditional knowledge bases to modern search engines (SEs). The emergence of large language models (LLMs) has further revolutionized the IR field by enabling users to interact with search systems in natural language. In this paper, we explore the advantages and disadvantages of LLMs and SEs, highlighting their respective strengths in understanding user-issued queries and retrieving up-to-date information. To leverage the benefits of both paradigms while circumventing their limitations, we propose InteR, a novel framework that facilitates knowledge refinement through interaction between SEs and LLMs. InteR allows SEs to expand knowledge in queries using LLM-generated knowledge collections and enables LLMs to enhance prompt formulation using SE-retrieved documents. This iterative refinement process augments the inputs of SEs and LLMs, leading to more accurate retrieval. Experiments on large-scale retrieval benchmarks involving web search and low-resource retrieval tasks demonstrate that InteR achieves overall superior zero-shot retrieval performance compared to state-of-the-art methods, even those using relevance judgment. Source code is available at https://github.com/Cyril-JZ/InteR
Ask questions about this paper to our AI assistant
You can also chat with multiple papers at once here.
Assess the quality of the AI-generated content by voting
Score: 0
Why do we need votes?
Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.
The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.
Similar papers summarized with our AI tools
Navigate through even more similar papers through a
tree representationLook for similar papers (in beta version)
By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.
Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.