Knowledge Refinement via Interaction Between Search Engines and Large Language Models

AI-generated keywords: Information Retrieval

AI-generated Key Points

  • Information retrieval (IR) is crucial for finding relevant resources from vast amounts of data
  • Applications of IR have evolved from traditional knowledge bases to modern search engines (SEs)
  • Large language models (LLMs) have revolutionized the field by enabling natural language interaction with search systems
  • The authors explore the advantages and disadvantages of LLMs and SEs in understanding user queries and retrieving up-to-date information
  • They propose a novel framework called InteR that facilitates knowledge refinement through interaction between SEs and LLMs
  • InteR allows SEs to expand knowledge in queries using LLM-generated knowledge collections and enables LLMs to enhance prompt formulation using SE-retrieved documents
  • Experiments on large-scale retrieval benchmarks show that InteR achieves superior zero-shot retrieval performance compared to state-of-the-art methods
  • The proposed framework can benefit various domains beyond traditional keyword-based searches, such as research on jazz music
  • LLMs excel in understanding contextual queries and generating specific answers, while SEs are efficient at indexing vast amounts of data and delivering results based on precise keywords
  • By combining the strengths of both LLMs and SEs through iterative refinement, InteR offers an enhanced retrieval experience with improved accuracy.
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Jiazhan Feng, Chongyang Tao, Xiubo Geng, Tao Shen, Can Xu, Guodong Long, Dongyan Zhao, Daxin Jiang

Work in progress. We added BEIR results and released source code
License: CC BY 4.0

Abstract: Information retrieval (IR) plays a crucial role in locating relevant resources from vast amounts of data, and its applications have evolved from traditional knowledge bases to modern search engines (SEs). The emergence of large language models (LLMs) has further revolutionized the IR field by enabling users to interact with search systems in natural language. In this paper, we explore the advantages and disadvantages of LLMs and SEs, highlighting their respective strengths in understanding user-issued queries and retrieving up-to-date information. To leverage the benefits of both paradigms while circumventing their limitations, we propose InteR, a novel framework that facilitates knowledge refinement through interaction between SEs and LLMs. InteR allows SEs to expand knowledge in queries using LLM-generated knowledge collections and enables LLMs to enhance prompt formulation using SE-retrieved documents. This iterative refinement process augments the inputs of SEs and LLMs, leading to more accurate retrieval. Experiments on large-scale retrieval benchmarks involving web search and low-resource retrieval tasks demonstrate that InteR achieves overall superior zero-shot retrieval performance compared to state-of-the-art methods, even those using relevance judgment. Source code is available at https://github.com/Cyril-JZ/InteR

Submitted to arXiv on 12 May. 2023

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2305.07402v2

Information retrieval (IR) is a crucial process in finding relevant resources from vast amounts of data. Over time, its applications have evolved from traditional knowledge bases to modern search engines (SEs). The emergence of large language models (LLMs) has further revolutionized the field by enabling users to interact with search systems using natural language. In this paper, the authors explore the advantages and disadvantages of LLMs and SEs, highlighting their respective strengths in understanding user-issued queries and retrieving up-to-date information. To leverage the benefits of both paradigms while overcoming their limitations, the authors propose a novel framework called InteR. This framework facilitates knowledge refinement through interaction between SEs and LLMs. InteR allows SEs to expand knowledge in queries using LLM-generated knowledge collections and enables LLMs to enhance prompt formulation using SE-retrieved documents. This iterative refinement process enhances the inputs of both SEs and LLMs, leading to more accurate retrieval results. The authors conduct experiments on large-scale retrieval benchmarks involving web search and low-resource retrieval tasks. The results demonstrate that InteR achieves superior zero-shot retrieval performance compared to state-of-the-art methods, even those using relevance judgment. In addition to the existing summary, it is highlighted that the proposed framework can benefit various domains beyond traditional keyword-based searches. For example, students researching jazz music can use LLMs to pose complex questions about key pioneers and their influence on the genre, allowing for more precise retrieval of relevant information. While LLMs excel in understanding contextual queries and generating specific answers, SEs still have significant advantages over LLMs. SEs can index a vast amount of data efficiently and deliver relevant results based on well-designed precise keywords. However, by combining the strengths of both LLMs and SEs through iterative refinement, InteR offers an enhanced retrieval experience with improved accuracy. Overall, this paper presents a novel framework that bridges the gap between LLMs and SEs, offering a promising approach to enhance information retrieval in various domains.
Created on 05 Jul. 2023

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.