Guiding Retrieval using LLM-based Listwise Rankers

AI-generated keywords: Large Language Models (LLMs)

AI-generated Key Points

  • Large Language Models (LLMs) are powerful tools for reranking search results in listwise settings.
  • The "cascading" retrieve-and-rerank approach faces the bounded recall problem, leading to relevant documents being permanently excluded from the final ranking.
  • Adaptive retrieval techniques have been proposed to address this limitation but do not seamlessly integrate with listwise rerankers.
  • The proposed algorithm merges initial ranking results and feedback documents to effectively overcome the bounded recall problem for LLM rerankers.
  • Extensive experimentation shows that the method enhances nDCG@10 by up to 13.23% and recall by 28.02% while maintaining a constant number of LLM inferences and minimal overheads.
  • Integrating adaptive retrieval methods with LLM-based listwise rerankers can enhance search performance and overcome recall limitations in traditional cascading approaches.
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Mandeep Rathee, Sean MacAvaney, Avishek Anand

16 pages, 2 figures, 3 tables
License: CC BY 4.0

Abstract: Large Language Models (LLMs) have shown strong promise as rerankers, especially in ``listwise'' settings where an LLM is prompted to rerank several search results at once. However, this ``cascading'' retrieve-and-rerank approach is limited by the bounded recall problem: relevant documents not retrieved initially are permanently excluded from the final ranking. Adaptive retrieval techniques address this problem, but do not work with listwise rerankers because they assume a document's score is computed independently from other documents. In this paper, we propose an adaptation of an existing adaptive retrieval method that supports the listwise setting and helps guide the retrieval process itself (thereby overcoming the bounded recall problem for LLM rerankers). Specifically, our proposed algorithm merges results both from the initial ranking and feedback documents provided by the most relevant documents seen up to that point. Through extensive experiments across diverse LLM rerankers, first stage retrievers, and feedback sources, we demonstrate that our method can improve nDCG@10 by up to 13.23% and recall by 28.02%--all while keeping the total number of LLM inferences constant and overheads due to the adaptive process minimal. The work opens the door to leveraging LLM-based search in settings where the initial pool of results is limited, e.g., by legacy systems, or by the cost of deploying a semantic first-stage.

Submitted to arXiv on 15 Jan. 2025

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2501.09186v1

, , , , In the realm of information retrieval, Large Language Models (LLMs) have emerged as powerful tools for reranking search results, particularly in listwise settings where multiple results are reranked simultaneously. However, a common challenge faced by this "cascading" retrieve-and-rerank approach is the bounded recall problem, which leads to relevant documents that were not initially retrieved being permanently excluded from the final ranking. To address this limitation, adaptive retrieval techniques have been proposed, but they do not seamlessly integrate with listwise rerankers due to their assumption that a document's score is computed independently from other documents. To overcome this challenge and improve search performance, our paper introduces an adaptation of an existing adaptive retrieval method tailored to support listwise rerankers and guide the retrieval process itself. By merging results from both the initial ranking and feedback documents provided by the most relevant seen up to that point, our proposed algorithm effectively overcomes the bounded recall problem for LLM rerankers. Through extensive experimentation across various LLM rerankers, first stage retrievers, and feedback sources, we demonstrate that our method significantly enhances nDCG@10 by up to 13.23% and recall by 28.02%, all while maintaining a constant number of LLM inferences and minimal overheads associated with the adaptive process. Furthermore, while previous advancements in this space have made progress in addressing representation bottlenecks and operational challenges related to deploying new core engines for adaptive retrieval techniques, there are still limitations in handling certain queries effectively. Our work contributes to bridging these gaps and opens up opportunities for leveraging LLM-based search in scenarios where the initial pool of results is constrained by legacy systems or cost considerations surrounding semantic first-stage deployment. Overall, our research showcases the potential of integrating adaptive retrieval methods with LLM-based listwise rerankers to enhance search performance and overcome recall limitations inherent in traditional cascading approaches.
Created on 30 Jun. 2025

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.