Adaptive Re-Ranking with a Corpus Graph

AI-generated keywords: Adaptive Re-Ranking Corpus Graph Search Systems Clustering Hypothesis Graph-based Adaptive Re-ranking

AI-generated Key Points

The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

  • The paper introduces Graph-based Adaptive Re-ranking (GAR) as a novel approach to improving re-ranking pipelines in search systems.
  • GAR has shown significant improvements in precision- and recall-oriented measures compared to traditional re-ranking methods.
  • The method is based on the clustering hypothesis and involves continuously adding similar documents to the candidate pool during re-ranking.
  • GAR is compatible with existing techniques like dense retrieval, robust in terms of hyperparameters, and adds minimal computational and storage costs.
  • Experiments on the MS MARCO passage ranking dataset showed promising results, with GAR enhancing nDCG of a BM25 candidate pool by up to 8% when combined with a monoT5 ranker.
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Sean MacAvaney, Nicola Tonellotto, Craig Macdonald

CIKM 2022

Abstract: Search systems often employ a re-ranking pipeline, wherein documents (or passages) from an initial pool of candidates are assigned new ranking scores. The process enables the use of highly-effective but expensive scoring functions that are not suitable for use directly in structures like inverted indices or approximate nearest neighbour indices. However, re-ranking pipelines are inherently limited by the recall of the initial candidate pool; documents that are not identified as candidates for re-ranking by the initial retrieval function cannot be identified. We propose a novel approach for overcoming the recall limitation based on the well-established clustering hypothesis. Throughout the re-ranking process, our approach adds documents to the pool that are most similar to the highest-scoring documents up to that point. This feedback process adapts the pool of candidates to those that may also yield high ranking scores, even if they were not present in the initial pool. It can also increase the score of documents that appear deeper in the pool that would have otherwise been skipped due to a limited re-ranking budget. We find that our Graph-based Adaptive Re-ranking (GAR) approach significantly improves the performance of re-ranking pipelines in terms of precision- and recall-oriented measures, is complementary to a variety of existing techniques (e.g., dense retrieval), is robust to its hyperparameters, and contributes minimally to computational and storage costs. For instance, on the MS MARCO passage ranking dataset, GAR can improve the nDCG of a BM25 candidate pool by up to 8% when applying a monoT5 ranker.

Submitted to arXiv on 18 Aug. 2022

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2208.08942v1

This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

The paper "Adaptive Re-Ranking with a Corpus Graph" by Sean MacAvaney, Nicola Tonellotto, and Craig Macdonald introduces a novel approach to improving the performance of re-ranking pipelines in search systems. The proposed method is known as Graph-based Adaptive Re-ranking (GAR) and has shown significant improvements in precision- and recall-oriented measures compared to traditional re-ranking methods. <br><br> Re-ranking pipelines typically involve assigning new ranking scores to documents or passages from an initial pool of candidates. These pipelines are limited by the recall of the initial candidate pool, as documents not identified initially cannot be re-ranked. To overcome this limitation, the authors propose a method based on the clustering hypothesis. Their approach involves continuously adding documents to the candidate pool that are most similar to the highest-scoring documents at each stage of re-ranking. This feedback process adapts the pool to include potentially high-ranking documents that were not present in the initial set and boosts the scores of deeper-lying documents that may have been overlooked due to budget constraints.<br><br> GAR is also compatible with various existing techniques such as dense retrieval and is robust in terms of hyperparameters. It adds minimal computational and storage costs while showing promising results in experiments on the MS MARCO passage ranking dataset. When combined with a monoT5 ranker, GAR was able to enhance the nDCG of a BM25 candidate pool by up to 8%. Overall, this innovative approach presents a promising solution for enhancing search system performance through adaptive re-ranking strategies based on corpus graphs.
Created on 30 Jun. 2025

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.