LLM-Powered Ensemble Learning for Paper Source Tracing: A GPU-Free Approach

AI-generated keywords: KDD CUP 2024 Source Tracing Competition Closed-Source LLMs Ensemble Learning GPU-Free Approach

AI-generated Key Points

The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

  • Team consisting of Kunlong Chen, Junjun Wang, Zhaoqun Chen, Kunjin Chen, and Yitian Chen secured 3rd place in KDD CUP 2024 paper source tracing competition
  • Innovative approach using closed-source large language models (LLMs) for identifying reference sources of academic papers
  • Methodology independent from GPUs for model training, instead leveraging LLMs for generating predicted reference sources directly from papers
  • Utilized ensemble learning techniques to enhance prediction accuracy
  • Research detailed in the paper titled "LLM-Powered Ensemble Learning for Paper Source Tracing: A GPU-Free Approach"
  • Code available at https://github.com/Cklwanfifa/KDDCUP2024-PST
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Kunlong Chen, Junjun Wang, Zhaoqun Chen, Kunjin Chen, Yitian Chen

Abstract: We participated in the KDD CUP 2024 paper source tracing competition and achieved the 3rd place. This competition tasked participants with identifying the reference sources (i.e., ref-sources, as referred to by the organizers of the competition) of given academic papers. Unlike most teams that addressed this challenge by fine-tuning pre-trained neural language models such as BERT or ChatGLM, our primary approach utilized closed-source large language models (LLMs). With recent advancements in LLM technology, closed-source LLMs have demonstrated the capability to tackle complex reasoning tasks in zero-shot or few-shot scenarios. Consequently, in the absence of GPUs, we employed closed-source LLMs to directly generate predicted reference sources from the provided papers. We further refined these predictions through ensemble learning. Notably, our method was the only one among the award-winning approaches that did not require the use of GPUs for model training. Code available at https://github.com/Cklwanfifa/KDDCUP2024-PST.

Submitted to arXiv on 14 Sep. 2024

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2409.09383v2

This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

Our team, consisting of Kunlong Chen, Junjun Wang, Zhaoqun Chen, Kunjin Chen, and Yitian Chen, secured the 3rd place in the KDD CUP 2024 paper source tracing competition with our innovative approach. The challenge was to identify the reference sources (ref-sources) of academic papers. While many teams relied on fine-tuning pre-trained neural language models like BERT or ChatGLM, we took a different route by leveraging closed-source large language models (LLMs). These LLMs have shown remarkable capabilities in handling complex reasoning tasks in zero-shot or few-shot scenarios. One notable aspect of our methodology was its independence from GPUs for model training. Instead of traditional GPU-intensive methods, we utilized closed-source LLMs to directly generate predicted reference sources from the provided papers. To enhance the accuracy of these predictions, we implemented ensemble learning techniques. Our approach stood out among the award-winning strategies as the only one that did not require GPU usage during model training. Our work has been detailed in the paper titled "LLM-Powered Ensemble Learning for Paper Source Tracing: A GPU-Free Approach," authored by Kunlong Chen, Junjun Wang, Zhaoqun Chen, Kunjin Chen, and Yitian Chen. This research showcases how advancements in LLM technology can revolutionize source tracing tasks and highlights the effectiveness of ensemble learning in refining predictions. For those interested in exploring our methodology further, the code is available at https://github.com/Cklwanfifa/KDDCUP2024-PST. Overall, our success in this competition underscores the potential of closed-source LLMs and novel approaches like ensemble learning in addressing challenging tasks within the realm of academic paper source tracing.
Created on 20 Oct. 2024

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.