WebCPM: Interactive Web Search for Chinese Long-form Question Answering

AI-generated keywords: Long-form Question Answering WebCPM Interactive Web Search Language Models Fine-Tuning

AI-generated Key Points

The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

  • Introduction of WebCPM, the first Chinese Long-form Question Answering (LFQA) dataset
  • LFQA aims to answer complex, open-ended questions with detailed responses
  • LFQA involves two procedures: information retrieval and information synthesis
  • WebCPM stands out because it uses interactive web search for information retrieval
  • Development of a web search interface called WebGPT for annotators to search for relevant information and answer questions
  • Dataset includes 5,500 question-answer pairs, 14,315 supporting facts, and 121,330 web search actions
  • Pre-trained language models are fine-tuned to generate answers based on collected facts and imitate human behaviors in web search
  • LFQA pipeline using fine-tuned models generates answers comparable to human-written ones in 32.5% and 47.5% of cases on their dataset and DuReader respectively
  • WebCPM is a valuable resource for Chinese LFQA research by incorporating interactive web search and demonstrating the effectiveness of fine-tuned language models
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Yujia Qin, Zihan Cai, Dian Jin, Lan Yan, Shihao Liang, Kunlun Zhu, Yankai Lin, Xu Han, Ning Ding, Huadong Wang, Ruobing Xie, Fanchao Qi, Zhiyuan Liu, Maosong Sun, Jie Zhou

ACL 2023, main conference

Abstract: Long-form question answering (LFQA) aims at answering complex, open-ended questions with detailed, paragraph-length responses. The de facto paradigm of LFQA necessitates two procedures: information retrieval, which searches for relevant supporting facts, and information synthesis, which integrates these facts into a coherent answer. In this paper, we introduce WebCPM, the first Chinese LFQA dataset. One unique feature of WebCPM is that its information retrieval is based on interactive web search, which engages with a search engine in real time. Following WebGPT, we develop a web search interface. We recruit annotators to search for relevant information using our interface and then answer questions. Meanwhile, the web search behaviors of our annotators would be recorded. In total, we collect 5,500 high-quality question-answer pairs, together with 14,315 supporting facts and 121,330 web search actions. We fine-tune pre-trained language models to imitate human behaviors for web search and to generate answers based on the collected facts. Our LFQA pipeline, built on these fine-tuned models, generates answers that are no worse than human-written ones in 32.5% and 47.5% of the cases on our dataset and DuReader, respectively.

Submitted to arXiv on 11 May. 2023

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2305.06849v2

This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

The paper introduces WebCPM, the first Chinese Long-form Question Answering (LFQA) dataset. LFQA aims to answer complex, open-ended questions with detailed responses. The process of LFQA involves two procedures: information retrieval and information synthesis. Information retrieval searches for relevant supporting facts, while information synthesis integrates these facts into a coherent answer. WebCPM stands out because its information retrieval is based on interactive web search, engaging with a search engine in real time. The authors develop a web search interface called WebGPT and recruit annotators to use this interface to search for relevant information and answer questions. The annotators' web search behaviors are recorded during the process. The dataset collected includes 5,500 high-quality question-answer pairs, along with 14,315 supporting facts and 121,330 web search actions. To generate answers based on the collected facts and imitate human behaviors for web search, pre-trained language models are fine-tuned. The authors build an LFQA pipeline using these fine-tuned models. The pipeline generates answers that are no worse than human-written ones in 32.5% and 47.5% of the cases on their dataset and DuReader respectively. Overall, this paper presents WebCPM as a valuable resource for Chinese LFQA research by providing a dataset that incorporates interactive web search for information retrieval and demonstrates the effectiveness of fine-tuned language models in generating accurate answers.
Created on 04 Jul. 2023

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

The license of this specific paper does not allow us to build upon its content and the summarizing tools will be run using the paper metadata rather than the full article. However, it still does a good job, and you can also try our tools on papers with more open licenses.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.