Proactive Agent: Shifting LLM Agents from Reactive Responses to Active Assistance

AI-generated keywords: Proactive Agents

AI-generated Key Points

The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

  • Authors propose a novel data-driven approach to develop proactive agents capable of anticipating and initiating tasks without explicit human instructions
  • Real-world human activities are collected to generate proactive task predictions and labeled as accepted or rejected by human annotators
  • A reward model is trained to evaluate the proactiveness of Large Language Model (LLM) agents automatically
  • Comprehensive data generation pipeline creates a diverse dataset called ProactiveBench containing 6,790 events
  • Fine-tuning models with this dataset significantly enhances the proactiveness of LLM agents, achieving an F1-Score of 66.47% in proactively offering assistance
  • Research paves the way for future advancements in human-agent collaboration by enabling agents to anticipate and initiate tasks autonomously with foresight
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Yaxi Lu, Shenzhi Yang, Cheng Qian, Guirong Chen, Qinyu Luo, Yesai Wu, Huadong Wang, Xin Cong, Zhong Zhang, Yankai Lin, Weiwen Liu, Yasheng Wang, Zhiyuan Liu, Fangming Liu, Maosong Sun

9 pages, 4 figures

Abstract: Agents powered by large language models have shown remarkable abilities in solving complex tasks. However, most agent systems remain reactive, limiting their effectiveness in scenarios requiring foresight and autonomous decision-making. In this paper, we tackle the challenge of developing proactive agents capable of anticipating and initiating tasks without explicit human instructions. We propose a novel data-driven approach for this problem. Firstly, we collect real-world human activities to generate proactive task predictions. These predictions are then labeled by human annotators as either accepted or rejected. The labeled data is used to train a reward model that simulates human judgment and serves as an automatic evaluator of the proactiveness of LLM agents. Building on this, we develop a comprehensive data generation pipeline to create a diverse dataset, ProactiveBench, containing 6,790 events. Finally, we demonstrate that fine-tuning models with the proposed ProactiveBench can significantly elicit the proactiveness of LLM agents. Experimental results show that our fine-tuned model achieves an F1-Score of 66.47% in proactively offering assistance, outperforming all open-source and close-source models. These results highlight the potential of our method in creating more proactive and effective agent systems, paving the way for future advancements in human-agent collaboration.

Submitted to arXiv on 16 Oct. 2024

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2410.12361v3

This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

In their paper titled "Proactive Agent: Shifting LLM Agents from Reactive Responses to Active Assistance," authors Yaxi Lu, Shenzhi Yang, Cheng Qian, Guirong Chen, Qinyu Luo, Yesai Wu, Huadong Wang, Xin Cong, Zhong Zhang, Yankai Lin, Weiwen Liu, Yasheng Wang, Zhiyuan Liu, Fangming Liu and Maosong Sun address the limitations of reactive agent systems by proposing a novel data-driven approach to develop proactive agents capable of anticipating and initiating tasks without explicit human instructions. The authors collect real-world human activities to generate proactive task predictions and label these predictions as accepted or rejected by human annotators. They then train a reward model that simulates human judgment to evaluate the proactiveness of Large Language Model (LLM) agents automatically. Furthermore, the authors develop a comprehensive data generation pipeline to create a diverse dataset called ProactiveBench containing 6,790 events. By fine-tuning models with this dataset, they demonstrate that the proposed approach significantly enhances the proactiveness of LLM agents. Experimental results show that their fine-tuned model achieves an impressive F1-Score of 66.47% in proactively offering assistance, surpassing both open-source and close-source models. These findings underscore the potential of the authors' method in creating more proactive and effective agent systems. Their research paves the way for future advancements in human-agent collaboration by enabling agents to anticipate and initiate tasks autonomously with foresight. The study contributes valuable insights into enhancing the capabilities of agent systems powered by large language models for improved performance in complex scenarios requiring proactive decision-making.
Created on 13 Apr. 2025

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.