Large Language Models As Evolution Strategies

AI-generated keywords: Large Language Models Evolution Strategies Black-box Optimization Prompting Strategy EvoLLM

AI-generated Key Points

  • Authors: Robert Tjarko Lange, Yingtao Tian, and Yujin Tang
  • Study focus: Large Language Models (LLMs) in implementing evolutionary optimization algorithms
  • Novel prompting strategy for zero-shot application of LLMs to black-box optimization
  • Resulting approach 'EvoLLM' outperforms traditional algorithms like random search and Gaussian Hill Climbing
  • Potential of LLMs as 'plug-ins' for recombination operations
  • Comparative analyses on model size, prompt strategy, and context construction
  • Enhancement of EvoLLM's performance through instruction fine-tuning based on teacher algorithm information
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Robert Tjarko Lange, Yingtao Tian, Yujin Tang

11 pages, 14 figures
License: CC BY 4.0

Abstract: Large Transformer models are capable of implementing a plethora of so-called in-context learning algorithms. These include gradient descent, classification, sequence completion, transformation, and improvement. In this work, we investigate whether large language models (LLMs), which never explicitly encountered the task of black-box optimization, are in principle capable of implementing evolutionary optimization algorithms. While previous works have solely focused on language-based task specification, we move forward and focus on the zero-shot application of LLMs to black-box optimization. We introduce a novel prompting strategy, consisting of least-to-most sorting of discretized population members and querying the LLM to propose an improvement to the mean statistic, i.e. perform a type of black-box recombination operation. Empirically, we find that our setup allows the user to obtain an LLM-based evolution strategy, which we call `EvoLLM', that robustly outperforms baseline algorithms such as random search and Gaussian Hill Climbing on synthetic BBOB functions as well as small neuroevolution tasks. Hence, LLMs can act as `plug-in' in-context recombination operators. We provide several comparative studies of the LLM's model size, prompt strategy, and context construction. Finally, we show that one can flexibly improve EvoLLM's performance by providing teacher algorithm information via instruction fine-tuning on previously collected teacher optimization trajectories.

Submitted to arXiv on 28 Feb. 2024

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2402.18381v1

In their work titled "Large Language Models As Evolution Strategies," authors Robert Tjarko Lange, Yingtao Tian, and Yujin Tang explore the capabilities of large language models (LLMs) in implementing evolutionary optimization algorithms. This study delves into the zero-shot application of LLMs to black-box optimization by introducing a novel prompting strategy that involves sorting discretized population members from least to most and querying the LLM for improvements. The resulting approach, dubbed 'EvoLLM,' outperforms traditional algorithms like random search and Gaussian Hill Climbing on synthetic BBOB functions and small neuroevolution tasks. This highlights the potential of LLMs as 'plug-ins' for in-context recombination operations. The paper also includes comparative analyses on factors such as model size, prompt strategy, and context construction. Additionally, the authors demonstrate that EvoLLM's performance can be enhanced by providing teacher algorithm information through instruction fine-tuning based on previously collected teacher optimization trajectories. Overall, this study showcases the adaptability and effectiveness of LLMs in improving optimization strategies.
Created on 01 Mar. 2024

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.