Language Models as Agent Models

AI-generated keywords: Language Models Agent Models Goal-Directed Intentional Communication Representations

AI-generated Key Points

The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

  • Language models (LMs) can serve as models of intentional communication in a narrow sense
  • LMs can infer and represent properties of the agent likely to have produced a given textual context
  • LMs can utilize fine-grained communicative intentions, beliefs, and goals
  • LMs have the potential to act as building blocks for systems that communicate and act intentionally
  • Challenges previous notions about the limitations of LMs in modeling goal-directed aspects of human language production and comprehension
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Jacob Andreas

Abstract: Language models (LMs) are trained on collections of documents, written by individual human agents to achieve specific goals in an outside world. During training, LMs have access only to text of these documents, with no direct evidence of the internal states of the agents that produced them -- a fact often used to argue that LMs are incapable of modeling goal-directed aspects of human language production and comprehension. Can LMs trained on text learn anything at all about the relationship between language and use? I argue that LMs are models of intentional communication in a specific, narrow sense. When performing next word prediction given a textual context, an LM can infer and represent properties of an agent likely to have produced that context. These representations can in turn influence subsequent LM generation in the same way that agents' communicative intentions influence their language. I survey findings from the recent literature showing that -- even in today's non-robust and error-prone models -- LMs infer and use representations of fine-grained communicative intentions and more abstract beliefs and goals. Despite the limited nature of their training data, they can thus serve as building blocks for systems that communicate and act intentionally.

Submitted to arXiv on 03 Dec. 2022

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2212.01681v1

This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

In the paper "Language Models as Agent Models" by Jacob Andreas, the author explores the capabilities of language models (LMs) in understanding the relationship between language and use. LMs are trained on collections of documents written by human agents to achieve specific goals in the outside world. The author argues that LMs can indeed serve as models of intentional communication in a narrow sense. When predicting the next word based on a given textual context, an LM can infer and represent properties of the agent likely to have produced that context. These representations can then influence subsequent LM generation, similar to how agents' communicative intentions impact their language. Recent literature suggests that LMs are capable of inferring and utilizing fine-grained communicative intentions, as well as more abstract beliefs and goals. This indicates that even with their limited training data, LMs can act as building blocks for systems that communicate and act intentionally. Overall, this paper highlights the potential for LMs to learn about the relationship between language and use, challenging previous notions about their limitations in modeling goal-directed aspects of human language production and comprehension.
Created on 22 Nov. 2023

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

The license of this specific paper does not allow us to build upon its content and the summarizing tools will be run using the paper metadata rather than the full article. However, it still does a good job, and you can also try our tools on papers with more open licenses.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.