Artificial Impressions: Evaluating Large Language Model Behavior Through the Lens of Trait Impressions

AI-generated keywords: Artificial Impressions Large Language Models Trait Impressions Stereotype Content Model Linear Probes

AI-generated Key Points

⚠The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

Authors Nicholas Deas and Kathleen McKeown introduce the concept of artificial impressions within Large Language Models (LLMs)
Artificial impressions are patterns in LLMs' internal representations of prompts that mirror human impressions and stereotypes based on language
Linear probes are used to predict these impressions according to the Stereotype Content Model (SCM)
The study explores the connection between artificial impressions and downstream model behavior
Inconsistency in LLMs' reporting of impressions when prompted, contrasting with reliable linear decodability from hidden representations
Artificial impressions derived from prompts can forecast quality and utilization of hedging in model responses
Analysis of how specific content, stylistic elements, and dialectal characteristics in prompts impact LLM impressions

Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Nicholas Deas, Kathleen McKeown

arXiv: 2510.08915v1 - DOI (cs.CL)

EMNLP 2025 Camera Ready

License: NONEXCLUSIVE-DISTRIB 1.0

Abstract: We introduce and study artificial impressions--patterns in LLMs' internal representations of prompts that resemble human impressions and stereotypes based on language. We fit linear probes on generated prompts to predict impressions according to the two-dimensional Stereotype Content Model (SCM). Using these probes, we study the relationship between impressions and downstream model behavior as well as prompt features that may inform such impressions. We find that LLMs inconsistently report impressions when prompted, but also that impressions are more consistently linearly decodable from their hidden representations. Additionally, we show that artificial impressions of prompts are predictive of the quality and use of hedging in model responses. We also investigate how particular content, stylistic, and dialectal features in prompts impact LLM impressions.

Submitted to arXiv on 10 Oct. 2025

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

⚠The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2510.08915v1

⚠This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

Comprehensive Summary
Key points
Layman's Summary
Blog article

In their paper titled "Artificial Impressions: Evaluating Large Language Model Behavior Through the Lens of Trait Impressions," authors Nicholas Deas and Kathleen McKeown introduce and explore the concept of artificial impressions within Large Language Models (LLMs). These artificial impressions are patterns in LLMs' internal representations of prompts that mirror human impressions and stereotypes based on language. The authors employ linear probes on generated prompts to predict these impressions according to the two-dimensional Stereotype Content Model (SCM). Through their research, they delve into the connection between these impressions and downstream model behavior, as well as investigate prompt features that may influence such impressions. The study reveals intriguing findings, including the inconsistency in LLMs' reporting of impressions when prompted, contrasting with the more reliable linear decodability of impressions from their hidden representations. Moreover, the authors demonstrate that artificial impressions derived from prompts can forecast the quality and utilization of hedging in model responses. Additionally, they analyze how specific content, stylistic elements, and dialectal characteristics in prompts impact LLM impressions. This comprehensive exploration sheds light on the intricate interplay between language models, human perceptions, stereotypes, and linguistic nuances. By delving into artificial impressions within LLMs, Deas and McKeown provide valuable insights into understanding and evaluating large language model behavior through a unique lens of trait impressions.

- Authors Nicholas Deas and Kathleen McKeown introduce the concept of artificial impressions within Large Language Models (LLMs)
- Artificial impressions are patterns in LLMs' internal representations of prompts that mirror human impressions and stereotypes based on language
- Linear probes are used to predict these impressions according to the Stereotype Content Model (SCM)
- The study explores the connection between artificial impressions and downstream model behavior
- Inconsistency in LLMs' reporting of impressions when prompted, contrasting with reliable linear decodability from hidden representations
- Artificial impressions derived from prompts can forecast quality and utilization of hedging in model responses
- Analysis of how specific content, stylistic elements, and dialectal characteristics in prompts impact LLM impressions

SummaryAuthors Nicholas Deas and Kathleen McKeown talk about artificial impressions in Big Language Models (LLMs). Artificial impressions are patterns in LLMs that reflect human impressions and stereotypes based on language. Linear probes are used to predict these impressions following the Stereotype Content Model (SCM). The study looks at how artificial impressions affect the behavior of models. Inconsistencies exist in LLMs' reporting of impressions when prompted, unlike the reliable linear decodability from hidden representations. Definitions- Artificial Impressions: Patterns or ideas created by machines that resemble human thoughts and stereotypes. - Large Language Models (LLMs): Complex computer programs that understand and generate human language. - Linear Probes: Methods used to predict certain characteristics or patterns within models. - Stereotype Content Model (SCM): A framework for understanding stereotypes based on their content and meaning. - Downstream Model Behavior: How a model behaves or performs based on its internal processes and inputs.

Introduction

Artificial intelligence (AI) has made significant strides in recent years, particularly in the field of natural language processing (NLP). Large Language Models (LLMs) have become increasingly popular due to their ability to generate human-like text and perform a wide range of NLP tasks. However, as these models continue to advance, it is essential to understand and evaluate their behavior through various lenses. In their paper titled "Artificial Impressions: Evaluating Large Language Model Behavior Through the Lens of Trait Impressions," authors Nicholas Deas and Kathleen McKeown introduce a new perspective for evaluating LLMs - artificial impressions.

The Concept of Artificial Impressions

The concept of artificial impressions refers to patterns within LLMs' internal representations that mirror human impressions and stereotypes based on language. These impressions are derived from prompts given to the model, which influence its output. The authors use linear probes on generated prompts to predict these impressions according to the Stereotype Content Model (SCM), which categorizes traits into two dimensions - warmth and competence.

Methodology

To explore this concept further, Deas and McKeown conducted experiments using three large-scale LLMs - GPT-3, BERT-Large, and RoBERTa-Large. They used linear probes trained on top layers of each model's hidden representations to predict trait impressions from generated prompts. The study also included an analysis of prompt features such as content, style elements, and dialectal characteristics that may influence these artificial impressions.

Predicting Impressions with Linear Probes

The authors found that linear decodability was more reliable than direct reporting when predicting trait impressions from hidden representations in all three models. This suggests that while LLMs may not always report consistent or accurate trait impressions when prompted directly, they still encode these impressions in their internal representations.

Impact of Artificial Impressions on Downstream Model Behavior

Deas and McKeown also investigated the connection between artificial impressions and downstream model behavior. They found that prompts with higher warmth scores tended to generate more hedging in model responses, while those with higher competence scores generated less hedging. This suggests that LLMs' artificial impressions can influence their output and may have implications for tasks such as text generation and sentiment analysis.

Analysis of Prompt Features

The authors also analyzed how specific prompt features impact LLM impressions. They found that content, stylistic elements, and dialectal characteristics all play a role in shaping these artificial impressions. For example, prompts containing words related to emotions or social relationships were more likely to elicit warmth-related traits from the models.

Content Analysis

Deas and McKeown conducted a content analysis of prompts using topic modeling techniques. They found that certain topics were associated with different trait impressions across all three models. For instance, prompts related to sports tended to elicit competence-related traits, while those related to family elicited warmth-related traits.

Stylistic Elements

The study also looked at how different stylistic elements in prompts influenced LLM impressions. The authors found that certain linguistic cues such as negation or adjectives had a significant impact on trait predictions. Additionally, they observed differences in impression predictions based on sentence structure - declarative vs interrogative sentences.

Dialectal Characteristics

Finally, Deas and McKeown examined how dialectal characteristics in prompts affected LLM impression predictions. They found that models trained on data from specific regions or countries tended to predict traits associated with those regions when prompted with dialect-specific language.

Conclusion

In conclusion, Deas and McKeown's research on artificial impressions within LLMs provides valuable insights into understanding and evaluating large language model behavior. By exploring the connection between these impressions, downstream model behavior, and prompt features, the authors shed light on the intricate interplay between LLMs, human perceptions, stereotypes, and linguistic nuances. This study highlights the need for a more nuanced approach to evaluating LLMs and emphasizes the importance of considering various factors that may influence their output. As AI continues to advance, it is crucial to continue examining its impact on society through different lenses such as trait impressions.

Created on 28 Feb. 2026

Assess the quality of the AI-generated content by voting

Score: 0

Similar papers summarized with our AI tools

79.3%

AI-Augmented Surveys: Leveraging Large Language Models and Surveys for Opinio…

cs.CL

79.3%

Several categories of Large Language Models (LLMs): A Short Survey

cs.CL

78.6%

Large language models effectively leverage document-level context for literar…

cs.CL

78.0%

Challenges and Responses in the Practice of Large Language Models

cs.CL

78.0%

Large Language Models for Information Retrieval: A Survey

cs.CL

78.0%

Large Language Models for Generative Information Extraction: A Survey

cs.CL

77.7%

Augmented Language Models: a Survey

cs.CL

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.