Context Matters: Evaluating Context Strategies for Automated ADR Generation Using LLMs

AI-generated keywords: Architecture Decision Records (ADRs)

AI-generated Key Points

  • Architecture Decision Records (ADRs) are crucial for preserving system design rationale
  • Large Language Models (LLMs) can help alleviate the burden of creating and maintaining ADRs
  • Context-aware prompting enhances ADR generation fidelity
  • Recency-based context selection is recommended for automated ADR generation
  • Effective ADR automation relies more on context engineering than model scale alone
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Aviral Gupta, Rudra Dhar, Daniel Feitosa, Karthik Vaidhyanathan

11 pages, 5 diagrams, Accepted at EASE Conference 2026 Research Track
License: CC BY 4.0

Abstract: Architecture Decision Records (ADRs) play a critical role in preserving the rationale behind system design, yet their creation and maintenance are often neglected due to the associated authoring overhead. This paper investigates whether Large Language Models (LLMs) can mitigate this burden and, more importantly, how different strategies for presenting historical ADRs as context influence generation quality. We curate and validate a large corpus of sequential ADRs drawn from 750 open-source repositories and systematically evaluate five context selection strategies (no context, All-history, First-K, Last-K, and RAFG) across multiple model families. Our results show that context-aware prompting substantially improves ADR generation fidelity, with a small recency window (typically 3-5 prior records) providing the best balance between quality and efficiency. Retrieval-based context selection yields marginal gains primarily in non-sequential or cross-cutting decision scenarios, while offering no statistically significant advantage in typical linear ADR workflows. Overall, our findings demonstrate that context engineering, rather than model scale alone, is the dominant factor in effective ADR automation, and we outline practical defaults for tool builders along with targeted retrieval fallbacks for complex architectural settings.

Submitted to arXiv on 04 Apr. 2026

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2604.03826v2

, , , , Architecture Decision Records (ADRs) are crucial for preserving the rationale behind system design, but their creation and maintenance often face neglect due to authoring overhead. This study explores how Large Language Models (LLMs) can alleviate this burden and examines different strategies for presenting historical ADRs as context to enhance generation quality. By analyzing a vast corpus of sequential ADRs from open-source repositories, five context selection strategies were evaluated across various model families. The results indicate that context-aware prompting significantly enhances ADR generation fidelity, with a small recency window (typically 3-5 prior records) striking the best balance between quality and efficiency. <break> Retrieval-based context selection offers marginal gains in non-sequential or cross-cutting decision scenarios but does not show significant advantages in linear ADR workflows. The study emphasizes that effective ADR automation relies more on context engineering than model scale alone. Furthermore, the longitudinal analysis reveals that foundational decisions shape system structure, while subsequent decisions evolve based on their immediate predecessors. The RAFG strategy excels in addressing cross-cutting concerns that span multiple components or reactivate dormant architectural patterns, emphasizing the importance of considering architectural scope in context selection. <break> The study also identifies common documentation issues such as external content dependency and knowledge vaporization affecting ADR quality. Practitioners are advised to prioritize recency-based context selection as a default strategy for automated ADR generation, leveraging simpler approaches like Last-K to reduce implementation barriers. Model scale is found to be less critical than previously assumed, with compact models demonstrating comparable quality when provided with appropriate context. <break> Organizations are encouraged to maintain self-contained architectural documentation to enhance both automated tool performance and long-term utility. Addressing incomplete documentation through automated generation can help recover undocumented architectural decisions and mitigate documentation debt effectively. In conclusion, this research provides valuable insights for practitioners implementing automated ADR generation, highlighting the significance of strategic factors like context selection, model scale considerations, and comprehensive documentation practices in optimizing the effectiveness of automated tools for architectural knowledge management.
Created on 22 Apr. 2026

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.