ATLANTIS: AI-driven Threat Localization, Analysis, and Triage Intelligence System

AI-generated keywords: DARPA's AI Cyber Challenge

AI-generated Key Points

  • Team Atlanta won the DARPA AI Cyber Challenge at DEF CON 33 with their cyber reasoning system, ATLANTIS
  • ATLANTIS autonomously discovers and patches vulnerabilities in software at high speed and scale
  • The system integrates large language models with advanced program analysis techniques like symbolic execution, directed fuzzing, and static analysis
  • Developed collaboratively by researchers from Georgia Tech, Samsung Research, KAIST, and POSTECH
  • ATLANTIS excels in scaling across diverse codebases while producing semantically correct patches
  • Notable components include BULLSEYE for directed fuzzing and DEEPGENERATOR for deep learning model generation
  • Infrastructure includes sinkpoint-aware fuzzing loops, ensemble techniques, distributed design elements, and comprehensive component overview
  • ATLANTIS-Java extends capabilities with LibAFL-based Jazzer tools
  • ExpKit module provides essential functionalities like Sink Finder, Path Finder, PoV Generator to enhance vulnerability detection and patch generation processes within ATLANTIS
  • Multilang aspect introduces SymState for concolic execution purposes along with shared utilities & libraries like FUNCTION TRACER and CODE RETRIEVER
  • Core components PRISM and MARTIAN agents play crucial roles in enhancing performance through innovative workflows
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Taesoo Kim, HyungSeok Han, Soyeon Park, Dae R. Jeong, Dohyeok Kim, Dongkwan Kim, Eunsoo Kim, Jiho Kim, Joshua Wang, Kangsu Kim, Sangwoo Ji, Woosun Song, Hanqing Zhao, Andrew Chin, Gyejin Lee, Kevin Stevens, Mansour Alharthi, Yizhuo Zhai, Cen Zhang, Joonun Jang, Yeongjin Jang, Ammar Askar, Dongju Kim, Fabian Fleischer, Jeongin Cho, Junsik Kim, Kyungjoon Ko, Insu Yun, Sangdon Park, Dowoo Baik, Haein Lee, Hyeon Heo, Minjae Gwon, Minjae Lee, Minwoo Baek, Seunggi Min, Wonyoung Kim, Yonghwi Jin, Younggi Park, Yunjae Choi, Jinho Jung, Gwanhyun Lee, Junyoung Jang, Kyuheon Kim, Yeonghyeon Cha, Youngjoon Kim

Version 1.0 (September 17, 2025). Technical Report. Team Atlanta -- 1st place in DARPA AIxCC Final Competition. Project page: https://team-atlanta.github.io/
License: CC BY 4.0

Abstract: We present ATLANTIS, the cyber reasoning system developed by Team Atlanta that won 1st place in the Final Competition of DARPA's AI Cyber Challenge (AIxCC) at DEF CON 33 (August 2025). AIxCC (2023-2025) challenged teams to build autonomous cyber reasoning systems capable of discovering and patching vulnerabilities at the speed and scale of modern software. ATLANTIS integrates large language models (LLMs) with program analysis -- combining symbolic execution, directed fuzzing, and static analysis -- to address limitations in automated vulnerability discovery and program repair. Developed by researchers at Georgia Institute of Technology, Samsung Research, KAIST, and POSTECH, the system addresses core challenges: scaling across diverse codebases from C to Java, achieving high precision while maintaining broad coverage, and producing semantically correct patches that preserve intended behavior. We detail the design philosophy, architectural decisions, and implementation strategies behind ATLANTIS, share lessons learned from pushing the boundaries of automated security when program analysis meets modern AI, and release artifacts to support reproducibility and future research.

Submitted to arXiv on 18 Sep. 2025

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2509.14589v1

In the Final Competition of DARPA's AI Cyber Challenge (AIxCC) at DEF CON 33 in August 2025, Team Atlanta emerged victorious with their cyber reasoning system, ATLANTIS. This system was designed to autonomously discover and patch vulnerabilities in software at the speed and scale demanded by modern technology. By integrating large language models (LLMs) with advanced program analysis techniques such as symbolic execution, directed fuzzing, and static analysis, ATLANTIS overcame existing limitations in automated vulnerability discovery and program repair. Developed collaboratively by researchers from Georgia Institute of Technology, Samsung Research, KAIST, and POSTECH, ATLANTIS tackled key challenges in cybersecurity. It successfully scaled across diverse codebases ranging from C to Java while maintaining high precision and broad coverage. The system excelled in producing semantically correct patches that preserved the intended behavior of the software. One notable component of ATLANTIS is BULLSEYE, a directed fuzzing tool that enhances vulnerability discovery. The system also includes infrastructure for sinkpoint-aware fuzzing loops, ensemble fuzzing techniques, sinkpoint identification and management strategies, distributed design elements, and a comprehensive component overview. ATLANTIS-Java further extends the capabilities of the system by incorporating LibAFL-based Jazzer tools for enhanced fuzzing capabilities. DEEPGENERATOR is utilized for generating deep learning models to improve sinkpoint-focused directed fuzzing efforts. Additionally, CodeQL-enhanced sink detection methods and static analysis techniques are employed for distance computation. The ExpKit module provides essential functionalities such as Sink Finder, Sink Manager, Path Finder, PoV Generator among others to enhance vulnerability detection and patch generation processes within ATLANTIS. Furthermore, the Multilang aspect of ATLANTIS introduces SymState for concolic execution purposes along with discussions on its implementation details. The Multilang-LLM-Agent (MLLA) component offers standalone functionality while shared utilities & libraries like FUNCTION TRACER and CODE RETRIEVER contribute to overall system efficiency. Lastly, core components like PRISM and MARTIAN agents play crucial roles in enhancing the performance of ATLANTIS through innovative workflows and key architectural concepts. The detailed architecture of these agents showcases how they interact within the broader framework of ATLANTIS to achieve optimal results in automated security practices.
Created on 19 Sep. 2025

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.