All notes

AI

May 16, 2026

Frontier AI Models Have Effectively Broken the Open CTF Competition Format

Large language models now solve capture-the-flag challenges at a level that undermines open CTF competition integrity, forcing the security community to rethink how these contests are structured.

The open CTF format assumes a level playing field where human skill is the primary differentiator. Frontier AI models break that assumption.

Modern LLMs can reason through cryptography puzzles, reverse engineering challenges, and web exploitation tasks fast enough to invalidate the traditional scoring dynamic. When a team can offload substantial portions of a CTF to an AI assistant, the competition stops measuring individual or team competence and starts measuring access to capable models and the skill to prompt them effectively.

The argument laid out is not that AI-assisted CTF solving is new — players have used scripts and automation forever. The shift is one of degree: frontier models handle the ambiguous, creative reasoning steps that previously required deep human expertise. Automation historically handled the mechanical parts. Now the hard parts are automated too.

This creates a structural problem for organizers. Open CTFs publish challenges publicly during the competition window. Any player can feed a challenge directly into a frontier model. There is no practical enforcement boundary short of banning internet access entirely, which is unenforceable in remote formats and alienating in on-site ones.

For the security industry, CTF performance has functioned as a credentialing signal. Recruiters and hiring managers treat podium finishes and high rankings as evidence of technical depth. That signal degrades when results reflect model access as much as human skill.

Possible responses include closed, in-person formats with strict device controls, AI-native divisions that treat model use as a first-class tool rather than a prohibited one, or challenge designs that resist LLM solving by requiring physical hardware interaction or real-time adversarial defense.

None of those options preserve the current format intact. The open CTF as it exists today is a casualty of the same capability jump that is reshaping software development, security research, and engineering hiring broadly.