The Intermediate level taught you to build guardrails; this one teaches you to prove they work. Turn ad-hoc defenses into systematic assurance — a threat-based red-team suite, automated attack generation, honest guardrail metrics, and continuous adversarial evaluation wired into CI so a safety regression fails the build like any other bug.
The Intermediate level taught you to build defenses — delimit untrusted context, validate outputs, layer guardrails. This level teaches you to prove they work, continuously. A guardrail you can't measure is a guardrail you can't trust, and "it seemed to block the attacks I tried by hand" is not assurance.
Treat safety exactly like latency or recall: define the behavior you require, build a test suite that exercises it, set a threshold, and gate releases on it. The shift is from "we added a prompt-injection filter" to "our injection attack-success rate is 3%, measured nightly against a 600-case suite, and the build fails if it exceeds 5%."
catalog threats -> build attack corpus -> run against the system
^ |
| v
refresh corpus <- triage results <- score (ASR, over-refusal)
The rest of today builds each stage: cataloguing and curating attacks (§2), generating them at scale (§3), scoring guardrails honestly (§4), and wiring the loop into CI so it runs forever (§5).