Anthropic’s Responsible Scaling Policy v1.0 stated its commitment was to write the ASL-4 measures before any model reaches ASL-3 capabilities.
- Committed 2023-09-19
- Due before any model reaches ASL-3
- Evaluated —
- Ruling —
Why this ruling
The original v1.0 trigger is documented. The trigger has since elapsed — Anthropic activated ASL-3 with Claude Opus 4 on 2025-05-22 — and v3.0 (Feb 2026) restructured away from the ASL-4 framing; whether the loosely-specified ASL-4 commitment was satisfied is genuinely disputed.
Cite this commitment
Overdue. "Define ASL-4 safeguards before reaching ASL-3." Overdue, 2023. https://overduetracker.org/c/anthropic-asl4-before-asl3 (retrieved 2026-06-19).