Anthropic’s Responsible Scaling Policy commits to applying ASL-3 Security and Deployment Standards before deploying a model that may have crossed the corresponding capability threshold. On 2025-05-22 Anthropic activated ASL-3 protections with the launch of Claude Opus 4.
- Committed 2023-09-19
- Due when a model may reach the ASL-3 capability threshold
- Evaluated 2025-05-22
- Ruling met
Why this ruling
Claude Opus 4 was the first Anthropic model deployed under ASL-3; Anthropic applied the standard as a precautionary measure without definitively determining the threshold had been crossed.
Cite this commitment
Overdue. "Apply ASL-3 safeguards when a model may reach the ASL-3 threshold." Overdue, 2025. https://overduetracker.org/c/anthropic-asl3-opus4 (retrieved 2026-06-19).