Apply ASL-3 safeguards when a model may reach the ASL-3 threshold

Anthropic’s Responsible Scaling Policy commits to applying ASL-3 Security and Deployment Standards before deploying a model that may have crossed the corresponding capability threshold. On 2025-05-22 Anthropic activated ASL-3 protections with the launch of Claude Opus 4.

Committed 2023-09-19
Due when a model may reach the ASL-3 capability threshold
Evaluated 2025-05-22
Ruling met

Why this ruling

Claude Opus 4 was the first Anthropic model deployed under ASL-3; Anthropic applied the standard as a precautionary measure without definitively determining the threshold had been crossed.

Source: Anthropic ↗
Committed: 2023-09-19
As of: 2026-06-19

Cite this commitment

Overdue. "Apply ASL-3 safeguards when a model may reach the ASL-3 threshold." Overdue, 2025. https://overduetracker.org/c/anthropic-asl3-opus4 (retrieved 2026-06-19).