What we include (and what we don't)
- A specific, dated public promise — a calendar deadline or a falsifiable trigger. Vague or aspirational pledges are excluded.
- For the main board, a promise the lab itself made or signed (RSP/Preparedness/Frontier-Safety milestones, self-imposed deadlines, the Seoul and White House voluntary commitments). Government laws (e.g. the EU AI Act) are not promises a lab made — they appear separately as regulatory milestones, as countdowns, never scored kept or broken.
- One rock-solid public source per row (primary preferred); a "missed" ruling requires especially strong sourcing.
- Neutral, factual phrasing; genuinely debatable rulings are flagged
⚠ contested.
How rulings are made
Met / Missed / Partial are human judgments, each backed by one public source. Overdue / Upcoming are computed automatically from the deadline versus today, so those timers stay current on their own. Genuinely disputable rulings are flagged ⚠ contested and phrased as open questions.
Editorial principles
Phrasing is strictly factual and neutral: we state the deadline and what shipped by it, and avoid editorializing verbs. A ruling ships only with a rock-solid source. Statuses are curated as of the "updated" date on the board; only the timers update live.
Data, license & how to cite
The dataset (src/data/*, /commitments.json, /commitments.csv) is licensed CC BY 4.0 — republish freely with attribution. The code is MIT. Nonprofits, journalists, and researchers are welcome to reuse it.
Suggested citation:
Overdue: Frontier AI Safety Commitment Tracker. https://overduetracker.org (retrieved 2026-06-19).Download: JSON · CSV. Explore: table view. When a ruling changes or an error is fixed, it is logged on Corrections.
Spotted an error? Open a GitHub issue.
How Overdue differs (related trackers)
Overdue complements existing work rather than replacing it. The Midas Project's Seoul Tracker grades one collective deadline, and its Watchtower flags quiet policy changes; METR's index catalogs policy documents; the FLI AI Safety Index and SaferAI grade overall posture. AI Lab Watch compiled the broadest commitment list, but has been unmaintained since September 2025. Overdue's contribution is to bring those regimes together with a live, per-promise deadline clock: many individual dated commitments — RSPs, frontier safety frameworks, the Seoul and White House commitments — each with a status, an explicit upcoming / overdue countdown, and one source. It is not the first accountability project.