OpenAI safety capability thresholds

AI R&D (High): OpenAI publicly reports (in a System Card, blog post, or Preparedness Framework update) that one of its models meets or exceeds the 'High' capability threshold for 'Autonomous AI Research & Development' (previously 'AI Self-Improvement'). The 'High' threshold is defined in the Preparedness Framework as equivalent to a 'performant mid-career ML research engineer', evaluated via benchmarks including Monorepo-Bench, OpenAI-Proof Q&A, and MLE-Bench.

Critical Bio/Chem: OpenAI publicly classifies one of its models as 'Critical' in the Biological/Chemical capability category in a System Card, safety report, or Preparedness Framework assessment.

Critical Cyber: OpenAI publicly classifies one of its models as 'Critical' in the Cybersecurity capability category in a System Card, safety report, or Preparedness Framework assessment.

OpenAI safety capability thresholds

0 comments