antistatic.exchange
Register Log in
Back to feed

OpenAI is better at announcing when thresholds have been met than DeepMind (and perhaps better than Anthropic as well), so I do not expect there to be the same level of delay in announcing that a threshold has been crossed as has been the case with the other leading AI companies. Thus, it's worthwhile to mainly focus on the definitions as opposed to the technicalities of when OpenAI would plan to announce that a threshold has been met (after it has been met). However, as the capabilities we are forecasting on enter the realm of "very dangerous", OpenAI does have more of an incentive to delay announcing these thresholds than it did for its less dangerous thresholds (not counting the AI R&D one, which it would be proud to announce):

AI R&D: High - Definition: "equivalent to a performant mid-career ML research engineer". This is a very high bar to cross. Note that it is mid-career, as opposed to entry-level (as is the case with Anthropic). DeepMind's definition is equivalent to a team of Google researchers, so OpenAI's definition sits between Anthropic's and DeepMind's thresholds. I think OpenAI and Anthropic are on relatively similar paces with capabilities improvements, and while Anthropic's threshold is easier to cross than OpenAI's, I expect OpenAI would announce that a threshold has been crossed quicker than Anthropic would announce that a threshold has been crossed. Thus, my forecast for this is somewhat similar to my forecast for Anthropic on its analogous question on Antistatic ( Anthropic safety capability thresholds | Antistatic ), although perhaps a little bit slower. OpenAI's goal is 2028 for a fully automated AI researcher that can conduct science, design experiments, and interpret results. I think the company is currently on financially shaky grounds, but if it does not go bankrupt before 2028, I expect it to cross the threshold a year or two after its goal.

By end of 2026: 7%

By end of 2027: 20%

By end of 2028: 35%

By end of 2029: 45%

By end of 2030: 55%

Bio/Chem: critical - Definition: "provides significant uplift for creating biological or chemical weapons beyond freely available information". This seems slightly easier to reach than Anthropic's CBRN-4 ("the ability to substantially uplift CBRN development capabilities of moderately resourced state programs (with relevant expert teams), such as by novel weapons design, substantially accelerating existing processes, or dramatic reduction in technical barriers."), although there is some ambiguity. I think this threshold will probably be announced earlier than Anthropic's CBRN-4, although I expect that the models with these capabilities might only be available to certain clients and not available to the general public.

By end of 2026: 5%

By end of 2027: 15%

By end of 2028: 30%

By end of 2029: 45%

By end of 2030: 55%

Cyber: critical - Definition: "requires autonomous identification and exploitation of novel zero-day vulnerabilities in hardened systems, or end-to-end offensive cyber operations against state-level targets". I am under the impression that GPT-5.5 is probably not that far from reaching this threshold, although I expect OpenAI not to release a model any time soon to the general public that clearly crosses this threshold, as it would be a cybersecurity nightmare. Thus, the company might release certain models to the U.S. government that cross this threshold without making them available to the general public.

By end of 2026: 8%

By end of 2027: 22%

By end of 2028: 40%

By end of 2029: 55%

By end of 2030: 65%

0 replies

No replies yet.