AI models can barely control their own reasoning, and OpenAI says that’s a good sign

March 6, 2026

2026-03-06 03:08 GMT · 4 months ago aimagpro.com

With GPT-5.4 Thinking, OpenAI is reporting on “CoT controllability” for the first time – a measure of whether AI models can deliberately manipulate their own reasoning. An accompanying study finds that reasoning models almost universally fail at this task, which OpenAI says is encouraging for AI safety.
The article AI models can barely control their own reasoning, and OpenAI says that's a good sign appeared first on The Decoder.