AI benchmarks are broken and the industry keeps using them anyway, study finds

2026-01-09 23:00 GMT · 4 months ago aimagpro.com

Benchmarks are supposed to measure AI model performance objectively. But according to an analysis by Epoch AI, results depend heavily on how the test is run. The research organization identifies numerous variables that are rarely disclosed but significantly affect outcomes.
The article AI benchmarks are broken and the industry keeps using them anyway, study finds appeared first on The Decoder.