GDPval sets a new standard for benchmarking AI on real-world knowledge work, with 1,320 tasks spanning 44 professions, all reviewed by industry experts.
The article OpenAI says top AI models are reaching expert territory on real-world knowledge work appeared first on THE DECODER.
