Meta’s ARE + Gaia2 Set a New Bar for AI Agent Evaluation under Asynchronous, Event-Driven Conditions
Meta AI has introduced Agents Research Environments (ARE), a modular simulation stack for creating and running agent tasks, and Gaia2, a follow-up benchmark to GAIA that evaluates agents in dynamic, write-enabled settings. ARE provides abstractions for apps, environments, events, notifications,…
