New ways to balance cost and reliability in the Gemini API

2026-04-02 07:00 GMT · 2 months ago aimagpro.com

Google is introducing two new inference tiers to the Gemini API, Flex and Priority,
to balance cost and latency.