Google's Gemini 3.5 Flash model, announced at Google I/O, offers performance comparable to the previous Gemini 3.1 Pro but at a significantly lower cost. Despite being branded as "Flash," its pricing is now much closer to Pro, with input costs at $1.50/1M tokens and output costs at $9.00/1M tokens, a substantial increase from earlier Flash versions. This pricing adjustment effectively makes Pro-level inference 25% cheaper, offering economic benefits for large-scale agentic workloads, though the author cautions against relying solely on benchmark performance for production decisions. AI
Summary written by gemini-2.5-flash-lite from 2 sources. How we write summaries →
IMPACT Makes Pro-level inference 25% cheaper, potentially accelerating adoption of agentic AI workloads at scale.
RANK_REASON Product announcement with significant pricing and performance implications for a major AI model.