Google's Gemini 3.5 Flash follows Anthropic and OpenAI in making newer AI models significantly pricier

Google's Gemini 3.5 Flash is a big step up from its predecessor, but in benchmark testing, it costs 5.5 times as much to run. On agent tasks, total costs even exceed the pricier Gemini 3.1 Pro by 75 percent because the model needs more interaction steps than any rival tested. Google isn't alone here: AI is getting more expensive across the board as the massive investments need to pay off.

mercoledì 20 maggio 2026 New tab

Google's new Gemini 3.5 Flash is a step up from its predecessor, but it costs more than five times as much to run. High token consumption on agent tasks pushes total costs past the pricier Pro model in benchmark testing.

Google Deepmind has released Gemini 3.5 Flash, the latest version of its Flash model family. Flash was long positioned as the cheaper, faster alternative to Google's more powerful Pro models. An analysis by Artificial Analysis, which got early access, found that Gemini 3.5 Flash costs 5.5 times more to run in benchmark testing than Gemini 3 Flash and nearly twice as much as the Pro model Gemini 3.1. The context window stays at one million tokens.

Gemini 3.5 Flash has gotten much more expensive than its predecessor, both in token price and token consumption. | Image: Artificial Analysis

Token prices alone have tripled: Google now charges $1.50 per million input tokens and $9.00 per million output tokens, up from $0.50 and $3.00 for Gemini 3 Flash. Per token, that's still cheaper than Gemini 3.1 Pro at $2.00 and $12.00.

In practice, though, the math flips. Gemini 3.5 Flash burns through so many more tokens on agent-based tasks that total costs end up 75 percent higher than Gemini 3.1 Pro, according to Artificial Analysis.

Gemini 3.5 Flash has gotten much more expensive than its predecessor, both in token price and token consumption. | Image: Artificial Analysis

Google's Gemini 3.5 Flash follows Anthropic and OpenAI in making newer AI models significantly pricier

Google's Gemini 3.5 Flash follows Anthropic and OpenAI in making newer AI models significantly pricier

Other newsrooms on this story

Related reading

Google Introduces Gemini 3.5 Flash at I/O 2026: A Faster and Cheaper Model for…

Google says Gemini 3.5 Flash can slash enterprise AI costs by more than $1…

Gemini 3.5 Flash lands on Google's Android coding rankings, but it's 3x the…

Google focuses on autonomous AI agents in Gemini 3.5 Flash

Google announces Gemini 3.5 Flash, its "strongest" coding model yet

Google launches Gemini 3.5 Flash, its fastest agentic AI model for coding | TNW

Other newsrooms on this story

Related reading

Google Introduces Gemini 3.5 Flash at I/O 2026: A Faster and Cheaper Model for…

Google says Gemini 3.5 Flash can slash enterprise AI costs by more than $1…

Gemini 3.5 Flash lands on Google's Android coding rankings, but it's 3x the…

Google focuses on autonomous AI agents in Gemini 3.5 Flash

Google announces Gemini 3.5 Flash, its "strongest" coding model yet

Google launches Gemini 3.5 Flash, its fastest agentic AI model for coding | TNW