Google is capping how much quota a single prompt can use and making Flash-Lite prompts free after users burned through limits rapidly

At I/O 2026, the Gemini app switched to compute-based usage limits. In response to “feedback about hitting limits too quickly,” Google...

Google has modified Gemini's compute-based usage limits following user complaints, aiming to address issues with prompt complexity and refresh intervals introduced at I/O 2026.

Nach Kritik an zu schnellen Nutzungslimits für Gemini-Modelle bessert Google nach. Komplexe Anfragen verbrauchen nun weniger Kontingent.

Google is capping how much quota a single prompt can use and making Flash-Lite prompts free after users burned through limits rapidly

A bug in Google's Gemini app caused just one or two Omni videos to eat up the entire usage quota. Google has fixed the bug, Ultra members now get twice as many video generations,…