Google just tested a bunch of new AI models for Android app coding – here are the rankings

Google has once again updated its “Android Bench” rankings for the best AI models for Android app development, with a bunch of new “open-weight” models as well as more details on the tokens used and cost of using these models.

One thing that large language models have gotten really good at is coding, with their ability to aid in the development of apps and other software projects also leading to the rise of “vibe coding.” Earlier this year, Google published a new benchmark ranking that showcased the “best” AI models for Android app development, taking into account common Android development tasks as well as how these models handle best practices.

When the “Android Bench” first debuted, Gemini 3.1 Pro led the pack, and OpenAI’s GPT 5.4 later tied for the top slot.

As of the May 18, 2026 update, there’s a new king in town. According to Google, GPT 5.5 is currently the best AI model for Android app development, beating out GPT 5.4 and Gemini 3.1 Pro by a little under 2%.

But this latest update also puts things into perspective much better, as Google now shows the average latency, total tokens used, and the average cost of using each AI model. Google details how it arrived at each metric in documentation around the benchmark.

When the “Android Bench” first debuted, Gemini 3.1 Pro led the pack, and OpenAI’s GPT 5.4 later tied for the top slot.

Google just tested a bunch of new AI models for Android app coding – here are the rankings

Google just tested a bunch of new AI models for Android app coding – here are the rankings

Other newsrooms on this story

Related reading

Google is changing how it judges AI models for Android coding, updates list…

Google is reportedly buying Android app code from Play Store devs to train AI…

Gemini 3.5 Flash lands on Google's Android coding rankings, but it's 3x the…

Google offers Android app developers cash in exchange for code to train AI

Google updates Android Bench with new LLMs, but Gemini still lags behind

Agentic app coding gets an upgrade with Google's release of Android CLI |…

Other newsrooms on this story

Related reading

Google is changing how it judges AI models for Android coding, updates list…

Google is reportedly buying Android app code from Play Store devs to train AI…

Gemini 3.5 Flash lands on Google's Android coding rankings, but it's 3x the…

Google offers Android app developers cash in exchange for code to train AI

Google updates Android Bench with new LLMs, but Gemini still lags behind

Agentic app coding gets an upgrade with Google's release of Android CLI |…