These LLMs are the best at resisting Russian propaganda

Open-weight models, including Nvidia’s Nemotron and Alibaba’s Qwen, showed strong results comparable to Anthropic’s best models. GPT-5.4—the best-performing model from OpenAI—also performed relatively well on the benchmark, providing “Exemplary” responses on 54 percent of questions and achieving an 88.9 mean score.

Unsurprisingly, recent frontier models showed a much stronger tendency to resist Russian propaganda than models from just a few years ago. Claude 3.5 Haiku—the highest-rated model released in 2024—received a mean rating of just 73.1 on the benchmark. That mark would put it in the bottom third of models released in 2026 on this metric.

Detailed benchmarks for Google’s Gemini 2.5 Pro model show particularly sensitivity to malicious prompts and prompts in Russian.

Estonian Language Institute

But that improvement over time was not uniform across all LLM makers. Google’s most propaganda-resistant LLM, Gemini 2.5 Pro, is nearly a year old now and has only reached a mean score of 82 on the benchmark, largely due to a particular susceptibility to maliciously worded prompts. The most recent tested Google model, Gemini 3.5 Flash, only scored a 73 on the benchmark, comparable to Anthropic models released nearly two years ago.

These LLMs are the best at resisting Russian propaganda

Other newsrooms on this story

Related reading

Mistral AI models flagged for potential Russian propaganda influence in new…

Mistral AI models score below 40% in detecting Russian propaganda, new…

How easily can Russian propaganda fool AI models? A new benchmark finds out

Europe's AI champion Mistral found to be spreading Russian propaganda

Z.AI's GLM-5.2 outperforms GPT-5.5 on coding benchmarks at one-sixth the cost

OpenAI releases lower-cost models to rival Meta, Mistral and DeepSeek