Anthropic says Claude 4 beats Gemini on coding benchmarks; works autonomously for hours.

When Anthropic’s older Claude model played Pokémon Red, it spent “dozens of hours” stuck in one city and had trouble identifying nonplayer characters. With Claude 4 Opus, the team…

Anthropic says Claude 4 beats Gemini on coding benchmarks; works autonomously for hours.