TL;DRAI

DeepSeek V4 and Qwen3-32B cut costs 40-65% vs GPT-4o with 84.6% quality and 1.2s latency on production workloads. Open-weight models with Apache/MIT licenses eliminate vendor lock-in and enable self-hosting—critical factors for CTO AI infrastructure choices.

I Spent Two Weeks Pitting Qwen 3 Max Against DeepSeek V4

I want to tell you about a rabbit hole I fell into recently. It started the way most of my projects do — someone on a Discord server I frequent asked a simple question: "Should I use Qwen 3 Max or DeepSeek V4 for my internal_compare workflow?" I had opinions, sure, but I wanted real numbers. So I cleared my calendar, fired up a couple of GPU instances, and started benchmarking.

What I found surprised me, and it also reinforced something I've been saying for years: the open source ecosystem is winning, and the walled gardens of the proprietary AI world are starting to look pretty silly.

Let me walk you through what I learned, the actual numbers I got, and why I keep coming back to open weight models with permissive licenses (looking at you, Apache 2.0 and MIT).

Why I Care About This in the First Place

dev.to

I Spent Two Weeks Pitting Qwen 3 Max Against DeepSeek V4

I Spent Two Weeks Pitting Qwen 3 Max Against DeepSeek V4 I want to tell you about a rabbit hole I...

lunedì 15 giugno 2026 New tab

TL;DRAI

1,721 words~8 min read

Let me walk you through what I learned, the actual numbers I got, and why I keep coming back to open weight models with permissive licenses (looking at you, Apache 2.0 and MIT).

Why I Care About This in the First Place

I Spent Two Weeks Pitting Qwen 3 Max Against DeepSeek V4

I Spent Two Weeks Pitting Qwen 3 Max Against DeepSeek V4

Related reading

I Tested DeepSeek V4 and V4 Flash Side by Side — Here's the Truth

A Month with DeepSeek: What Happened When I Replaced Claude Opus for Real Work

Qwen Is Not Yet Ready to Power Local OpenClaw Deployments

DeepSeek V4 vs DeepSeek V4 Flash: What I Learned as a Junior Dev

I Tested DeepSeek V4 Flash and GPT-4o Side by Side — Here's the p99 Latency…

Stop Guessing: Real p99 Latency Data Comparing DeepSeek, Qwen, Kimi, and GLM

Related reading

I Tested DeepSeek V4 and V4 Flash Side by Side — Here's the Truth

A Month with DeepSeek: What Happened When I Replaced Claude Opus for Real Work

Qwen Is Not Yet Ready to Power Local OpenClaw Deployments

DeepSeek V4 vs DeepSeek V4 Flash: What I Learned as a Junior Dev

I Tested DeepSeek V4 Flash and GPT-4o Side by Side — Here's the p99 Latency…

Stop Guessing: Real p99 Latency Data Comparing DeepSeek, Qwen, Kimi, and GLM