TL;DRAI

Backend engineer reduced GPT-4o costs by 60× switching to DeepSeek; equivalent MMLU/HumanEval scores. Tech takeaway: LLM capability parity achieved; advantage now depends on operational moats—SDK compatibility, billing convenience—reshaping procurement.

Why I Migrated From GPT-4o to DeepSeek — A Backend Engineer's Notes

Six months ago, my monthly OpenAI bill crossed four figures and I finally snapped. Not because the cost was unbearable in absolute terms, but because I had a sneaking suspicion I was overpaying for marginal quality gains. So I did what any sane backend engineer would do: I instrumented my service to log token usage by endpoint, spun up parallel calls to every major Chinese model, and started comparing numbers like my paycheck depended on it. Spoiler — it kind of did.

This is the story of what I found when I actually ran Chinese AI models (DeepSeek, Qwen, Kimi, GLM) head-to-head against the US incumbents (GPT-4o, Claude 3.5 Sonnet, Gemini 1.5 Pro) on a real production workload. Not a synthetic benchmark, not a vibes-based Twitter thread — actual requests flowing through my service. Fwiw, the results were not what I expected.

The Pricing Problem Nobody Wants to Talk About

Let's start with the part CFOs care about. The price gap between US and Chinese models in 2026 isn't a rounding error — it's a yawning chasm. Here's what I'm currently paying (or would pay) per million tokens:

dev.to

Why I Migrated From GPT-4o to DeepSeek — A Backend Engineer's Notes

Why I Migrated From GPT-4o to DeepSeek — A Backend Engineer's Notes Six months ago, my monthly...

lunedì 22 giugno 2026 New tab

TL;DRAI

1,446 words~7 min read

The Pricing Problem Nobody Wants to Talk About

Why I Migrated From GPT-4o to DeepSeek — A Backend Engineer's Notes

Why I Migrated From GPT-4o to DeepSeek — A Backend Engineer's Notes

Other newsrooms on this story

Related reading

From GPT-4o to DeepSeek: My Multi-Region Cost Optimization Story

I Saved $2,620 Monthly Ditching GPT-4 — A Data Scientist's Deep Dive

I Was Spending $3,200/Month on GPT. Then I Tried Chinese Models.

How to Build Your Own AI API Gateway (70x Cheaper Than GPT-4o)

I Tested DeepSeek V4 Flash and GPT-4o Side by Side — Here's the Real-World…

A Month with DeepSeek: What Happened When I Replaced Claude Opus for Real Work

Other newsrooms on this story

Related reading

From GPT-4o to DeepSeek: My Multi-Region Cost Optimization Story

I Saved $2,620 Monthly Ditching GPT-4 — A Data Scientist's Deep Dive

I Was Spending $3,200/Month on GPT. Then I Tried Chinese Models.

How to Build Your Own AI API Gateway (70x Cheaper Than GPT-4o)

I Tested DeepSeek V4 Flash and GPT-4o Side by Side — Here's the Real-World…

A Month with DeepSeek: What Happened When I Replaced Claude Opus for Real Work