DeepSeek’s V4 series, expected to fully launch in mid-July, will implement a tiered pricing structure that doubles token rates during designated peak periods.
How the pricing works
The peak windows are set from 9:00 to 12:00 and 14:00 to 18:00 Beijing Time. During those hours, both input and output token costs will double across the two V4 models: deepseek-v4-pro and deepseek-v4-flash.
V4-Flash currently sits at roughly $0.14 per million input tokens (on cache misses) and $0.28 per million output tokens. For V4-Pro, peak output pricing lands around 12 yuan per million tokens, approximately $1.76. V4-Flash peak output comes in at about 4 yuan per million tokens, or roughly $0.59.
DeepSeek says users will receive 24-hour advance notification before any price changes take effect.










