DeepSeek V4 Flash

The cheap workhorse with a 1M-token window. Built for high-volume pipelines where the bill matters as much as the answer.


Slug	`deepseek-v4-flash`
Parameters	Mid-tier MoE
Context	1,000,000 tokens (1M)
Throughput	70 tokens/sec
TTFT	220ms
License	DeepSeek License
Pricing	cogito.decart.ai/models/deepseek-v4-flash

Best for

client.chat.completions.create(
    model="deepseek-v4-flash",
    messages=[{"role": "user", "content": "Summarize: ..."}],
)