| Slug | deepseek-v4-flash |
| Parameters | Mid-tier MoE |
| Context | 1,000,000 tokens (1M) |
| Throughput | 70 tokens/sec |
| TTFT | 220ms |
| License | DeepSeek License |
| Pricing | cogito.decart.ai/models/deepseek-v4-flash |
Best for
- High-volume RAG with long retrieved contexts
- Document and codebase summarization
- Synthetic data generation
- Anywhere $/token matters more than peak intelligence