The cheap workhorse with a 1M-token window. Built for high-volume pipelines where the bill matters as much as the answer.Documentation Index
Fetch the complete documentation index at: https://docs.cogito.decart.ai/llms.txt
Use this file to discover all available pages before exploring further.
| Slug | deepseek-v4-flash |
| Parameters | Mid-tier MoE |
| Context | 1,000,000 tokens (1M) |
| Throughput | 70 tokens/sec |
| TTFT | 220ms |
| License | DeepSeek License |
| Hardware | AWS Trainium |
| Input price | $0.14 / 1M tokens |
| Output price | $0.28 / 1M tokens |
Best for
- High-volume RAG with long retrieved contexts
- Document and codebase summarization
- Synthetic data generation
- Anywhere $/token matters more than peak intelligence