> ## Documentation Index
> Fetch the complete documentation index at: https://docs.cogito.decart.ai/llms.txt
> Use this file to discover all available pages before exploring further.

# DeepSeek V4 Flash

> Cheap, fast, 1M context. The high-volume workhorse.

The cheap workhorse with a 1M-token window. Built for high-volume pipelines where the bill matters as much as the answer.

|                |                                                                                                |
| -------------- | ---------------------------------------------------------------------------------------------- |
| **Slug**       | `deepseek-v4-flash`                                                                            |
| **Parameters** | Mid-tier MoE                                                                                   |
| **Context**    | 1,000,000 tokens (1M)                                                                          |
| **Throughput** | 70 tokens/sec                                                                                  |
| **TTFT**       | 220ms                                                                                          |
| **License**    | DeepSeek License                                                                               |
| **Pricing**    | [cogito.decart.ai/models/deepseek-v4-flash](https://cogito.decart.ai/models/deepseek-v4-flash) |

## Best for

* High-volume RAG with long retrieved contexts
* Document and codebase summarization
* Synthetic data generation
* Anywhere \$/token matters more than peak intelligence

```python theme={null}
client.chat.completions.create(
    model="deepseek-v4-flash",
    messages=[{"role": "user", "content": "Summarize: ..."}],
)
```
