OpenAI’s first open-weight release since GPT-2. A 120B mixture-of-experts model with around 5B active parameters per token, strong general reasoning, and reliable function calling.Documentation Index
Fetch the complete documentation index at: https://docs.cogito.decart.ai/llms.txt
Use this file to discover all available pages before exploring further.
| Slug | gpt-oss-120b |
| Parameters | 120B (MoE, ~5B active) |
| Context | 131,072 tokens (128k) |
| Throughput | 70 tokens/sec |
| TTFT | 240ms |
| License | Apache 2.0 |
| Hardware | AWS Trainium |
| Input price | $0.039 / 1M tokens |
| Output price | $0.18 / 1M tokens |
Best for
- Default chat / agent
- Coding assistance
- Tool calling and structured outputs
- Workloads where GPT-4-class output quality at open-weight pricing matters