> ## Documentation Index
> Fetch the complete documentation index at: https://docs.cogito.decart.ai/llms.txt
> Use this file to discover all available pages before exploring further.

# GPT-OSS 120B

> OpenAI's open-weight 120B MoE. Strong general reasoning at near-commodity pricing.

OpenAI's first open-weight release since GPT-2. A 120B mixture-of-experts model with around 5B active parameters per token, strong general reasoning, and reliable function calling.

|                |                                                                                      |
| -------------- | ------------------------------------------------------------------------------------ |
| **Slug**       | `gpt-oss-120b`                                                                       |
| **Parameters** | 120B (MoE, \~5B active)                                                              |
| **Context**    | 131,072 tokens (128K)                                                                |
| **Throughput** | 70 tokens/sec                                                                        |
| **TTFT**       | 240ms                                                                                |
| **License**    | Apache 2.0                                                                           |
| **Pricing**    | [cogito.decart.ai/models/gpt-oss-120b](https://cogito.decart.ai/models/gpt-oss-120b) |

## Best for

* Default chat / agent
* Coding assistance
* Tool calling and structured outputs
* Workloads where GPT-4-class output quality at open-weight pricing matters

## Sample request

```python theme={null}
from openai import OpenAI

client = OpenAI(
    base_url="https://api.cogito.decart.ai/v1",
    api_key=os.environ["COGITO_API_KEY"],
)

response = client.chat.completions.create(
    model="gpt-oss-120b",
    messages=[{"role": "user", "content": "Hello!"}],
)
```

## License

Apache 2.0. Use it however you want, including commercial deployments. Cogito's serving doesn't change the license terms.
