GPT-OSS 120B - Cogito

OpenAI’s first open-weight release since GPT-2. A 120B mixture-of-experts model with around 5B active parameters per token, strong general reasoning, and reliable function calling.


Slug	`gpt-oss-120b`
Parameters	120B (MoE, ~5B active)
Context	131,072 tokens (128K)
Throughput	70 tokens/sec
TTFT	240ms
License	Apache 2.0
Pricing	cogito.decart.ai/models/gpt-oss-120b

Best for

Default chat / agent
Coding assistance
Tool calling and structured outputs
Workloads where GPT-4-class output quality at open-weight pricing matters

Sample request

from openai import OpenAI

client = OpenAI(
    base_url="https://api.cogito.decart.ai/v1",
    api_key=os.environ["COGITO_API_KEY"],
)

response = client.chat.completions.create(
    model="gpt-oss-120b",
    messages=[{"role": "user", "content": "Hello!"}],
)

License

Apache 2.0. Use it however you want, including commercial deployments. Cogito’s serving doesn’t change the license terms.

​Best for

​Sample request

​License

Best for

Sample request

License