Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.cogito.decart.ai/llms.txt

Use this file to discover all available pages before exploring further.

OpenAI’s first open-weight release since GPT-2. A 120B mixture-of-experts model with around 5B active parameters per token, strong general reasoning, and reliable function calling.
Sluggpt-oss-120b
Parameters120B (MoE, ~5B active)
Context131,072 tokens (128k)
Throughput70 tokens/sec
TTFT240ms
LicenseApache 2.0
HardwareAWS Trainium
Input price$0.039 / 1M tokens
Output price$0.18 / 1M tokens

Best for

  • Default chat / agent
  • Coding assistance
  • Tool calling and structured outputs
  • Workloads where GPT-4-class output quality at open-weight pricing matters

Sample request

from openai import OpenAI

client = OpenAI(
    base_url="https://api.cogito.decart.ai/v1",
    api_key=os.environ["COGITO_API_KEY"],
)

response = client.chat.completions.create(
    model="gpt-oss-120b",
    messages=[{"role": "user", "content": "Hello!"}],
)

License

Apache 2.0. Use it however you want, including commercial deployments. Cogito’s serving doesn’t change the license terms.