Skip to main content
Moonshot’s flagship MoE. Trained for long-horizon agentic workflows; the model engineering teams reach for when the cheap models stop being enough.
Model IDmoonshotai/kimi-k2.6
Fast aliasmoonshotai/kimi-k2.6:fast
Route slugkimi-k2.6
Parameters1T MoE (~32B active)
Context262,144 tokens (256K)
Throughput70 tokens/sec
TTFT280ms
LicenseModified MIT
Pricingcogito.decart.ai/models/kimi-k2.6

Best for

  • Coding agents (multi-file refactors, multi-step PRs)
  • Long-horizon agentic reasoning
  • Complex tool use with parallel function calls
  • Workloads that benefit from long, reliable reasoning chains
client.chat.completions.create(
    model="moonshotai/kimi-k2.6",
    messages=[...],
)
The default moonshotai/kimi-k2.6 route is served from the AWS B300 high-capacity Moonshot deployment. Use moonshotai/kimi-k2.6:fast to opt into the CoreWeave B200 low-latency route (lower per-route capacity).

License

Kimi K2.6 ships under a Modified MIT license. Permissive enough for commercial deployments; check the upstream license for redistribution specifics.