Model Library

Browse our supported open source models and deploy in dedicated endpoints

LLM
New
qwen3-next-80b-a3b-instructQwen3 Next 80B A3B Instruct$0.15/M In$1.5/M Out65536 ContextLLMServerless
New
qwen3-next-80b-a3b-thinkingQwen3 Next 80B A3B Thinking$0.15/M In$1.5/M Out65536 ContextLLMServerless
New
MoonshotAI
Kimi K2 0905$0.6/M In$2.5/M Out262144 ContextLLMServerless
New
deepseek-v3.1DeepSeek V3.1$0.27/M In$1/M Out163840 ContextLLMServerless
qwen3-coder-480b-a35b-instructQwen3 Coder 480B A35B Instruct$0.29/M In$1.2/M Out262144 ContextLLMServerless
OpenAI
OpenAI GPT OSS 120B$0.1/M In$0.5/M Out131072 ContextLLMServerless
MoonshotAI
Kimi K2 Instruct$0.57/M In$2.3/M Out131072 ContextLLMServerless
Hot
deepseek-v3-0324DeepSeek V3 0324$0.28/M In$1.14/M Out163840 ContextLLMServerless
qwen3-235b-a22b-thinking-2507Qwen3 235B A22b Thinking 2507$0.3/M In$3/M Out131072 ContextLLMServerless
OpenAI
OpenAI: GPT OSS 20B$0.05/M In$0.2/M Out131072 ContextLLMServerless
qwen3-235b-a22b-instruct-2507Qwen3 235B A22B Instruct 2507$0.15/M In$0.8/M Out131072 ContextLLMServerless

Dedicated Endpoint

Enterprise-Grade Infrastructure for AI

For enterprises that require higher performance, tailored SLAs, or private hosting for custom models
  • Custom pricing
  • Guaranteed uptime & latency
  • Unlimited scale
  • Dedicated clusters
Get Enterprise-Grade Endpoint
de-banner