Model Library

Browse our supported open source models and deploy in dedicated endpoints

LLM
New
deepseek-v3.2-expDeepseek V3.2 Exp$0.27/M In$0.41/M Out163840 ContextLLMServerless
New
glm-4.6GLM 4.6$0.6/M In$2.2/M Out204800 ContextLLMServerless
New
deepseek-v3.1-terminusDeepseek V3.1 Terminus$0.27/M In$1/M Out98304 ContextLLMServerless
New
qwen3-vl-235b-a22b-thinkingQwen3 VL 235B A22B Thinking$0.98/M In$3.95/M Out131072 ContextLLMServerless
New
qwen3-next-80b-a3b-instructQwen3 Next 80B A3B Instruct$0.15/M In$1.5/M Out131072 ContextLLMServerless
New
qwen3-next-80b-a3b-thinkingQwen3 Next 80B A3B Thinking$0.15/M In$1.5/M Out131072 ContextLLMServerless
New
qwen3-vl-235b-a22b-instructQwen3 VL 235B A22B Instruct$0.3/M In$1.5/M Out131072 ContextLLMServerless
New
MoonshotAI
Kimi K2 0905$0.6/M In$2.5/M Out262144 ContextLLMServerless
New
deepseek-v3.1DeepSeek V3.1$0.27/M In$1/M Out131072 ContextLLMServerless
qwen3-coder-480b-a35b-instructQwen3 Coder 480B A35B Instruct$0.29/M In$1.2/M Out262144 ContextLLMServerless
OpenAI
OpenAI GPT OSS 120B$0.1/M In$0.5/M Out131072 ContextLLMServerless
MoonshotAI
Kimi K2 Instruct$0.57/M In$2.3/M Out131072 ContextLLMServerless
Hot
deepseek-v3-0324DeepSeek V3 0324$0.27/M In$1.12/M Out163840 ContextLLMServerless
qwen3-235b-a22b-thinking-2507Qwen3 235B A22b Thinking 2507$0.3/M In$3/M Out131072 ContextLLMServerless
OpenAI
OpenAI: GPT OSS 20B$0.05/M In$0.2/M Out131072 ContextLLMServerless
qwen3-235b-a22b-instruct-2507Qwen3 235B A22B Instruct 2507$0.15/M In$0.8/M Out131072 ContextLLMServerless

Dedicated Endpoint

Enterprise-Grade Infrastructure for AI

For enterprises that require higher performance, tailored SLAs, or private hosting for custom models
  • Custom pricing
  • Guaranteed uptime & latency
  • Unlimited scale
  • Dedicated clusters
Get Enterprise-Grade Endpoint
de-banner