Model Library

Explore models ready for production — deploy in seconds.

New
MoonshotAI
Kimi K2 Thinking
$0.48/MtInput
$2/MtOutput
262144Context
262144Max Output
LLMServerless
New
minimax-m2MiniMax-M2
$0.24/MtInput
$0.96/MtOutput
204800Context
131072Max Output
LLMServerless
New
deepseek-v3.2-expDeepseek V3.2 Exp
$0.216/MtInput
$0.328/MtOutput
163840Context
65536Max Output
LLMServerless
New
qwen3-vl-235b-a22b-thinkingQwen3 VL 235B A22B Thinking
$0.784/MtInput
$3.16/MtOutput
131072Context
32768Max Output
LLMServerless
New
glm-4.6GLM 4.6
$0.48/MtInput
$0.088/MtCache Read
$1.76/MtOutput
204800Context
131072Max Output
LLMServerless
Free
kat-coderKAT-Coder-Pro V1
$0/MtInput
$0/MtOutput
256000Context
32000Max Output
LLMServerless
New
deepseek-ocrDeepSeek-OCR
$0.024/MtInput
$0.024/MtOutput
8192Context
8192Max Output
LLMServerless
New
deepseek-v3.1-terminusDeepseek V3.1 Terminus
$0.216/MtInput
$0.8/MtOutput
131072Context
65536Max Output
LLMServerless
New
qwen3-vl-235b-a22b-instructQwen3 VL 235B A22B Instruct
$0.24/MtInput
$1.2/MtOutput
131072Context
32768Max Output
LLMServerless
Partner
qwen3-maxQwen3 Max
$1.688/MtInput
$6.76/MtOutput
262144Context
65536Max Output
LLMServerless
New
RSkywork R1V4-Lite
$0/MtInput
$0/MtOutput
262144Context
65536Max Output
LLMServerless
New
deepseek-v3.1DeepSeek V3.1
$0.216/MtInput
$0.8/MtOutput
131072Context
32768Max Output
LLMServerless
New
MoonshotAI
Kimi K2 0905
$0.48/MtInput
$2/MtOutput
262144Context
262144Max Output
LLMServerless
qwen3-coder-480b-a35b-instructQwen3 Coder 480B A35B Instruct
$0.232/MtInput
$0.96/MtOutput
262144Context
65536Max Output
LLMServerless
OpenAI
OpenAI GPT OSS 120B
$0.04/MtInput
$0.2/MtOutput
131072Context
32768Max Output
LLMServerless
MoonshotAI
Kimi K2 Instruct
$0.456/MtInput
$1.84/MtOutput
131072Context
131072Max Output
LLMServerless
Hot
deepseek-v3-0324DeepSeek V3 0324
$0.216/MtInput
$0.108/MtCache Read
$0.896/MtOutput
163840Context
163840Max Output
LLMServerless
qwen3-235b-a22b-thinking-2507Qwen3 235B A22b Thinking 2507
$0.24/MtInput
$2.4/MtOutput
131072Context
32768Max Output
LLMServerless
qwen3-235b-a22b-instruct-2507Qwen3 235B A22B Instruct 2507
$0.072/MtInput
$0.464/MtOutput
131072Context
16384Max Output
LLMServerless

Dedicated Endpoint

Enterprise-Grade Infrastructure for AI

For enterprises that require higher performance, tailored SLAs, or private hosting for custom models
  • Custom pricing
  • Guaranteed uptime & latency
  • Unlimited scale
  • Dedicated clusters
Get Enterprise-Grade Endpoint
de-banner