Model Library

Explore models ready for production — deploy in seconds.

New
MoonshotAI
Kimi K2 Thinking
$0.6/MtInput
$2.5/MtOutput
262144Context
262144Max Output
LLMServerless
PPaddleOCR-VL
$0.02/MtInput
$0.02/MtOutput
16384Context
16384Max Output
LLMServerless
New
deepseek-v3.2-expDeepseek V3.2 Exp
$0.27/MtInput
$0.41/MtOutput
163840Context
65536Max Output
LLMServerless
Partner
qwen3-maxQwen3 Max
$2.11/MtInput
$8.45/MtOutput
262144Context
65536Max Output
LLMServerless
New
glm-4.6GLM 4.6
$0.6/MtInput
$0.11/MtCache Read
$2.2/MtOutput
204800Context
131072Max Output
LLMServerless
New
qwen3-vl-235b-a22b-thinkingQwen3 VL 235B A22B Thinking
$0.98/MtInput
$3.95/MtOutput
131072Context
32768Max Output
LLMServerless
New
qwen3-vl-235b-a22b-instructQwen3 VL 235B A22B Instruct
$0.3/MtInput
$1.5/MtOutput
131072Context
32768Max Output
LLMServerless
New
deepseek-ocrDeepSeek-OCR
$0.03/MtInput
$0.03/MtOutput
8192Context
8192Max Output
LLMServerless
New
deepseek-v3.1-terminusDeepseek V3.1 Terminus
$0.27/MtInput
$1/MtOutput
131072Context
65536Max Output
LLMServerless
New
MoonshotAI
Kimi K2 0905
$0.6/MtInput
$2.5/MtOutput
262144Context
262144Max Output
LLMServerless
New
deepseek-v3.1DeepSeek V3.1
$0.27/MtInput
$1/MtOutput
131072Context
32768Max Output
LLMServerless
qwen3-coder-480b-a35b-instructQwen3 Coder 480B A35B Instruct
$0.29/MtInput
$1.2/MtOutput
262144Context
65536Max Output
LLMServerless
OpenAI
OpenAI GPT OSS 120B
$0.05/MtInput
$0.25/MtOutput
131072Context
32768Max Output
LLMServerless
MoonshotAI
Kimi K2 Instruct
$0.57/MtInput
$2.3/MtOutput
131072Context
131072Max Output
LLMServerless
Hot
deepseek-v3-0324DeepSeek V3 0324
$0.27/MtInput
$0.135/MtCache Read
$1.12/MtOutput
163840Context
163840Max Output
LLMServerless
qwen3-235b-a22b-thinking-2507Qwen3 235B A22b Thinking 2507
$0.3/MtInput
$3/MtOutput
131072Context
32768Max Output
LLMServerless
qwen3-235b-a22b-instruct-2507Qwen3 235B A22B Instruct 2507
$0.09/MtInput
$0.58/MtOutput
131072Context
16384Max Output
LLMServerless

Dedicated Endpoint

Enterprise-Grade Infrastructure for AI

For enterprises that require higher performance, tailored SLAs, or private hosting for custom models
  • Custom pricing
  • Guaranteed uptime & latency
  • Unlimited scale
  • Dedicated clusters
Get Enterprise-Grade Endpoint
de-banner