Large Language Models
Browse our supported open source models and deploy in dedicated endpoints
New
DeepSeek V3.1Chat$0.55/1.66 in/out MTokens | 163840 ContextNew
Qwen3 Coder 480B A35B InstructChat$0.64/2.5 in/out MTokens | 262144 ContextNew
OOpenAI GPT OSS 120BChat$0.1/0.5 in/out MTokens | 131072 ContextNew
ZGLM-4.5Chat$0.6/2.2 in/out MTokens | 131072 ContextMKimi K2 InstructChat$0.57/2.3 in/out MTokens | 131072 Context Hot
DeepSeek V3 0324Chat$0.28/1.14 in/out MTokens | 163840 ContextNew
Qwen3 235B A22B Instruct 2507Chat$0.15/0.8 in/out MTokens | 262144 ContextNew
Qwen3 235B A22b Thinking 2507Chat$0.3/3 in/out MTokens | 131072 ContextDedicated
DeepSeek R1 Distill Llama 8BChat$0.04/0.04 in/out MTokens | 32000 Context
Llama 3.1 8B InstructChat$0.02/0.05 in/out MTokens | 16384 ContextNew
ZGLM 4.5VChat$0.6/1.8 in/out MTokens | 65536 ContextNew
OOpenAI: GPT OSS 20BChat$0.05/0.2 in/out MTokens | 131072 Context
DeepSeek R1 Distill Qwen 14BChat$0.15/0.15 in/out MTokens | 64000 ContextHot
Llama 3.3 70B InstructChat$0.13/0.39 in/out MTokens | 131072 Context See All