Large Language Models

Browse our supported open source models and deploy in dedicated endpoints

Create a New Endpoint

DeepSeek V3 0324Chat$0.28/1.14 in/out MTokens | 163840 Context

KKimi K2 InstructChat$0.57/2.3 in/out MTokens | 131072 Context

Qwen3 Coder 480B A35B InstructChat$2/2 in/out MTokens | 262144 Context

DeepSeek R1 0528Chat$0.7/2.5 in/out MTokens | 163840 Context

Qwen3 235B A22B Instruct 2507Chat$0.15/0.8 in/out MTokens | 262144 Context

EERNIE 4.5 VL 424B A47BChat$0.42/1.25 in/out MTokens | 123000 Context

EERNIE 4.5 300B A47BChat$0.28/1.1 in/out MTokens | 123000 Context

Qwen3 30B A3BChat$0.1/0.45 in/out MTokens | 40960 Context

MiniMax M1Chat$0.55/2.2 in/out MTokens | 1000000 Context

Dedicated

DeepSeek R1 0528 Qwen3 8BChat$0.06/0.09 in/out MTokens | 128000 Context

Qwen3 32BChat$0.1/0.45 in/out MTokens | 40960 Context

Qwen2.5 VL 72B InstructChat$0.8/0.8 in/out MTokens | 32768 Context

Qwen3 235B A22BChat$0.2/0.8 in/out MTokens | 40960 Context

DeepSeek V3 (Turbo) Chat$0.4/1.3 in/out MTokens | 64000 Context

GGLM 4.1V 9B ThinkingChat$0.035/0.138 in/out MTokens | 65536 Context

Llama 4 Maverick InstructChat$0.17/0.85 in/out MTokens | 1048576 Context

GGemma 3 27BChat$0.119/0.2 in/out MTokens | 32000 Context

DeepSeek R1 (Turbo) Chat$0.7/2.5 in/out MTokens | 64000 Context

LL3 8B Stheno V3.2Chat$0.05/0.05 in/out MTokens | 8192 Context

MMythomax L2 13BChat$0.09/0.09 in/out MTokens | 4096 Context

Deepseek Prover V2 671BChat$0.7/2.5 in/out MTokens | 160000 Context

Dedicated

Llama 4 Scout InstructChat$0.1/0.5 in/out MTokens | 131072 Context

Dedicated

DeepSeek R1 Distill Llama 8BChat$0.04/0.04 in/out MTokens | 32000 Context

Llama 3.1 8B InstructChat$0.02/0.05 in/out MTokens | 16384 Context

DeepSeek R1 Distill Qwen 14BChat$0.15/0.15 in/out MTokens | 64000 Context

Llama 3.3 70B InstructChat$0.13/0.39 in/out MTokens | 131072 Context

Qwen 2.5 72B InstructChat$0.38/0.4 in/out MTokens | 32000 Context

MMistral NemoChat$0.04/0.17 in/out MTokens | 60288 Context

DeepSeek R1 Distill Qwen 32BChat$0.3/0.3 in/out MTokens | 64000 Context

Llama 3 8B InstructChat$0.04/0.04 in/out MTokens | 8192 Context

WWizardlm 2 8x22BChat$0.62/0.62 in/out MTokens | 65535 Context

DeepSeek R1 Distill LLama 70BChat$0.8/0.8 in/out MTokens | 32000 Context

MMistral 7B InstructChat$0.029/0.059 in/out MTokens | 32768 Context

Llama3 70B InstructChat$0.51/0.74 in/out MTokens | 8192 Context

Hermes 2 Pro Llama 3 8BChat$0.14/0.14 in/out MTokens | 8192 Context

LL3 70B Euryale V2.1 Chat$1.48/1.48 in/out MTokens | 8192 Context

DDolphin Mixtral 8x22BChat$0.9/0.9 in/out MTokens | 16000 Context

MMidnight Rose 70BChat$0.8/0.8 in/out MTokens | 4096 Context

LSao10k L3 8B Lunaris Chat$0.05/0.05 in/out MTokens | 8192 Context

FREE

EERNIE 4.5 VL 28B A3BChat$0/0 in/out MTokens | 30000 Context

FREE

EERNIE 4.5 21B A3BChat$0/0 in/out MTokens | 120000 Context

EERNIE 4.5 0.3BChat$0/0 in/out MTokens | 120000 Context

GGemma3 1B ITChat$0/0 in/out MTokens | 32768 Context

Dedicated

Qwen3 8BChat$0.035/0.138 in/out MTokens | 128000 Context

FreeDedicated

Qwen3 4BChat$0/0 in/out MTokens | 128000 Context

GGLM-4-32B-0414Chat$0.24/0.24 in/out MTokens | 32000 Context

Free

Qwen2.5 7B InstructChat$0/0 in/out MTokens | 32000 Context

Free

Llama 3.2 1B Instruct Chat$0/0 in/out MTokens | 131000 Context

Llama 3.2 3B InstructChat$0.03/0.05 in/out MTokens | 32768 Context

LL31 70B Euryale V2.2Chat$1.48/1.48 in/out MTokens | 8192 Context

Large Language Models

Dedicated Endpoint