MODEL APIS

Browse our supported open source models

Developer-first infrastructure that scales from zero to production.

Models (133)

New

Kimi K3

$3/MtInput

$0.3/MtCache Read

$15/MtOutput

1048576Context

1048576Max Output

LLMServerless

New

Hy3

$0.14/MtInput

$0.035/MtCache Read

$0.58/MtOutput

262144Context

262144Max Output

LLMServerless

New

GLM 5.2

$1.4/MtInput

$0.26/MtCache Read

$4.4/MtOutput

1048576Context

131072Max Output

LLMServerless

New

Kimi K2.7 Code

$0.95/MtInput

$0.19/MtCache Read

$4/MtOutput

262144Context

262144Max Output

LLMServerless

New

Deepseek V4 Flash 0731

$0.14/MtInput

$0.028/MtCache Read

$0.28/MtOutput

1048576Context

393216Max Output

LLMServerless

NewTIME LIMITED FREE

Macaron V1 Venti

$0/MtInput

$0/MtOutput

1048576Context

131072Max Output

LLMServerless

New

MiniMax M3

$0.3/MtInput

$0.06/MtCache Read

$1.2/MtOutput

1000000Context

131072Max Output

LLMServerless

New

Deepseek V4 Flash

$0.14/MtInput

$0.028/MtCache Read

$0.28/MtOutput

1048576Context

393216Max Output

LLMServerless

New

Deepseek V4 Pro

$1.6/MtInput

$0.135/MtCache Read

$3.2/MtOutput

1048576Context

393216Max Output

LLMServerless

Hot

Deepseek V3.2

$0.269/MtInput

$0.1345/MtCache Read

$0.4/MtOutput

163840Context

65536Max Output

LLMServerless

NewTIME LIMITED FREE

Ling-3.0-flash

$0/MtInput

$0/MtOutput

262144Context

32768Max Output

LLMServerless

NewTIME LIMITED FREE

Macaron V1 Tall

$0/MtInput

$0/MtOutput

262144Context

32768Max Output

LLMServerless

New

Step 3.7 Flash

$0.2/MtInput

$0.04/MtCache Read

$1.15/MtOutput

262144Context

256000Max Output

LLMServerless

New

Nemotron 3 Nano 30B A3B

$0.05/MtInput

$0.2/MtOutput

262144Context

32768Max Output

LLMServerless

CoBuddy

$0.28/MtInput

$0.07/MtCache Read

$1.13/MtOutput

131072Context

65536Max Output

LLMServerless

New

XiaomiMiMo/MiMo-V2.5

$0.168/MtInput

$0.0034/MtCache Read

$0.336/MtOutput

1048576Context

131072Max Output

LLMServerless

LIMITED TIME 50% OFF

Qwen3.7-Max

$1.25/MtInput

$0.25/MtCache Read

$3.75/MtOutput

1000000Context

65536Max Output

LLMServerless

New

XiaomiMiMo/MiMo-V2.5-Pro

$0.522/MtInput

$0.0043/MtCache Read

$1.044/MtOutput

1048576Context

131072Max Output

LLMServerless

New

Qwen3.6-27B

$0.6/MtInput

$3.6/MtOutput

262144Context

65536Max Output

LLMServerless

Kimi K2.6

$0.8/MtInput

$0.16/MtCache Read

$3.4/MtOutput

262144Context

262144Max Output

LLMServerless

GLM-5.1

$1.38/MtInput

$0.26/MtCache Read

$4.4/MtOutput

204800Context

131072Max Output

LLMServerless

Gemma 4 26B A4B

$0.13/MtInput

$0.4/MtOutput

262144Context

131072Max Output

LLMServerless

Gemma 4 31B

$0.14/MtInput

$0.4/MtOutput

262144Context

131072Max Output

LLMServerless

MiniMax M2.7

$0.3/MtInput

$0.06/MtCache Read

$1.2/MtOutput

204800Context

131072Max Output

LLMServerless

MiniMax M2.5-highspeed

$0.6/MtInput

$0.03/MtCache Read

$2.4/MtOutput

204800Context

131100Max Output

LLMServerless

Qwen3.5-27B

$0.3/MtInput

$2.4/MtOutput

262144Context

65536Max Output

LLMServerless

Qwen3.5-122B-A10B

$0.4/MtInput

$3.2/MtOutput

262144Context

65536Max Output

LLMServerless

Qwen3.5-35B-A3B

$0.25/MtInput

$2/MtOutput

262144Context

65536Max Output

LLMServerless

Qwen3.5-397B-A17B

$0.6/MtInput

$3.6/MtOutput

262144Context

65536Max Output

LLMServerless

MiniMax M2.5

$0.3/MtInput

$0.03/MtCache Read

$1.2/MtOutput

204800Context

131100Max Output

LLMServerless

GLM-5

$1/MtInput

$0.2/MtCache Read

$3.2/MtOutput

202800Context

131072Max Output

LLMServerless

Qwen3 Coder Next

$0.2/MtInput

$1.5/MtOutput

262144Context

65536Max Output

LLMServerless

DeepSeek-OCR 2

$0.03/MtInput

$0.03/MtOutput

8192Context

8192Max Output

LLMServerless

Kimi K2.5

$0.6/MtInput

$0.1/MtCache Read

$3/MtOutput

262144Context

262144Max Output

LLMServerless

GLM-4.7-Flash

$0.07/MtInput

$0.01/MtCache Read

$0.4/MtOutput

200000Context

128000Max Output

LLMServerless

Minimax M2.1

$0.3/MtInput

$0.03/MtCache Read

$1.2/MtOutput

204800Context

131072Max Output

LLMServerless

GLM-4.7

$0.6/MtInput

$0.11/MtCache Read

$2.2/MtOutput

204800Context

131072Max Output

LLMServerless

AutoGLM-Phone-9B-Multilingual

$0.035/MtInput

$0.138/MtOutput

65536Context

65536Max Output

LLMServerless

Kimi K2 Thinking

$0.6/MtInput

$0.15/MtCache Read

$2.5/MtOutput

262144Context

100352Max Output

LLMServerless

MiniMax-M2

$0.3/MtInput

$0.03/MtCache Read

$1.2/MtOutput

204800Context

131072Max Output

LLMServerless

PaddleOCR-VL

$0.02/MtInput

$0.02/MtOutput

16384Context

16384Max Output

LLM

Deepseek V3.2 Exp

$0.27/MtInput

$0.41/MtOutput

163840Context

65536Max Output

LLMServerless

Qwen3 VL 235B A22B Thinking

$0.98/MtInput

$3.95/MtOutput

131072Context

32768Max Output

LLMServerless

GLM 4.6V

$0.3/MtInput

$0.055/MtCache Read

$0.9/MtOutput

131072Context

32768Max Output

LLMServerless

GLM 4.6

$0.55/MtInput

$0.11/MtCache Read

$2.2/MtOutput

204800Context

131072Max Output

LLMServerless

New

Qwen3.6-35B-A3B

$0.248/MtInput

$1.485/MtOutput

262144Context

65536Max Output

LLMServerless

Kat Coder Pro

$0.3/MtInput

$0.06/MtCache Read

$1.2/MtOutput

256000Context

128000Max Output

LLMServerless

Qwen3 Next 80B A3B Instruct

$0.15/MtInput

$1.5/MtOutput

131072Context

32768Max Output

LLMServerless

DeepSeek-OCR

$0.03/MtInput

$0.03/MtOutput

8192Context

8192Max Output

LLM

Deepseek V3.1 Terminus

$0.27/MtInput

$0.135/MtCache Read

$1/MtOutput

131072Context

32768Max Output

LLMServerless

Qwen3 VL 235B A22B Instruct

$0.3/MtInput

$1.5/MtOutput

131072Context

32768Max Output

LLMServerless

Qwen3 Max

$2.11/MtInput

$8.45/MtOutput

262144Context

65536Max Output

Partner

LLMServerless

DeepSeek V3.1

$0.27/MtInput

$0.135/MtCache Read

$1/MtOutput

131072Context

32768Max Output

LLMServerless

Kimi K2 0905

$0.6/MtInput

$2.5/MtOutput

262144Context

100352Max Output

LLMServerless

Qwen3 Coder 480B A35B Instruct

$0.38/MtInput

$1.55/MtOutput

262144Context

65536Max Output

LLMServerless

Qwen3 Coder 30b A3B Instruct

$0.07/MtInput

$0.27/MtOutput

160000Context

32768Max Output

LLMServerless

OpenAI GPT OSS 120B

$0.05/MtInput

$0.25/MtOutput

131072Context

32768Max Output

LLMServerless

Kimi K2 Instruct

$0.57/MtInput

$2.3/MtOutput

131072Context

100352Max Output

LLMServerless

Hot

DeepSeek V3 0324

$0.27/MtInput

$0.135/MtCache Read

$1.12/MtOutput

163840Context

65536Max Output

LLMServerless

Qwen3 235B A22b Thinking 2507

$0.3/MtInput

$3/MtOutput

131072Context

32768Max Output

LLMServerless

Llama 3.1 8B Instruct

$0.02/MtInput

$0.05/MtOutput

16384Context

16384Max Output

LLMServerless

Gemma3 12B

$0.05/MtInput

$0.1/MtOutput

131072Context

8192Max Output

LLM

GLM 4.5V

$0.6/MtInput

$0.11/MtCache Read

$1.8/MtOutput

65536Context

16384Max Output

LLMServerless

OpenAI: GPT OSS 20B

$0.04/MtInput

$0.15/MtOutput

131072Context

32768Max Output

LLMServerless

Qwen3 235B A22B Instruct 2507

$0.09/MtInput

$0.58/MtOutput

131072Context

16384Max Output

LLMServerless

Llama 3.3 70B Instruct

$0.135/MtInput

$0.4/MtOutput

6000Context

120000Max Output

LLMServerless

Qwen 2.5 72B Instruct

$0.38/MtInput

$0.4/MtOutput

32000Context

8192Max Output

LLMServerless

Mistral Nemo

$0.04/MtInput

$0.17/MtOutput

60288Context

16000Max Output

LLMServerless

MiniMax M1

$0.55/MtInput

$2.2/MtOutput

1000000Context

40000Max Output

LLMServerless

DeepSeek R1 0528

$0.7/MtInput

$0.35/MtCache Read

$2.5/MtOutput

163840Context

32768Max Output

LLMServerless

Wizardlm 2 8x22B

$0.62/MtInput

$0.62/MtOutput

65535Context

8000Max Output

LLMServerless

Dedicated

DeepSeek R1 0528 Qwen3 8B

$0.06/MtInput

$0.09/MtOutput

128000Context

32000Max Output

LLM

DeepSeek R1 Distill LLama 70B

$0.8/MtInput

$0.8/MtOutput

8192Context

8192Max Output

LLMServerless

Qwen3 235B A22B

$0.2/MtInput

$0.8/MtOutput

40960Context

20000Max Output

LLMServerless

Llama 4 Maverick Instruct

$0.27/MtInput

$0.85/MtOutput

1048576Context

8192Max Output

LLMServerless

Dedicated

Llama 4 Scout Instruct

$0.18/MtInput

$0.59/MtOutput

131072Context

131072Max Output

LLMServerless

Hermes 2 Pro Llama 3 8B

$0.14/MtInput

$0.14/MtOutput

8192Context

8192Max Output

LLM

L3 70B Euryale V2.1

$1.48/MtInput

$1.48/MtOutput

8192Context

8192Max Output

LLM

Sao10k L3 8B Lunaris

$0.05/MtInput

$0.05/MtOutput

8192Context

8192Max Output

LLMServerless

BaiChuan M2 32B

$0.07/MtInput

$0.07/MtOutput

131072Context

131072Max Output

LLM

ERNIE 4.5 VL 424B A47B

$0.42/MtInput

$1.25/MtOutput

123000Context

16000Max Output

LLMServerless

Gemma 3 27B

$0.119/MtInput

$0.2/MtOutput

98304Context

16384Max Output

LLMServerless

DeepSeek V3 (Turbo)

$0.4/MtInput

$1.3/MtOutput

64000Context

16000Max Output

LLMServerless

DeepSeek R1 (Turbo)

$0.7/MtInput

$2.5/MtOutput

64000Context

16000Max Output

LLMServerless

L3 8B Stheno V3.2

$0.05/MtInput

$0.05/MtOutput

8192Context

32000Max Output

LLMServerless

MMythomax L2 13B

$0.09/MtInput

$0.09/MtOutput

4096Context

3200Max Output

LLM

RRing-2.6-1T

$0.3/MtInput

$0.06/MtCache Read

$2.5/MtOutput

262144Context

65536Max Output

LLMServerless

Ling-2.6-flash

$0.1/MtInput

$0.02/MtCache Read

$0.3/MtOutput

262144Context

32768Max Output

LLMServerless

Ling-2.6-1T

$0.3/MtInput

$0.06/MtCache Read

$2.5/MtOutput

262144Context

32768Max Output

LLMServerless

zai-org/glm-4.5-air

$0.13/MtInput

$0.025/MtCache Read

$0.85/MtOutput

131072Context

98304Max Output

LLMServerless

qwen/qwen3-vl-30b-a3b-instruct

$0.2/MtInput

$0.7/MtOutput

131072Context

32768Max Output

LLMServerless

Qwen3 Omni 30B A3B Thinking

$0.25/MtInput

$0.97/MtOutput

65536Context

16384Max Output

LLMServerless

Qwen3 Omni 30B A3B Instruct

$0.25/MtInput

$0.97/MtOutput

65536Context

16384Max Output

LLMServerless

Qwen MT Plus

$0.25/MtInput

$0.75/MtOutput

16384Context

8192Max Output

LLMServerless

ERNIE 4.5 21B A3B

$0.07/MtInput

$0.28/MtOutput

120000Context

8000Max Output

LLMServerless

Llama 3.2 3B Instruct

$0.03/MtInput

$0.05/MtOutput

32768Context

32000Max Output

LLM

L31 70B Euryale V2.2

$1.48/MtInput

$1.48/MtOutput

8192Context

8192Max Output

LLMServerless

qwen/qwen3-embedding-0.6b

$0.07/MtInput

$0/MtOutput

32768Context

32768Max Output

Embedding

Qwen3 Embedding 8B

$0.07/MtInput

$0/MtOutput

32768Context

4096Max Output

Embedding

BAAI:BGE-M3

$0.01/MtInput

$0.01/MtOutput

8192Context

96000Max Output

Embedding

Qwen3 Reranker 8B

$0.05/MtInput

$0.05/MtOutput

32768Context

4096Max Output

Reranker

baai/bge-reranker-v2-m3

$0.01/MtInput

$0.01/MtOutput

8000Context

8000Max Output

Reranker

Seedream 4.0

$0.03/image

Text to ImageImage to Image

Qwen-Image Text to Image

$0.02/image

Text to Image

New

Qwen-Image Edit

$0.02/image

Image Edit

Flux.1 Kontext Dev

$0.0225/image

Image to Image

Flux.1 Kontext Pro

$0.36/image

Image to Image

Flux.1 Kontext Max

$0.072/image

Image to Image

New

MiniMax speech-2.6-turbo

$60 / 1M characters

Text to Speech

New

MiniMax speech-2.6-hd

$100 / 1M characters

Text to Speech

MiniMax Voice-Cloning

$1.5 / voice

Voice Cloning

Fish Audio Text to Speech

$15 / 1M characters

Text to Speech

Fish Audio Voice Cloning

$0.1 / voice

Voice Cloning

TText to Speech

$15 / 1M characters

Text to Speech

New

MiniMax Hailuo 2.3 Text to Video

$0.49/video1080P 6s

Text to Video

New

MiniMax Hailuo 2.3 Image to Video

$0.49/video1080P 6s

Image to Video

New

MiniMax Hailuo 2.3 Fast Image to Video

$0.33/video1080P 6s

Image to Video

Kling V1.6 Text to Video

$0.27/video720P 5s

Text to Video

Kling V1.6 Image to Video

$0.46/video1080P Pro 5s

Image to Video

New

Kling V2.5 Turbo Text to Video

$0.35/video1080P 5s

Text to Video

New

Kling V2.5 Turbo Image to Video

$0.35/video1080P 5s

Image to Video

New

Vidu Q2

$0.2677/video1080P 5s

Text to VideoReference to Video

New

Vidu Q2 Pro

$0.5135/video1080P 5s

Image to Video

New

Vidu Q2 Pro Fast

$0.143/video1080P 5s

Image to Video

New

Vidu Q2 Turbo

$0.3347/video1080P 5s

Image to Video

New

Vidu Q2 Template

-Template

Template to Video

New

Vidu Q3 Pro Start-End-to-Video

$0.0714/s1080P Off-Peak

Start-End Frame to Video

New

Vidu Q3 Turbo Text-to-Video

$0.0357/s1080P Off-Peak

Text to Video

New

Vidu Q3 Turbo Image-to-Video

$0.0357/s1080P Off-Peak

Image to Video

New

Vidu Q3 Turbo Start-End-to-Video

$0.0357/s1080P Off-Peak

Start-End Frame to Video

VVideo Merge Face

-SVD 5 steps

Video Edit

EXA

$0.007/request

Tavily

$0.008/request

Everything you need to build production AI.

200+ models, on-demand GPUs, and secure agent runtimes — unified under one API. Free to start, scales as you grow.