Pricing

Explore pricing for our Model APIs and GPU resources. Find the right plan to match your needs with transparent rates and flexible options.

Batch inference is available at an introductory 50% discount on input and output tokens for supported models.Learn more

deepseekDeepseek

Advanced AI models from DeepSeek, offering cutting-edge reasoning capabilities and competitive pricing for enterprise and research applications.

Model NameContextInputOutputActions
deepseek/deepseek-v3.2-exp163,840$0.27 /Mt$0.41 /MtMore
deepseek/deepseek-ocr8,192$0.03 /Mt$0.03 /MtMore
deepseek/deepseek-v3.1-terminus131,072$0.27 /Mt$1 /MtMore
deepseek/deepseek-v3.1131,072$0.27 /Mt$1 /MtMore
deepseek/deepseek-v3-0324163,840$0.27 /Mt · Cache Read $0.135 /Mt$1.12 /MtMore
deepseek/deepseek-r1-distill-qwen-14b32,768$0.15 /Mt$0.15 /MtMore
deepseek/deepseek-r1-0528163,840$0.7 /Mt · Cache Read $0.35 /Mt$2.5 /MtMore
deepseek/deepseek-r1-distill-qwen-32b64,000$0.3 /Mt$0.3 /MtMore
deepseek/deepseek-r1-0528-qwen3-8b128,000$0.06 /Mt$0.09 /MtMore
deepseek/deepseek-r1-distill-llama-70b8,192$0.8 /Mt$0.8 /MtMore
deepseek/deepseek-prover-v2-671b160,000$0.7 /Mt$2.5 /MtMore
deepseek/deepseek-v3-turbo64,000$0.4 /Mt$1.3 /MtMore
deepseek/deepseek-r1-turbo64,000$0.7 /Mt$2.5 /MtMore

meta-llamaLlama

Meta's Llama models providing state-of-the-art language understanding with open architecture designed for diverse applications.

Model NameContextInputOutputActions
meta-llama/llama-3.1-8b-instruct16,384$0.02 /Mt$0.05 /MtMore
meta-llama/llama-3.3-70b-instruct131,072$0.13 /Mt$0.39 /MtMore
meta-llama/llama-3-8b-instruct8,192$0.04 /Mt$0.04 /MtMore
meta-llama/llama-3-70b-instruct8,192$0.51 /Mt$0.74 /MtMore
meta-llama/llama-4-maverick-17b-128e-instruct-fp81,048,576$0.17 /Mt$0.85 /MtMore
meta-llama/llama-4-scout-17b-16e-instruct131,072$0.1 /Mt$0.5 /MtMore
meta-llama/llama-3.2-3b-instruct32,768$0.03 /Mt$0.05 /MtMore

qwenQwen

Qwen series models offering efficient language processing with various parameter sizes, from lightweight to enterprise-grade solutions.

Model NameContextInputOutputActions
qwen/qwen3-max262,144-
Tiered pricing
More
qwen/qwen3-vl-235b-a22b-thinking131,072$0.98 /Mt$3.95 /MtMore
qwen/qwen3-next-80b-a3b-instruct131,072$0.15 /Mt$1.5 /MtMore
qwen/qwen3-next-80b-a3b-thinking131,072$0.15 /Mt$1.5 /MtMore
qwen/qwen3-vl-235b-a22b-instruct131,072$0.3 /Mt$1.5 /MtMore
qwen/qwen3-coder-480b-a35b-instruct262,144$0.29 /Mt$1.2 /MtMore
qwen/qwen3-coder-30b-a3b-instruct262,144$0.07 /Mt$0.27 /MtMore
qwen/qwen3-235b-a22b-thinking-2507131,072$0.3 /Mt$3 /MtMore
qwen/qwen3-235b-a22b-instruct-2507131,072$0.09 /Mt$0.58 /MtMore
qwen/qwen-2.5-72b-instruct32,000$0.38 /Mt$0.4 /MtMore
qwen/qwen3-235b-a22b-fp840,960$0.2 /Mt$0.8 /MtMore
qwen/qwen2.5-vl-72b-instruct32,768$0.8 /Mt$0.8 /MtMore
qwen/qwen3-32b-fp840,960$0.1 /Mt$0.45 /MtMore
qwen/qwen3-30b-a3b-fp840,960$0.09 /Mt$0.45 /MtMore
qwen/qwen3-vl-8b-instruct131,072$0.08 /Mt$0.5 /MtMore
qwen/qwen3-vl-30b-a3b-instruct131,072$0.2 /Mt$0.7 /MtMore
qwen/qwen3-vl-30b-a3b-thinking131,072$0.2 /Mt$1 /MtMore
qwen/qwen-mt-plus4,096$0.25 /Mt$0.75 /MtMore
qwen/qwen3-8b-fp8128,000$0.035 /Mt$0.138 /MtMore
qwen/qwen3-4b-fp8128,000$0.03 /Mt$0.03 /MtMore
qwen/qwen2.5-7b-instruct32,000$0.07 /Mt$0.07 /MtMore

Wenxin
Baidu

Baidu's ERNIE models providing advanced Chinese language understanding and multimodal capabilities, optimized for Chinese applications with competitive pricing.

Model NameContextInputOutputActions
baidu/ernie-4.5-21B-a3b-thinking131,072$0.07 /Mt$0.28 /MtMore
baidu/ernie-4.5-vl-424b-a47b123,000$0.42 /Mt$1.25 /MtMore
baidu/ernie-4.5-300b-a47b-paddle123,000$0.28 /Mt$1.1 /MtMore
baidu/ernie-4.5-vl-28b-a3b30,000$0.14 /Mt$0.56 /MtMore
baidu/ernie-4.5-21B-a3b120,000$0.07 /Mt$0.28 /MtMore

googleGemma

Google's Gemma models offering high-quality language processing with excellent performance for various NLP tasks.

Model NameContextInputOutputActions
google/gemma-3-12b-it131,072$0.05 /Mt$0.1 /MtMore
google/gemma-3-27b-it98,304$0.119 /Mt$0.2 /MtMore

TTHUDM

GLM series models from Tsinghua University, featuring advanced Chinese language understanding and generation capabilities.

Model NameContextInputOutputActions
zai-org/glm-4.6204,800$0.6 /Mt · Cache Read $0.11 /Mt$2.2 /MtMore
zai-org/glm-4.5131,072$0.6 /Mt · Cache Read $0.11 /Mt$2.2 /MtMore
zai-org/glm-4.5v65,536$0.6 /Mt · Cache Read $0.11 /Mt$1.8 /MtMore
thudm/glm-4.1v-9b-thinking65,536$0.035 /Mt$0.138 /MtMore
zai-org/glm-4.5-air131,072$0.13 /Mt$0.85 /MtMore

Sao10KSao10K

Specialized fine-tuned models optimized for creative and roleplay applications with enhanced storytelling capabilities.

Model NameContextInputOutputActions
sao10k/l3-70b-euryale-v2.18,192$1.48 /Mt$1.48 /MtMore
sao10k/l3-8b-lunaris8,192$0.05 /Mt$0.05 /MtMore
Sao10K/L3-8B-Stheno-v3.28,192$0.05 /Mt$0.05 /MtMore
sao10k/l31-70b-euryale-v2.28,192$1.48 /Mt$1.48 /MtMore

mistralaiMistralai

Efficient and powerful language models from Mistral AI, designed for both commercial and open-source applications.

Model NameContextInputOutputActions
mistralai/mistral-nemo60,288$0.04 /Mt$0.17 /MtMore

nousresearchNousresearch

Research-focused AI models designed for advanced reasoning and enhanced instruction following capabilities.

Model NameContextInputOutputActions
nousresearch/hermes-2-pro-llama-3-8b8,192$0.14 /Mt$0.14 /MtMore

grypheGryphe

Innovative AI models from Gryphe, delivering specialized language understanding with a focus on efficiency and adaptability for niche applications.

Model NameContextInputOutputActions
gryphe/mythomax-l2-13b4,096$0.09 /Mt$0.09 /MtMore

minimaxaiMiniMax

Minimax AI's advanced language models delivering robust conversational AI capabilities with optimized performance for customer service, content generation, and creative applications, featuring strong multilingual support and enterprise-ready scalability.

Model NameContextInputOutputActions
minimaxai/minimax-m1-80k1,000,000$0.55 /Mt$2.2 /MtMore

microsoftMicrosoft

Microsoft's models deliver state-of-the-art multilingual capabilities with enterprise-grade security, seamlessly integrated into the Azure cloud ecosystem. Optimized for cross-platform collaboration and business intelligence, these models excel in document understanding and generation—key strengths for Microsoft 365 workflows.

Model NameContextInputOutputActions
microsoft/wizardlm-2-8x22b65,535$0.62 /Mt$0.62 /MtMore

OOther

--

Model NameContextInputOutputActions
moonshotai/kimi-k2-thinking262,144$0.6 /Mt$2.5 /MtMore
paddlepaddle/paddleocr-vl16,384$0.02 /Mt$0.02 /MtMore
kat-coder256,000-
Tiered pricing
More
moonshotai/kimi-k2-0905262,144$0.6 /Mt$2.5 /MtMore
baichuan/baichuan-m2-32b131,072$0.07 /Mt$0.07 /MtMore

MMixture of Expert

Premium collection of state-of-the-art AI models featuring advanced reasoning, mathematical proof capabilities, and cutting-edge language understanding across multiple domains.

Model NameContextInputOutputActions
minimax/minimax-m2204,800$0.3 /Mt$1.2 /MtMore
deepseek/deepseek-v3.2-exp163,840$0.27 /Mt$0.41 /MtMore
zai-org/glm-4.6204,800$0.6 /Mt · Cache Read $0.11 /Mt$2.2 /MtMore
deepseek/deepseek-v3.1-terminus131,072$0.27 /Mt$1 /MtMore
deepseek/deepseek-v3.1131,072$0.27 /Mt$1 /MtMore
qwen/qwen3-coder-30b-a3b-instruct262,144$0.07 /Mt$0.27 /MtMore
openai/gpt-oss-120b131,072$0.05 /Mt$0.25 /MtMore
moonshotai/kimi-k2-instruct131,072$0.57 /Mt$2.3 /MtMore
deepseek/deepseek-v3-0324163,840$0.27 /Mt · Cache Read $0.135 /Mt$1.12 /MtMore
zai-org/glm-4.5131,072$0.6 /Mt · Cache Read $0.11 /Mt$2.2 /MtMore
qwen/qwen3-235b-a22b-thinking-2507131,072$0.3 /Mt$3 /MtMore
zai-org/glm-4.5v65,536$0.6 /Mt · Cache Read $0.11 /Mt$1.8 /MtMore
openai/gpt-oss-20b131,072$0.04 /Mt$0.15 /MtMore
minimaxai/minimax-m1-80k1,000,000$0.55 /Mt$2.2 /MtMore
deepseek/deepseek-r1-0528163,840$0.7 /Mt · Cache Read $0.35 /Mt$2.5 /MtMore
qwen/qwen3-235b-a22b-fp840,960$0.2 /Mt$0.8 /MtMore
meta-llama/llama-4-maverick-17b-128e-instruct-fp81,048,576$0.17 /Mt$0.85 /MtMore
meta-llama/llama-4-scout-17b-16e-instruct131,072$0.1 /Mt$0.5 /MtMore
baidu/ernie-4.5-21B-a3b-thinking131,072$0.07 /Mt$0.28 /MtMore
baidu/ernie-4.5-vl-424b-a47b123,000$0.42 /Mt$1.25 /MtMore
baidu/ernie-4.5-300b-a47b-paddle123,000$0.28 /Mt$1.1 /MtMore
deepseek/deepseek-prover-v2-671b160,000$0.7 /Mt$2.5 /MtMore
qwen/qwen3-32b-fp840,960$0.1 /Mt$0.45 /MtMore
qwen/qwen3-30b-a3b-fp840,960$0.09 /Mt$0.45 /MtMore
deepseek/deepseek-v3-turbo64,000$0.4 /Mt$1.3 /MtMore
deepseek/deepseek-r1-turbo64,000$0.7 /Mt$2.5 /MtMore
baidu/ernie-4.5-vl-28b-a3b30,000$0.14 /Mt$0.56 /MtMore
baidu/ernie-4.5-21B-a3b120,000$0.07 /Mt$0.28 /MtMore
Embeddings

Image

Pricing may vary based on image dimensions, inference steps, and upscaling factors. Use thePricing Calculatorfor an estimate.

API NameWidth&HeightSteps/ScalePricing
Text to Image512*5125$0.001 /image
Image to Image512*5125$0.001 /image
Remove Background--$0.017 /image
Replace Background--$0.0255 /image
Inpainting512*5125$0.0015 /image
Remove Text--$0.017 /image
Cleanup--$0.017 /image
Merge Face--$0.0255 /image
API NameModeWidth&HeightPricing
Seedream 3.0 Text to Image--$0.03 /image
Seedream 4.0--$0.03 /image
Hunyuan Image 3--$0.1 /image
Qwen-Image Text to Image--$0.02 /image
Qwen-Image Edit--$0.02 /image
Flux.1 Kontext Dev--$0.0225 /image
fast_mode-$0.018 /image
Flux.1 Kontext Pro--$0.036 /image
Flux.1 Kontext Max--$0.072 /image

Video

Pricing may vary based on the number of frames, chosen model, and inference steps. Use thePricing Calculatorfor an estimate.

API NameTotal FramesStepsPricing
Text to Video3220$0.0307 /video
API NameModeDurationResolutionPricing
Kling V1.6 Text to VideoStandard5s720P$0.27 /video
Standard10s720P$0.54 /video
Kling V1.6 Image to VideoStandard5s720P$0.27 /video
Standard10s720P$0.54 /video
Professional5s1080P$0.46 /video
Professional10s1080P$0.92 /video
Kling V2.1 Master Text to VideoMaster5s1080P$1.17 /video
Master10s1080P$2.34 /video
Kling V2.1 Image to VideoStandard5s720P$0.23 /video
Standard10s720P$0.45 /video
Professional5s1080P$0.43 /video
Professional10s1080P$0.81 /video
Kling V2.1 Master Image to VideoMaster5s1080P$1.17 /video
Master10s1080P$2.34 /video
Kling V2.5 Turbo Text to Video-5s1080P$0.35 /video
-10s1080P$0.70 /video
Kling V2.5 Turbo Image to Video-5s1080P$0.35 /video
-10s1080P$0.70 /video
MiniMax Video 01-6s720P$0.40 /video
MiniMax Video 02-6s768P$0.25 /video
-10s768P$0.50 /video
-6s1080P$0.44 /video
Minimax Hailuo 2.3 Text to Video-6s768P$0.28 /video
-10s768P$0.56 /video
-6s1080P$0.49 /video
Minimax Hailuo 2.3 Image to Video-6s768P$0.28 /video
-10s768P$0.56 /video
-6s1080P$0.49 /video
Minimax Hailuo 2.3 Fast Image to Video-6s768P$0.19 /video
-10s768P$0.32 /video
-6s1080P$0.33 /video
Hunyuan Video Fast-5s1280*720 | 720*1280$0.30 /video
Wan 2.1 Text to Video-5s1280*720 | 720*1280$0.30 /video
-5s832*480 | 480*832$0.20 /video
fast_mode5s1280*720 | 720*1280$0.225 /video
fast_mode5s832*480 | 480*832$0.125 /video
Wan 2.1 Image to Video-5s1280*720 | 720*1280$0.30 /video
-5s832*480 | 480*832$0.20 /video
fast_mode5s1280*720 | 720*1280$0.225 /video
fast_mode5s832*480 | 480*832$0.125 /video
Wan 2.2 Text to Video-5s480P$0.09 /video
LoRA5s480P$0.18 /video
-8s480P$0.288 /video
LoRA8s480P$0.288 /video
-5s720P$0.27 /video
LoRA5s720P$0.315 /video
-8s720P$0.432 /video
LoRA8s720P$0.504 /video
-5s1080P$0.40 /video
Wan 2.2 Image to Video-5s480P$0.09 /video
LoRA5s480P$0.18 /video
-8s480P$0.288 /video
LoRA8s480P$0.288 /video
-5s720P$0.27 /video
LoRA5s720P$0.315 /video
-8s720P$0.432 /video
LoRA8s720P$0.504 /video
-5s1080P$0.40 /video
Wan 2.5 Text to Video-5s480P$0.25 /video
-10s480P$0.50 /video
-5s720P$0.50 /video
-10s720P$1.00 /video
-5s1080P$0.75 /video
-10s1080P$1.50 /video
Wan 2.5 Image to Video-5s480P$0.25 /video
-10s480P$0.50 /video
-5s720P$0.50 /video
-10s720P$1.00 /video
-5s1080P$0.75 /video
-10s1080P$1.50 /video
Vidu Q1 Text to Videogeneral style5s1080P$0.36 /video
anime style5s1080P$0.36 /video
Vidu Q1 Image to Video-5s1080P$0.36 /video
Vidu Q1 Start End to Video-5s1080P$0.36 /video
Vidu Q1 Reference to Video-5s1080P$0.36 /video
Vidu 2.0 Image to Video-4s360P$0.09 /video
-4s720P$0.18 /video
-4s1080P$0.27 /video
-8s720P$0.27 /video
Vidu 2.0 Reference to Video-4s360P$0.09 /video
-4s720P$0.18 /video
Vidu 2.0 Start End to Video-4s360P$0.09 /video
-4s720P$0.18 /video
-4s1080P$0.27 /video
-8s720P$0.27 /video
PixVerse V4.5 Text to Video-5s360P$0.25 /video
-5s540P$0.25 /video
-5s720P$0.35 /video
-5s1080P$0.70 /video
fast_mode5s360P$0.50 /video
fast_mode5s540P$0.50 /video
fast_mode5s720P$0.70 /video
PixVerse V4.5 Image to Video-5s360P$0.25 /video
-5s540P$0.25 /video
-5s720P$0.35 /video
-5s1080P$0.70 /video
fast_mode5s360P$0.50 /video
fast_mode5s540P$0.50 /video
fast_mode5s720P$0.70 /video
Seedance V1 Lite Text to Video-5s480P( 21:9 & 9:21 )$0.08 /video
-5s480P( 16:9 & 9:16 )$0.09 /video
-5s480P( 4:3 & 3:4 )$0.08 /video
-5s480P( 1:1 )$0.09 /video
-5s720P( 21:9 & 9:21 )$0.20 /video
-5s720P( 16:9 & 9:16 )$0.19 /video
-5s720P( 4:3 & 3:4 )$0.20 /video
-5s720P( 1:1 )$0.19 /video
-5s1080P( 21:9 & 9:21 )$0.43 /video
-5s1080P( 16:9 & 9:16 )$0.44 /video
-5s1080P( 4:3 & 3:4 )$0.44 /video
-5s1080P( 1:1 )$0.44 /video
-10s480P( 21:9 & 9:21 )$0.17 /video
-10s480P( 16:9 & 9:16 )$0.17 /video
-10s480P( 4:3 & 3:4 )$0.17 /video
-10s480P( 1:1 )$0.17 /video
-10s720P( 21:9 & 9:21 )$0.41 /video
-10s720P( 16:9 & 9:16 )$0.37 /video
-10s720P( 4:3 & 3:4 )$0.39 /video
-10s720P( 1:1 )$0.39 /video
-10s1080P( 21:9 & 9:21 )$0.85 /video
-10s1080P( 16:9 & 9:16 )$0.88 /video
-10s1080P( 4:3 & 3:4 )$0.88 /video
-10s1080P( 1:1 )$0.87 /video
Seedance V1 Lite Image to Video-5s480P( 21:9 & 9:21 )$0.08 /video
-5s480P( 16:9 & 9:16 )$0.09 /video
-5s480P( 4:3 & 3:4 )$0.08 /video
-5s480P( 1:1 )$0.09 /video
-5s720P( 21:9 & 9:21 )$0.20 /video
-5s720P( 16:9 & 9:16 )$0.19 /video
-5s720P( 4:3 & 3:4 )$0.20 /video
-5s720P( 1:1 )$0.19 /video
-5s1080P( 21:9 & 9:21 )$0.43 /video
-5s1080P( 16:9 & 9:16 )$0.44 /video
-5s1080P( 4:3 & 3:4 )$0.44 /video
-5s1080P( 1:1 )$0.44 /video
-10s480P( 21:9 & 9:21 )$0.17 /video
-10s480P( 16:9 & 9:16 )$0.17 /video
-10s480P( 4:3 & 3:4 )$0.17 /video
-10s480P( 1:1 )$0.17 /video
-10s720P( 21:9 & 9:21 )$0.41 /video
-10s720P( 16:9 & 9:16 )$0.37 /video
-10s720P( 4:3 & 3:4 )$0.39 /video
-10s720P( 1:1 )$0.39 /video
-10s1080P( 21:9 & 9:21 )$0.85 /video
-10s1080P( 16:9 & 9:16 )$0.88 /video
-10s1080P( 4:3 & 3:4 )$0.88 /video
-10s1080P( 1:1 )$0.87 /video
Seedance V1 Pro Text to Video-5s480P( 21:9 & 9:21 )$0.12 /video
-5s480P( 16:9 & 9:16 )$0.12 /video
-5s480P( 4:3 & 3:4 )$0.12 /video
-5s480P( 1:1 )$0.12 /video
-5s720P( 21:9 & 9:21 )$0.28 /video
-5s720P( 16:9 & 9:16 )$0.26 /video
-5s720P( 4:3 & 3:4 )$0.27 /video
-5s720P( 1:1 )$0.27 /video
-5s1080P( 21:9 & 9:21 )$0.59 /video
-5s1080P( 16:9 & 9:16 )$0.61 /video
-5s1080P( 4:3 & 3:4 )$0.61 /video
-5s1080P( 1:1 )$0.61 /video
-10s480P( 21:9 & 9:21 )$0.23 /video
-10s480P( 16:9 & 9:16 )$0.24 /video
-10s480P( 4:3 & 3:4 )$0.23 /video
-10s480P( 1:1 )$0.24 /video
-10s720P( 21:9 & 9:21 )$0.56 /video
-10s720P( 16:9 & 9:16 )$0.51 /video
-10s720P( 4:3 & 3:4 )$0.55 /video
-10s720P( 1:1 )$0.54 /video
-10s1080P( 21:9 & 9:21 )$1.18 /video
-10s1080P( 16:9 & 9:16 )$1.22 /video
-10s1080P( 4:3 & 3:4 )$1.22 /video
-10s1080P( 1:1 )$1.22 /video
Seedance V1 Pro Image to Video-5s480P( 21:9 & 9:21 )$0.12 /video
-5s480P( 16:9 & 9:16 )$0.12 /video
-5s480P( 4:3 & 3:4 )$0.12 /video
-5s480P( 1:1 )$0.12 /video
-5s720P( 21:9 & 9:21 )$0.28 /video
-5s720P( 16:9 & 9:16 )$0.26 /video
-5s720P( 4:3 & 3:4 )$0.27 /video
-5s720P( 1:1 )$0.27 /video
-5s1080P( 21:9 & 9:21 )$0.59 /video
-5s1080P( 16:9 & 9:16 )$0.61 /video
-5s1080P( 4:3 & 3:4 )$0.61 /video
-5s1080P( 1:1 )$0.61 /video
-10s480P( 21:9 & 9:21 )$0.23 /video
-10s480P( 16:9 & 9:16 )$0.24 /video
-10s480P( 4:3 & 3:4 )$0.23 /video
-10s480P( 1:1 )$0.24 /video
-10s720P( 21:9 & 9:21 )$0.56 /video
-10s720P( 16:9 & 9:16 )$0.51 /video
-10s720P( 4:3 & 3:4 )$0.55 /video
-10s720P( 1:1 )$0.54 /video
-10s1080P( 21:9 & 9:21 )$1.18 /video
-10s1080P( 16:9 & 9:16 )$1.22 /video
-10s1080P( 4:3 & 3:4 )$1.22 /video
-10s1080P( 1:1 )$1.22 /video
API NameModelStepsPricing
Image to VideoSVD-XT20$0.024 /video
SVD20$0.0134 /video

Audio

API NameModePricing
Text to Speech-$15 /1M characters
MiniMax speech-02-hdT2A / T2A Async$80 /1M characters
MiniMax speech-02-turboT2A / T2A Async$48 /1M characters
MiniMax speech-2.5-hd-previewT2A / T2A Async$80 /1M characters
MiniMax speech-2.5-turbo-previewT2A / T2A Async$48 /1M characters
MiniMax speech-2.6-turboT2A / T2A Async$60 /1M characters
MiniMax speech-2.6-hdT2A / T2A Async$100 /1M characters
MiniMax Voice-Cloning-$2.4 /voice
Ready to build smarter? Start today.
Get started with Novita AI and unlock the power of affordable, reliable, and scalable AI inference for your applications.