Pricing

Explore pricing for our Model APIs and GPU resources. Find the right plan to match your needs with transparent rates and flexible options.

Batch inference is available at an introductory 50% discount on input and output tokens for supported models.Learn more

Deepseek

Advanced AI models from DeepSeek, offering cutting-edge reasoning capabilities and competitive pricing for enterprise and research applications.

Model NameContextInputOutputActions
deepseek/deepseek-v3-0324163,840$0.28 /M Tokens$1.14 /M TokensMore
deepseek/deepseek-r1-0528163,840$0.7 /M Tokens$2.5 /M TokensMore
deepseek/deepseek-r1-0528-qwen3-8b128,000$0.06 /M Tokens$0.09 /M TokensMore
deepseek/deepseek-v3-turbo64,000$0.4 /M Tokens$1.3 /M TokensMore
deepseek/deepseek-r1-turbo64,000$0.7 /M Tokens$2.5 /M TokensMore
deepseek/deepseek-prover-v2-671b160,000$0.7 /M Tokens$2.5 /M TokensMore
deepseek/deepseek-r1-distill-llama-8b32,000$0.04 /M Tokens$0.04 /M TokensMore
deepseek/deepseek-r1-distill-qwen-14b64,000$0.15 /M Tokens$0.15 /M TokensMore
deepseek/deepseek-r1-distill-qwen-32b64,000$0.3 /M Tokens$0.3 /M TokensMore
deepseek/deepseek-r1-distill-llama-70b32,000$0.8 /M Tokens$0.8 /M TokensMore

Llama

Meta's Llama models providing state-of-the-art language understanding with open architecture designed for diverse applications.

Model NameContextInputOutputActions
meta-llama/llama-4-maverick-17b-128e-instruct-fp81,048,576$0.17 /M Tokens$0.85 /M TokensMore
meta-llama/llama-4-scout-17b-16e-instruct131,072$0.1 /M Tokens$0.5 /M TokensMore
meta-llama/llama-3.1-8b-instruct16,384$0.02 /M Tokens$0.05 /M TokensMore
meta-llama/llama-3.3-70b-instruct131,072$0.13 /M Tokens$0.39 /M TokensMore
meta-llama/llama-3-8b-instruct8,192$0.04 /M Tokens$0.04 /M TokensMore
meta-llama/llama-3-70b-instruct8,192$0.51 /M Tokens$0.74 /M TokensMore
meta-llama/llama-3.2-1b-instruct131,000FreeFreeMore
meta-llama/llama-3.2-3b-instruct32,768$0.03 /M Tokens$0.05 /M TokensMore

Qwen

Qwen series models offering efficient language processing with various parameter sizes, from lightweight to enterprise-grade solutions.

Model NameContextInputOutputActions
qwen/qwen3-coder-480b-a35b-instruct262,144$0.64 /M Tokens$2.5 /M TokensMore
qwen/qwen3-235b-a22b-thinking-2507131,072$0.3 /M Tokens$3 /M TokensMore
qwen/qwen3-235b-a22b-instruct-2507262,144$0.15 /M Tokens$0.8 /M TokensMore
qwen/qwen3-30b-a3b-fp840,960$0.1 /M Tokens$0.45 /M TokensMore
qwen/qwen3-32b-fp840,960$0.1 /M Tokens$0.45 /M TokensMore
qwen/qwen2.5-vl-72b-instruct32,768$0.8 /M Tokens$0.8 /M TokensMore
qwen/qwen3-235b-a22b-fp840,960$0.2 /M Tokens$0.8 /M TokensMore
qwen/qwen-2.5-72b-instruct32,000$0.38 /M Tokens$0.4 /M TokensMore
qwen/qwen3-8b-fp8128,000$0.035 /M Tokens$0.138 /M TokensMore
qwen/qwen3-4b-fp8128,000FreeFreeMore
qwen/qwen2.5-7b-instruct32,000$0.07 /M Tokens$0.07 /M TokensMore

Baidu

Baidu's ERNIE models providing advanced Chinese language understanding and multimodal capabilities, optimized for Chinese applications with competitive pricing.

Model NameContextInputOutputActions
baidu/ernie-4.5-vl-424b-a47b123,000$0.42 /M Tokens$1.25 /M TokensMore
baidu/ernie-4.5-300b-a47b-paddle123,000$0.28 /M Tokens$1.1 /M TokensMore
baidu/ernie-4.5-vl-28b-a3b30,000$0.14 /M Tokens$0.56 /M TokensMore
baidu/ernie-4.5-21B-a3b120,000$0.07 /M Tokens$0.28 /M TokensMore
baidu/ernie-4.5-0.3b120,000FreeFreeMore

Gemma

Google's Gemma models offering high-quality language processing with excellent performance for various NLP tasks.

Model NameContextInputOutputActions
google/gemma-3-27b-it32,000$0.119 /M Tokens$0.2 /M TokensMore
google/gemma-3-1b-it32,768FreeFreeMore

THUDM

GLM series models from Tsinghua University, featuring advanced Chinese language understanding and generation capabilities.

Model NameContextInputOutputActions
zai-org/glm-4.5v65,536$0.6 /M Tokens$1.8 /M TokensMore
zai-org/glm-4.5131,072$0.6 /M Tokens$2.2 /M TokensMore
thudm/glm-4.1v-9b-thinking65,536$0.035 /M Tokens$0.138 /M TokensMore
thudm/glm-4-32b-041432,000$0.24 /M Tokens$0.24 /M TokensMore

Sao10K

Specialized fine-tuned models optimized for creative and roleplay applications with enhanced storytelling capabilities.

Model NameContextInputOutputActions
Sao10K/L3-8B-Stheno-v3.28,192$0.05 /M Tokens$0.05 /M TokensMore
sao10k/l3-70b-euryale-v2.18,192$1.48 /M Tokens$1.48 /M TokensMore
sao10k/l3-8b-lunaris8,192$0.05 /M Tokens$0.05 /M TokensMore
sao10k/l31-70b-euryale-v2.28,192$1.48 /M Tokens$1.48 /M TokensMore

Mistralai

Efficient and powerful language models from Mistral AI, designed for both commercial and open-source applications.

Model NameContextInputOutputActions
mistralai/mistral-nemo60,288$0.04 /M Tokens$0.17 /M TokensMore
mistralai/mistral-7b-instruct32,768$0.029 /M Tokens$0.059 /M TokensMore

Nousresearch

Research-focused AI models designed for advanced reasoning and enhanced instruction following capabilities.

Model NameContextInputOutputActions
nousresearch/hermes-2-pro-llama-3-8b8,192$0.14 /M Tokens$0.14 /M TokensMore

CognitiveComputations

Specialized AI models focused on advanced cognitive tasks and complex reasoning applications.

Model NameContextInputOutputActions
cognitivecomputations/dolphin-mixtral-8x22b16,000$0.9 /M Tokens$0.9 /M TokensMore

Sophosympatheia

Fine-tuned models designed for enhanced emotional intelligence and nuanced conversational capabilities.

Model NameContextInputOutputActions
sophosympatheia/midnight-rose-70b4,096$0.8 /M Tokens$0.8 /M TokensMore

Gryphe

Innovative AI models from Gryphe, delivering specialized language understanding with a focus on efficiency and adaptability for niche applications.

Model NameContextInputOutputActions
gryphe/mythomax-l2-13b4,096$0.09 /M Tokens$0.09 /M TokensMore

MiniMax

Minimax AI's advanced language models delivering robust conversational AI capabilities with optimized performance for customer service, content generation, and creative applications, featuring strong multilingual support and enterprise-ready scalability.

Model NameContextInputOutputActions
minimaxai/minimax-m1-80k1,000,000$0.55 /M Tokens$2.2 /M TokensMore

Microsoft

Microsoft’s models deliver state-of-the-art multilingual capabilities with enterprise-grade security, seamlessly integrated into the Azure cloud ecosystem. Optimized for cross-platform collaboration and business intelligence, these models excel in document understanding and generation—key strengths for Microsoft 365 workflows.

Model NameContextInputOutputActions
microsoft/wizardlm-2-8x22b65,535$0.62 /M Tokens$0.62 /M TokensMore

Mixture of Expert

Premium collection of state-of-the-art AI models featuring advanced reasoning, mathematical proof capabilities, and cutting-edge language understanding across multiple domains.

Model NameContextInputOutputActions
deepseek/deepseek-v3-0324163,840$0.28 /M Tokens$1.14 /M TokensMore
zai-org/glm-4.5v65,536$0.6 /M Tokens$1.8 /M TokensMore
openai/gpt-oss-120b131,072$0.1 /M Tokens$0.5 /M TokensMore
openai/gpt-oss-20b131,072$0.05 /M Tokens$0.2 /M TokensMore
zai-org/glm-4.5131,072$0.6 /M Tokens$2.2 /M TokensMore
qwen/qwen3-235b-a22b-thinking-2507131,072$0.3 /M Tokens$3 /M TokensMore
moonshotai/kimi-k2-instruct131,072$0.57 /M Tokens$2.3 /M TokensMore
deepseek/deepseek-r1-0528163,840$0.7 /M Tokens$2.5 /M TokensMore
baidu/ernie-4.5-vl-424b-a47b123,000$0.42 /M Tokens$1.25 /M TokensMore
baidu/ernie-4.5-300b-a47b-paddle123,000$0.28 /M Tokens$1.1 /M TokensMore
qwen/qwen3-30b-a3b-fp840,960$0.1 /M Tokens$0.45 /M TokensMore
minimaxai/minimax-m1-80k1,000,000$0.55 /M Tokens$2.2 /M TokensMore
qwen/qwen3-32b-fp840,960$0.1 /M Tokens$0.45 /M TokensMore
qwen/qwen3-235b-a22b-fp840,960$0.2 /M Tokens$0.8 /M TokensMore
deepseek/deepseek-v3-turbo64,000$0.4 /M Tokens$1.3 /M TokensMore
meta-llama/llama-4-maverick-17b-128e-instruct-fp81,048,576$0.17 /M Tokens$0.85 /M TokensMore
deepseek/deepseek-r1-turbo64,000$0.7 /M Tokens$2.5 /M TokensMore
deepseek/deepseek-prover-v2-671b160,000$0.7 /M Tokens$2.5 /M TokensMore
meta-llama/llama-4-scout-17b-16e-instruct131,072$0.1 /M Tokens$0.5 /M TokensMore
baidu/ernie-4.5-vl-28b-a3b30,000$0.14 /M Tokens$0.56 /M TokensMore
baidu/ernie-4.5-21B-a3b120,000$0.07 /M Tokens$0.28 /M TokensMore
baidu/ernie-4.5-0.3b120,000FreeFreeMore
Embeddings
Image

Pricing may vary based on image dimensions, inference steps, and upscaling factors. Use thePricing Calculatorfor an estimate.

API NameWidth&HeightSteps/ScalePricing
Text to Image512*5125$0.001 /image
Image to Image512*5125$0.001 /image
Remove Background--$0.017 /image
Replace Background--$0.0255 /image
Inpainting512*5125$0.0015 /image
Remove Text--$0.017 /image
Cleanup--$0.017 /image
Merge Face--$0.0255 /image
API NameWidth&HeightPricing
Seedream 3.0 Text to Image-$0.03 /image
Qwen-Image Text to Image-$0.02 /image
Video

Pricing may vary based on the number of frames, chosen model, and inference steps. Use thePricing Calculatorfor an estimate.

API NameTotal FramesStepsPricing
Text to Video3220$0.0307 /video
API NameModeDurationResolutionPricing
Kling V1.6 Text to VideoStandard5s720P$0.27 /video
Standard10s720P$0.54 /video
Kling V1.6 Image to VideoStandard5s720P$0.27 /video
Standard10s720P$0.54 /video
Professional5s1080P$0.46 /video
Professional10s1080P$0.92 /video
MiniMax Video 01-6s720P$0.40 /video
MiniMax Video 02-6s768P$0.25 /video
-10s768P$0.50 /video
-6s1080P$0.44 /video
Hunyuan Video Fast-5s1280*720 | 720*1280$0.30 /video
Wan 2.1 Text to Video-5s1280*720 | 720*1280$0.30 /video
-5s832*480 | 480*832$0.20 /video
fast_mode5s1280*720 | 720*1280$0.225 /video
fast_mode5s832*480 | 480*832$0.125 /video
Wan 2.1 Image to Video-5s1280*720 | 720*1280$0.30 /video
-5s832*480 | 480*832$0.20 /video
fast_mode5s1280*720 | 720*1280$0.225 /video
fast_mode5s832*480 | 480*832$0.125 /video
Wan 2.2 Text to Video-5s480P$0.09 /video
-5s1080p$0.40 /video
Wan 2.2 Image to Video-5s480P$0.09 /video
-5s1080P$0.40 /video
Vidu Q1 Text to Videogeneral style5s1080P$0.36 /video
anime style5s1080P$0.36 /video
Vidu Q1 Image to Video-5s1080P$0.36 /video
Vidu Q1 Start End to Video-5s1080P$0.36 /video
Vidu Q1 Reference to Video-5s1080P$0.36 /video
Vidu 2.0 Image to Video-4s360P$0.09 /video
-4s720P$0.18 /video
-4s1080P$0.27 /video
-8s720P$0.27 /video
Vidu 2.0 Reference to Video-4s360P$0.09 /video
-4s720P$0.18 /video
Vidu 2.0 Start End to Video-4s360P$0.09 /video
-4s720P$0.18 /video
-4s1080P$0.27 /video
-8s720P$0.27 /video
API NameModelStepsPricing
Image to VideoSVD-XT20$0.024 /video
SVD20$0.0134 /video
Audio
API NameModePricing
Text to Speech-$15 /1M characters
MiniMax speech-02-hdT2A / T2A Async$80 /1M characters
MiniMax speech-02-turboT2A / T2A Async$48 /1M characters
MiniMax Voice-Cloning-$2.4 /voice
Ready to build smarter? Start today.
Get started with Novita AI and unlock the power of affordable, reliable, and scalable AI inference for your applications.