The website requires JavaScript to function properly.
Model Library
GPUs
Pricing
Docs
Log In
Get Started
Get Started
Log In
Model Library
GPUs
Pricing
Docs
qwen/qwen3-235b-a22b-fp8
128000 Context
$0.2 / 1M input tokens
$0.8 / 1M output tokens
Learn about API rate limits
128000 Context
$0.2/1M input tokens
$0.8/1M output tokens
Demo
API
Get up to $500 in LLM API
Code
Join Discord
Loading...
Model Configuration
Response format
System Prompt
Be a helpful assistant
max_tokens
temperature
top_p
min_p
top_k
presence_penalty
frequency_penalty
repetition_penalty