The website requires JavaScript to function properly.
Model Library
GPUs
Agent Sandbox
Pricing
Docs
Log In
Get Started
qwen/qwen3-235b-a22b-fp8
40960 Context
$0.2 / 1M input tokens
$0.8 / 1M output tokens
Learn about API rate limits
40960 Context
$0.2/1M input tokens
$0.8/1M output tokens
Demo
API
Dedicated Endpoints
Join Discord
Model Configuration
Response format
text
System Prompt
Be a helpful assistant
max_tokens
temperature
top_p
min_p
top_k
presence_penalty
frequency_penalty
repetition_penalty