Model Library/Qwen3 235B A22b Thinking 2507
Qwen

Qwen3 235B A22b Thinking 2507

qwen/qwen3-235b-a22b-thinking-2507
The Qwen3-235B-A22B-Thinking-2507 represents the newest thinking-enabled model in the Qwen3 series, delivering groundbreaking improvements in reasoning capabilities. This advanced AI demonstrates significantly enhanced performance across logical reasoning, mathematics, scientific analysis, coding tasks, and academic benchmarks - matching or even surpassing human-expert level performance to achieve state-of-the-art results among open-source thinking models. Beyond its exceptional reasoning skills, the model shows markedly better general capabilities including more precise instruction following, sophisticated tool usage, highly natural text generation, and improved alignment with human preferences. It also features enhanced 256K long-context understanding, allowing it to maintain coherence and depth across extended documents and complex discussions.

Recursos

API serverless

Documentação

qwen/qwen3-235b-a22b-thinking-2507 is available via Novita's serverless API, where you pay per token. There are several ways to call the API, including OpenAI-compatible endpoints with exceptional reasoning performance.

Implantações sob demanda

Documentação

On-demand deployments allow you to use qwen/qwen3-235b-a22b-thinking-2507 on dedicated GPUs with high-performance serving stack with high reliability and no rate limits.

Serverless disponível

Execute consultas imediatamente, pague apenas pelo uso

Entrada$0.3 / M Tokens
Saída$3 / M Tokens

Use os exemplos de código a seguir para integrar com nossa API:

1from openai import OpenAI
2
3client = OpenAI(
4    api_key="<Your API Key>",
5    base_url="https://api.novita.ai/openai"
6)
7
8response = client.chat.completions.create(
9    model="qwen/qwen3-235b-a22b-thinking-2507",
10    messages=[
11        {"role": "system", "content": "You are a helpful assistant."},
12        {"role": "user", "content": "Hello, how are you?"}
13    ],
14    max_tokens=32768,
15    temperature=0.7
16)
17
18print(response.choices[0].message.content)

Informações

Provedor
Qwen
Quantização
fp8

Funcionalidades compatíveis

Comprimento do contexto
131072
Saída máxima
32768
Serverless
Compatível
Function Calling
Compatível
Reasoning
Compatível
API da Anthropic
Compatível
Capacidades de entrada
text
Capacidades de saída
text

Tudo o que você precisa para criar IA de produção.

Mais de 200 modelos, GPUs sob demanda e ambientes de execução de agentes seguros — unificados em uma única API. Grátis para começar, escala conforme você cresce.