Model Library/MiniMax M2.5-highspeed
minimax/minimax-m2.5-highspeed

MiniMax M2.5-highspeed

minimax/minimax-m2.5-highspeed
MiniMax M2.5-highspeed is an accelerated SOTA model engineered for scenarios demanding extreme efficiency. It perfectly inherits the core intelligence and robust digital workspace capabilities of the standard M2.5—including its 80.2% score on SWE-Bench Verified, seamless manipulation of Office documents, and versatility in cross-software collaboration. With zero compromise on reasoning precision or logical depth, the Highspeed version delivers ultra-low latency inference through rigorous engineering optimization. This means you get more than just an intelligent assistant capable of planning and self-optimization; you gain a "high-velocity engine" that responds to high-frequency calls and processes complex document streams in near real-time, making it ideal for latency-sensitive interactive applications and large-scale automated pipelines.

Recursos

API serverless

Documentação

minimax/minimax-m2.5-highspeed is available via Novita's serverless API, where you pay per token. There are several ways to call the API, including OpenAI-compatible endpoints with exceptional reasoning performance.

Serverless disponível

Execute consultas imediatamente, pague apenas pelo uso

Entrada$0.6 / M Tokens
Leitura de cache$0.03 / M Tokens
Saída$2.4 / M Tokens

Use os exemplos de código a seguir para integrar com nossa API:

1from openai import OpenAI
2
3client = OpenAI(
4    api_key="<Your API Key>",
5    base_url="https://api.novita.ai/openai"
6)
7
8response = client.chat.completions.create(
9    model="minimax/minimax-m2.5-highspeed",
10    messages=[
11        {"role": "system", "content": "You are a helpful assistant."},
12        {"role": "user", "content": "Hello, how are you?"}
13    ],
14    max_tokens=131100,
15    temperature=0.7
16)
17
18print(response.choices[0].message.content)

Informações

Provedor
MiniMax
Quantização
fp8

Funcionalidades compatíveis

Comprimento do contexto
204800
Saída máxima
131100
Serverless
Compatível
Function Calling
Compatível
Structured Output
Compatível
Reasoning
Compatível
API da Anthropic
Compatível
Capacidades de entrada
text
Capacidades de saída
text

Tudo o que você precisa para criar IA de produção.

Mais de 200 modelos, GPUs sob demanda e ambientes de execução de agentes seguros — unificados em uma única API. Grátis para começar, escala conforme você cresce.