Model Library/ERNIE 4.5 VL 28B A3B
Wenxin

ERNIE 4.5 VL 28B A3B

baidu/ernie-4.5-vl-28b-a3b
The ERNIE 4.5 series of open-source models adopts a Mixture-of-Experts (MoE) architecture, representing an innovative multimodal heterogeneous model structure. It achieves cross-modal knowledge fusion through a parameter-sharing mechanism while retaining dedicated parameter spaces for individual modalities. This architecture is particularly well-suited for the continuous pre-training paradigm from large language models to multimodal models, significantly enhancing multimodal understanding capabilities while maintaining or even improving performance in text-based tasks. The models are efficiently trained, inferred, and deployed using the PaddlePaddle deep learning framework. During the pre-training of large language models, the Model FLOPs Utilization (MFU) reaches 47%. Experimental results demonstrate that this series of models achieves state-of-the-art (SOTA) performance across multiple text and multimodal benchmarks, with particularly outstanding results in instruction following, world knowledge memorizatio

Recursos

API serverless

Documentação

baidu/ernie-4.5-vl-28b-a3b is available via Novita's serverless API, where you pay per token. There are several ways to call the API, including OpenAI-compatible endpoints with exceptional reasoning performance.

Serverless disponível

Execute consultas imediatamente, pague apenas pelo uso

Entrada$0.14 / M Tokens
Saída$0.56 / M Tokens

Use os exemplos de código a seguir para integrar com nossa API:

1from openai import OpenAI
2
3client = OpenAI(
4    api_key="<Your API Key>",
5    base_url="https://api.novita.ai/openai"
6)
7
8response = client.chat.completions.create(
9    model="baidu/ernie-4.5-vl-28b-a3b",
10    messages=[
11        {"role": "system", "content": "You are a helpful assistant."},
12        {"role": "user", "content": "Hello, how are you?"}
13    ],
14    max_tokens=8000,
15    temperature=0.7
16)
17
18print(response.choices[0].message.content)

Informações

Provedor
BAIDU
Quantização
fp16

Funcionalidades compatíveis

Comprimento do contexto
30000
Saída máxima
8000
Serverless
Compatível
Function Calling
Compatível
Reasoning
Compatível
Capacidades de entrada
text, image
Capacidades de saída
text

Tudo o que você precisa para criar IA de produção.

Mais de 200 modelos, GPUs sob demanda e ambientes de execução de agentes seguros — unificados em uma única API. Grátis para começar, escala conforme você cresce.