Mistral Nemo

mistralai/mistral-nemo

A 12B parameter model with a 128k token context length built by Mistral in collaboration with NVIDIA. The model is multilingual, supporting English, French, German, Spanish, Italian, Portuguese, Chinese, Japanese, Korean, Arabic, and Hindi. It supports function calling and is released under the Apache 2.0 license.

Funktionen

Serverless API

Dokumentation

mistralai/mistral-nemo is available via Novita's serverless API, where you pay per token. There are several ways to call the API, including OpenAI-compatible endpoints with exceptional reasoning performance.

On-Demand-Bereitstellungen

Dokumentation

On-demand deployments allow you to use mistralai/mistral-nemo on dedicated GPUs with high-performance serving stack with high reliability and no rate limits.

Verfügbare Serverless

Abfragen sofort ausführen, nur für die Nutzung bezahlen

Eingabe$0.04 / M Tokens

Ausgabe$0.17 / M Tokens

Verwenden Sie die folgenden Codebeispiele, um unsere API zu integrieren:

1from openai import OpenAI
2
3client = OpenAI(
4    api_key="<Your API Key>",
5    base_url="https://api.novita.ai/openai"
6)
7
8response = client.chat.completions.create(
9    model="mistralai/mistral-nemo",
10    messages=[
11        {"role": "system", "content": "You are a helpful assistant."},
12        {"role": "user", "content": "Hello, how are you?"}
13    ],
14    max_tokens=16000,
15    temperature=0.7
16)
17
18print(response.choices[0].message.content)

Info

Anbieter

Mistral

Quantisierung

fp8

Unterstützte Funktionalität

Kontextlänge

60288

Maximale Ausgabe

16000

Serverless

Unterstützt

Structured Output

Unterstützt

Eingabefähigkeiten

text

Ausgabefähigkeiten

text