Model Library/Mistral Nemo
mistralai/mistral-nemo

Mistral Nemo

mistralai/mistral-nemo
A 12B parameter model with a 128k token context length built by Mistral in collaboration with NVIDIA. The model is multilingual, supporting English, French, German, Spanish, Italian, Portuguese, Chinese, Japanese, Korean, Arabic, and Hindi. It supports function calling and is released under the Apache 2.0 license.

Funktionen

Serverless API

Dokumentation

mistralai/mistral-nemo is available via Novita's serverless API, where you pay per token. There are several ways to call the API, including OpenAI-compatible endpoints with exceptional reasoning performance.

On-Demand-Bereitstellungen

Dokumentation

On-demand deployments allow you to use mistralai/mistral-nemo on dedicated GPUs with high-performance serving stack with high reliability and no rate limits.

Verfügbare Serverless

Abfragen sofort ausführen, nur für die Nutzung bezahlen

Eingabe$0.04 / M Tokens
Ausgabe$0.17 / M Tokens

Verwenden Sie die folgenden Codebeispiele, um unsere API zu integrieren:

1from openai import OpenAI
2
3client = OpenAI(
4    api_key="<Your API Key>",
5    base_url="https://api.novita.ai/openai"
6)
7
8response = client.chat.completions.create(
9    model="mistralai/mistral-nemo",
10    messages=[
11        {"role": "system", "content": "You are a helpful assistant."},
12        {"role": "user", "content": "Hello, how are you?"}
13    ],
14    max_tokens=16000,
15    temperature=0.7
16)
17
18print(response.choices[0].message.content)

Info

Anbieter
Mistral
Quantisierung
fp8

Unterstützte Funktionalität

Kontextlänge
60288
Maximale Ausgabe
16000
Serverless
Unterstützt
Structured Output
Unterstützt
Eingabefähigkeiten
text
Ausgabefähigkeiten
text