Model Library/Llama 3.2 3B Instruct
meta-llama/llama-3.2-3b-instruct

Llama 3.2 3B Instruct

meta-llama/llama-3.2-3b-instruct
The Meta Llama 3.2 collection of multilingual large language models (LLMs) is a collection of pretrained and instruction-tuned generative models in 1B and 3B sizes (text in/text out)

Features

On-demand Deployments

Docs

On-demand deployments allow you to use meta-llama/llama-3.2-3b-instruct on dedicated GPUs with high-performance serving stack with high reliability and no rate limits.

Info

Provider
Llama
Quantization
bf16

Supported Functionality

Context Length
32768
Max Output
32000
Serverless
Not supported
Input Capabilities
text
Output Capabilities
text