The Meta Llama 3.2 collection of multilingual large language models (LLMs) is a collection of pretrained and instruction-tuned generative models in 1B and 3B sizes (text in/text out)
Features
On-demand Deployments
Docs
On-demand deployments allow you to use meta-llama/llama-3.2-3b-instruct on dedicated GPUs with high-performance serving stack with high reliability and no rate limits.