Gemma3 12B

google/gemma-3-12b-it

Features

On-demand Deployments

Docs

On-demand deployments allow you to use google/gemma-3-12b-it on dedicated GPUs with high-performance serving stack with high reliability and no rate limits.

Info

Provider

Gemma

Quantization

bf16

Supported Functionality

Context Length

131072

Max Output

8192

Serverless

Not supported

Input Capabilities

text, image

Output Capabilities

text

Everything you need to build production AI.

200+ models, on-demand GPUs, and secure agent runtimes — unified under one API. Free to start, scales as you grow.