Model Library/Gemma3 12B
google/gemma-3-12b-it

Gemma3 12B

google/gemma-3-12b-it

Features

On-demand Deployments

Docs

On-demand deployments allow you to use google/gemma-3-12b-it on dedicated GPUs with high-performance serving stack with high reliability and no rate limits.

Info

Provider
Gemma
Quantization
bf16

Supported Functionality

Context Length
131072
Max Output
8192
Serverless
Not supported
Input Capabilities
text, image
Output Capabilities
text