Skip to main content

Deploy Gemma 3

Text & Chat

Gemma 3 is Google's efficient open model family with the best quality-to-size ratio in its class. Available in 4B, 12B, and 27B sizes, these models punch above their weight on reasoning and instruction following.

Deploy Gemma 3 in minutes

Starting at $0.51/hr on dedicated GPU

Available Variants (3)

ModelGPUVRAMPriceAction
Gemma 3 4B
Small (4B)
L424 GB$0.51/hrDeploy
Gemma 3 12B
Medium (12B, Recommended)
L424 GB$0.51/hrDeploy
Gemma 3 27B
Large (27B)
RTX A600048 GB$0.64/hrDeploy

Prices include 30% service fee. Billed per minute while running.

Includes OpenWebUI chat interface and OpenAI-compatible API endpoint.

Use Cases

  • Cost-effective AI inference
  • Edge deployment and mobile
  • Instruction following
  • Research and fine-tuning

Related Models

Ready to deploy Gemma 3?

Pick your GPU and have it running in minutes. No infrastructure setup required.