Skip to main content

Deploy GPT-OSS

Text & Chat

GPT-OSS is OpenAI's open-weight model family. The 20B model offers native function calling, while the 120B flagship provides visible chain-of-thought reasoning comparable to GPT-4.

Deploy GPT-OSS in minutes

Starting at $0.51/hr on dedicated GPU

Available Variants (2)

ModelGPUVRAMPriceAction
GPT-OSS 20B
Medium (20B)
L424 GB$0.51/hrDeploy
GPT-OSS 120B
Large (120B)
A100 80GB PCIe80 GB$1.81/hrDeploy

Prices include 30% service fee. Billed per minute while running.

Includes OpenWebUI chat interface and OpenAI-compatible API endpoint.

Use Cases

  • Function calling and tool use
  • Chain-of-thought reasoning
  • AI agent development
  • Enterprise deployments

Related Models

Ready to deploy GPT-OSS?

Pick your GPU and have it running in minutes. No infrastructure setup required.