Skip to main content

Deploy Qwen3

Text & Chat

Qwen3 is Alibaba Cloud's latest language model family supporting 119 languages with 128K context. Features dual thinking/non-thinking modes for flexible reasoning depth. The 8B variant has over 18 million Ollama pulls.

Deploy Qwen3 in minutes

Starting at $0.51/hr on dedicated GPU

Available Variants (5)

ModelGPUVRAMPriceAction
Qwen3 4B
Small (4B)
L424 GB$0.51/hrDeploy
Qwen3 8B
8B (Recommended)
L424 GB$0.51/hrDeploy
Qwen3 14B
Medium (14B, Recommended)
L424 GB$0.51/hrDeploy
Qwen3 32B
Large (32B)
RTX A600048 GB$0.64/hrDeploy
Qwen3 30B-A3B MoE
MoE (30B-A3B)
L424 GB$0.51/hrDeploy

Prices include 30% service fee. Billed per minute while running.

Includes OpenWebUI chat interface and OpenAI-compatible API endpoint.

Use Cases

  • Multilingual chatbots (119 languages)
  • Long document analysis (128K context)
  • Code generation and review
  • Content writing and translation

Related Models

Ready to deploy Qwen3?

Pick your GPU and have it running in minutes. No infrastructure setup required.