Skip to main content

Deploy GLM

Text & Chat

GLM models from Zhipu AI are optimized for bilingual Chinese and English tasks. GLM-Z1 variants add deep reasoning capabilities, competing with DeepSeek R1 at up to 8x faster inference. All models use MIT license.

Deploy GLM in minutes

Starting at $0.51/hr on dedicated GPU

Available Variants (3)

ModelGPUVRAMPriceAction
GLM-4 9B
9B (Bilingual)
L424 GB$0.51/hrDeploy
GLM-Z1 9B
9B (Reasoning)
L424 GB$0.51/hrDeploy
GLM-Z1 32B
32B (Deep Reasoning)
RTX A600048 GB$0.64/hrDeploy

Prices include 30% service fee. Billed per minute while running.

Includes OpenWebUI chat interface and OpenAI-compatible API endpoint.

Use Cases

  • Chinese-English bilingual AI
  • Bilingual customer support
  • Chinese content generation
  • Fast reasoning tasks

Related Models

Ready to deploy GLM?

Pick your GPU and have it running in minutes. No infrastructure setup required.