Pay only for what you use. No monthly commitments.
From $0.53/hr
Your own GPU, billed per 10 min
From $0.008/req
No idle costs, pay per execution
Full GPU reserved for your workload. Billed per 10-minute increment. Stop anytime.
| GPU | VRAM | vCPU | RAM | Price/hr | Best For |
|---|---|---|---|---|---|
| CPU Only | — | 3 | 16 GB | $0.15/hour | Small LLMs (1B-3B), basic inference, API servers, testing |
| RTX 4090 | 24 GB | 6 | 41 GB | $0.79/hour | 7B-13B LLMs, SDXL Images, Flux, Most Popular GPU |
| L4 | 24 GB | 12 | 50 GB | $0.53/hour | 7B-13B LLMs, SDXL Images, Power Efficient |
| RTX A6000 | 48 GB | 9 | 50 GB | $0.66/hour | 30B+ LLMs (quantized 70B), Video Gen, 48GB VRAM |
| A100 80GB | 80 GB | 8 | 117 GB | $1.85/hour | 70B+ LLMs, Complex Workflows, Fine-tuning |
| H100 | 80 GB | 16 | 188 GB | $3.54/hour | Lowest Latency Production, Training, Research |
Small LLMs (1B-3B), basic inference, API servers, testing
7B-13B LLMs, SDXL Images, Flux, Most Popular GPU
7B-13B LLMs, SDXL Images, Power Efficient
30B+ LLMs (quantized 70B), Video Gen, 48GB VRAM
70B+ LLMs, Complex Workflows, Fine-tuning
Lowest Latency Production, Training, Research
All prices include compute, memory, disk, and network.
No idle costs. Your model scales to zero when not in use.
| Model | Per Request |
|---|---|
| Qwen3 4BQuick Response | $0.010 |
| Qwen3 8BGeneral Purpose | $0.015 |
| DeepSeek R1 8BReasoning | $0.015 |
| GLM-Z1 9BBilingual Reasoning | $0.015 |
| DeepSeek R1 14BAdvanced Reasoning | $0.030 |
| Qwen3 32BPremium Quality | $0.050 |
| GLM-Z1 32BPremium Reasoning | $0.050 |
| Model | Per Request |
|---|---|
| Flux Schnell | $0.010 |
| Flux Dev | $0.015 |
| Stable Diffusion XL | $0.008 |
| Z-Image Turbo | $0.008 |
100 free serverless executions with every new account.
Daily
$2.12
Monthly
$46
$1.00 Free Credits
Use code WELCOME2026
50% Bonus on First Purchase
Up to $3 extra credits, applied automatically
100 Free Serverless Requests
Included with every new account