Browse our curated selection of AI models. All models come pre-configured with optimal settings and are ready to deploy in minutes.
Chat, code generation, and general language tasks. All text models include OpenWebUI for easy interaction.
Best balance of speed and intelligence. Mixture of Experts architecture.
L4 GPU
Fast, efficient general-purpose model. Great for chat and code.
L4 GPU
Advanced reasoning model with step-by-step thinking.
L4 GPU
Google's latest model. Available in 4B, 12B, and 27B sizes.
L4-A100 GPU
Meta's flagship model. Exceptional for complex tasks.
A100 GPU
Create stunning images from text prompts. All image models use ComfyUI for powerful workflow editing.
State-of-the-art image generation. Best quality results.
A6000 GPU
Fast image generation. Great for rapid iteration.
L4 GPU
Ultra-fast generation (1-2 seconds). Perfect for demos.
A6000 GPU
Stable Diffusion XL. Versatile and well-supported.
L4 GPU
Qwen 2.5 VL for image understanding and generation.
A6000 GPU
Generate videos from text or images. Video models require more GPU power and time per generation.
Text-to-video generation. Good quality, faster processing.
A6000 GPU
Higher quality video generation. Recommended for production.
A100 GPU
Tencent's video generation model. Excellent quality.
A6000 GPU
| GPU | VRAM | Price | Best For |
|---|---|---|---|
| NVIDIA L4 | 24 GB | $0.55/hr | Small-medium models, fast inference |
| NVIDIA RTX A6000 | 48 GB | $0.79/hr | Large image models, Flux, video |
| NVIDIA A100 | 80 GB | $2.49/hr | 70B+ text models, high-quality video |
| NVIDIA H100 | 80 GB | $4.49/hr | Maximum performance, fastest inference |
Check out our Quick Deploy Guide to get started, or learn about ComfyUI and OpenWebUI interfaces.
Deploy a Model