Skip to main content

AI Model Catalog

Deploy 68+ open-source AI models on dedicated GPUs. Pick a model, choose your GPU, and deploy in minutes.

Text & Chat

Language models for chat, code generation, reasoning, and analysis.

DeepSeek R1

5 variants

DeepSeek R1 is an open-source reasoning model from DeepSeek AI. It demonstrates step-by-step chain-of-thought thinking and rivals GPT-4 on complex rea...

From $0.51/hrView details →

Qwen3

5 variants

Qwen3 is Alibaba Cloud's latest language model family supporting 119 languages with 128K context. Features dual thinking/non-thinking modes for flexib...

From $0.51/hrView details →

QwQ 32B

QwQ is a 32B reasoning model from Alibaba specialized in math and logic. It excels at mathematical proofs, competitive programming, and structured rea...

From $0.64/hrView details →

LLaMA 4

2 variants

LLaMA 4 is Meta's latest open-weight model family. Scout uses a 109B MoE architecture with 17B active parameters, 10M token context window, and native...

From $0.64/hrView details →

Mistral

3 variants

Mistral AI builds fast, efficient language models. Ministral 8B is their latest small model with excellent multilingual support under Apache 2.0. Mist...

From $0.51/hrView details →

GPT-OSS

2 variants

GPT-OSS is OpenAI's open-weight model family. The 20B model offers native function calling, while the 120B flagship provides visible chain-of-thought ...

From $0.51/hrView details →

Gemma 3

3 variants

Gemma 3 is Google's efficient open model family with the best quality-to-size ratio in its class. Available in 4B, 12B, and 27B sizes, these models pu...

From $0.51/hrView details →

Phi-4

Phi-4 is Microsoft's 14B parameter model that delivers top reasoning performance for its size class. It outperforms many larger models on math, coding...

From $0.51/hrView details →

GLM

3 variants

GLM models from Zhipu AI are optimized for bilingual Chinese and English tasks. GLM-Z1 variants add deep reasoning capabilities, competing with DeepSe...

From $0.51/hrView details →

Magistral 24B

Magistral is a 24B parameter model specialized in legal and financial analysis. It provides transparent reasoning chains suited for compliance review,...

From $0.64/hrView details →

Image Generation

Generate and edit images with state-of-the-art diffusion models.

Qwen-Image-2512

Qwen-Image-2512 is the #1 ranked open-source image generation model. It excels at realistic human faces, accurate text rendering, and complex composit...

From $0.64/hrView details →

Z Image Turbo

Z Image Turbo is a fast distilled image generation model from Alibaba Tongyi. It achieves sub-second generation times on high-end GPUs, making it idea...

From $0.51/hrView details →

Flux

3 variants

Flux is Black Forest Labs' flagship image generation family. Flux Dev delivers the best quality with excellent text rendering, while Flux Schnell offe...

From $0.51/hrView details →

FLUX.2

3 variants

FLUX.2 is Black Forest Labs' 32B parameter model with multi-image editing capabilities. The Dev FP8 variant runs on RTX 4090/5090, while the full mode...

From $0.51/hrView details →

FLUX.2 Klein

3 variants

FLUX.2 Klein is the fastest model in the Flux family. The 9B FP8 variant delivers sub-second generation in 4 steps, while the 4B model is fully open s...

From $0.51/hrView details →

HiDream I1

3 variants

HiDream I1 is a 17B parameter image generation model with excellent prompt following. Available in Dev, Full, and Fast variants with FP8 quantization ...

From $0.64/hrView details →

Stable Diffusion

5 variants

Stable Diffusion by Stability AI is the most widely adopted image generation family with the largest ecosystem of fine-tunes and LoRAs. SDXL generates...

From $0.51/hrView details →

Qwen Image

2 variants

Qwen Image models from Alibaba Tongyi provide precise text and semantic image editing alongside high-quality generation. They support bilingual Chines...

From $0.64/hrView details →

Video Generation

Create videos from text prompts or animate existing images.

Text-to-Speech

Convert text to natural speech with voice cloning and multilingual support.

Ready to deploy?

Pick any model and have it running on a GPU in minutes.

Go to Dashboard