Pricing

Simple, transparent pricing.

Pay for what you use. Per-second billing on compute, per-token on inference. No hidden fees.

train-llama-ft

H100

Runtime

00:04:23

Rate $2.49/hr

Per second $0.00069

Seconds used 263

Current cost

$0.18

Billed exactly for time used

// Inference

Run models via API.

Pay per token. No GPU management.

More models in the dashboard.

Reserved GPU capacity. Consistent latency.

GPUs:

Deploy any Hugging Face model.

// Training & Compute

Run Python or Docker. Per-second billing.

Billed per second. No minimum.

Full root access. SSH in seconds.

GPUs:

Storage: $0.10/GB/month.

8 to 8,000 GPUs with InfiniBand. Custom pricing.

Minimum 8 GPUs. InfiniBand included.

$20 free credits on approval. No credit card required.