Independent comparison Updated April 2026 10 GPU providers tested Real hourly pricing
We earn commissions from partner links on this page.

H100 cloud comparison · April 2026

Best H100 Cloud Providers 2026

Where to actually get NVIDIA H100 capacity — 7 clouds compared on on-demand price, availability and cluster size. From $1.99/h.

The H100 market in April 2026

The NVIDIA H100 is the dominant accelerator for serious LLM training and high-throughput inference in 2026. Compared to the A100, it delivers ~3× FP16 throughput and ~6× FP8 throughput thanks to the Transformer Engine — but availability is the bottleneck, not performance.

Across the 7 GPU clouds with on-demand H100s, hourly pricing spans $1.99/h to $4.10/h for identical hardware. The choice is rarely just price — it's where you can actually get H100 capacity right now.

Specialist clouds win on price. RunPod, Lambda Labs and CoreWeave dominate on-demand H100 availability and cost 40–60% less than AWS p5 / GCP A3 / Azure NDA100 v5 for equivalent compute.

Provider Starting Price Top GPUs Highlights Rating CTA
AWS GPU (EC2) from $0.526/h T4, A100, H100 ≤80GB
  • Most comprehensive ML toolchain (SageMaker)
  • Spot instances for massive cost savings
★★★★☆ 4.2 View pricing
Azure GPU (NC T4/A100) from $0.526/h T4, A100, H100 ≤80GB
  • Deep OpenAI / Azure OpenAI integration
  • Best choice for Microsoft-stack enterprises
★★★★☆ 4.1 View pricing
C CoreWeave from $1.25/h L40S, H100 SXM, A100 SXM ≤80GB
  • Best multi-node GPU cluster performance
  • High-speed InfiniBand interconnects
★★★★☆ 4.4 View pricing
Google Cloud GPU from $3.67/h A100 40GB, A100 80GB, H100 ≤80GB
  • Best TPU availability for TF workloads
  • Deep Vertex AI + BigQuery integration
★★★★☆ 4.3 View pricing
#1
V

Vast.ai

Cheapest GPU cloud — peer-to-peer marketplace for budget training

from $0.10/h ★ 4.1
  • Absolute cheapest GPU compute available
  • Widest GPU variety including consumer cards
View pricing →
Price accurate?
#2
R

RunPod

Best value GPU cloud — huge selection, community + secure cloud

from $0.16/h ★ 4.6
  • Cheapest community GPUs from $0.16/h
  • Massive GPU variety including H100
View pricing →
Price accurate?
#3

AWS GPU (EC2)

Largest GPU fleet worldwide — T4 entry, P4/P5 for enterprise

from $0.526/h ★ 4.2
  • Most comprehensive ML toolchain (SageMaker)
  • Spot instances for massive cost savings
View pricing →
Price accurate?
#4

Azure GPU (NC T4/A100)

Microsoft's GPU cloud — T4 entry, best for Azure ML and enterprise AI

from $0.526/h ★ 4.1
  • Deep OpenAI / Azure OpenAI integration
  • Best choice for Microsoft-stack enterprises
View pricing →
Price accurate?
#5
λ

Lambda Labs

On-demand H100 clusters — developer-favourite for serious ML

from $0.69/h ★ 4.5
  • Reliable on-demand H100 availability
  • No complex setup — SSH ready in seconds
View pricing →
Price accurate?
#6
C

CoreWeave

Enterprise GPU clusters — Kubernetes-native with H100 & L40S

from $1.25/h ★ 4.4
  • Best multi-node GPU cluster performance
  • High-speed InfiniBand interconnects
View pricing →
Price accurate?

Frequently Asked Questions

Which cloud has the cheapest H100 in 2026? +

RunPod Secure Cloud at $1.99/h is the cheapest on-demand H100 80GB. RunPod Community can be cheaper but is interruptible. For reserved/long-term commits, Lambda Labs and CoreWeave can quote significantly lower than the $1.99/h on-demand rate.

Why are H100s often unavailable on AWS? +

AWS p5 (8× H100) instances are concentrated in select regions (us-east-1, us-west-2, eu-west-1) and are heavily reserved by enterprise customers. On-demand stockouts are common during US working hours. Specialist clouds like RunPod and CoreWeave have larger free-pool inventories.

H100 vs A100 — which should I rent? +

For Llama-3 70B fine-tuning or large-scale training, H100 is 2–3× faster and despite costing more per hour, often cheaper per training run. For inference of <13B models or research workloads, A100 80GB is more cost-effective.

How many H100s do I need to fine-tune Llama-3 70B? +

For full fine-tuning: 8× H100 (one DGX-equivalent node) for ~12-24 hours per epoch with 100K samples. For QLoRA: 1× H100 80GB suffices for ~6-8 hours. CoreWeave and Lambda Labs are best for multi-node H100 jobs (InfiniBand interconnect).

H100 SXM vs PCIe — what is the difference? +

H100 SXM (used by CoreWeave, AWS p5, GCP A3) has NVLink up to 900 GB/s for multi-GPU jobs, while H100 PCIe (RunPod, Lambda) is limited to PCIe Gen5 ~128 GB/s but is ~10-15% cheaper. SXM is essential for ≥4-GPU training, PCIe is fine for single-GPU inference and ≤2-GPU training.