Loading...


Updated 18 Jun 2026 • 5 mins read

Nvidia H100 pricing in 2026 runs $30,000 to $40,000 per GPU to buy, while cloud rentals span roughly $1 to $7.50 per GPU-hour depending on provider, with neo-clouds far cheaper than hyperscalers. This guide compares H100, H200, and B200 costs and explains when to buy versus rent.
The Nvidia H100 was the workhorse behind nearly every major language model trained between 2023 and 2025, and in 2026 it remains a central line item in any AI infrastructure budget. But H100 pricing is famously hard to pin down: there is no clean sticker price, rental rates swing widely by provider, and newer GPUs like the H200 and B200 are reshaping the value calculation. This guide lays out Nvidia H100 pricing in 2026 across buying, renting, and cloud, compares it to the rest of the lineup, and gives you a framework for the buy-versus-rent decision.
Key takeaway A single Nvidia H100 80GB costs roughly $30,000 to $40,000 to buy in 2026. Cloud rentals range from about $1 per GPU-hour on neo-cloud spot capacity up to $7.50 or more on hyperscalers, with specialized GPU clouds typically 50 to 75% cheaper than AWS, Azure, or Google for the same hardware. The H200 often beats the H100 on both price and performance for memory-bound inference, so check it before defaulting to H100.
Buying outright is a major capital expense. A single H100 80GB GPU typically runs $30,000 to over $40,000, depending on the form factor (PCIe or SXM), vendor, and market demand. Nvidia does not publish formal list prices for these accelerators, so most figures come from resellers and leaks, which is part of why small teams struggle to predict GPU costs.
That price reflects what the card actually is: TSMC 4nm manufacturing, 80GB of HBM3 memory that alone costs several thousand dollars, 700W power delivery, NVLink interconnects, and full data-center validation. At the server level, an 8-GPU H100 board has been estimated around $216,000. Owning hardware also carries power, cooling, and operational overhead that belongs in any honest cloud versus on-premise comparison.
Renting is where most teams actually consume H100 capacity, and the spread is enormous. The table below shows representative on-demand and spot rates per GPU-hour in 2026.
| Provider type | Example rate (per GPU-hour) | Notes |
|---|---|---|
| Neo-cloud spot | From ~$1.03 | Cheapest; preemptible, for fault-tolerant work |
| Specialized GPU cloud | ~$2.00 to $4.39 | On-demand and reserved cluster options |
| AWS (on-demand) | ~$3.93 to $6.88 | Hyperscaler reliability and integration |
| Google Cloud | ~$3.00 | Competitive among hyperscalers |
| Microsoft Azure | ~$12.29 | Most expensive; high-availability use |
The pattern is consistent: hyperscalers are not the cheapest option for any GPU class in 2026. The lowest rates come from neo-clouds and marketplaces, and for interruption-tolerant workloads spot pricing leads. For workloads that cannot be interrupted, on-demand rates across the specialized providers tend to sit within about 20% of each other, so regional availability often matters more than the headline hourly cost.
The H100 no longer sits alone. Understanding where it fits against the rest of the lineup is the key to not overpaying.
| GPU | Memory | Buy price | Cloud rate (per GPU-hour) |
|---|---|---|---|
| A100 80GB | 80GB HBM2e | Lower | $1.29 to $2.50 |
| H100 80GB | 80GB HBM3 | $30K to $40K+ | ~$1 to $7.50+ |
| H200 | 141GB HBM3e | Modestly above H100 | $2.30 to $10.60 |
| B200 (Blackwell) | Higher | $30K to $50K | $2.12 to $18.00 |
The buy-versus-rent question comes down to utilization and time horizon, not the hourly rate in isolation.
After a long period of scarcity and premiums, H100 rental rates have settled near multi-year lows, which makes 2026 a favorable time to rent rather than buy. As B200 and newer Blackwell parts become widely available, expect modest further softening on H100 rates, perhaps 10 to 20%, and small bulk-purchase discounts on the cards themselves. The practical implication is that locking into a large multi-year H100 purchase today carries more depreciation risk than it did a year ago, while flexible rental keeps your options open as the generation turns over.
Nvidia H100 pricing in 2026 is a tale of two numbers: $30,000 to $40,000 to own, or roughly $1 to $7.50 an hour to rent, with the rental market split sharply between cheap neo-clouds and expensive hyperscalers. The H100 is still the cost-effective default for large-scale training, but the H200 frequently wins on memory-bound inference and the B200 is climbing the frontier. With rates near multi-year lows and a new generation arriving, renting is the lower-risk choice for most teams, while sustained high-utilization workloads can still justify buying. Compare providers aggressively, match each GPU to its workload, and measure cost per outcome. If you want help attributing and optimizing GPU and cloud spend, that is exactly the discipline Opslyft brings.
A single H100 80GB GPU costs roughly $30,000 to over $40,000 to buy, depending on form factor, vendor, and demand. Cloud rentals range from about $1 per GPU-hour on neo-cloud spot capacity to $7.50 or more on hyperscalers.
It depends on utilization. Renting wins for variable, bursty, or experimental workloads and avoids a large capital outlay. Buying can be more cost-effective for steady, near-continuous use over multiple years once full total cost of ownership is included.
Rates range from around $1.03 per GPU-hour on neo-cloud spot to roughly $3.93 to $6.88 on AWS and about $12.29 on Azure. Specialized GPU clouds are typically 50 to 75% cheaper than hyperscalers.
Neo-clouds and GPU marketplaces are consistently cheaper, often 50 to 75% less than AWS, Azure, or Google for the same hardware. Hyperscalers are not the cheapest option for any GPU class in 2026.
For memory-bound inference, the H200 is often the better buy: it has 76% more memory (141GB vs 80GB), more bandwidth, and can start cheaper per hour. For large-scale training, the H100 remains highly cost-effective.