Tech & Science

Nvidia's NVL72 AI Server Rack Costs $7.8 Million Per Unit

Nvidia's NVL72 AI server rack, featuring Rubin GPUs and Vera CPUs, carries a component cost near $7.8 million, limiting buyers to major hyperscalers and sovereign AI programs.

·May 21, 2026 at 5:32 PM·2 min read

Nvidia's NVL72 AI Server Rack Costs $7.8 Million Per Unit

Priced at approximately $7.8 million per unit, Nvidia's upcoming NVL72 AI server rack is set to debut in the second half of 2026, targeting a select group of buyers such as US hyperscalers and sovereign AI programs. This figure, derived from a Morgan Stanley analysis, highlights the significant investment required for next-generation AI infrastructure.

NVL72 Hardware Specifications and Costs

The NVL72 rack integrates 72 Rubin GPUs alongside 36 Vera CPUs. Morgan Stanley estimates each Rubin GPU costs around $55,000, marking a 57% increase over the current Blackwell chips. The GPUs alone contribute roughly $4 million to the total component cost.

Memory expenses constitute a major portion of the price. Each Rubin GPU is equipped with 288 GB of HBM4 high-bandwidth memory, while each Vera CPU is paired with 1.5 TB of LPDDR5X. Altogether, a full rack contains 20.7 TB of video memory and 54 TB of system RAM. According to Morgan Stanley, memory costs have surged by 435% compared to the previous generation, largely due to the complexity of manufacturing HBM4 versus the HBM3e used in Blackwell systems.

Manufacturing Challenges and Supply Constraints

Printed circuit board (PCB) costs have risen by 233%, reflecting the stringent manufacturing tolerances needed to handle the data transfer speeds demanded by these systems. Nvidia is effectively establishing new production standards with each generation, which results in higher supplier charges.

Add Daily Beirut to your Google News feed to get the latest first.Follow

Additionally, supply constraints for HBM4 memory are impacting production. Analysts at KeyBanc report that neither Micron nor SK Hynix currently meet qualification targets or volume commitments, which could reduce 2026 rack production from an anticipated 12,000–14,000 units to about 6,000.

Market and Buyer Profile

Major cloud providers such as AWS, Google Cloud, Microsoft Azure, and CoreWeave have announced plans to deploy Rubin architecture in the latter half of 2026. The $7.8 million price tag is tailored for these hyperscalers.

OpenAI's April 2026 decision to pause its Stargate UK data center, citing energy expenses and regulatory delays, underscores the concentrated nature of this market. Electricity costs in the UK are roughly 4.4 times higher than in the US, and grid approval processes can extend project timelines by years.

Industry Implications for AI Hardware Costs

The Rubin platform exemplifies the scale of investment flowing into AI hardware infrastructure. Compared to this, Nvidia's consumer and workstation GPU business remains relatively small. Morgan Stanley analysts emphasize that such elevated costs are becoming standard in the industry, indicating that affordable AI will continue to be driven primarily by software and cloud services rather than hardware affordability.

Lebanon

NVL72 Hardware Specifications and Costs

Manufacturing Challenges and Supply Constraints

Market and Buyer Profile

Industry Implications for AI Hardware Costs

Latest news

Will Lebanon Redistribute the Frequency Spectrum to Boost the Internet?

Iranian MP Confirms No Reopening of Global Internet Network

American Host Semra Hunter Joins ITV's World Cup Team

Liverpool Owners Consider Firing Slut After Salah's Criticism