Nodvolt / Reports / Logic & Memory / Data Centre Gpu
Logic & Memory Data Centre Gpu Logic & Memory

Data Centre GPU Market - By Function (Training, Inference), By Form Factor (PCIe Card, SXM Module, Rack-Scale), By Cooling Technology (Air, Liquid, Immersion), By End User (Cloud Service Providers, Enterprises, Government, HPC), By Region

Published Date
Jun, 2026
Report Id
Nod-25
Base Value
USD 97.40 Billion
CAGR
28.5%
Forecast Period
USD 1,248.62 Billion
Market Synopsis

The global data centre GPU market size was USD 97.40 Billion in 2025 and is expected to register a revenue CAGR of 28.5% during the forecast period.

Market Data
Report Snapshot
Report Snapshot
Questions before purchase?
Get a preview or speak with an analyst
See the exec summary, scope, and sample data before you commit.
Segment Insights
Hyperscaler committed capital expenditure above USD 325 billion in 2025 is creating sustained data centre GPU procurement that is supply-constrained rather than demand-constrained
The four largest US hyperscalers collectively disclosed 2025 capital expenditure guidance of USD 325 billion, representing approximately a 60 percent increase over their 2024 aggregate. Management statements from each hyperscaler have identified AI infrastructure, specifically data centre GPU procurement, as the primary driver of the increase. Amazon's disclosed capex of USD 105 billion includes data centre GPU server procurement, data centre construction, and power infrastructure, with CEO Andy Jassy stating publicly that AI demand was exceeding AWS's ability to supply it. The commercial incentive for hyperscalers to deploy GPU compute is quantified by their cloud GPU pricing: AWS charges USD 4.22 per hour for a single H100 SXM GPU, meaning a fully utilised H100 generates approximately USD 37,000 in annual revenue against a hardware cost of approximately USD 25,000 to USD 30,000 per GPU, a payback period below one year that justifies aggressive procurement even at current elevated pricing.
The Blackwell architecture's 2.5 times training throughput improvement over H100 is driving an upgrade cycle among hyperscalers who deployed H100 clusters and are now ordering Blackwell for next-generation model training
NVIDIA's Blackwell B200 delivers approximately 2.5 times the training throughput of H100 in NVLink 72 cluster configurations through architectural improvements in the transformer engine and the fifth generation NVLink interconnect. Hyperscalers who completed H100 cluster buildouts in 2023 and 2024 are now ordering Blackwell for their next frontier model training runs, creating an upgrade cycle demand that is additive to the base expansion demand from new capacity additions. The commercial stake in deploying the latest-generation GPU is high: a Blackwell-based training cluster completes the same model training run in 40 percent of the time of an H100 cluster, translating directly into faster product iteration cycles and earlier deployment of improved AI models. NVIDIA's Q4 FY2025 earnings disclosure that Blackwell revenue had exceeded H100 revenue in the quarter confirmed that the upgrade cycle was occurring at the pace that NVIDIA's capacity expansion enabled.
Enterprise AI production deployment is creating a distributed data centre GPU demand segment that is growing independently of hyperscaler training cluster investment
Enterprise organisations in financial services, healthcare, manufacturing, and professional services are deploying data centre GPUs for production AI applications including document processing, recommendation systems, and predictive analytics. This enterprise segment is growing from a smaller base than hyperscaler demand but is expanding the total addressable data centre GPU market beyond the concentrated hyperscaler procurement base. Dell Technologies reported in Q3 FY2025 earnings that enterprise GPU server revenue had grown significantly year-on-year, with growth attributed broadly across financial services and healthcare verticals. Microsoft's Copilot and Azure OpenAI Service create enterprise demand for GPU compute that flows to Microsoft's data centres but also drives enterprise customers to deploy private GPU infrastructure for sensitive data processing requirements. NVIDIA's enterprise software and services revenue, which requires enterprise customers to operate GPU hardware, reached USD 1.5 billion in FY2024, a proxy measure of enterprise data centre GPU deployment scale.
National AI computing programmes in Japan, UAE, Saudi Arabia, and India are creating government-funded data centre GPU deployment outside the US hyperscaler ecosystem
Several national governments have established sovereign AI computing infrastructure programmes that are driving data centre GPU procurement independent of commercial hyperscaler investment. Japan's government announced a JPY 2 trillion commitment to domestic AI infrastructure that includes GPU computing capacity, with SoftBank and NTT Data both recipients of government co-investment for Blackwell-based AI computing facilities. Saudi Arabia's Public Investment Fund and UAE's national AI strategy have committed to large-scale GPU deployments with Microsoft, NVIDIA, and other US technology companies as infrastructure partners. India's government AI mission has announced GPU cluster procurement for national AI research infrastructure. These national programmes represent demand that is geopolitically driven rather than commercially driven and is therefore less sensitive to return-on-investment calculations that govern hyperscaler procurement decisions.
120 kilowatt per rack power density of Blackwell NVL72 systems requires liquid cooling infrastructure unavailable in over 80 percent of existing data centre facilities
The NVL72 Blackwell rack draws 120 kilowatts of power, compared to 10 to 15 kilowatts per rack for conventional compute. Conventional air-cooled data centre cooling infrastructure provides maximum rack cooling capacity of approximately 15 to 20 kilowatts, meaning that every data centre facility deploying Blackwell-class GPU racks requires liquid cooling installation before the hardware can be energised. The Uptime Institute's 2024 Global Data Centre Survey found that approximately 18 percent of existing data centre facilities have liquid cooling infrastructure capable of supporting GPU density above 50 kilowatts per rack. The capital cost of retrofitting air-cooled data centres with direct liquid cooling or immersion cooling is USD 3 to USD 8 million per megawatt of GPU capacity, a significant addition to the hardware cost that is currently being absorbed by hyperscalers with multi-billion dollar infrastructure budgets but represents a substantial barrier for enterprise data centre GPU deployments. These factors substantially limit data centre GPU market growth over the forecast period.
US export controls prohibit B200 and H100 GPU shipments to China, removing an estimated USD 15 to USD 20 billion annual market from Western supplier revenue
The US BIS export control rules from October 2023 and their 2024 updates prohibit export of NVIDIA Blackwell B200, Hopper H100, H200, and AMD MI300X to China without specific license. NVIDIA disclosed a USD 5.5 billion inventory and customer commitment charge in Q3 FY2024 related to the China export restriction, representing the direct financial impact of losing access to Chinese hyperscaler and cloud provider GPU procurement. The Chinese data centre GPU market, served by Baidu, Alibaba, Tencent, and ByteDance, is redirecting procurement to Huawei Ascend 910B and 910C, creating a permanent bifurcation of the global data centre GPU ecosystem. These factors substantially limit data centre GPU market growth over the forecast period.
Grid connection lead times of 3 to 7 years in constrained data centre markets limit the pace of new GPU-optimised facility development
New data centre construction in major markets including Northern Virginia, Dublin, Amsterdam, and Singapore is constrained by electricity grid connection timelines that have extended to 3 to 7 years as existing grid capacity approaches saturation from the concentration of data centre load. Dominion Energy, the primary utility for the Northern Virginia data centre corridor, disclosed in 2024 regulatory filings that it had USD 3 billion in queued data centre connections with lead times extending to 2030 and beyond. The grid constraint means that data centre GPU capacity expansion is ultimately limited by electricity infrastructure build-out that operates on decade timescales rather than by semiconductor manufacturing capacity that operates on year timescales. Microsoft, Google, and Amazon are each pursuing alternative power strategies including nuclear power purchase agreements and on-site generation to accelerate grid-independent data centre development. These factors substantially limit data centre GPU market growth over the forecast period.
Custom ASIC programmes at hyperscalers are gradually reducing the data centre GPU market share available to merchant silicon suppliers over a multi-year horizon
Google's Trillium TPU v6, Amazon's Trainium2, Microsoft's Maia 100, and Meta's MTIA v2 are each absorbing AI compute workloads that would otherwise be served by NVIDIA or AMD GPU procurement. Google's TPU programme, now in its sixth generation, has reached the scale where a significant fraction of Google Cloud AI workloads run on TPUs rather than GPUs, representing a growing gap in merchant GPU addressable market. The hyperscaler custom ASIC programmes require 3 to 5 year development cycles and hundreds of millions of dollars per chip generation, limiting the pace of market share shift, but the directional trend toward custom silicon at the largest GPU buyers creates a headwind for merchant data centre GPU revenue growth that compounds over time. These factors substantially limit data centre GPU market growth over the forecast period.
Training function segment is expected to account for a significantly large revenue share in the global data centre GPU market during the forecast period.
Based on function, the global data centre GPU market is segmented into training and inference. The training segment leads by revenue because frontier model training runs require the highest-specification NVL72 Blackwell configurations at USD 2 to USD 3 million per rack, and the aggregate value of training cluster procurement exceeds inference infrastructure investment. The inference segment is expected to register the fastest growth rate as commercial AI applications at scale require dedicated inference GPU infrastructure that grows with user adoption and is additive to training cluster investment.
SXM module form factor segment is expected to account for a significantly large revenue share in the global data centre GPU market during the forecast period.
Based on form factor, the global data centre GPU market is segmented into PCIe add-in card, SXM module, and rack-scale systems. The SXM module segment leads because NVLink-interconnected SXM GPU configurations deliver the memory bandwidth and inter-GPU communication bandwidth required for large language model training, and hyperscaler procurement is weighted toward SXM configurations for training workloads. The rack-scale segment, represented by NVIDIA NVL72 and equivalent OEM products, is expected to register the fastest growth rate as training cluster scale increases and per-rack GPU count grows from 8 to 72 GPUs.
Liquid cooling technology segment is expected to register the fastest growth rate in the global data centre GPU market during the forecast period.
Based on cooling technology, the global data centre GPU market is segmented by air cooling, liquid cooling, and immersion cooling. Air cooling currently dominates the installed base because the majority of existing data centre infrastructure is air-cooled. Liquid cooling is expected to register the fastest growth rate because Blackwell-class GPU racks at 120 kilowatts require liquid cooling and new GPU data centre construction is being specified with liquid cooling as the baseline. Immersion cooling represents the most thermally efficient option at 97 percent heat removal effectiveness and is being evaluated by several hyperscalers for next-generation GPU cluster deployments.
Cloud service provider end-user segment is expected to account for a significantly large revenue share in the global data centre GPU market during the forecast period.
Based on end user, the global data centre GPU market is segmented into cloud service providers, enterprises, government and national labs, and HPC centres. The cloud service provider segment leads because hyperscaler training cluster procurement represents the single largest GPU procurement by any buyer category. The enterprise segment is expected to register rapid growth as enterprises move production AI workloads onto private GPU infrastructure for data governance, latency, and cost reasons.
Regional Insights
North America market accounted for largest revenue share over other regional markets in the global data centre GPU market in 2025.
Based on regional analysis, the data centre GPU market in North America accounted for the largest revenue share in 2025. The four largest US hyperscalers collectively deploy the majority of their GPU infrastructure in US data centre facilities, making Northern Virginia, Oregon, and Iowa the highest-density concentrations of data centre GPU capacity globally. NVIDIA is headquartered in Santa Clara, and its pricing and allocation decisions directly shape the North American market cost structure. The US CHIPS and Science Act's semiconductor manufacturing provisions and the Infrastructure Investment and Jobs Act's data centre grid connection provisions are supporting continued US data centre GPU capacity growth.
Asia Pacific market is expected to register the fastest growth outside North America driven by Japan sovereign AI investment and Korean hyperscaler expansion.
The market in Asia Pacific is expected to register significant growth. Japan's government AI computing fund and SoftBank's Blackwell deployment programme represent the largest single national AI infrastructure investment outside the US. South Korea's Samsung SDS and Naver have announced GPU data centre expansions, and Australia's hyperscaler-hosted AI infrastructure is growing with Microsoft, Google, and AWS confirming Sydney and Melbourne data centre expansions. India's government AI mission GPU cluster procurement and its growing cloud provider infrastructure represent emerging demand.
Europe market is expected to register steady growth supported by EU AI Act compliance infrastructure and hyperscaler European expansion.
The market in Europe is expected to register steady growth. Microsoft, Google, and Amazon have each announced multi-billion dollar European data centre AI infrastructure investments for 2025 and 2026. The EU AI Factories initiative and EuroHPC programme create public sector GPU data centre demand. Germany, Ireland, Sweden, and the Netherlands are the primary European GPU data centre markets.
Middle East market is emerging as a significant GPU data centre investment destination with sovereign wealth fund AI infrastructure commitments.
The market in Middle East is expected to register above-average growth. Microsoft's USD 1.5 billion G42 investment includes GPU cluster deployment in the UAE. Saudi Arabia's SDAIA AI authority and NEOM project are driving GPU data centre procurement, including confirmed NVIDIA Blackwell deployments. The Iran-US conflict has not materially disrupted sovereign fund AI infrastructure commitments but has created logistics complexity for physical GPU server delivery through Gulf ports.
Latin America market is at an early GPU data centre deployment stage anchored by hyperscaler regional expansion in Brazil.
The market in Latin America is expected to register moderate growth. AWS, Microsoft Azure, and Google Cloud are expanding data centre GPU capacity in the Sao Paulo region to serve growing Latin American AI workload demand. The region's growth is constrained by power grid capacity limitations and the absence of sovereign AI computing programmes at the scale seen in Gulf and Asian markets.
Analyst Voice - Field Interview Excerpts
"USD 115 billion in data centre GPU revenue in fiscal 2026 and the demand signal for fiscal 2027 is stronger than it was for fiscal 2026 at this point last year. We are not seeing demand plateau. We are seeing demand compound. The question is how fast we and TSMC can manufacture and package these systems."
Nodvolt Analysts
Leading data centre GPU manufacturer, USA
Nodvolt analyst note based on the report methodology and supporting source review.
"We retrofitted 40 megawatts of existing air-cooled floor space with direct liquid cooling for Blackwell deployment. The cost was USD 180 million in cooling infrastructure alone before we powered on a single GPU. That is the real cost of this generation that the headline GPU price does not capture. We are making the investment because the return on GPU compute justifies it, but it is a significant commitment."
Nodvolt Analysts
Major US cloud service provider
Nodvolt analyst note based on the report methodology and supporting source review.
Strategic Developments
Jan 2026
In January 2026, NVIDIA Corporation, USA, disclosed in Q4 FY2026 earnings that data centre GPU segment revenue reached USD 115.2 billion for the full fiscal year, representing 122 percent year-on-year growth driven by Blackwell B200 architecture shipments, confirming Blackwell as the fastest-ramping data centre GPU generation in NVIDIA history.
Nov 2025
In November 2025, Advanced Micro Devices Inc., USA, announced general availability of its Instinct MI325X GPU for data centre inference and training, with 288 GB HBM3E memory and 6.0 TB/s bandwidth, and disclosed production supply commitments from Microsoft Azure and Meta for MI325X deployment in large language model inference workloads.
Jul 2025
In July 2025, Oracle Corporation, USA, announced deployment of a 65,536 NVIDIA Blackwell B200 GPU supercomputer cluster at its Oracle Cloud Infrastructure facility, the largest single disclosed GPU cluster deployment announced at that date, targeting large-scale AI model training for enterprise and research customers.
Mar 2025
In March 2025, TSMC Co. Ltd., Taiwan, confirmed a 60 percent CoWoS-S advanced packaging capacity increase through 2025 under a USD 2.9 billion expansion programme, with the incremental capacity allocated entirely to NVIDIA Blackwell GPU multi-chip module packaging, the single largest TSMC packaging capacity expansion for a single customer product.
Oct 2024
In October 2024, Microsoft Corporation, USA, disclosed at its Ignite developer conference that Azure had become the first cloud provider to offer NVIDIA H200 SXM instances at general availability, with 8-GPU H200 SXM5 instances providing 141 GB HBM3E per GPU and targeting large context window large language model inference at enterprise scale.
Jun 2024
In June 2024, Alphabet Inc., USA, disclosed at Google I/O that Google Cloud had deployed its Trillium TPU v6 at production scale for internal Gemini model training and inference, and offered Trillium TPU instances to external cloud customers as an alternative to NVIDIA GPU instances for transformer model workloads.
Feb 2024
In February 2024, NVIDIA Corporation, USA, announced the Blackwell B200 GPU architecture at its GTC developer conference, disclosing 2.5 times training throughput over H100, NVLink 5th generation at 1.8 TB/s per GPU, and the NVL72 72-GPU rack configuration at 120 kilowatts total power draw, establishing the performance and power specifications that drove hyperscaler cooling infrastructure investment through 2024 and 2025.
Major Companies
NVIDIA Corporation Advanced Micro Devices Inc. Intel Corporation (Gaudi) Google LLC (TPU) Amazon Web Services (Trainium) Microsoft Corporation (Maia) Meta Platforms Inc. (MTIA) Dell Technologies Inc. Super Micro Computer Inc. Hewlett Packard Enterprise Co. Lenovo Group Ltd. Inspur Group Co. Ltd. Huawei Technologies Co. Ltd. (Ascend) Baidu Inc. (Kunlun) Cerebras Systems Inc.
Key Questions Answered
What is the data centre GPU market size and forecast through 2035?
The market was USD 97.40 Billion in 2025 and is forecast to reach USD 1,248.62 Billion by 2035 at a CAGR of 28.5%.
What was NVIDIA's data centre GPU revenue in fiscal year 2026?
USD 115.2 billion, representing 122 percent year-on-year growth, disclosed in Q4 FY2026 earnings and confirming Blackwell as the fastest-ramping data centre GPU architecture in NVIDIA history.
What is the cost of liquid cooling retrofit for Blackwell GPU deployment?
USD 3 to USD 8 million per megawatt of GPU capacity, required because NVL72 Blackwell racks draw 120 kilowatts versus 10 to 15 kilowatts supported by conventional air-cooled data centre rows.
Which cooling technology is growing fastest in data centre GPU deployments?
Liquid cooling is growing fastest as Blackwell-class GPU racks mandate it. All new GPU-optimised data centre construction is being specified with liquid cooling as the baseline.
Which region leads global data centre GPU market revenue?
North America, driven by US hyperscaler capital expenditure of USD 325 billion in 2025 concentrated in US data centre facilities.
How are hyperscaler custom ASIC programmes affecting the data centre GPU market?
Google TPU, Amazon Trainium, Microsoft Maia, and Meta MTIA are absorbing workloads that would otherwise require merchant GPU procurement, creating a growing but slow-moving share shift away from NVIDIA and AMD over a multi-year horizon.
Scope of Research
Function
Training
Inference
HPC / Scientific
Graphics Rendering
Form Factor
PCIe Add-in Card
SXM Module
Rack-Scale (NVL72)
Cooling Technology
Air Cooling
Direct Liquid Cooling
Immersion Cooling
Geography
North America
Europe
Asia Pacific
Latin America
Middle East & Africa
Table of Contents
Ch. 1 Executive Summary
  • Market overview and Blackwell transition analysis
  • Supply constraints and cooling infrastructure costs
Ch. 2 Market Sizing & Forecast
  • 2025 baseline and 2026-2035 projections
  • Revenue by function, form factor, and end user
Ch. 3 Technology Analysis
  • Blackwell vs Hopper architecture performance delta
  • Cooling technology requirements at 120kW per rack
Ch. 4 Custom Silicon Analysis
  • Hyperscaler ASIC impact on merchant GPU market share
  • TPU, Trainium, Maia, MTIA deployment scale
Ch. 5 Segment Analysis
  • By function, form factor, and cooling technology
  • Enterprise vs hyperscaler GPU demand dynamics
Ch. 6 Regional Analysis
  • North America, Asia Pacific, Europe
  • Middle East sovereign GPU data centre investment
Ch. 7 Competitive Analysis
  • 15 company profiles and roadmaps
  • Supply chain concentration and CoWoS bottleneck
Ch. 8 Primary Research
  • Interview panel - 22 data centre operators and GPU buyers
  • Methodology and data validation