AI is clearly accelerating demand for cloud computing, but not in the way many expected. Is the biggest story right now about software innovation? No. It’s about the extraordinary amount of capital flowing into the physical infrastructure needed to support AI at scale. Chips, networking gear, power systems, and massive data centers are becoming the strategic center of gravity for the cloud market as providers race to support model training and inference workloads.

The numbers are hard to ignore. US technology companies, including Alphabet, Amazon, Meta, and Microsoft, are expected to spend about $650 billion on AI-related infrastructure in 2026, up from roughly $410 billion in 2025, according to analysis cited by Reuters. That kind of growth tells us something important. AI is not just another software wave that sits neatly atop the existing cloud stack. It is forcing a redesign of the stack itself.

That redesign reaches deep into the networking and data movement. Nvidia recently announced plans to invest $2 billion each in photonics companies Lumentum and Coherent, which underscores where the pressure points are emerging. The issue is no longer only raw compute. It is also how quickly data can move between processors, racks, and clusters without creating unacceptable bottlenecks or power inefficiencies. As AI systems scale, latency, throughput, and energy usage become first-order economic concerns.