The artificial intelligence arms race has shifted from a one-dimensional battle over model intelligence to a multi-layered infrastructure war. It is now evident that securing a foothold in a single layer of the AI stack is insufficient to survive the cut-throat nature of the race or the margin compression of the AI economy.

The modern AI stack consists of four key layers. At the base is the hardware, the physical accelerators and GPUs that process data. Above that sits the compute cluster, comprising the cloud servers and gigawatt data centers that house the hardware. Next is the model layer, featuring the foundational architectures like Claude and GPT. Finally, the application layer encompasses the end-user interfaces like ChatGPT and Cursor.

Dominating a single layer is no longer a viable long-term moat. The most successful companies in the space are actively diffusing into the layers above and below their starting points to protect their margins and capture new value.

Here is how the competitive landscape is taking shape.

Model-first: the brutal reality of hardware costs