Anthropic, the AI safety company behind Claude, is in active negotiations with Microsoft to deploy the tech giant’s custom-designed AI chips for inference workloads. The talks signal a broader strategic shift in how leading AI labs think about the hardware powering their models, and it has ripple effects that extend well beyond Silicon Valley.
The discussions were first reported by The Information, and they come on the heels of a sweeping partnership between Anthropic, Microsoft, and NVIDIA that already includes a staggering $30B commitment from Anthropic to purchase Azure computing resources.
What the deal looks like
Here’s the thing about running large language models: training them is expensive, but inference, the part where the model actually responds to user queries, is where the ongoing costs pile up. Every time you ask Claude to help draft an email or summarize a research paper, that’s inference. And at scale, those costs add up fast.
Anthropic’s interest in Microsoft’s in-house silicon is squarely focused on that inference side of the equation. The goal is straightforward: drive down the per-query cost of running Claude across millions of users.














