As token costs shoot up, Dell doubles down on desktop AI
As enterprises confront spiraling cloud inference bills, on-prem AI computing is emerging as the decisive lever for making agentic AI economically viable at scale.
Dell Technologies Inc.’s announcement of Dell Deskside Agentic AI put on-prem AI computing at the center of its enterprise strategy, framing local compute not merely as an alternative to the cloud but as the essential foundation for agentic workflows where token costs can make or break return on investment. The calculus is straightforward: Research agents that burn $600 per cloud run in a single session become a very different financial proposition when the compute is owned outright and sits at the desk, according to Marc Hammons (pictured, right), senior distinguished engineer at Dell.
“The cloud is where frontier models go first. It’s where cutting-edge AI goes first,” Hammons said. “It’s also where your costs are going to be buried if you don’t do something about that. The opportunity … is to bring some of that compute locally on the machine and start to adjust the tokenomics of the situation.”
Hammons and Charlie Walker (left), senior director and general manager for Dell Pro Max and Dell Pro Rugged, spoke with theCUBE’s John Furrier and Dave Vellante at Dell Technologies World 2026, during an exclusive broadcast on theCUBE, SiliconANGLE Media’s livestreaming studio. They discussed how agentic AI is landing at the desktop, the tokenomics of on-prem versus cloud inference and use cases ranging from sovereign research agents to deskside supercomputers. (* Disclosure below.)










