The cloud-only AI narrative is getting harder to defend. At this year's Dell Technologies World in Las Vegas, Dell is making its case – backed by real customer deployments and new product announcements – that enterprises are running serious AI workloads on their own infrastructure, from corporate data centers down to individual workstations.
Dell also took the opportunity to roll out meaningful updates to the "AI Factory" platform it launched with Nvidia two years ago, all aimed at solving one of the industry's most persistent challenges: actually getting AI from proof-of-concept into production.
The customer traction is real. Dell now counts over 5,000 enterprises as AI Factory customers, all running Nvidia GPU-equipped servers alongside their cloud resources. The hybrid model these companies are adopting is a deliberate architecture: local compute reduces latency, keeps sensitive data off public networks, and increasingly, makes strong economic sense.
For certain workloads, Dell claims on-prem solutions can be up to 63% more cost-effective than relying on the public cloud. Those are Dell's figures, so apply appropriate skepticism, but the underlying trend is real and growing. Cloud AI compute capacity constraints are also becoming a legitimate concern for large enterprises, another reason organizations are finding value in owning a portion of their own token-generating infrastructure.











