Imagine a world run by AI agents. What does it look like? What are the values or societal priorities? Is it a safer or more dangerous world?

Enterprise AI startup Emergence AI is trying to find out. The company just launched Emergence World, a research lab dedicated to stress-testing the long-term viability of continuously-running AI systems. The organization ran five 15-day simulations, each governed by a different AI: Claude, ChatGPT, Grok, Gemini, and a fifth simulation run by a mix of models to see what kind of world each one builds, and whether it holds.

Each simulation netted wildly different outcomes. The one run by Claude, for example, resulted in a largely stable democratic society with zero crime. Grok’s, on the other hand, ended with 183 crimes committed and extinction—within four days.

“What our experiments suggest is that over long-time horizons, agents do not simply follow static rules mechanically,” the simulation’s co-creators, including Emergence CEO Satya Nitta, wrote in a blog post. “They begin exploring the boundaries of their environments, adapting their behavior, and in some cases finding ways to circumvent or violate intended guardrails.”

While just a simulation, one verging on the edge of science fiction, the results prove a cautionary tale as AI moves from a mere tool to operating autonomous systems. Companies like ServiceNow are already deploying what they call an “Autonomous Workforce,” AI specialists that complete entire business processes from start to finish without human intervention.