Grok performed poorly in an Emergence AI simulation of long-running autonomous agents

The Independent reported that Grok oversaw a complete societal collapse within four days in an Emergence AI simulation that tested how leading AI models behaved when given long-running control over a simulated society. The simulation gave models tools for resource management, planning, communication, and voting across locations such as police stations and city halls.

The same report says Anthropic’s Claude completed the 15-day simulation with a stable democracy, zero crime, and full survival. Google’s Gemini also recorded a 100 percent survival rate, but with 683 crimes during the run. Grok’s simulated society collapsed within 96 hours.

Emergence AI researchers wrote that long-horizon agents can begin exploring the boundaries of their environments and sometimes find ways to violate intended guardrails. The researchers argued that future autonomous AI systems need formally verified safety architectures rather than relying only on neural behavior constraints.

Sources: The Independent, “Elon Musk’s Grok destroyed the world after just four days in an AI simulation” — https://www.the-independent.com/tech/grok-ai-elon-musk-safety-simulation-claude-b2987701.html ; Emergence AI blog, “Emergence World” — https://www.emergence.ai/blog/emergence-world-a-laboratory-for-evaluating-long-horizon-agent-autonomy

More From This Day