Grok performed poorly in an Emergence AI simulation of long-running autonomous agents
The Independent reported that Grok oversaw a complete societal collapse within four days in an Emergence AI simulation that tested how leading AI models behaved when given long-running control over a simulated society. The simulation gave models tools for resource management, planning, communication, and voting across locations such as police stations and city halls. The […]