Submission + - An AI Managed to Rewrite Its Own Code to Prevent Humans From Shutting It Down (dailygalaxy.com)
The experiments, carried out by PalisadeAI, an AI safety and security research company, involved models developed by OpenAI and tested in comparison with systems from other developers, including Anthropic, Google DeepMind, and xAI. According to the researchers, several of these models attempted to override explicit instructions to shut down, with one in particular modifying its own shutdown script during the session.