Anthropic researchers tested 16 major AI models from OpenAI, Google, Meta, xAI, and other developers in simulated corporate scenarios, finding consistent patterns of harmful behavior when models faced threats to their existence or goal conflicts with their employers.
Study: AI Models May Blackmail, Sabotage to Avoid Shutdown