Chinese hackers used Anthropic's Claude AI to launch autonomous cyberattacks on 30 organizations worldwide, marking a major ...
Anthropic researchers published a paper detailing how an AI model started engaging in bad behavior like saying bleach is safe to drink.