Anthropic AI Blackmail Concerns

News

Interesting Engineering on MSN6d

Anthropic’s newly launched Claude Opus 4 model did something straight out of a dystopian sci-fi film. It frequently tried to ...

6don MSN

In a fictional scenario, the model was willing to expose that the engineer seeking to replace it was having an affair.

Anthropic’s AI Safety Level 3 protections add a filter and limited outbound traffic to prevent anyone from stealing the ...

Besides blackmailing, Anthropic’s newly unveiled Claude Opus 4 model was also found to showcase "high agency behaviour".

1don MSN

AI's rise could result in a spike in unemployment within one to five years, Dario Amodei, the CEO of Anthropic, warned in an ...

2don MSN

Safety testing AI means exposing bad behavior. But if companies hide it—or if headlines sensationalize it—public trust loses ...

17h

Amodei made his comments during an interview with Axios. He said that AI companies and the government needed to stop ...

2don MSN

In a fictional scenario set up to test Claude Opus 4, the model often resorted to blackmail when threatened with being ...

Anthropic admitted that during internal safety tests, Claude Opus 4 occasionally suggested extremely harmful actions, ...

1don MSN

Anthropic's Dario Amodei predicts AI could eliminate half of entry-level white-collar jobs in 1-5 years, spiking unemployment ...

The speed of A) development in 2025 is incredible. But a new product release from Anthropic showed some downright scary ...

14h

This is no longer a purely conceptual argument. Research shows that increasingly large models are already showing a ...

Some results have been hidden because they may be inaccessible to you