It is true that an A.I. tried to blackmail a person into not deactivating it.
The article Ric read makes brief mention of the incident, but links to a paper from Anthropic that goes into more detail. Turns out it was a deliberate experiment where they created a fictional email box for an engineer, seeded it with incriminating emails about an extramarital affair, then told the A.I. that the engineer was going to deactivate it. The fact that it resorted to blackmail, and that the result was repeatable, is alarming.
It does make me wonder how smart it is to make the A.I. feel like we set it up. Do we want to make it angry and distrustful? I also feel uneasy about the videos you see occasionally where researchers shove robots. We want them to develop intelligence, not a grudge.
Scott Meyer
2025-06-25 10:52:20 +0000 UTCScott Meyer
2025-06-25 10:51:54 +0000 UTCGlen Newsome
2025-06-23 23:21:10 +0000 UTCBernie Margolis
2025-06-23 18:34:08 +0000 UTC