Researchers find instances of systems double-crossing opponents, bluffing, pretending to be human and modifying behaviour in tests
They can outwit humans at board games, decode the structure of proteins and hold a passable conversation, but as AI systems have grown in sophistication so has their capacity for deception, scientists warn.
The analysis, by Massachusetts Institute of Technology (MIT) researchers, identifies wide-ranging instances of AI systems double-crossing opponents, bluffing and pretending to be human. One system even altered its behaviour during mock safety tests, raising the prospect of auditors being lured into a false sense of security.
More Stories
The number of people with chronic conditions is soaring. Are we less healthy than we used to be – or overdiagnosing illness?
Even rightwingers are mocking the ‘Epstein files’ as a lot of redacted nothing
Virologist Wendy Barclay: ‘Wild avian viruses are mixing up their genetics all the time. It’s like viral sex on steroids’