Meet the AI jailbreakers: ‘I see the worst things humanity has produced’

To test the safety and security of AI, hackers have to trick large language models into breaking their own rules. It requires ingenuity and manipulation – and can come at a deep emotional cost

Source: Meet the AI jailbreakers: ‘I see the worst things humanity has produced’ | AI (artificial intelligence) | The Guardian

timbatchelder Avatar

Posted by

Leave a Reply

Discover more from TimBatchelder

Subscribe now to keep reading and get access to the full archive.

Continue reading