To test the safety and security of AI, hackers have to trick large language models into breaking their own rules. It requires ingenuity and manipulation - and can come at a deep emotional cost ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results