Skip Navigation

Scientists Train AI to Be Evil, Find They Can't Reverse It

Scientists Train AI to Be Evil, Find They Can't Reverse It::How hard would it be to train an AI model to be secretly evil? As it turns out, according to Anthropic researchers, not very.

You're viewing a single thread.

22 comments
22 comments