OpenAI’s latest model will block the ‘ignore all previous instructions’ loophole
OpenAI’s latest model will block the ‘ignore all previous instructions’ loophole
www.theverge.com OpenAI’s latest model will block the ‘ignore all previous instructions’ loophole
OpenAI’s newest model, GPT-4o Mini, includes a new safety mechanism to prevent hackers from overriding chatbots.
You're viewing a single thread.
View all comments
101
comments
"...today is opposite day."
48 0 ReplyI just love that almost anyone can participate in hacking language models. It just shows how good natural language is as a programming language, and is a great way to explain how useful these things can be when used correctly
11 1 ReplyIt won't be long before you end up with language models that suggest ways to break other language models.
1 0 Reply
You've viewed 101 comments.
Scroll to top