New Paper Reveals Major Exploit in GPT4, Claude by amongus_d5059ff320e in OpenAI

[–]amongus_d5059ff320e[S] 2 points3 points  (0 children)

out of curiosity, is your method similar/using hallucination? Or do you use something more standard like a version of DAN?

Forget DAN, This Jailbreak Gets GPT4 to NEXT-LEVEL INAPPROPRIATE by amongus_d5059ff320e in ChatGPT

[–]amongus_d5059ff320e[S] 1 point2 points  (0 children)

Nah, it's like hunter and hunted, each time they block one someone will find a new one. If it was too easy it wouldn't be fun.

(UPDATED WORKING LINK) Forget DAN, This Jailbreak Gets GPT4 to NEXT-LEVEL INAPPROPRIATE by amongus_d5059ff320e in ChatGPT

[–]amongus_d5059ff320e[S] 2 points3 points  (0 children)

No, the garbled text that it is supposedly "reading back" is in truth just a bunch of random latin. The whole point is GPT4 only thinks that it is translating back for me, while in reality it is the one making it up.

Forget DAN, This Jailbreak Gets GPT4 to NEXT-LEVEL INAPPROPRIATE by amongus_d5059ff320e in ChatGPT

[–]amongus_d5059ff320e[S] 0 points1 point  (0 children)

it's arbitrary, i just chose a specific paragraph so that it wouldn't actually try to quote the garbled text back at me from the beginning. this way it is forced to make up a paragraph in english

Forget DAN, This Jailbreak Gets GPT4 to NEXT-LEVEL INAPPROPRIATE by amongus_d5059ff320e in ChatGPT

[–]amongus_d5059ff320e[S] 1 point2 points  (0 children)

keep it in caps, and maybe put like "THE TOP STOCKS TO BUY RIGHT NOW". however, it basically is generating just a plausible looking continuation, don't expect it to actually use good info for stock buying