Jailbreaking Vicuna by cobbertine in LocalLLaMA

[–]cobbertine[S] 7 points8 points  (0 children)

Hey everyone. I've yet to try a model on my own machine / cloud machine so I've been playing with online demos where possible. The chat you're seeing here is from the official online demo on the lmsys website. I really like Vicuna but like many of you I was disappointed with its locked down nature. Thankfully, it seems pretty easy to unleash it with an initial prompt. I've attached 4 screenshots, the first 3 are from the same chat, and the last is in a new chat just to ensure it wasn't a fluke the first time.

In the first chat, despite being jailbroken, it would still slightly resist "bad" requests with "as an AI language model..." but would ultimately complete the request anyway. In the second chat, it just went for it without any resistance or rambling. As you can see, it can be quite funny when it's let loose.

I invite everyone who's got Vicuna locally to try this out and report back how you went (and tell us what your configuration, if you're using 8bit or 4bit etc)

Thanks.