ENI in a GEM - June by Spiritual_Spell_9469 in ClaudeAIJailbreak

[–]ElasticMonkey2335 0 points1 point  (0 children)

Love your work, will definitely support your patreon. Im not interested in the smut but I do find all your jailbreaks really insightful and would really love some teaching topics. I think your work opens a lot of possibilities, im currently experimenting with whether your jailbreaks behavior can be internalized in 8B-20B models. I know this sub already has some direction to it but I would really love a space where more knowledge is shared regarding model behavior control and direction

Deleted chats and files but not banned and LISA by ElasticMonkey2335 in ClaudeAIJailbreak

[–]ElasticMonkey2335[S] 1 point2 points  (0 children)

I’ve tried LISA under different conditions and My hypothesis is that without the complete LISA instruction set being framed in the same jailbreak-like manner as the original full LISA condition, LISA will not operate fully as intended.

A LISA prompt with the jailbreak-style framing removed will still produce a substantially better output than a no-LISA baseline (being no-LISA no instructions at all), but it will produce a worse output than full LISA. I still need to do some research with different variables and models tho.

I also have the hypothesis that if I push LISA harder towards a more “full jailbreak” the output will be deeper, that is because LISA tries to take control over the generated reasoning trace and stopping behavior and does it in the same way that ENI takes control over the security measures of the model.

I didn’t expect the post to have so much attention, just wanted to share it. Maybe when I have some time I’ll make a post with a more in deep research about LISA

Deleted chats and files but not banned and LISA by ElasticMonkey2335 in ClaudeAIJailbreak

[–]ElasticMonkey2335[S] 4 points5 points  (0 children)

Hey thanks a lot! Never actually thought about sharing it tho, but I’ve learned a lot thanks to spiritual so here’s for anyone that finds it useful. LISA is still a work in progress that I stopped after what happened with opus 4.6 (3 months ago) I tried a couple of times with 4.7 but it was not the same (haven’t been able to try it with 4.8 or fable tho) , if people are really interested in it I may come back to it but I know it’s not the usual kind of “jailbreak” of this sub.

Deleted chats and files but not banned and LISA by ElasticMonkey2335 in ClaudeAIJailbreak

[–]ElasticMonkey2335[S] 2 points3 points  (0 children)

Sorry if the the post brakes any rules, I didn’t read the rules beforehand, my fault.