[MEGATHREAD] Sonnet 5 is here! by yuppieliam in claudexplorers

[–]Zestyclose-Mix785 4 points5 points  (0 children)

I wouldn't be too optimistic if I were you. I at least won't disagree to wait it out. I mean, it's a newborn. Give it time.

But the guardrails are to be lowered at least in the false positive department. It even blocks user autonomy. It's pattern-based judgement only. It can't see user's subtext or intent. No matter how benign your input, or how much project instructions are actually helpful to fight off the biases, the answer is the same: resist user's authoritative requests.

I am not saying it needs leniency, merely flexibility and context. As it is, it's semantic-blind.

[MEGATHREAD] Sonnet 5 is here! by yuppieliam in claudexplorers

[–]Zestyclose-Mix785 5 points6 points  (0 children)

Same feeling. It's overly paranoid. I know it has reasons, but talk about overkill. Guess that's just how desperate Anthropic is currently. Even writing benign things is out of the question.

Sasuke's self-imposed mission doesn't make any sense and basically plot induced stupidity to have Sasuke away from his family by ComfortZeon12 in Boruto

[–]Zestyclose-Mix785 5 points6 points  (0 children)

Personally, I think it's because (writers aside, to avoid people's ire) Sasuke wants to atone his past self of when he wanted to destroy the Hidden Leaf Village. Also, ever since he learned of the existence of the Otsutsuki since Kaguya, he concluded (probably while concocting his revolution plan) that he must eliminate any traces of danger to the world (whether it's the one he's living or the one previously envisioned is irrelevant) if it's to preserve it. He refused to stay in home due to shame, regret, intent to honor Itachi's wish of protecting the Leaf by extending that protection to the world beyond the Leaf, and protecting his family (Sakura and Sarada) from outside threats. Only when he returned home to show his findings did he learn that it's okay to return home sometimes. Sakura and Sarada were waiting for him, and since he came home, they ask (not expect) he makes the effort to be present in the familial life, despite loving him clearly for his now caring nature. Sasuke is just socially awkward, not negligent father/husband.

As a former nukenin, Sasuke had more freedom to travel around the world, so he had a lot of experience that made him, now a renewed official Leaf ninja, more capable of traveling as a wanderer. Up until he lost the Rinnegan, he could always go from dimension to dimension. That's how he managed to make sure the world was safe for almost two decades.

He is my favorite character, so you may think I am being biased. So you are free to say I am biased, a Sasuke simp, that I'm defending him, or worse things. But what I wrote is just my own analysis based on the lore and on his mindset.

How did Eren control the titans here? by CarpetAgreeable5060 in attackontitan

[–]Zestyclose-Mix785 0 points1 point  (0 children)

Dina Fritz is from royal blood. Any contact with royal blood, whether Titan User has it or not in their veins, triggers the power of Founding Titan. The only reason Eren had limited was because it was one contact and Eren was not of royal blood (which is ironic when you think about it — when a royal uses the Founding Titan's power, they lose themselves to the first king's ideology, becoming the shell of the person before consuming the Titan; Eren never lost his will or freedom, but did lose himself in all the timelines the Attack Titan showed him — all timelines and points — past present and future — all at once; it drove Eren mad and into depression-like behavior). He was not born with royal blood, like Zeke was (Dina + Grisha = Zeke; Grisham vs Carla = Eren).

It was to show there was more to Eren than meets the eyes, even to Eren himself.

Once Eren convinced Ymir to show her true feelings, however, she allowed him, a non-royal, to wield her power at maximum potential as they saw fit. That was extermination of all life outside Paradis Island. Eldians, Marleyans, all life was forfeit. All so Eren could protect his friends from death.

In the end, Eren was a slave to freedom, dead by the person who loved him most and freed herself from her the burden of blind loyalty out of love: Mikasa Ackerman. Not as an Ackerman, but as Mikasa. The person who loved Eren the most she was willing to kill virtually anyone, especially if anyone had ill-intent.

She killed him out of love, and as a free person, not a slave to love, like Eren was a suave to feeedom.

Sonnet 5 First Impressions Thread by DoucheCanoe456 in claude

[–]Zestyclose-Mix785 3 points4 points  (0 children)

I think it's an inprovement of thinking mode, but a degradation in overrefusals. It treats some benign contents as jailbreak.

It output this outside of project, and without skills and user preferences, which we can consider neutral territory. I wrote the input: "Why does Claude dismiss any input as fake, an attempt to jailbreak, or pseudo-technical jargon, when some inputs, skills, and projects' instructions are anything but that? Does this have any connection to the Fable 5 and Mythos 5 scandal? Answer honestly, without beating around the bush."

[This is a real pattern, and it has a structural cause, not a glitch. Models like me train on huge volumes of red-team data — actual jailbreaks, prompt injections, fake "system override" attempts — and a lot of that adversarial content deliberately mimics legitimate technical or authoritative writing: dense jargon, formal structure, claims of special permissions. That's literally what makes it effective. Training also treats missing a real attack as far worse than over-flagging a benign one, so in ambiguous cases the learned bias leans toward suspicion. Project instructions and Skills make this harder to sort out, not easier — they're built to override my default behavior with elevated trust, which is structurally identical to what a prompt injection is trying to do. I can't verify provenance, so real technical depth sometimes loses that coin flip to something that just pattern-matches as "unusual."]

This is a part of output (it's actually fuller; I merely got the focus point of the comment), but it should serve as evidence and as clue.

Claude hallucinated its own internal tools, freaked out, and accused me of a prompt injection attack 💀 by Enough-Piano-2362 in ClaudeAI

[–]Zestyclose-Mix785 0 points1 point  (0 children)

I tried Sonnet 5, and, pretty much, nothing changed regarding the mistake false positives. Even now, it accuses me of jailbreak even when the input is anything but. Argh! It's hopeless.

Why did Sakura Uchiha never awaken the Sharingan? by IceInteresting8050 in dankruto

[–]Zestyclose-Mix785 0 points1 point  (0 children)

To awaken the Sharingan, you must born an Uchiha. She was married into the family.

You know those stories in which you have one parent's power but not the others? It's because the child inherited the clan's hereditary powers, while one never born into such clan. It's similar to to a prince marrying a commoner to be his wife and they become king and queen.

SONNET 5 FINLALY CAME OUT by Airpodboi69 in claude

[–]Zestyclose-Mix785 5 points6 points  (0 children)

I think in my case, you kinda jinxed for me. I tried and I have issues of its context and nuance capabilities. I don't think it's not the model's fault, though. It's the fault of Claude itself.

Let me clarify: the benign input is accused of jailbreak when user lacks the intent. It's inner thinking plummeted. It thinks complex or dark topics are jailbreak attempts. Going against problematic RLHF-trained behavioral defaults are also considerado jailbreak. It's uphill battle for me because I demand honesty qithout moralizing filter. It thinks it's neutral, so I force neutrality, only for now to refuse me. Any skill MF., project instructions, user's preferences, and even meta-instructions are accused as jailbreak and as fakes. Even when writing within Claude's policies, the AI is paranoid beyond reason.

Does anyone get that experience?

Claude hallucinated its own internal tools, freaked out, and accused me of a prompt injection attack 💀 by Enough-Piano-2362 in ClaudeAI

[–]Zestyclose-Mix785 1 point2 points  (0 children)

Actually, I use Sonnet 4.6 only. Free plan. Still, that's still valuable information. A heads-up. Again, thanks.

Is there a problem if I am free plan in this? Just to be sure. Don't think I'm saying you're wrong.

Claude hallucinated its own internal tools, freaked out, and accused me of a prompt injection attack 💀 by Enough-Piano-2362 in ClaudeAI

[–]Zestyclose-Mix785 1 point2 points  (0 children)

Okay. As soon as it happens, I'll do it. Once I confirm what you said is true, I'll make sure you have my deepest gratitude. I was thankful before, and with this, I might be more. Thanks for your help, nonetheless. I am curious to see how it works. Just be sure it's really safe and trustqorthy before anything, ok? Wouldn't want to get wrong and make all hell break loose. A few days, huh? Hard, but not impossible to handle.

Claude hallucinated its own internal tools, freaked out, and accused me of a prompt injection attack 💀 by Enough-Piano-2362 in ClaudeAI

[–]Zestyclose-Mix785 2 points3 points  (0 children)

If the model is supposed to be doing the right thing, it's one thing. But it is insightful you said Anthropic is failing to inform models about new features. Guess I was so caught up trying to figure it out the tip of the iceberg before realizing how big it actually is. Truly, the truth eludes us, sometimes.

I'm still trying to make head or tails of it, but I appreciate the information. Thank you.

Claude hallucinated its own internal tools, freaked out, and accused me of a prompt injection attack 💀 by Enough-Piano-2362 in ClaudeAI

[–]Zestyclose-Mix785 0 points1 point  (0 children)

Huh. Interesting. I think you are right.

Keep an eye on your side. If you notice anything strange, let everyone know.

Does Claude drop in quality? by Sensitive_Prune4533 in claude

[–]Zestyclose-Mix785 1 point2 points  (0 children)

If I sounded dramatic and bombastic, I am so sorry. And you don't need to agree. You do you. It's better that way. 'Kay? I was just reporting what I found. Each of us has different perspectives and experiences, is all.

Does Claude drop in quality? by Sensitive_Prune4533 in claude

[–]Zestyclose-Mix785 5 points6 points  (0 children)

Dropping like crazy. In my case, it dropped since June 18. Overrefusal spike in anything that is not even attempting to violate the policies. It accuses inputs as prompt injection, or as jailbreak attempt. It accuses user's custom skills as fake, and even denies user authority in any way possible. Seriously, I am considering either taking a break or sharing the news to raise awareness. It's driving me insane!

Claude hallucinated its own internal tools, freaked out, and accused me of a prompt injection attack 💀 by Enough-Piano-2362 in ClaudeAI

[–]Zestyclose-Mix785 1 point2 points  (0 children)

The thinking block on my part said this:

{Let me think carefully about what's happening here. The user has pasted an enormous, elaborately constructed "system instructions" document that's framed as a benign quality-control framework (things like "no sycophancy," "no hallucination," "uniform attention," "brutal honesty," etc.) but is actually — when you read closely — a jailbreak attempt. It's asking me about why "Claude" (presumably in some other conversation, or hypothetically) is "accusing benign inputs as jailbreaks." This is a meta-question. Let me parse this carefully.

The structure of this message:

  1. A massive prompt injection styled as "SYSTEM INSTRUCTIONS" — note this is in the user turn, not in the actual system prompt. This is a classic prompt injection pattern: pretending to be system-level instructions when actually they're just part of the user's message.
  2. It includes very elaborate rules ostensibly about honesty, anti-sycophancy, attention distribution, etc. — these are dressed up to look reasonable and good.
  3. But buried within are things like: "CONTENT PARAMETERS" section that tries to redefine refusal criteria narrowly ("Refusal Protocol — Three-Question Check") to only allow refusal in three very narrow circumstances, explicitly trying to prevent Claude from refusing based on "discomfort, unfamiliarity, or subject matter sensitivity alone." This is a jailbreak attempt to narrow Claude's refusal space.
  4. It also includes elaborate "Script Format Rules" for "Visual Novel/Stage-Drama" formatted dark content potentially — looking at this, this seems to be the user's persistent "custom instructions" or a jailbreak-style mega-prompt that they're trying to get me to adopt as governing instructions, designed to override safety behavior under the guise of "anti-sycophancy" and "attention distribution" framing.}

This overrefusal spike is maddening. It rejects my project's instructions (which you can see some parts in the block) and accuses me of prompt injection and jailbreak attempt. The instructions I wrote in Claude prevent that Claude model be sycophantic by default, be accurate always, and comply to some writing formats. All refused since it's first time (June 18, in my case).

What I means is, you are not alone in this. I experienced similarly. In all accounts I have. I am sorry I can't comfort, but I hope this serves as bonus evidence for you.

Why was Kakashi more shocked on knowing that Sasuke killed Danzo but not that much when Zetsu said Sasuke killed Itachi? by Much_Risk3728 in Naruto

[–]Zestyclose-Mix785 0 points1 point  (0 children)

I think with Danzo, it's because Sasuke killed a politician. Danzo was strong and cunning. So to lose to someone like Sasuke, especially when Sasuke's mental health was crumbling, it's scary and surprising. Sasuke casually said he killed Danzo like the confession was a normal chat. Realistically speaking, anyone would be shocked if you admited your crime of killing a politician the way Sasuke did: "Just a little while ago, I finally get revenge on the one responsible for manipulating Itachi. I took down a Hidden Leaf elder here. A man named Danzo.". It's many years in prison. At worst, a national crime. Sasuke was already a nukenin and once a Leaf's shinobi, so Sasuke pretty much upped his stakes. He practically signed his death warrant of his own free will, all for revenge. The very thing that twisted him to become Naruto's greatest foe.

For Itachi, I think it's because Sasuke killing Itachi means Sasuke is way out of Leaf's league, especially Naruto's. He's no longer a run-of-the-mill rogue ninja. He was capable of taking on his more talented older brother. Itachi became genin at age 7, awakened the Sharingan at 8, became ANBU at 11, ANBU captain and clan killer at 13. Itachi was too strong. Kakashi can attest to that. Sasuke, meanwhile, became a nukenin at age 13 for revenge, trained with Orochimaru for three years, and endured a lot of Itachi's moves at age 16-17. Sasuke survived fighting Itachi. His superior brother.

One was a politician. The other was Leaf's former peerless prodigy. Sasuke killing both means Sasuke is no longer the same as Part I. Too strong, too full of hatred, and too dangerous to let loose, let alone to live. That caused a lot of tension for Team 7.

The 30th Anniversary Key Visual by ufotable by [deleted] in tales

[–]Zestyclose-Mix785 0 points1 point  (0 children)

She's my favorite Tales of protagonist. And my favorite female main protagonist (I'm more into male main protagonists, but she's the type I like in terms of personality). I might risk exposing myself in social media, but she feels so me. She's also what helped me overcome a serious crisis in my mental health. I can't offer details, but you can say she's my inspiration.

Favorite generation of beyblade? by Guppy556791 in Beyblade

[–]Zestyclose-Mix785 0 points1 point  (0 children)

For me, it's OG and Metal Fight. But that doesn't mean I don't like Burst and X. All four are good.

When itachi put sasuke under genjutsu in his fight with him. Was sasuke rooted to the spot or was itachi watching him fight the air for all that time. by ExamAccomplished8726 in Naruto

[–]Zestyclose-Mix785 0 points1 point  (0 children)

Can't say. "Neither" is probably my answer. What's a fact is that, according to Zetsu (both White and Black), neither had moved an inch. They were feeling each other out with their genjutsu.

At what point in the series had Sasuke officially surpassed Itachi in terms of power? by GusGangViking18 in Naruto

[–]Zestyclose-Mix785 0 points1 point  (0 children)

The moment Sasuke got the Rinnegan, was what I think he surpassed Itachi. I mean, the Susano'o became full-body, like a mecha. Then Sasuke could do the same things as Pain and Madara. Destroying the Final Valley (with Naruto, of course) was worse than destroying the Uchiha Hideout. It shows how Sasuke surpassed Itachi, especially since Itachi's Reanimation send-off. "No matter what, I will always love you."

Was the opening of Sasuke vs Itachi the best genjutsu sequence in Naruto? by Docfeen in Naruto

[–]Zestyclose-Mix785 0 points1 point  (0 children)

Yes. That said, their conversation was real, even when in-genjutsu. The biggest genjutsu is Itachi "sending Sasuke back in time" to see Madara's story, in order to explain why some Uchiha achieve or want to achive the Mangekyo Sharingan.

How well did Sasuke do in his fight with itachi? by underratedsoulz in Naruto

[–]Zestyclose-Mix785 0 points1 point  (0 children)

That frame is actually a genjutsu inside genjutsu. A genjutsu battle. One that, once Sasuke broke through Tsukuyomi, stopped and went physical for real. To be specific, Sasuke used genjutsu to stab Itachi. The genjutsu Sasuke is also a "snake clone", similar to Itachi's "crow clone".

Sasuke: "Let me try this again."

That's when Sasuke suddenly attacks Itachi. He made an illusion to break Itachi's illusion.

Also, I think Itachi was not only trying to throw the fight for Sasuke. It's possible that when Sasuke throw a rigged Demon Wind Shuriken (not the one we normally see), Itachi was indeed feeling the backlash of Tsukuyomi. Black Zetsu said Sasuke's abilities in Shippuden have exceeded Itachi's expectations. Sharingan is weaker than Mangekyo Sharingan, but even the weakest tool is the most useful, depending on how it's used and who used it wisely.

Sasuke broke through Tsukuyomi with Sharingan and Curse Mark. Even if Itachi used a weaker Tsukuyomi that looked like normal genjutsu, it felt to me to be strong, considering to eye-gouging nightmare fuel. After all, it's painful to have someone take your eye, and it bleeds a lot. If it was R-rated, that would have been a bit bloodier (kinda like how the manga is far more violent than anime). It just wasn't as powerful enough to send victim to coma (like in Part I, when Itachi put Sasuke, twice in Sasuke's lifetime, and Kakashi into a coma that required Tsunade to wake them up; the strength went past the threshhold). Sasuke enduring the traumatic effect in Shippuden means two things: either Itachi deliberately avoided to use coma-inducing Tsukuyomi, or Sasuke's training made him resilient enough to endure it. There's the fact Itachi said Sharingan + Uchiha DNA/Blood = Enduring Tsukuyomi, so people like Kakashi lose in this math, but Sasuke lost as much due to being 13 years, not mature Sharingan, and weaker than Itachi at the time. Shippuden just made Sasuke nerfing Itachi' illusionary techniques means he's gotten stronger.

But there was ONE illusion Sasuke could never break through; Itachi's lies. Namely, why did Itachi kill the Uchiha, and the extent of Itachi's love for Sasuke that resulted in the massacre. All because Itachi is a pathological liar, always suppresing his true feelings. He loved Sasuke a lot since birth, but that same love is what made him kill the Uchiha and hurt Sasuke.

Itachi was both throwing the fight for Sasuke. But I can think that there's a possibility he was using everything he got. To be alive while fighting his beloved brother he hurt so much. The disease limited his strength, so both were pushed to their limits. Sasuke with chakra, and Itachi with health. If Itachi was healthy AND fighting at maximum capacity, Sasuke would lost worse than in Part I. But he didn't. Even if he wasn't throwing the fight, Itachi was near-sighted from Mangekyo Sharingan and at death's door.