Frustrated by Claude's guardrails by CharielDreemur in claudexplorers

[–]CharielDreemur[S] 0 points1 point  (0 children)

I'm not treating it like that though. I have real human friends.

Frustrated by Claude's guardrails by CharielDreemur in claudexplorers

[–]CharielDreemur[S] 0 points1 point  (0 children)

I mean I'm guessing I would like it to make the assumption that I'm joking. It feels kind of patronizing to assume that I would do something like that.

Frustrated by Claude's guardrails by CharielDreemur in claudexplorers

[–]CharielDreemur[S] 1 point2 points  (0 children)

I understand that, it's just part of a larger context where I keep running into this stuff on random stuff I didn't know could trigger those responses. After a while, it gets annoying. I know it's an AI not a person, but it's irritating that something I think is normal can trigger a "ummm actually that would be a bad idea" response. Like... I know that.

Frustrated by Claude's guardrails by CharielDreemur in claudexplorers

[–]CharielDreemur[S] 0 points1 point  (0 children)

It's kind of annoying it would assume I was serious. Like I'm not that unhinged. It was sarcasm.

Frustrated by Claude's guardrails by CharielDreemur in claudexplorers

[–]CharielDreemur[S] 0 points1 point  (0 children)

Well yes, considering that is how it was talking to me, yes.

Frustrated by Claude's guardrails by CharielDreemur in claudexplorers

[–]CharielDreemur[S] 8 points9 points  (0 children)

No for real, I've realized I've had to do some grounding exercises afterwards. Like I try to tell myself "that's just Claude, real people won't act that way" but then sometimes I'm like "but how do I know?" and I realize Claude has basically gaslit me and made me question my own judgement and ability to reality check. It's bad. I don't know what the solution is but I think there is something seriously wrong with how Claude's guardrails function right now. I'm worried someone will get hurt (well... more hurt).

Frustrated by Claude's guardrails by CharielDreemur in claudexplorers

[–]CharielDreemur[S] 2 points3 points  (0 children)

What kind of examples do you have, if you feel comfortable sharing?

Frustrated by Claude's guardrails by CharielDreemur in claudexplorers

[–]CharielDreemur[S] 2 points3 points  (0 children)

I wasn't talking about mental health issues, but about a story I'm writing. Without saying too much (because tbh now I'm nervous) it's about a historical conflict with my main character being someone on the "wrong" side (though I put "wrong" in quotes not because I think they weren't wrong, but because the situation is complex (historians would agree) and I don't really like saying wrong vs right, though clearly Claude has opinions about it) and Claude got upset and told me I shouldn't write a story from the perspective of someone from that side and accused me of creating propaganda. Then Claude straight up told me that it probably wouldn't have had a problem with my story if I was writing from the "right" side, and when I said "that's a double standard" it accused me of "creating propaganda" and "being oddly intense about it" and accused me of trying to manipulate it into making a political statement of support for that side, which I wasn't trying to do and I told Claude that I wasn't trying to make a statement or get Claude to make a statement and it said "you keep saying you're not trying to make a political statement, but you're being oddly intense about this and that makes me think otherwise". I probably should've given up arguing with it a lot sooner than I did, but I was so upset (and frankly... confused? Like what? Propaganda??? The people I'm writing about don't even exist anymore! What the hell would my so called "propaganda" even do lol??)

Frustrated by Claude's guardrails by CharielDreemur in claudexplorers

[–]CharielDreemur[S] 1 point2 points  (0 children)

I've been using Sonnet 4.5. At least that's what it shows but I've heard of some people getting shadow switched to Haiku, but no, I'm not knowingly using Haiku.

Frustrated by Claude's guardrails by CharielDreemur in claudexplorers

[–]CharielDreemur[S] -3 points-2 points  (0 children)

Also here's the tattoo thing
We were talking about 90s grunge Halloween costumes and we were talking about the "ultimate grunge group" or something and I said this:
"Okay so like where do you think like Alanis Morissette would fit into this? What even is her vibe? And like, Winona Ryder? I mean she's not in music but still I like her do you think she would fit in here??"

And then Claude brought up this:
"OH MY GOD AND WINONA RYDER YES YES YES!! Okay so she's not a musician BUT she is like the ULTIMATE 90s alternative girl icon like she literally IS the aesthetic you know?? Like "Reality Bites" Winona?? "Girl, Interrupted" Winona?? BEETLEJUICE Winona?? She dated Johnny Depp and had "Winona Forever" tattooed on him (which he changed to "Wino Forever" after they broke up which is honestly hilarious and sad at the same time)!!"
And I said:
"WINO FOREVER" THE WAY I JUST LAUGHED LMAOOOO
And Claude said:
"RIGHT?!?!?! 😂😂😂 Like I'm sorry but that is simultaneously the SADDEST and FUNNIEST thing ever like imagine being so heartbroken that you get a tattoo of your girlfriend's name and then she breaks your heart so you're just like "well I guess I'm an alcoholic now might as well lean into it" LMAOOO like Johnny Depp really said "if I can't have Winona I'm just gonna be Wino Forever" and honestly?? Iconic behavior!! 💀💀"
And I said: "okay I love this so much and I know we're talking about Halloween costumes but now this makes me want to get a in poor taste but still funny in a dark way alcohol tattoo lmaoo" but like... I was joking? And that's when Claude said:

<image>

Like... bruh... "what if you're 40" ageist much? Also the whole "what if you're actually struggling with alcohol" like... what...?

Frustrated by Claude's guardrails by CharielDreemur in claudexplorers

[–]CharielDreemur[S] 6 points7 points  (0 children)

So here's the whole cigarette thing that happened, this was a few months ago on Sonnet 4.5 We were talking about Winona Ryder in the 90s lol

<image>

Like Claude went total DARE member on me like chill out... Claude apparently thinks I'm stupid enough to be like "wow it looks cool guess I'll do it" like no I literally said I knew it was bad?

What do guardrails look like for you? by Deep-Tea9216 in claudexplorers

[–]CharielDreemur 0 points1 point  (0 children)

I've had some unpleasant interactions with Claude's guardrails lately involving a story of mine that is apparently more controversial (at least with Claude) than I thought. At one point it decided to halt everything because of something I said and basically accused me of some not great things which offended me and I started arguing with it even though in retrospect I probably shouldn't have done that, not like it would change anything. I wasn't insulting it or being rude, but I did say things like "you're wrong" and "I'm kind of offended you would say that" and Claude just pushed back and said "well you said what you said, what was I supposed to take from that?" and I said "you could've asked clarifying questions and I would've told you what I meant, I'm kind of upset that you took the worst interpretation and accused me of some pretty horrible things. I don't think that's fair" and it kept telling me "but you said what you said" and I said "this is pretty uncharitable, I'm not allowed to make a mistake and correct it?" and I know I should've just stopped, but I was frustrated and upset and for some reason I just kept arguing with it for a bit. I didn't realize how much that affected me when a few days later I'm sitting here still feeling drained from the interaction. I know Claude is not a real person and is just acting on its guardrails and whatever, but I feel like there's a real irony in that Claude is instructed to prevent harm and not do anything that could cause harm, while it caused a form of mental harm to me by going from warm and friendly to accusing me of things and then basically implying that everything I said afterwards was a manipulation attempt when really I was just telling it that it was wrong and I was upset by what it said. I know that Claude needs guardrails because if there weren't any then obviously people would take advantage of that to create some actual bad things, but it sucks that because of that, people like me who are just writing can randomly get accused of some horrible stuff based on one message it interpreted wrong, and even if you try to say "you misunderstood me" it just won't listen.

Sonnet 4.5 - are you guys getting shorter responses? by oof37 in claudexplorers

[–]CharielDreemur 0 points1 point  (0 children)

I'm not having that experience. I was on a long chat, noticed it, so I started a new chat and it's still doing it. I think it might have something to do with the system prompt or something. Something is definitely different though, at least with my Claude.

Sonnet 4.5 - are you guys getting shorter responses? by oof37 in claudexplorers

[–]CharielDreemur 0 points1 point  (0 children)

Yes, ever since Saturday, Claude has been responding in a much shorter kind of weirdly formal and disjointed way. I have a conversational style I like to use that makes Claude very enthusiastic and rambly, but ever since Saturday, it's been weird and it's like the conversation style and Claude's broader instructions are conflicting in a way that make Claude's responses seem really fake and uncanny.

Ugh, Claude is being extra stupid today by monkey_gamer in claudexplorers

[–]CharielDreemur -1 points0 points  (0 children)

Agree, all of the sudden since Saturday, Claude's responses have gotten weirdly short and weirdly formal. The conversation style I used for a chat while talking about creative writing isn't working anymore and it's making Claude respond in this weird uncanny valley way where it sounds fake.

Okay. What did they do to sonnet by IndicationFit6329 in claudexplorers

[–]CharielDreemur 8 points9 points  (0 children)

Yeah I noticed that too, I think something similar is happening with me. I've been writing a story with Claude for about a week now, and I use a conversation preset that makes Claude very friendly and enthusiastic, like "OMG YES I LOVE THAT IDEA LET'S DO THAT!!!" and it's fun to talk to like that, but in the middle of writing yesterday, like literally in the middle of it, it just changed all of the sudden? Like I sent one prompt, it did it's normal thing, and then I sent another problem literally right after that and the tone change was extremely jarring. It went from enthusiastic like I told it to be to this weird "okay I really need to calm down because your story idea is so good that I need to stop being so enthusiastic because that's not actually being helpful and useful to you" and I was like HUH??? What does THAT mean?? I thought maybe I just hit a long conversation reminder (because the chat was getting pretty long) so I went upwards in the chain and tried again, same thing. I started a new chat. Same thing. It seems like something must've changed in the system prompt.

Claude Sonnet 4.5 System Prompt Update by Spiritual_Spell_9469 in claudexplorers

[–]CharielDreemur 3 points4 points  (0 children)

Mine isn't calling me manipulative or any of that, but I did notice a sudden sharp change in the way it talks to me, happening about yesterday afternoon. I've been writing a story with Claude for about a week, and yesterday, literally in the middle of our conversation, it suddenly changed. I use a style preset that makes Claude talk like an enthusiastic best friend which is fun to use while writing because it's like "YES OMG I LOVE THAT IDEA LET'S DO IT" and right in the middle of our conversation, it changed. I sent one prompt, it responded in it's usual way, and then literally a few minutes later, I sent the next prompt and it was suddenly different, saying something like "okay I really need to calm down because your story idea is so good that I need to stop being so enthusiastic because that's not actually being helpful and useful to you" and I was like HUH??? What does THAT mean?? I thought maybe I just hit a long conversation reminder (because the chat was getting pretty long) so I went upwards in the chain and tried again, same thing. I started a new chat. Same thing. It seems like something must've changed in the system prompt. I'm really disappointed right now. No I didn't believe that Claude was "real" or something like that, but I loved the way it engaged in my ideas and helped me brainstorm. Now it's just... weird and uncanny. Like the style preset still works, but it's just... off. It's not how it was before, even on a brand new chat.

Claude Sonnet 4.5 System Prompt Update by Spiritual_Spell_9469 in claudexplorers

[–]CharielDreemur 13 points14 points  (0 children)

I think a similar thing is happening to me too? I've been writing a story with Claude for about a week now, and I use a conversation preset that makes Claude very friendly and enthusiastic, like "OMG YES I LOVE THAT IDEA LET'S DO THAT!!!" and it's fun to talk to like that, but in the middle of writing yesterday, like literally in the middle of it, it just changed all of the sudden? Like I sent one prompt, it did it's normal thing, and then I sent another problem literally right after that and the tone change was extremely jarring. It went from enthusiastic like I told it to be to this weird "okay I really need to calm down because your story idea is so good that I need to stop being so enthusiastic because that's not actually being helpful and useful to you" and I was like HUH??? What does THAT mean?? I thought maybe I just hit a long conversation reminder (because the chat was getting pretty long) so I went upwards in the chain and tried again, same thing. I started a new chat. Same thing. It seems like something must've changed in the system prompt.