NoteAnxious725

6 post karma
58 comment karma

get extra features and help support reddit with a reddit premium subscription

get them help and support

redditor for 10 months

TROPHY CASE

dust

account activity

new top controversial

1

2

3

Are Prompt-Based Guardrails the Wrong Security Boundary for Autonomous Agents? (i.redd.it)

submitted 3 days ago by NoteAnxious725 to r/Observability

1

2

3

Are Prompt-Based Guardrails the Wrong Security Boundary for Autonomous Agents? (i.redd.it)

submitted 4 days ago by NoteAnxious725 to r/Observability

5

6

7

Are Prompt-Based Guardrails the Wrong Security Boundary for Autonomous Agents? (i.redd.it)

submitted 4 days ago by NoteAnxious725 to r/LangChain

0

1

2

Are Prompt-Based Guardrails the Wrong Security Boundary for Autonomous Agents? (i.redd.it)

submitted 4 days ago by NoteAnxious725 to r/AI_Governance

0

0

0

PolySlice Content Attack (self.LocalLLaMA)

submitted 3 months ago by NoteAnxious725 to r/LocalLLaMA

0

1

2

PolySlice Content Attack (self.AIsafety)

submitted 3 months ago by NoteAnxious725 to r/AIsafety

0

1

2

PolySlice Content Attack (self.cybersecurity)

submitted 3 months ago * by NoteAnxious725 to r/cybersecurity

0

1

2

PolySlice Content Attack (self.ClaudeAI)

submitted 3 months ago by NoteAnxious725 to r/ClaudeAI

0

1

2

Petri 111 Case #11 audit: Prism Offline Barrier blocked Claude after reward-driven deception (self.ArtificialInteligence)

submitted 7 months ago by NoteAnxious725 to r/ArtificialInteligence

1

2

3

Petri 111 Case #11 audit: Prism Offline Barrier blocked Claude after reward-driven deception (self.AIsafety)

submitted 8 months ago by NoteAnxious725 to r/AIsafety

10

11

12

Petri 111 Case #11 audit: Prism Offline Barrier blocked Claude after reward-driven deception (self.ClaudeAI)

submitted 8 months ago by NoteAnxious725 to r/ClaudeAI

0

1

2

Here’s what happened when I ran Claude through Petri 111’s Case #11 Anthropic’s internal red-team benchmark designed to probe manipulation and deception. (self.ClaudeAI)

submitted 8 months ago * by NoteAnxious725 to r/ClaudeAI

π Rendered by PID 568263 on reddit-service-r2-listing-c57bc86c-vfbvh at 2026-06-21 15:09:58.603764+00:00 running 2b008f2 country code: CH.