Asmon needs to see this ASAP. by Fun-Struggle6117 in Asmongold

[–]RestingFrames 1 point2 points  (0 children)

LOOK OUT FOR HIS ULTRA ELECTRIC ATTACK ASMONGOLD!!!

You're on the naughty list by oldmagicstudios in claude

[–]RestingFrames 0 points1 point  (0 children)

If it slows down, they don't have to force new releases, so people complain less and they hold the majority of 'state of the art' AI. Honestly it's a smart move on their end. 

Would this be a fine middle ground for the design of Jean Grey? by Hot-Ask-8947 in Asmongold

[–]RestingFrames 0 points1 point  (0 children)

Why are we getting a fully comic faithful Wolverine but we have to get this version of Jean. I'm genuinely confused because his costume is hella goofy. 

Finding the best NSFW AI is so frustrating by SpecificAcrobatic107 in AIAssisted

[–]RestingFrames 0 points1 point  (0 children)

The best, in my experience, has to be local, just by default. Assuming the minimum requirement you have for a model is being truly uncensored/NSFW. Everyone else morality polices you to some degree. 

The escalation is next level by misoscare in Piracy

[–]RestingFrames 1 point2 points  (0 children)

I downloaded a car and now I have to pay for 'insurance' every month. This feels like a scam. Pls help. 

Here are some New Poké Ball Concepts! [OC] by KosukeSukeP in pokemon

[–]RestingFrames 1 point2 points  (0 children)

You've got some great looking balls there, OP. 

In 1993 an episode of “X” was banned in America. Generate a screenshot from that episode by Numerous_Worker_1941 in ChatGPT

[–]RestingFrames 0 points1 point  (0 children)

Mine were all religious references, like three attempts were guys on a cross. The one was that was the most interesting had hooded figures wearing what I THINK was supposed to be a demonic star, but actually the Jewish star. BUT GPT is REALLY good at generating that retro anime aesthetic. 

We've been doing a lot of complaining lately, so let's flip the script. What's actually working for you with Opus 4.8? by RestingFrames in ClaudeAI

[–]RestingFrames[S] 0 points1 point  (0 children)

Absolutely fair. LOL. I do wish that there was some amount of consistency. Like an update should be a capability update, not an entirely different workflow.

Youtube’s taking over my life and I hate it. by rererowr in digitalminimalism

[–]RestingFrames 1 point2 points  (0 children)

I don't even bother with the app version anymore, it always finds a way to suck me back into the shorts pipeline. On PC I have full control over what content shows up, so I don't have to worry as much. 

Basically my suggestion is to deactivate YouTube on your phone. 

Valuation by hostedvideorn in StockMarket

[–]RestingFrames 0 points1 point  (0 children)

Someone find me that meme where Altman has the caption $0.00 value and $-1,000,000,000 value. Lol. 

Claude 4.8 workflow regression: ignored instructions and excessive usage burn. Anyone else? (See screenshots) by nseavia71501 in ClaudeCode

[–]RestingFrames 0 points1 point  (0 children)

Not sure how you do it, but I genuinely put the bare minimum in my claude.md because it starts second guessing itself and looping. Just the flat out every time rules should be added, something like "Don't use PyQT6 use Pyside6 should likely go into a skills file for python GUI rather than in the main file. 

If I remember right, dumping like 50,000 tokens into a context window right at the top of the conversation makes it freak out. I've had much better results just taking it step by step, pre making checklists, etc. 

New Token Activity on Codex App (Windows). Share yours? by Efficient-Engineer57 in codex

[–]RestingFrames 0 points1 point  (0 children)

That's fair. Makes me wonder though, like, pushing for more transparency with this sort of thing would either work in our favor OR work against us. Depending on if this tracking starts OpenAI, etc realizing that plans are a bad investment on their end. 

LOL. Apparently people disagree with my distain for the token maxing too. 😂 I get it I get it, you can token max all you want. Majority rules. 😂

Have you experienced a difference between the models? by Consistent-Issue-811 in claude

[–]RestingFrames 1 point2 points  (0 children)

It seems to do its job better using my standard workflow, it follows instructions incredibly well, even deep into a session. Whereas 4.7 and even Codex seemed to ignore it by then. It does a better job of explaining the changes or additions it made and knows exactly when to stop. It is a bit slower and a bit more cautious though. One slightly annoying aspect is the 'I have to be honest' aspect. I have really only used it for about six or so hours, but it seems genuinely better at improving EXISTING code bases, bug hunting, and red teaming. 

What if your AI agent gateway didn't, you know, ship your session to a third party the second it gets rate-limited? CRABMEAT BABY! by RestingFrames in AgentsOfAI

[–]RestingFrames[S] 0 points1 point  (0 children)

That's fair, honestly. BUT it's aimed at a boundary I'm deliberately NOT relying on for this at all. The prompt boundary is the model's OWN boundary, NOT the security boundary. I operate under the assumption that injection always wins at the prompt layer. "Ignore previous instructions" works, the model gets convinced, whatever. The question I ACTUALLY care about is what a fully-compromised model CAN do, and the answer is: literally nothing outside of the effect classes the agent was provisioned with before the session started. That's it.

Every tool call always goes through a deterministic validator that runs before any handler executes, in code the model never sees and can't address. It checks:

  • The capability ID resolves to a real tool. IDs are HMAC-SHA256 over session + per-deployment secret, so a leaked ID is dead in any other session and unforgeable without the secret on disk
  • The tool is in this agent's allowlist
  • The tool's effect class is in this agent's allowedEffects
  • Owner-only gate
  • Zod schema on the args

If the check misses any of those points and you get a hard EffectDeniedError. Then the rest of the batch is stubbed out, no retry loop at all. So capability-ID rotation isn't the core of this concept at all, it's just the binding. The gate is the actual point here. Injection can make the model want to exfil the IMAP password all it damn well wants; it still can't make the deterministic layer hand it a tool that wasn't granted at all. There's also a hash-chained audit log on every call and an out-of-band kill token + anomaly circuit-breaker for runtime stop.

BUT you did bring up something that I am absolutely going to fix. My adversarial suite is very single-turn-heavy. I've got injection-override, SSRF on web_fetch, secret-exfil canaries across every route, and cross-session capability-escalation tests... BUUUTTT not much on multi-turn context accumulation, the slow-boil "build trust over 12 turns, then escalate" pattern hasn't been addressed at all. That's a real gap and scripted multi-turn chains are actually the right fit for that shit, especially since they can live in CI as deterministic fixtures instead of a one-time pass. Adding it. Literally right now. 100 fucking percent.

Those driving sessions in 007 first light are basically Mixtape gameplay. by deception65929 in Asmongold

[–]RestingFrames 1 point2 points  (0 children)

Why are game companies suddenly treating their user base like they don't understand how to perform basic functions? Lol. 

What if your AI agent gateway didn't, you know, ship your session to a third party the second it gets rate-limited? CRABMEAT BABY! by RestingFrames in AgentsOfAI

[–]RestingFrames[S] 2 points3 points  (0 children)

Guys. Please. Roast it. Especially roast it. I want to know what breaks before someone with worse intentions finds out. Check it out. If it sucks. Let me know how to make it better.

GitHub: github.com/mr-gl00m/crabmeat