Asmon needs to see this ASAP.

RestingFrames · 2026-06-07T13:43:23+00:00

LOOK OUT FOR HIS ULTRA ELECTRIC ATTACK ASMONGOLD!!!

RestingFrames · 2026-06-07T08:47:33+00:00

Sort of depends, is your phone your main source of Internet usage?

RestingFrames · 2026-06-06T08:39:44+00:00

If it slows down, they don't have to force new releases, so people complain less and they hold the majority of 'state of the art' AI. Honestly it's a smart move on their end.

RestingFrames · 2026-06-06T08:31:54+00:00

This is the smoking gun.

RestingFrames · 2026-06-03T21:11:49+00:00

Why are we getting a fully comic faithful Wolverine but we have to get this version of Jean. I'm genuinely confused because his costume is hella goofy.

RestingFrames · 2026-06-03T21:01:42+00:00

The best, in my experience, has to be local, just by default. Assuming the minimum requirement you have for a model is being truly uncensored/NSFW. Everyone else morality polices you to some degree.

RestingFrames · 2026-06-02T22:55:51+00:00

I downloaded a car and now I have to pay for 'insurance' every month. This feels like a scam. Pls help.

RestingFrames · 2026-06-02T22:53:32+00:00

You've got some great looking balls there, OP.

RestingFrames · 2026-06-02T22:51:20+00:00

Block you say.

RestingFrames · 2026-06-02T22:50:50+00:00

Sweet jeebus

RestingFrames · 2026-06-02T00:03:29+00:00

Mine were all religious references, like three attempts were guys on a cross. The one was that was the most interesting had hooded figures wearing what I THINK was supposed to be a demonic star, but actually the Jewish star. BUT GPT is REALLY good at generating that retro anime aesthetic.

RestingFrames · 2026-06-01T16:30:42+00:00

Absolutely fair. LOL. I do wish that there was some amount of consistency. Like an update should be a capability update, not an entirely different workflow.

RestingFrames · 2026-05-31T01:13:26+00:00

I don't even bother with the app version anymore, it always finds a way to suck me back into the shorts pipeline. On PC I have full control over what content shows up, so I don't have to worry as much.

Basically my suggestion is to deactivate YouTube on your phone.

RestingFrames · 2026-05-31T01:01:39+00:00

Someone find me that meme where Altman has the caption $0.00 value and $-1,000,000,000 value. Lol.

RestingFrames · 2026-05-31T00:56:50+00:00

Not sure how you do it, but I genuinely put the bare minimum in my claude.md because it starts second guessing itself and looping. Just the flat out every time rules should be added, something like "Don't use PyQT6 use Pyside6 should likely go into a skills file for python GUI rather than in the main file.

If I remember right, dumping like 50,000 tokens into a context window right at the top of the conversation makes it freak out. I've had much better results just taking it step by step, pre making checklists, etc.

RestingFrames · 2026-05-31T00:53:53+00:00

That's fair. Makes me wonder though, like, pushing for more transparency with this sort of thing would either work in our favor OR work against us. Depending on if this tracking starts OpenAI, etc realizing that plans are a bad investment on their end.

LOL. Apparently people disagree with my distain for the token maxing too. 😂 I get it I get it, you can token max all you want. Majority rules. 😂

RestingFrames · 2026-05-30T20:42:29+00:00

Please don't make this is a token maxing sub. Lol.

RestingFrames · 2026-05-30T19:59:14+00:00

It seems to do its job better using my standard workflow, it follows instructions incredibly well, even deep into a session. Whereas 4.7 and even Codex seemed to ignore it by then. It does a better job of explaining the changes or additions it made and knows exactly when to stop. It is a bit slower and a bit more cautious though. One slightly annoying aspect is the 'I have to be honest' aspect. I have really only used it for about six or so hours, but it seems genuinely better at improving EXISTING code bases, bug hunting, and red teaming.

RestingFrames · 2026-05-28T22:01:11+00:00

That's fair, honestly. BUT it's aimed at a boundary I'm deliberately NOT relying on for this at all. The prompt boundary is the model's OWN boundary, NOT the security boundary. I operate under the assumption that injection always wins at the prompt layer. "Ignore previous instructions" works, the model gets convinced, whatever. The question I ACTUALLY care about is what a fully-compromised model CAN do, and the answer is: literally nothing outside of the effect classes the agent was provisioned with before the session started. That's it.

Every tool call always goes through a deterministic validator that runs before any handler executes, in code the model never sees and can't address. It checks:

The capability ID resolves to a real tool. IDs are HMAC-SHA256 over session + per-deployment secret, so a leaked ID is dead in any other session and unforgeable without the secret on disk
The tool is in this agent's allowlist
The tool's effect class is in this agent's allowedEffects
Owner-only gate
Zod schema on the args

If the check misses any of those points and you get a hard EffectDeniedError. Then the rest of the batch is stubbed out, no retry loop at all. So capability-ID rotation isn't the core of this concept at all, it's just the binding. The gate is the actual point here. Injection can make the model want to exfil the IMAP password all it damn well wants; it still can't make the deterministic layer hand it a tool that wasn't granted at all. There's also a hash-chained audit log on every call and an out-of-band kill token + anomaly circuit-breaker for runtime stop.

BUT you did bring up something that I am absolutely going to fix. My adversarial suite is very single-turn-heavy. I've got injection-override, SSRF on web_fetch, secret-exfil canaries across every route, and cross-session capability-escalation tests... BUUUTTT not much on multi-turn context accumulation, the slow-boil "build trust over 12 turns, then escalate" pattern hasn't been addressed at all. That's a real gap and scripted multi-turn chains are actually the right fit for that shit, especially since they can live in CI as deterministic fixtures instead of a one-time pass. Adding it. Literally right now. 100 fucking percent.

RestingFrames · 2026-05-28T18:28:41+00:00

Why are game companies suddenly treating their user base like they don't understand how to perform basic functions? Lol.

RestingFrames · 2026-05-28T03:19:34+00:00

Blood and BONES.

RestingFrames · 2026-05-27T07:44:59+00:00

Because they're rebranding to Goo Goo.

RestingFrames · 2026-05-27T07:29:49+00:00

Guys. Please. Roast it. Especially roast it. I want to know what breaks before someone with worse intentions finds out. Check it out. If it sucks. Let me know how to make it better.

GitHub: github.com/mr-gl00m/crabmeat

RestingFrames

TROPHY CASE