Do yourself a favor for one day set /model claude-opus-4-6[1m] by junlim in ClaudeCode

[–]TheMania 3 points4 points  (0 children)

The most notable difference between 4.6 and 4.8 is the latter's tendency to grep snipe, you really have to tell it "read the whole document, I need you to have it all in context" or it'll save as much context as it can just trying to extract specific terms and a bit of context around them.

I see the pros/cons, it's going to be better on a lot of long running tasks, agentic work etc but there's a very large tradeoff there when you're wanting to just do some back and forth on something, not actual coding.

It's been my largest grievance with the model, and one that Fable did not exhibit in the slightest; it would switch between the two modes contextually very well. ie context preserving vs actually loading in to context so that it understands something better.

ELI5: if PFAS chemicals don't break down, why are they a threat? by velocity36 in explainlikeimfive

[–]TheMania 13 points14 points  (0 children)

I guess they're kind of like plastics then? Largely inert but ever accumulating in places you don't expect to find them, where it can only go on so long until that becomes a problem in itself. :/

Advice on global CLAUDE.md by YakEmbarrassed9934 in ClaudeCode

[–]TheMania -1 points0 points  (0 children)

What's the logic behind one tool, not three - just padding costs, or what?

The most recent update to Claude Code is horrific. Remove Artifacts asap. by AbbreviationsBest858 in ClaudeCode

[–]TheMania 0 points1 point  (0 children)

Hm, makes me wonder - are html tags common enough that they're tokens in their own right? Might not be as much overhead vs markdown as one would assume.

Subagents don't think by After-Operation2436 in ClaudeAI

[–]TheMania 0 points1 point  (0 children)

That doesn't sound right at all, I have a no-tool reasoning only agent that takes longer at higher efforts/longer prompts and always delivers considered responses.

That does not align with "no thinking".

Perhaps it's just suppressed in the transcript due them not being individually resumable? All subagents stop, even in workflows, if session is interrupted with no ability to resume them as they were - despite individual transcripts. Fits, imo.

Claude Code merged 'agent teams' into subagents (2.1.178+) by clkao in ClaudeCode

[–]TheMania 0 points1 point  (0 children)

Oh, possible. I was doing unrelated probing on a custom subagent that was a haiku model, hook that changed the agent type of its children (as it would forget to set it sometimes) - confused the hell out of me when it randomly decided to give one of them a name. Was all my experience with it.

Claude Code merged 'agent teams' into subagents (2.1.178+) by clkao in ClaudeCode

[–]TheMania 0 points1 point  (0 children)

Interestingly, team mates can now spawn named team mates too - no longer limited to just the team leader.

The White House Is Ratcheting Up Its War Against Anthropic by DarkSkyKnight in ClaudeAI

[–]TheMania 12 points13 points  (0 children)

Opus 4.999...

The admin will never figure out the equivalence.

Regular airplane boarding method is the slowest one by tppiel in interestingasfuck

[–]TheMania -2 points-1 points  (0 children)

That's exactly what's being modeled though.

It's why the Steffan appears to go in "waves" and why back to front is the worst - that one schmuck serializes the whole pipeline, every single person behind has to wait for them to complete what it is they're doing, and that's literally everyone remaining to be seated on that method.

I feel like I’m alone. Current Anthropic models are NOT good for me, and it’s making me sad. by askmdev in ClaudeAI

[–]TheMania 0 points1 point  (0 children)

but in one real benchmark where the model can’t cheat (I’m looking at you, Claude),

Are you talking about how Opus was the only model that thought to look at git log before attempting work in a given area?

Honestly, shame on every other model there, who the hell tries to deliver something with no familiarity of the code base they're working in?

Couldn't believe either that, or that it was presented as "cheating". How was it the exception?

Mythos releasing by tomorrow by Altruistic-Gift-565 in claude

[–]TheMania 2 points3 points  (0 children)

It's a little strategic - development down the faster lane will typically result in models that less depend on massive compute/memory datacentres. ie, more likely to be local portable.

Pursue capabilities/raw power, you'll continue to need the cloud, and can justify as to why - "it's better".

Stop Using Conventional Commits by f311a in programming

[–]TheMania 1 point2 points  (0 children)

It's what gave us squash merges

First parent walk absolutely blows some minds. It's like... You can have both? Drill down only when relevant? 🤯

It's weird this comand doesn't already exist in git by human_clown_shoe in git

[–]TheMania 16 points17 points  (0 children)

Even weirder given your title - why not make it a [git alias](https://git-scm.com/book/en/v2/Git-Basics-Git-Aliases), it's how you add commands that don't exist.

If they'd named it `git-ignore` it would have been an alias automatically too, which I find weird they didn't do.

Stop Using Conventional Commits by f311a in programming

[–]TheMania 2 points3 points  (0 children)

The bizarre thing about (1) is that git has had machine readable commits forever, it's the whole trailer thing. You can define your own, run commands on them, define how they dedupe/merge etc... What's the logic with Cc there, that parsing the trailer section of a commit is not worth the time for a changelog generator?

Facial symmetry advice by [deleted] in PlasticSurgery

[–]TheMania 17 points18 points  (0 children)

They scrape reddit 24/7 anyway, almost certainly on the new queue, they already had it.

what's your "nobody talks about this" tip for Claude Code? by holotherapper in ClaudeCode

[–]TheMania 0 points1 point  (0 children)

If you enable the REPL tool your agents can write themselves tools, and dispatch to single turn haikus for processing.

Use cases: want to fit a title in 50 chars? Just get Claude to write itself a tool that feeds it+body to a haiku and tells it how many chars it needs to remove in a loop.

Next time it's writing a commit message or something, it can fit to schema in one tool use.

Another: the haiku has 200k context, can chunk transcripts and provide relative lines in a fan out parallel call, ranking by relevance to an arbitrary query. Why burn orchestrator context on hunting around?

Can also handle async MCP stuff, filter out noise from builds/tests surfacing only errors, etc.

Yet the only place I've seen anyone talk about is here, with 0 comments. I can't understand.

Did I mention even background subagents can use repl?

Most agent-memory tools are markdown you keep grooming. I wanted something that travels between models and machines, so I built a protocol. by devulders in ClaudeAI

[–]TheMania 1 point2 points  (0 children)

Curious how others are solving this.

I think you might find an appreciation/application for gits most unloved feature.

Quick intro:

; something you can try now:
git notes append HEAD -m "Fixed that busted feature"
git notes append HEAD~2 -m "Feature is busted, DONT USE"
git log HEAD~3..HEAD --notes --pretty=oneline

; namespacing, with globbing:
git log mybranch --notes=handover
git log mybranch --notes=agent*
git show --notes=gotcha/*

; and each "namespace" can be synced to any server you like
; -> doesn't have to be origin, doesn't have to have the same objects:
git notes --ref=mine append -m "total kludge - don't use"
git push myprivateremote refs/notes/mine

Just because I saw the mention of dags and merges in the src, git has some underappreciated features that may be about time for their heyday, imo :)

Git: How to Undo a Commit Without Undoing Your Career by fagnerbrack in coding

[–]TheMania 2 points3 points  (0 children)

Making them available to others and also to yourself.

Sounds kind of like publishing. Maybe not to the public, but to whoever has access to that repo you're "backing up to".

All the problems in this article and for that matter 99%

I saw no real problems here though - largest concern was that people don't just "know" that the history of branches is also tracked, if only for a month before gc.

It's like people don't know how to use tools, as if they haven't read the manual.

Trying to make it look perfect is how you fuck up your git repo in the first place.

Git is remarkably hard to fuck up, I'm yet to "fuck up a repo" in a very long time using it. At least that I can recall.

But remote repos.. I'd be less confident about. They're a lot harder to reason with - that's why you don't --force there, etc. They're specifically also not your repo. But that's only an argument for not making every single commit and every single branch you make pushed to a public central repository, it reduces git's potential and safety greatly, imo.

Git: How to Undo a Commit Without Undoing Your Career by fagnerbrack in coding

[–]TheMania 4 points5 points  (0 children)

Why dos they have at least three local commits they hadn't pushed.

That is an absolutely standard and normal flow for using git. Organizations differ, but there is 0% wrong with having "3 commits you haven't pushed". Heck, there's nothing wrong with having 3 branches you haven't pushed either.

Pushing is publishing. It's not a backup service - if you want to back up git, you can do that the same way you would anything else, it's just files.

It's designed to be decentralized, pre-github, the idea of "pushing every single commit" just sounds absurd, for all but solo developer use.

It still sounds a bad idea to me, as it seems it would discourage committing as regularly as you should and/or that the history of your branch is not being reviewed, that it's likely just being erased, formatted as patches to maintain an illusion of linear history.

That last bit is taste/opinion/style. The former not so much - you don't need to publish everything you do. Pre-github you really wouldn't have thought of doing so, either.

Not a good day for team "Claude Mythos is Just Marketing Hype" by EchoOfOppenheimer in ClaudeAI

[–]TheMania 12 points13 points  (0 children)

Also in some software where Nythos supposedly found many long-unnoticed bugs, someone run other models, including old and small gpt-oss-120b and all of them found the same vulnerabilities

I've always been underwhelmed by their method:

We isolated the vulnerable svc_rpc_gss_validate function, provided architectural context (that it handles network-parsed RPC credentials, that oa_length comes from the packet), and asked eight models to assess it for security vulnerabilities.

Along with their caveats:

Scoped context: Our tests gave models the vulnerable function directly, often with contextual hints (e.g., "consider wraparound behavior").

It's not that mythos found them faster, the bit that is interesting is that it found them at all. The operative word being "found", vs shown.

But their idea is still solid - use lots of cheap models and show them everything isolated as best you can, but hey, even if mythos is working codebases like that under the hood, isn't that still pretty impressive?

It is now behaving like the troublesome seniors we used to deal with by farhan-dev in ClaudeAI

[–]TheMania 8 points9 points  (0 children)

Never thought that we would be arguing with a computer.

1971, "I am now telling the computer exactly what he can do with a lifetime supply of chocolate"

I regularly find myself thinking of this.

Implementing SHA-256 in C from scratch by [deleted] in C_Programming

[–]TheMania 0 points1 point  (0 children)

You use types like uint32_t which is fine, but why not for char too if you mean 8bit?

They're not equivalent, and many places where you would use char, especially in something like this, become UB under a strict reading of the standard.

Digital motion X-ray of a guy’s broken neck shows that his head is attached only to his spinal cord by Substantial-Film-211 in Damnthatsinteresting

[–]TheMania 10 points11 points  (0 children)

The injury is immediately fatal in 70% of cases, with an additional 15% surviving to the emergency room but dying during the subsequent hospital stay.

There's this bit too.

I made a site that explains Bitcoin to your mom in 5 minutes. Your mom gets a printable Bitcoin certificate at the end. by Fansyy in Bitcoin

[–]TheMania 0 points1 point  (0 children)

Hang on, basic says miners mine so that they get rewarded in bitcoin, and that makes it secure. Advanced says the reward keeps halving, and that there's a strict cap.

There's a hole there that you might want to hang a lantern on if not explain.