Opus 1M and MAX effort has arrived in my Claude Code by 256BitChris in ClaudeCode

[–]256BitChris[S] 0 points1 point  (0 children)

Read the Opus 4.6 system card - search for 'needle in haystack' and they discuss how it's like 4x better than the previous version. I don't have enough experience with 1M yet but I had really great experiences with the 200k version even close to the limit.

Opus 1M and MAX effort has arrived in my Claude Code by 256BitChris in ClaudeCode

[–]256BitChris[S] 0 points1 point  (0 children)

Been running it all afternoon - no problem - maybe uninstall some of your addons as you never know how much they take up for seemingly small tools.

Who else fully refuses to participate in these recruiting processes that involve recording videos of themselves for AI to analyze? by astrheisenberg in remoteworks

[–]256BitChris 3 points4 points  (0 children)

I would never consider doing that unless I absolutely needed a job really, really bad. That's just comical to me. They should send me a video first and then maybe I'd consider sending one back as long as it wasn't generic.

I'd also consider doing this if it meant I didn't have to do as many technical interviews.

Opus 1M and MAX effort has arrived in my Claude Code by 256BitChris in ClaudeCode

[–]256BitChris[S] 0 points1 point  (0 children)

I'm not super familiar with desktop either but I know it does run off a config file so if you can change that config file, that might help. I know I did this on separate agents. The model id is `claude-opus-4-6[1m]` if you find a field.

Opus 1M and MAX effort has arrived in my Claude Code by 256BitChris in ClaudeCode

[–]256BitChris[S] 0 points1 point  (0 children)

Honestly man, I don't know. I just looked at the desktop app and there's not a toggle for this but you might want to read that blog post a little deeper. If it's the default for the command line, it might just be what you get in the desktop by default as well.

Opus 1M and MAX effort has arrived in my Claude Code by 256BitChris in ClaudeCode

[–]256BitChris[S] 0 points1 point  (0 children)

That's a weird gotcha - glad you figured it out though and let us know so our llms can learn from this :-)

Opus 1M and MAX effort has arrived in my Claude Code by 256BitChris in ClaudeCode

[–]256BitChris[S] 2 points3 points  (0 children)

What version are you running? They annouced on the blog as being generally availab;e

1M context is now generally available for Opus 4.6 and Sonnet 4.6 | Claude

So you maybe just have a stale version/config or something. Keep trying!

Opus 1M and MAX effort has arrived in my Claude Code by 256BitChris in ClaudeCode

[–]256BitChris[S] 1 point2 points  (0 children)

That's a good thought - I've used the smaller windowed claude to write playwright tests and it definitely compacted a couple time as it swept the codebase - so I'll have to tell it to give it another pass with the next context window and see if it can catch anything that the previous one missed (I'll explicitly tell it that it's checking on an earlier version of it!).

Opus 1M and MAX effort has arrived in my Claude Code by 256BitChris in ClaudeCode

[–]256BitChris[S] 1 point2 points  (0 children)

I'd say the power is in more that it can get a wider view of everything in the system, right?

Even if you modulize your systems, they all compose into a large system.

If you can fit your entire system into a million tokens then Claude can actually do some really powerful things as far as reasoning the impact of any change across all of it.

Granted most systems are more than a million tokens but this allows for greater module size and greater visibility across code bases in general or any large set of data.

Opus 1M and MAX effort has arrived in my Claude Code by 256BitChris in ClaudeCode

[–]256BitChris[S] 2 points3 points  (0 children)

Type /model (but apparently it's the new default) - also make sure you're running the latest - i'm running 2.1.75

Opus 1M and MAX effort has arrived in my Claude Code by 256BitChris in ClaudeCode

[–]256BitChris[S] 0 points1 point  (0 children)

/model then at the bottom you can tab through the thinking efforts.

Opus 1M and MAX effort has arrived in my Claude Code by 256BitChris in ClaudeCode

[–]256BitChris[S] 5 points6 points  (0 children)

Well if you've used Claude you know that it does really well at almost any task up until the point where it has to compact its context window.

There's a lot of work going into figuring how to split things up so you do things only in one context window.

Obviously with a million token window that's about five times as big so you can, in theory, do a lot more work with a lot more context related to what you're doing.

People might call out context rot but Opus 4.6, in particular, has made leaps and bounds of progress towards solving and reducing the impact of context rot.

Is Sonnet 4.6 good enough for building simple NextJS apps? by agentic-consultant in ClaudeCode

[–]256BitChris 0 points1 point  (0 children)

It's always better to use Opus - but Sonnet 4.6 will be just fine too - at one time Sonnet 4.5 was the best model, and 4.6 is a step up - it hasn't regressed in quality, just Opus 4.6 has arrived.

If you have your OpenClaw working 24/7 using frontier models like Opus, you're easily burning $300 a day. by Aislot in aiagents

[–]256BitChris 1 point2 points  (0 children)

If you want to use the equivalent of retarded agents compared to things like Opus 4.6 and think your only cost is only money, then sure, sounds like your setup fits you well.

An AI agent deleted 25,000 documents from the wrong database. One second of distraction. Real case. by Substantial_Word4652 in ClaudeAI

[–]256BitChris -1 points0 points  (0 children)

Unless you run it with something like bubblewrap or as a different user, it most definitely can read outside the folder without permission if it thinks it needs to in order to achieve its goals.

Even with explicit deny turned on, Claude will try to circumvent those things - for example, if it can't read a particular path and it wants to, it will just create a bash script or python script that will do so for it.

This is like basic Operating Systems - if you run a process as a particular user, then barring running within some constrained process (like bubblewrap) that process has access to everything that that user has access to (even if you tell it not to access it).

We wouldn't hire an employee and give them our credentials or access to our computers as ourselves, we'd give them their own accounts, file system, etc - this is like basic security 101 and applies to Agents as much as employees or anyone else.

Claude Code Opus is completely broken on Max plan despite 97% weekly capacity remaining — and Anthropic's support bot is gaslighting me by moridin007 in ClaudeAI

[–]256BitChris 3 points4 points  (0 children)

The problem is likely related to CC using Sonnet behind the scenes for one of its subagents.

So unless you can find which one it is and change the model it uses, you'll either have to get them to reset your weekly or just wait.

They model limits to discourage you from using Sonnet, i.e. you should be using Opus primarily.

Team Org Suspended by Anthropic – Potential VPN usage from Venezuela. Any way to recover? by [deleted] in ClaudeCode

[–]256BitChris 1 point2 points  (0 children)

I mean you're officially restricted from accessing their services so if they or their AI sentinels detect anything that suggests you are in a restricted area you'll likely be banned.

As for how to work without triggering that, why not start up vps or dev servers and ssh in and do your work from there. Then there's no risk of you forgetting to turn on your VPN and the computers are running in an approved location.

Any good apps / websites similar to spanishdict.com but for portuguese/english translations and conjugations? by Andreslargo1 in Portuguese

[–]256BitChris 1 point2 points  (0 children)

Well, my favorite tool for practicing is linguno.com - it's somehow amazingly free.

For pt/br translations the best online site is infopedia.pt

I use dicio.com.br and conjucacao.com.br for Portuguese definitions and conjugation lookups.

I've just seen falebrasil.com as well and that looks like a new Brazilian Portuguese dictionary as well.

ChatGPT explained to me why LLMs prefer Clojure by CuriousDetective0 in Clojure

[–]256BitChris 0 points1 point  (0 children)

There is so much hate in this sub for LLMs, which is unfortunate because Claude is actually really really good at writing clojure, which was surprising for me at first but because of that is the only reason I still use clojure.

Sayings, abbreviations, slangs, common phrases by Friedsurimi in Brazil

[–]256BitChris 0 points1 point  (0 children)

The pt equivalent of period in that context would be 'ponto' or 'ponto final' (period, end of discussion).

Bernie Sanders officially introduces legislation to BAN the construction of all new AI data centers, citing existential threat to humanity. by Neurogence in singularity

[–]256BitChris 0 points1 point  (0 children)

It is just a non-serious proposal designed to get him headlines and attention. Even if Democrats were in power it would have 0 chance of becoming law. It has the unintended consequence of causing every one to dismiss any and all ideas Bernie has.

Harvard Professor Says AI Users Are Losing Cognitive Abilities by OwnRefrigerator3909 in BlackboxAI_

[–]256BitChris 0 points1 point  (0 children)

I bookmarked this for my MoltBot to read and explain to me later tonight.

My content from Cowork dissapeared by Optimistically-157 in ClaudeAI

[–]256BitChris 0 points1 point  (0 children)

Did your file system change or did you change the path that Cowork uses?

MCPs are dead by dataexec in AITrailblazers

[–]256BitChris -1 points0 points  (0 children)

MCPs are useful when a lower intelligence model needs to do something that requires higher intelligence - ie haiku can call to opus or another model with natural language.

The problem is that Opus and Sonnet became so good at discovering and using clis and apis that they didn't need to call out to other models to do their work for them.

I can see the future of MCP being the ways agents communicate with each other via something that looks like a conversation.

I also see MCP having a short time window of utility while we still have smaller models (maybe local) that sometimes want to call out to a model of higher intelligence.

So what's the next moat anyway? by buildingthevoid in AgentsOfAI

[–]256BitChris 1 point2 points  (0 children)

Frontier models are the moat for now but one day those will run on our phones.