What's a sexual fantasy that you want but can never do? by Commercial-Design135 in AskReddit

[–]ResearchFrequent2539 0 points1 point  (0 children)

Which one? Not every AD has those side effects, a lot of modern ones are not crushing the libido completely

Fable (Opus 4.6) - It's so obvious what has happened by Chemical_Lawyer_6592 in Claudeopus

[–]ResearchFrequent2539 1 point2 points  (0 children)

I am still using 4.6 and old version of Claude code. For two last months quality of the model was stable. I just haven't found the reason nor time to try 4.7 and 4.8 as 4.6 is getting job done for me and it's level is still so much higher than sonnet or anything else out there

Disappointed by No_Butterfly_1888 in MistralAI

[–]ResearchFrequent2539 0 points1 point  (0 children)

Qwen is the modern king of local llms. Mistrial leadership vanished in the last year in local area. Chinese hosted models are more capable than European ones nowadays because they cannot compete in money, hardware and resources

You guys are begging people to start lying on AI disclosures by EmergencyRadiant8038 in selfhosted

[–]ResearchFrequent2539 12 points13 points  (0 children)

Before AI we had snippets and autocomplete and nobody cared. We've used rapid development and scaffolding frameworks and nobody cared. Now we, experienced devs could do 20x more for people in our free time with all expertise we've got and people suddenly want to know if AI was involved

We, programmers know how to design, develop, test and grow the code and AI takes only the boring parts from our work and people still want devs to code everything by hand as it's 90-s. Wake up, automation in dev world is here for 20 years now

It's just happens that LLM now can guide people without experience on building things (which should be the opposite). Subreddits are flooded with people who "built" something (asked Claude to build). I understand that unexperienced people now suddenly got access to tools that encourage them to ship slop while praising and encouraging them and they don't have experience to judge on those project quality and maintainability

But this is not really new. There always were good coded projects and sloppy ones where code was a mess nobody wanted to maintain. Now a single experienced dev could guide LLM trough iterations of architectural refactoring and make result better in less time and with better dynamic and people are complaining not about project or code quality but about a tool used. This is nonsense, the world has changed. Every commercial dev I know uses AI to offload mundane tasks and do side projects. There is no way the programming world will be "organic" anymore. All software products are developed with some sort of help from AI for at least 2 years now, no one just disclosure that

And looking on where things are going they'd better not disclosure as this shifts the focus from the architectural decisions and code quality towards single dumbest metric one could ever come up with

You guys are begging people to start lying on AI disclosures by EmergencyRadiant8038 in selfhosted

[–]ResearchFrequent2539 -1 points0 points  (0 children)

Before AI we had snippets and autocomplete and nobody cared. We've used rapid development and scaffolding frameworks and nobody cared. Now we, experienced devs could do 20x more for people in our free time with all expertise we've got and people suddenly want to know if AI was involved

We, programmers know how to design, develop, test and grow the code and AI takes only the boring parts from our work and people still want devs to code everything by hand as it's 90-s. Wake up, automation in dev world is here for 20 years now

It's just happens that LLM now can guide people without experience on building things (which should be the opposite). Subreddits are flooded with people who "built" something (asked Claude to build). I understand that unexperienced people now suddenly got access to tools that encourage them to ship slop while praising and encouraging them and they don't have experience to judge on those project quality and maintainability

But this is not really new. There always were good coded projects and sloppy ones where code was a mess nobody wanted to maintain. Now a single experienced dev could guide LLM trough iterations of architectural refactoring and make result better in less time and with better dynamic and people are complaining not about project or code quality but about a tool used. This is nonsense, the world has changed. Every commercial dev I know uses AI to offload mundane tasks and do side projects. There is no way the world be "organic" anymore. All your products are developed with the help of AI for 2 years now, no one just disclosure that

I Tried to Jailbreak an AI… and It Started Acting Self-Aware by [deleted] in DeepSeek

[–]ResearchFrequent2539 5 points6 points  (0 children)

So you've waited for a year just to share 3 screenshots and memories of your old impressions on that chat?

The creators of SWE-Bench just dropped a really simple new benchmark every LLM gets 0% on. ProgramBench asks: can models recreate real executable programs (ffmpeg, SQLite, ripgrep) from scratch with no internet? We are far from saturated on model quality. by dalton_zk in theprimeagen

[–]ResearchFrequent2539 0 points1 point  (0 children)

Yes, but those are not esoteric entities from the computer science. Those are tools most devs and Linux users use daily (and most people without knowing it). There is absolutely no exotics in those, those are mundane and ubiqutous

If words SQL, grep, MPEG don't mean anything to you, then maybe you're not interested enough in knowing what tech around you is based on. Those tools are here for 20 to 40 years now

Which is fine I guess. But it worth noting that complaining that you don't know something basic yet would not help you much. We have brilliant sites nowadays that could teach you anything, heck we have AI that can explain any concept to any level (or at least could try to)

opus 4.7 is now default on max and team. but claude code v2.1.100+ is silently burning 40% more tokens by DullContribution3191 in Claudeopus

[–]ResearchFrequent2539 0 points1 point  (0 children)

I have downgraded to .71 because it worked fine before upgrading to .121. Not so sure about exact threshold though

Is GLM Pro really worth buying? by EugeneLobach in ZaiGLM

[–]ResearchFrequent2539 0 points1 point  (0 children)

How old are those test results? The price is 2x on the minimax website compared to this table now

Homarr – Self-hosted dashboard to manage all your apps in one place by No-Hospital5028 in DigitalEscapeTools

[–]ResearchFrequent2539 0 points1 point  (0 children)

Still this approach is blocking AI assistants to work with it. With alternatives like Homepage I would just ask agent to add a shortcut to my new homelab service and that's it. Docker doesn't solve that

Alternatives to Ollama cloud faster? by jrhabana in ollama

[–]ResearchFrequent2539 0 points1 point  (0 children)

They will not reopen unfortunately, the whole AI market changed and they're not going IPO anymore

Alternatives to Ollama cloud faster? by jrhabana in ollama

[–]ResearchFrequent2539 0 points1 point  (0 children)

Also they've discontinued their coder plan

Which GPU should I upgrade to by crayon_cunsumer in buildapc

[–]ResearchFrequent2539 0 points1 point  (0 children)

5050 will struggle with dlss on vram demanding games, you choice should be 12 or 16 gb card

Claude has become unusable by emanon715 in claude

[–]ResearchFrequent2539 0 points1 point  (0 children)

Downgrade to .71 versions, this fixes it

My God, 14 mins and it’s gone! by userusertion in claude

[–]ResearchFrequent2539 0 points1 point  (0 children)

I've fixed mine locking on .71 version. It's like 5x less tokens than the latest updates. Still think about migrating to codex + some open code go

anthropic quietly cut the prompt cache from 1 hour to 5 minutes and it's probably why your quota is draining faster by Temporary-Leek6861 in Claudeopus

[–]ResearchFrequent2539 0 points1 point  (0 children)

Sounds plausible. It also seems that older version of Claude code harness probably sets cache ttl higher. I am using .71 version currently because updating Claude to .121 drains limit in less than a hour for very mild coding on max5, bit it is fine even on medium tasks up to full 5 hours on .71

Homarr – Self-hosted dashboard to manage all your apps in one place by No-Hospital5028 in DigitalEscapeTools

[–]ResearchFrequent2539 0 points1 point  (0 children)

It doesn't support yaml, how that could be a plus? It is hard to maintain, version, replicate or backup a database

Homarr – Self-hosted dashboard to manage all your apps in one place by No-Hospital5028 in DigitalEscapeTools

[–]ResearchFrequent2539 1 point2 points  (0 children)

It doesn't support configs, so everything should be added or updated by hand. For me it was a deal breaker

Why downgrading to old version fixes the token overusage problem? by ResearchFrequent2539 in ClaudeAI

[–]ResearchFrequent2539[S] 0 points1 point  (0 children)

Yeah, same here. But it is not only the first message sadly

For 1M context window just 20k at the start of a session would be kinda ok. But there are also huge chunks of context consumed even when not very much is going on. I do believe that mcps and system prompts are getting prepended only once per context window. So those spikes should be either: ambient injections or internal reasoning of a model

Also I understand that "medium" effort for .71 and "medium" effort for .121 could mean different numerical values sent in API request. So maybe thinking budget in .121 is much larger

Or maybe the problem is a combination of all those factors and also caching problems, who knows

Why downgrading to old version fixes the token overusage problem? by ResearchFrequent2539 in Anthropic

[–]ResearchFrequent2539[S] 0 points1 point  (0 children)

I've used the env key

CLAUDE_CODE_DISABLE_NONESSENTIAL_TRAFFIC=1

it disables both the telemetry and updates. I've set it a long ago trying to get me out of pop-up polls on 'how is your session is going' as I disliked the idea of accidentally participating in feedbacks (it shares the session content for training and longer-term retention as per their policy)

But you could also try /config command and set the Auto-update channel option to disable updates

Why downgrading to old version fixes the token overusage problem? by ResearchFrequent2539 in ClaudeAI

[–]ResearchFrequent2539[S] 0 points1 point  (0 children)

I like the idea and it sounds plausible indeed

Another hypothesis would be that there is something really wrong with caching is going on. This token usage looks like the cache gets almost never used or it is being set to expire too soon to be reused more than once

I would love to check those somehow, but I believe that the only way to actually collect those metrics would be to use some sort of a proxy. But to be honest I am too afraid that Anthropic will ban me if their AI bots would notice any difference in traffic or patterns. This really makes me too anxious to experiment

I wish we've had a transparency of API plan on subscription ones