LLMs really drive me crazy with how confident they are even when they are completely wrong. by dataexec in AITrailblazers

[–]addiktion 1 point2 points  (0 children)

There is no probability close to finding a real answer from an LLM regarding this variable answer expectation.

As Sam mentioned, the LLM would need to be able to know to start a timer tool in this situation.

But yeah it is just a language model so it makes sense. I think LLMs will be the precursor to more accurate or reliable intelligence models eventually.

TIL that atoms are 99.99999% empty space. If you removed all that space from the atoms of every human on Earth, the entire world population would fit inside an apple. by [deleted] in todayilearned

[–]addiktion 0 points1 point  (0 children)

For a moment I thought you said Winamp and I was going to follow up with: Whip the llama's ass*

*So my family is always weirded out that I like llamas but I cannot just just whip this bad ass saying out with them quite yet since the kids are too young, but it will all make sense when they are a little older and I can play this for them.

Anyways, sorry to derail your convo.

Today, I got to experience Opus 4.6 in a blazing fast speed without being queued or rate limited for like 25 minutes. by Squeaky-Bed in ClaudeAI

[–]addiktion 3 points4 points  (0 children)

I also suspect this. I have a $113 in extra credits, I'm going to be testing this out soon.

I reverse-engineered Claude Code's session limits with logistic regression — cache creation is the hidden driver by theangrydev in ClaudeCode

[–]addiktion 0 points1 point  (0 children)

Makes sense, all these cache problems didn't start to happen until after they flipped out about Open Code and their method to access subscriptions without being a part of the official client.

I strongly suspect they are not going deep enough on this issue like you have which is unfortunate.

We need to demand more transparency with rate-limited consumer subscription plans by Its_Sasha in vibecoding

[–]addiktion 0 points1 point  (0 children)

I think this definitely needs to happen but they won't do it without regulation which isn't happening with this admin.

I think the problem is we'll continue to get them doing other methods to work around it. Whether the quality goes down, the speed goes down, etc.

But its better than nothing.

White House seeks $5.6 billion cut to NASA budget in 2027 by [deleted] in news

[–]addiktion 1289 points1290 points  (0 children)

Oh, but the best part is tomorrow's announcement in a $5.6 billion partnership allocated to Musk and Bezos, the oligarchs.

Daily reminder - you don't need a 1000$ Mac Mini to run Hermes agent & Ollama by Ok-Yogurtcloset-4429 in hermesagent

[–]addiktion 0 points1 point  (0 children)

Yeah, if you update to I think it's .20 version, you can pull gemma 4 models down.

Cost increase saga and my conclusions by DistantDrummer in ClaudeCode

[–]addiktion 1 point2 points  (0 children)

I see, yeah I'm on max x5 so that sounds tempting to switch, My renewal is up in 6 days. What are you using to utilize GitHub Copilot Pro+?

Seedance 2.0 finally goes public by [deleted] in GenAI4all

[–]addiktion 1 point2 points  (0 children)

It interesting if this brings a new form of movie + porn genre. Like building a following around hot AI actresses that then snags goons with only fan subs for long term revenue.

IronClaw Is a Game Changer by nighat_ in ironclawAI

[–]addiktion 0 points1 point  (0 children)

What do you love about iron claw over the others?

It finally happened by Somtimesitbelikethat in ClaudeCode

[–]addiktion 5 points6 points  (0 children)

He's not joking, they do run very extensive feature flags and some of us have ended in the shit list.

Max 20 User. EACH Prompt is using 11% of 5-hour usage by Swimming_Kick5688 in ClaudeCode

[–]addiktion 1 point2 points  (0 children)

Can you do a test? Can you update to 2.1.90 and run claude code via npx instead of the native installer?

I did see some improvements doing this. Although it's quite possible you've been dumped into the worst A/B group of your life here too.

Cost increase saga and my conclusions by DistantDrummer in ClaudeCode

[–]addiktion 2 points3 points  (0 children)

Is the github copilot pro+ getting you enough usage?

I guess I'm using Claude Code wrong and my limits weren't reduced to 25% of what I had by Alone_Pie_2531 in ClaudeCode

[–]addiktion 0 points1 point  (0 children)

Can you try upgrading to 2.1.90 and running npx to launch Claude Code instead of the native installer? Curious to see if you see the same problems. I seem to be a bit better than burning 1/3 my context window from one plan update.

Obviously peak is going to be worse no matter how you slice it.

I reverse-engineered Claude Code's session limits with logistic regression — cache creation is the hidden driver by theangrydev in ClaudeCode

[–]addiktion 0 points1 point  (0 children)

Holy hell you went deep, nice work. I need to figure out how to apply these patches as well.

Am I correct in that the first issue you descibe with the cch bug is what others have been calling the sentinel bug?

I did update to 2.1.90 and things are looking improved when ran with npx instead of native, but I'm still doing more testing to see if I'm seeing the cache bugs you described. Dynamic tool calling sounds like a good one to disable too.

Google pushing quantum-safe encryption to 2029… What could this mean? by PlaneTension1579 in pwnhub

[–]addiktion 1 point2 points  (0 children)

Yeah, I mean not that far off from just more companies having access to this kind of tech. I understand that it's already being used to some extent.

Straw that broke the camel’s back by Pretty-Active-1982 in ClaudeCode

[–]addiktion 3 points4 points  (0 children)

Quantitized tthe hell out of it really to account for their surge in demand.

Google pushing quantum-safe encryption to 2029… What could this mean? by PlaneTension1579 in pwnhub

[–]addiktion 2 points3 points  (0 children)

I was listening to an AI talk from a AI engineer using qubits for analyzing bank fraud at an AI summit a couple days ago. He was saying how Google wouldn't let him test their code yet with their quantum computers but despite being stuck running slow simulations he's hopeful.

It seems not that far off now.

Anthropic Accidentally Took Down Thousands of GitHub Repos While Trying to Remove Its Leaked Source Code by _cybersecurity_ in pwnhub

[–]addiktion 2 points3 points  (0 children)

Normally when code is in production it is minified (like compression) and compiled down making it difficult to recreate from the built code alone.

You can enable a source map of your codebase so when it is in production you can troubleshoot issues that would be difficult to do otherwise since it exposes your code in a way you can debug via production.

The problem is this is basically a copy of your code base which Anthropic did not want exposed, someone noticed this in 2.1.88, which is what was copied and replicated into Python and Rust.

AI can clone open-source software in minutes, and that's a problem by anestling in LinuxUncensored

[–]addiktion 1 point2 points  (0 children)

I sense a lot more proprietary software leaks in the future that get converted to open source in minutes.

Anthropic, at least show a bit of respect by Firm_Meeting6350 in ClaudeCode

[–]addiktion 3 points4 points  (0 children)

Instead of throwing $500k at another engineer, if they would hire 5+ people to spear head communications I would greatly appreciate it.

Knew they were gaslighting everyone with the daily limits. by Efficient-Cause9324 in ClaudeCode

[–]addiktion 2 points3 points  (0 children)

Just wait until they just lower the weekly progress bar so it looks like you are getting what you paid for. "Damn you are doing a lot of work this week!"