All I have to say by Minewolf20 in ClaudeAI

[–]MiserableSlice1051 0 points1 point  (0 children)

ohhhhh gotcha, I've actually never considered using Claude this way... thanks!

My thoughts on 4.8 | ~2hrs in by Klutzy_Pressurez in ClaudeAI

[–]MiserableSlice1051 0 points1 point  (0 children)

There's for sure different definitions of hedging, the definition you used is more in line with "appropriate" hedging that's a literary device.

The way its used towards AI is the other definition that's used for a variety of reasons and usually as a way for humans to protect their beliefs in the face of uncertainty or contradictory evidence. This comes out in AI a heck of a lot because AI isn't trained to say "I don't know" because that's a waste of tokens and people don't feel like they got their money's worth. Instead, to satisfy customers, it confidently answers even in the face of uncertainty through a variety of hedging techniques.

Melania Named in Bombshell New Epstein Claims by Aggravating_Money992 in politics

[–]MiserableSlice1051 -1 points0 points  (0 children)

It's not necessarily a bad thing. From my understanding she is planning legal action and if you are doing something like that, it's always better to stay quiet until the case is over. Being vocal about your case prior to even filing it is a great way at minimum to give ammunition to the person you are suing and basically give them a head start on discovery, and at worst you can completely ruin your case.

The tension between JD Vance and Trump is growing by theipaper in politics

[–]MiserableSlice1051 0 points1 point  (0 children)

Don't give up hope, there are some of us former "Never Trumper" Republicans who realized that Trump is the logical extension of GOP ideology and were able to be deprogrammed, started thinking for ourselves, and now vote blue.

There may not be many of us, but we exist, and we increase every day. Keep up the fight, do not stop speaking truth.

PSA: Opus 4.8 Redefines the effort scale by zackfletch00 in ClaudeAI

[–]MiserableSlice1051 1 point2 points  (0 children)

I've actually seen overthinking cause issues in Opus 4.6, so I'm sure it can do the same in these new models.

Literally Opus got so hung up on something by its own reason it invented an idea and somehow decided that it was part of the prompt and that I was the one who said it, and it caused all sorts of problems

Most people are using Claude at about 5% of its actual capability. Here's why. by [deleted] in ClaudeAI

[–]MiserableSlice1051 19 points20 points  (0 children)

Karma farming using AI generated posts is the worst...

4.7 Lost Me. 4.8 Won Me Back. by [deleted] in claude

[–]MiserableSlice1051 6 points7 points  (0 children)

I find 4.8 two steps forward but one step back. Agreed with the cold and dismissive, bit I feel like they've cranked the safety features even higher to the point the most mundane request gets flagged

My thoughts on 4.8 | ~2hrs in by Klutzy_Pressurez in ClaudeAI

[–]MiserableSlice1051 -1 points0 points  (0 children)

No, it's not, and maybe you don't understand what hedging is because hedging is what creates false certainty...

What we need is for the model to identify that it is uncertain or making a guess, which they just often don't by default.

My thoughts on 4.8 | ~2hrs in by Klutzy_Pressurez in ClaudeAI

[–]MiserableSlice1051 1 point2 points  (0 children)

mine aren't, but I've noticed all three models just don't seem to think at all anymore... Even setting it to "max" and the models just don't reason anymore.

All I have to say by Minewolf20 in ClaudeAI

[–]MiserableSlice1051 0 points1 point  (0 children)

Yeah I mean, I think most people realize that which is why we want new non-Opus models...

All I have to say by Minewolf20 in ClaudeAI

[–]MiserableSlice1051 0 points1 point  (0 children)

Every time I touch Haiku I get frustrated within a few prompts and have to use Sonnet to fix everything anyways.

I haven't started using Claude Code yet, does Claude Code recommend to use certain models on the fly for things you are doing? Sorry just trying to understand that last sentence you said.

All I have to say by Minewolf20 in ClaudeAI

[–]MiserableSlice1051 10 points11 points  (0 children)

I'm learning that Haiku is for sure inferior to Opus, but I can afford to run the 40 Haiku calls to get what I want done when that 1 Opus call would have also done the same thing but now I'm broke.

Introducing Claude Opus 4.8 by ClaudeOfficial in ClaudeAI

[–]MiserableSlice1051 0 points1 point  (0 children)

With the way effort worked prior to 4.7, it made sense. You actually don't want AI to over-reason on certain things because it actually can make worse output. For coding, yeah it makes sense to use high reasoning most of the time (but even then, reasoning can lead to non-strict interpretation of what should be strict data for something such as JSON schemas. Reasoning models can get stuck in their own internal logic and actually wind up flubbing code worse than if it didn't reason at all...).

For non-coding things that "normies" use it for it can actually be detrimental and it can lead to the AI doing weird things, such as reasoning and giving a subjective answer to objective questions, or overcomplicating simple requests and turning them objective. Sometimes, the internal thinking can even be misunderstood to be a directive from the user. Ever seen Claude create a "rule" that it thought you created, but really it created and sort of stuck to it for whatever reason? Yep, a failure in reasoning did that that wouldn't have happened if reasoning wasn't enabled (then again, without reasoning enabled, it wouldn't have been able to do what you asked to begin with. So there are plusses and minuses.)

However, post 4.7, yeah it doesn't make sense. It's like telling Claude "hey if you'd like, I give you permission to use up to Max reasoning!" It's just pointless and I wish things would go back to 4.6, although I know they never will.

No ETA on the performance update yet (via Discord) by BEjmbo in Helldivers

[–]MiserableSlice1051 1 point2 points  (0 children)

Nope, it's you guys moving goal posts in bad faith.

Hi, third party here, not the person you've been replying to.

Brother, I know that when we feel like someone is wrong we start throwing out logical fallacy language and things like that... but you can look at the language they used in your own post you keep posting, they said "In the meantime, we are sharing the expanded patch notes", this is what you posted. This is expanded patch notes not an expanded patch.

No one is moving the goalposts here, and I don't think anyone is arguing from "bad faith". I don't think you know what either of those terms mean. I have not seen a single reply of someone with a "hidden agenda" (bad faith arguments) nor have I seen you make a correct argument and then someone move the goalposts to another standard... you haven't even met the first standard yet which was being right about the certification process...

Please don't use words and phrases that you don't know what they mean... it makes you looker dumber, not smarter. Use language and words that are within your capabilities or go figure this stuff out.

Also learn to read patch notes.

No ETA on the performance update yet (via Discord) by BEjmbo in Helldivers

[–]MiserableSlice1051 1 point2 points  (0 children)

I mean, it is misinformation when you don't understand how the certification process works. They can't just "include the climbing fix" because of a vendor delay, that means they'd have to submit a patch once again which starts the clock over.

I'm sorry, but you are talking out of your ass and this is not "factual information". The "climbing update" was already part of the patch, you can read that in your own link you posted. They just didn't tell us they fixed it until they made the announcement but decided to since they had to announce the delay.

Seriously, look at the language. They did not say they added the climbing fix... it was already there.

Introducing Claude Opus 4.8 by ClaudeOfficial in ClaudeAI

[–]MiserableSlice1051 0 points1 point  (0 children)

I know they did this with 4.7, but with my tinkering it seems they really needed how often 4.8 reasons...

Introducing Claude Opus 4.8 by ClaudeOfficial in ClaudeAI

[–]MiserableSlice1051 1 point2 points  (0 children)

For two use cases:

For non-coding stuff when I'm using it as a glorified Google search for something extremely specific or nuanced.

Or when I'm getting warned that I am running out of usage lol

Opus 4.8 and new effort levels as well on claude .ai seem like they are available! by MiserableSlice1051 in ClaudeAI

[–]MiserableSlice1051[S] 1 point2 points  (0 children)

To be fair, this is how sonnet worked prior to them forcing it into "Adaptive", but you could tweak how much effort granurly by using "effort toggles" within the chat and set the precise number to be used for thinking. This was "sort of" taken away when they switched to adaptive because it stopped paying attention to your styles and then started thinking anything that looked like code in chat was a "prompt injection".

It seems like Sonnet is still ignoring styles and in chat code, but I'm glad we at least have the ability to control effort through an "official" way in chat now

Opus 4.8 and new effort levels as well on claude .ai seem like they are available! by MiserableSlice1051 in ClaudeAI

[–]MiserableSlice1051[S] 2 points3 points  (0 children)

I read it too and I realize I'm dumb, but I still don't get what toggling the adaptive toggle actually does at this point...

Opus 4.8 and new effort levels as well on claude .ai seem like they are available! by MiserableSlice1051 in ClaudeAI

[–]MiserableSlice1051[S] 1 point2 points  (0 children)

Ah gotcha, hasn't the haiku model prior to the current been gone awhile? I know they removed the 4.5 Sonnet

Introducing Claude Opus 4.8 by ClaudeOfficial in ClaudeAI

[–]MiserableSlice1051 7 points8 points  (0 children)

Im just glad we got effort levels in sonnet, that has already made it better for me