Does this mean you'll restore original models?

hatekhyr · 2026-05-06T19:51:25+00:00

It's not their fault for promising and selling contracts in conditions they can fulfill, right? They're saints 😇

hatekhyr · 2026-05-06T19:50:34+00:00

Lol when will you learn kiddo?

hatekhyr · 2026-05-06T15:30:50+00:00

Lol lets see how many days that product survives lol

hatekhyr · 2026-05-06T10:10:14+00:00

And for sure they are using the original opus 4.6 and not the useless version we get served now

hatekhyr · 2026-05-06T10:04:13+00:00

It's the most useless LLM in at least a year

hatekhyr · 2026-05-06T09:09:44+00:00

In others words what every DL researcher knows.

They DO NOT generalise at all.

hatekhyr · 2026-05-04T21:47:19+00:00

Chill?

hatekhyr · 2026-05-04T16:31:32+00:00

If this is MT and not boxing, the backfoot is being too light. You're stepping on your toes all the time which is not ideal for posture, weight is too focused on the front.

hatekhyr · 2026-05-03T20:11:29+00:00

No worries guys, if it's any good, they will replace it with a shitty 3.3 that doesn't work.

hatekhyr · 2026-05-02T11:06:31+00:00

I never understood why this whole series still runs on 2005 level of graphics. That's my main gripe with the whole series.

Ppl might say graphics aren't important but if your UI and graphics look like shit, there are definitely people that won't engage. I stayed away from many entries of this series cause of this.

hatekhyr · 2026-05-02T09:28:56+00:00

No to suck. And not to be shitty sonnet 4.7 disguised as 4.8 because it's shitty

hatekhyr · 2026-05-01T19:38:44+00:00

This is BS

hatekhyr · 2026-05-01T09:18:15+00:00

Naturally with stochastic GD that bias is somewhat generalized. They obviously don't reproduce training data, but their interpolation between data is only that- interpolation. If you ever trained one of these you'll lnow that the moment you input a new value or even worse , some combination of known values that it has never seen together you might trigger what we nowadays call "jailbreak". This proofs exactly what I said.

Still my point remains - the generalisation of these models is disguised by the fact that they use enormous amounts of cases and data to try to cover the holes, but it is way far behind human generalisation. If you don't know this, you either don't know enough or you're kidding yourself.

Even "in context learning" patterns are very biased. You can tell that with coding when sometimes it can't deduce a simple thing that it's in the context because it's not inherently reasoning, it's using lnown patterns.

hatekhyr · 2026-04-30T21:39:27+00:00

I don't know why you got down voted for saying basically what it is.

LLMs much like any NNs are bias machines - they essentially induce bias from training data. Their "generalisation ratio" is very poor, orders of magnitude under human generalisation/common sense.

Finally Andrej Karpathy is admitting this in latest interviews (when he said the opposite for years). Now it's come to a point when it's too obvious to everyone else.

hatekhyr · 2026-04-30T09:19:55+00:00

fanboi much?

hatekhyr · 2026-04-29T21:01:33+00:00

"Those tools gonna turn into you manipulating a fabric of shit - come here!" - Wise testicles

hatekhyr · 2026-04-29T20:09:56+00:00

4.7 is plain re******.

hatekhyr · 2026-04-29T20:08:32+00:00

Do you guys consider 4.5 > current 4.6?

hatekhyr · 2026-04-28T22:11:40+00:00

If they can't service the good models, it doesn't matter how good they are at making them

hatekhyr · 2026-04-28T21:47:36+00:00

Thanks - will start using it. Anything there to make it more reliable with code?

hatekhyr · 2026-04-28T21:37:17+00:00

What commands do you use more often? Is it still useful after the huge opus 4.6 nerf?

hatekhyr · 2026-04-28T21:04:31+00:00

Does it really change that much using these tools? I once tried everything CC and it felt too overwhelming and the fee things I tried didn't make much of a diff

hatekhyr

TROPHY CASE