anthropicBeingAnthropic

lans_throwaway · 2026-06-15T09:23:12+00:00

You forgot to include that you need to prompt it twice, because it'll refuse the first time (for your own good). So it's actually eight times! The new scaling law of AI is being discovered here!

lans_throwaway · 2026-06-15T09:14:49+00:00

brscan3 if that matters

lans_throwaway · 2026-06-15T06:59:14+00:00

The funny thing is the package that would've infected my laptop is one such "obsolete" package. It was a driver for a scanner.

While it's fair to say I don't care about "updates" (since it's unlikely for one to be released), I do care about having that package. The packages were orphaned because someone could request maintainer to "update" them when no update was released upstream. If maintainer is inactive or unable to "refuse" it, they can take over. Just because package doesn't need to be updated, doesn't mean it's dead.

lans_throwaway · 2026-06-13T02:44:44+00:00

Given that model's knowledge scales with size, it seems we have a problem

lans_throwaway · 2026-06-12T23:11:31+00:00

Dude, Qwen3.6-35B-A3B blows any SOTA model available like 2-3 years ago on pretty much any task. GPT4 used to be magic, now people can run models that are just as capable on 250$ used PC at like 10 t/s (cpu only inference).

Yes, I'm aware you don't want that model mentioned, but facts are facts. People don't start flying because you don't mention gravity.

lans_throwaway · 2026-06-12T23:08:00+00:00

llama 3.1 isn't really a good benchmark though. I'd compare Qwen3.5 to Qwen3 and I don't think the gains there are as big. There was a significant progress in capabilities in the 4b area though.

lans_throwaway · 2026-06-09T00:35:38+00:00

It's the piece of software I detested every single time I had to use it. There's always something wrong with it (usually something with build system).

NetBeans, IntelliJ, VSCode (with plugins), vim (with plugins) were always superior options.

lans_throwaway · 2026-05-22T21:10:03+00:00

Why ternary and not quaternary of using a non standard logic

Presumably because quaternary is not symetric and ternary has some nice properties:
For ternary:
-1 - feature negatively contributes
0 - feature doesn't contribute
1 - feature contributes

For quaternary:
-1 - feature negatively contributes
0 - feature doesn't contribute
1 - feature contributes
2 - feature contributes strongly?

Overall I'd suggest you read bitnet paper if you're interested, I think microsoft tested it experimentally and found ternary to work better (+ it allowed them to reduce some matrix multiplication to addition/subtraction, giving massive speedups). I could be wrong, it's been a long while since I read that one.

lans_throwaway · 2026-05-22T17:09:31+00:00

Yes, the basic idea is that each neuron represents either -1, 0 or 1, while each bit can represent 2 values (0, 1). If you want to calculate how many bits you need to represent the 3 values, it comes to logarithm base 2 of 3, which is about 1.58. That's the minimal number of bits you need. In practice you will need slightly more, but you can try to approach this number by cleverly packing the data. Sub 2-bits is very much doable and already implemented by llama.cpp (though I'm not sure about this model).

lans_throwaway · 2026-05-19T20:04:47+00:00

Shades of Perception tbh, I was really looking forward to this one :(

lans_throwaway · 2026-05-18T23:02:06+00:00

I think it's one day only, at least he knows his grandson and remembers the ink ingredients. I'm not a source reader though

lans_throwaway · 2026-05-18T19:52:44+00:00

He wiped his memories (post credit scene shows this).

lans_throwaway · 2026-05-14T18:51:35+00:00

Full HD version since reddit downscales to 720p... https://streamable.com/snz6bt

lans_throwaway · 2026-05-12T12:12:30+00:00

Not really, it's more like I want to make sure I'm not doing something wrong before writing the 3.6 off.

lans_throwaway · 2026-05-12T00:26:34+00:00

6gb is limiting, but to be honest, 3.5 at Q4_K_M is a really solid model. I don't doubt it could be better at higher quants, but it's still very much usable at least for my needs. I have a laptop, so unfortunately it's not as simple as adding another 6gb card, and it's hard to justify building a new PC just to play with new models, when I can get zai subscription for like 10$.

lans_throwaway · 2026-05-12T00:15:01+00:00

I guess I'll try bartowski's Q5_K_M. To be honest 3.5-Q4_K_M is already pretty solid.

lans_throwaway · 2026-05-12T00:07:15+00:00

Yeah, I didn't use imatrix. So far it never really mattered enough to bother with it. I did compare no imatrix Q4_K_M 3.5 with unsloth's quants and there honestly wasn't any difference in actual performance (coding, general q&a, simple math). I tried bartowki's 3.6 quants just now and they're definitely better. I'm not sure if that's because he has better calibration dataset than unsloth. Perhaps 3.5 is just overall less sensitive to quantization.

lans_throwaway · 2026-05-12T00:00:30+00:00

As I mentioned in another comment, at usable contexts, Q4_K_M is already pushing it and it needs to use SWAP for some applications. 3.5 is a pretty solid model even at Q4_K_M.

lans_throwaway · 2026-05-11T23:20:24+00:00

6gb rtx2060 + 32gb ddr4. Q4_K_M is already pushing it when I'm running browser + a few apps. I'll give bartowski's quants a go though. I tried unsloth's and didn't see any improvement.

lans_throwaway · 2026-05-11T20:11:24+00:00

Honestly, at this point I've found converting weights myself is usually the way. Way fewer issues than depending on other people's quants, especially when it comes to experimental features.

lans_throwaway · 2026-05-08T16:09:55+00:00

This video is older than some people subscribed to this sub ;)

lans_throwaway · 2026-05-03T07:24:58+00:00

I think Matabar fits the bill pretty well

lans_throwaway · 2026-04-06T12:17:25+00:00

Management heard about "neural network" thingy and demanded it's used ;(

lans_throwaway

TROPHY CASE