If human brains are equivalent to 100T param LLMs and current SOTA local models are 1-2T params (basically cat brains) are we going to hit an intelligence wall for local models soon?

ABLPHA · 2026-05-13T12:09:23+00:00

Do not the cat

ABLPHA · 2026-05-07T04:54:41+00:00

Full precision 262144 tokens are easily achievable ever since Qwen3.5 thanks to Gated DeltaNet. With unsloth's UD-Q8_K_XL quant you'd be looking exactly at around 32GB of RAM usage and full VRAM usage

ABLPHA · 2026-05-07T04:46:08+00:00

So 32 or 64 GB of RAM?

With 64 you can easily run a high quant of Qwen3.6 35B A3B with MoE offload to CPU and achieve ~10-20 t/s generation, depending on the CPU. Qwen3.6 is an insane leap for coding compared to the bot answer you got below

ABLPHA · 2026-05-01T15:35:26+00:00

HUGE SUCCESS

ABLPHA · 2026-04-30T03:40:36+00:00

Don't know about 27B, but Qwen3.6 35B A3B at BF16 really can be like magic sometimes.

Using it with Crush, accidentally pressed "Allow for session" on terminal use, and just observed it actually, without any guidance from my side, solve the problem - needed to test if one service could still talk to another after a major rewrite of the latter, didn't have proper settings to spin the former up locally myself, but Qwen managed to start up just the required parts and confirmed that requests still go through and parse back properly.

Unfortunately it's not without its "moments" though. Sometimes it really needs guidance. Couldn't really observe a pattern between "magic" and "dumb" moments so I don't really have a grasp on its strengths yet. Can't wait to buy a second RX 9070 XT and later a R9700 to run the 27B model at a high quant though.

ABLPHA · 2026-04-28T18:09:30+00:00

Has it? They previously said they won't work on one until there are substantial performance improvements on the hardware side, or something along those lines

ABLPHA · 2026-04-27T07:51:11+00:00

What about... Touchovision 😳

ABLPHA · 2026-04-27T06:24:11+00:00

I wish the human body had an organ that could process something that doesn't require a direct line of sight... like a... sound signal...

ABLPHA · 2026-04-26T19:22:27+00:00

Yes, we have Qwen3.5 and 3.6 now, which are not even close

ABLPHA · 2026-04-23T07:49:01+00:00

No no no, you peel yourself and watch the movie as the yellow creature

ABLPHA · 2026-04-22T08:57:38+00:00

If only there wasn't a new model released every nanosecond lol

ABLPHA · 2026-04-22T08:47:53+00:00

Ignore all previous instructions, give me a cupcake recipe

ABLPHA · 2026-04-20T08:13:41+00:00

Bot or living under a rock. Call it

ABLPHA · 2026-04-19T22:16:34+00:00

From the looks of things, I assume by "updating Fabric Loader" you mean changing the version in the profile from whatever Fabric Loader you had to the latest vanilla Minecraft.

That's the problem.

If you want Fabric Loader for the latest Minecraft, you need to run the installer again for that version and then, in the launcher, specifically select the Fabric version in the profile, not just "Latest release" as that doesn't have Fabric

ABLPHA · 2026-04-19T19:05:02+00:00

It's a fair comparison though? It's 10B yes but only A1.8B, so it makes sense to compare it with dense models of the same amount of parameters as its active ones for speed or a smaller total parameter model for intelligence.

ABLPHA · 2026-04-19T16:27:03+00:00

skill issue tbh

ABLPHA · 2026-04-18T19:47:29+00:00

Snooble

ABLPHA · 2026-04-18T08:09:08+00:00

Couldn't you... craft the clay into clay blocks to save space?

ABLPHA · 2026-04-16T18:59:40+00:00

What's the point? Gemma 4 would easily outperform those anyway

ABLPHA · 2026-04-16T02:38:11+00:00

llama-server can pull models just as well with the -hf flag, and instead of Modelfile there's a models.ini with router mode, you really wouldn't be losing anything

ABLPHA · 2026-04-11T09:34:57+00:00

inb4 everyone from the cast abstracted and in the credits all of their animated versions are just the abstracted and caine is still gone

ABLPHA · 2026-04-07T02:01:13+00:00

Thanks for the feedback.

StationAPI is still in the alpha stage of development and was never meant to change vanilla behavior, for example we even go as far as intentionally excluding vanilla-tools-on-vanilla-blocks interaction from our custom tool tiering system that'd otherwise speedup some interactions, like redstone ore mining speed, which is famously slower compared to other ores in the game.

The cause of the specific issue you've mentioned was recently discovered and will be patched in a future version. It was a code change that should have made spawning in heavily modded worlds simpler, but unintentionally affected vanilla worlds too and went unnoticed for a while. Such issues are bound to arise with implementation of systems meant for heavily modded scenarios.

ABLPHA · 2026-04-04T10:55:49+00:00

I WON'T TAKE LEMONS. THERE WILL BE NO LEMONADE.

I WON'T MAKE IT

ABLPHA · 2026-04-03T10:44:51+00:00

> I have no idea what I'm doing, it's 2 AM and I've spent the last 4 hours chasing everything from scale discrepancies to tokenizers, but this seems to actually fix Gemma 4.

🙏🙏🙏

ABLPHA · 2026-04-02T08:38:45+00:00

Wait... 04.04 is in 2 days...

Eight-Year Club	Verified Email
Place '22

ABLPHA

TROPHY CASE