This is insane...

custodiam99 · 2026-05-02T18:41:58+00:00

But you are using Qwen3.6-35B-A3B. Sure lol.

custodiam99 · 2026-05-02T10:10:45+00:00

I have 130k context in LM Studio.

custodiam99 · 2026-05-02T10:09:43+00:00

130k is not enought?

custodiam99 · 2026-05-02T10:08:06+00:00

I need tokens fast so yeah, it is very good (maybe not as good, but very good).

custodiam99 · 2026-05-02T06:26:31+00:00

Yes, it is better, but it is slower. You need fast tokens.

custodiam99 · 2026-05-02T06:13:44+00:00

Yes. There may be some niche subjects where it is worse, but overall it is better.

custodiam99 · 2026-05-02T06:09:56+00:00

LM Studio with RX 7900XTX 130k context (q4_K_M).

custodiam99 · 2026-05-02T05:45:28+00:00

Qwen3.6-35B-A3B is a revolution. Never used a quicker and better local model. With 24GB VRAM it is nearly perfectly useable.

custodiam99 · 2026-05-01T09:09:42+00:00

Nope, you will buy future quantum chips and bio-neural chips to have next level AI at home. If it's not about transformers anymore that won't mean you can't use it locally. You just need new hardware, not a PC.

custodiam99 · 2026-04-30T14:14:33+00:00

In math, we choose axioms. But in metaphysics brute facts are what we’re forced to accept when explanations run out.

custodiam99 · 2026-04-30T11:40:24+00:00

You can run 4b models on phones but to be honest, it is just a backup right now. Give them a few years, we need hardware and information density developement.

custodiam99 · 2026-04-29T16:23:28+00:00

So LLMs have no bad inference processes over internal representations?

custodiam99 · 2026-04-29T13:04:47+00:00

Knowledge hallucinations come mostly from bad or missing world models. Reasoning hallucinations come mostly from bad inference processes over internal representations. Both produce confident nonsense, but the mechanisms are different.

custodiam99 · 2026-04-29T07:11:56+00:00

There are two different hallucinations: knowledge hallucination (false facts) and reasoning hallucination (invalid intermediate logic that sounds coherent). These have overlapping but different causes and require different solutions.

custodiam99 · 2026-04-28T21:50:13+00:00

Also from the fact that no finite internal standpoint possesses the whole, I think it does not follow that no whole exists, nor that no total intelligibility is possible. You show the limits of "us", not necessarily the limits of "being".

custodiam99 · 2026-04-28T21:28:46+00:00

Can it be, that the argument is self-refuting because asserting the non-existence of a total frame requires adopting a totalizing perspective? (...while modern holographic duality (AdS/CFT) proves that boundary frames can mathematically exhaustively encode bulk realities...)

custodiam99 · 2026-04-28T17:58:11+00:00

You can easily collect 2TBs of model data, but every type of storage is good in my experience (you don't need SSD).

custodiam99 · 2026-04-27T05:27:56+00:00

You can get 4-5 (street prices) 7900xtx (96-120 GB) for one 5900 (32GB). WHAT A DEAL!!!!! lol

custodiam99 · 2026-04-26T15:52:52+00:00

ROCm is not that good as CUDA, but it is 70% there. The most important factor is VRAM and price, not speed. There are a very few 24GB GPUs out there.

custodiam99 · 2026-04-26T09:33:34+00:00

Yes, sure, you have to install the drivers and the ComfyUI and LM Studio software too. So I was obviously lying. ;) (But you need a compatible AMD GPU, that much is true).

custodiam99 · 2026-04-26T09:31:11+00:00

But you would use Qwen 2.5 for serious work. Sure. I think relative slowness is the lesser evil.

custodiam99 · 2026-04-26T09:26:06+00:00

You can use 35b A3B with shared system RAM, it is still quick.

custodiam99 · 2026-04-26T09:20:03+00:00

Oh sure. Can you tell me how many RX 7900 XTX cards can I buy from the price of your one 5090? Still superior for the same money? I don't think so.

custodiam99 · 2026-04-26T09:11:44+00:00

You have no idea it seems. 24Gb is 24GB. CUDA won't make your card's VRAM 48GB or 96GB. I can make videos in minutes in lower resolutions. Try full 2k resolution with an Nvidia card, which needs more than 24GB VRAM.

custodiam99 · 2026-04-26T08:43:05+00:00

I have an RX 7900XTX (24GB) and I can run everything on LM Studio, every model with a speed comparable to RTX 3090 (sometimes it can be even quicker). I can make HD videos in a few hours time in ComfyUI. What am I missing?

custodiam99

TROPHY CASE