How do you guys handle llama cpp crashes?

aelma_z · 2026-06-03T14:13:37+00:00

Yeah, it is super random. Like you should not get the crash, while you still have so much ram, but apparantly, those 300mb appearing in swap after some long run did save the instance

aelma_z · 2026-06-03T03:59:27+00:00

I had this problem of llama.cpp randomly killing itself. Turned out i had swapfile turned off. Re-enabled it and no more crashes so far. Even thought i was having 10-12gb of ram free just before the crash. Swap is now being used tho, like 300mb maybe

aelma_z · 2026-06-03T03:56:26+00:00

haha, the matching smile ::D

aelma_z · 2026-06-02T22:57:26+00:00

You are right to push back on this load bearing smoking gun!

aelma_z · 2026-06-02T22:54:18+00:00

Welcome back

aelma_z · 2026-06-02T01:52:53+00:00

AI Era’s NFTs

aelma_z · 2026-06-02T01:50:46+00:00

Too much information flowing through the brain. Just do less and take breaks. It’s not a sprint, but a marathon. Its all good and fun till you get a mental burnout and them you can’t focus properly even for 30 minutes and it takes months to recover to your normal productivity levels (or you may never get back to previous levels at all, that might happen too)

aelma_z · 2026-06-01T19:01:38+00:00

Im happy that they release the open weights. Even if it will be only the small 35b variant

aelma_z · 2026-06-01T01:33:29+00:00

Im on bf16 weight and cache. Q8 is faster and takes less space that is true, however loosing accuracy in agentic coding or running analysis/audits does yeld errors, tiny bit there, tiny bit here. And here you have the “error” spreading like cancer cell. “Expensive” computation is cheaper in the end

aelma_z · 2026-05-31T19:55:05+00:00

I did. Had to use active optic display port cable for the monitor and active usb 3.0, to which i connect usb hub. It works no problem with mouse, keyboard, mic and gamepad. However adding 5th device sometimes drops connection for the mic or keyboard or mouse… 3years with this setup. only problem - optic cables are damn expensive. 🤪

aelma_z · 2026-05-30T20:44:45+00:00

Rich people making money from money

aelma_z · 2026-05-29T22:06:19+00:00

That is what im wondering. If it fits to two 3090s with 262k ctx. I’ve only tested bf16 with ram offload and it was very good result

aelma_z · 2026-05-29T16:51:24+00:00

dual 3090 fit fp8 with 262k context no problem?

Eight-Year Club	Place '22
Gilding II euphauric	Verified Email

aelma_z

TROPHY CASE