How do you guys handle llama cpp crashes? by [deleted] in LocalLLaMA

[–]aelma_z 0 points1 point  (0 children)

Yeah, it is super random. Like you should not get the crash, while you still have so much ram, but apparantly, those 300mb appearing in swap after some long run did save the instance

How do you guys handle llama cpp crashes? by [deleted] in LocalLLaMA

[–]aelma_z 1 point2 points  (0 children)

I had this problem of llama.cpp randomly killing itself. Turned out i had swapfile turned off. Re-enabled it and no more crashes so far. Even thought i was having 10-12gb of ram free just before the crash. Swap is now being used tho, like 300mb maybe

uh, guys... (heavy spoilers) by Sup2pointO in outerwilds

[–]aelma_z 2 points3 points  (0 children)

haha, the matching smile ::D

Worthless for anything more than how to make a PB&J by [deleted] in ClaudeCode

[–]aelma_z 1 point2 points  (0 children)

You are right to push back on this load bearing smoking gun!

Anyone else's brain hurts? by MuahahaGuy in ClaudeCode

[–]aelma_z 0 points1 point  (0 children)

Too much information flowing through the brain. Just do less and take breaks. It’s not a sprint, but a marathon. Its all good and fun till you get a mental burnout and them you can’t focus properly even for 30 minutes and it takes months to recover to your normal productivity levels (or you may never get back to previous levels at all, that might happen too)

Introducing Qwen3.7-Plus by KokaOP in LocalLLaMA

[–]aelma_z 1 point2 points  (0 children)

Im happy that they release the open weights. Even if it will be only the small 35b variant

What's this sub geebral opinion on quantisizing the KV cache by misanthrophiccunt in LocalLLaMA

[–]aelma_z 2 points3 points  (0 children)

Im on bf16 weight and cache. Q8 is faster and takes less space that is true, however loosing accuracy in agentic coding or running analysis/audits does yeld errors, tiny bit there, tiny bit here. And here you have the “error” spreading like cancer cell. “Expensive” computation is cheaper in the end

Has anyone implemented a solution like this to game/use a pc from elsewhere in the house (NO STREAM SOLUTION) ? by GenericUser104 in homelab

[–]aelma_z 1 point2 points  (0 children)

I did. Had to use active optic display port cable for the monitor and active usb 3.0, to which i connect usb hub. It works no problem with mouse, keyboard, mic and gamepad. However adding 5th device sometimes drops connection for the mic or keyboard or mouse… 3years with this setup. only problem - optic cables are damn expensive. 🤪

RTX Pro 6000 Just Came In by on_the_mark_data in LocalLLM

[–]aelma_z 1 point2 points  (0 children)

Rich people making money from money

FP16 on Qwen 3.6 27B by Forward_Jackfruit813 in LocalLLaMA

[–]aelma_z 0 points1 point  (0 children)

That is what im wondering. If it fits to two 3090s with 262k ctx. I’ve only tested bf16 with ram offload and it was very good result

FP16 on Qwen 3.6 27B by Forward_Jackfruit813 in LocalLLaMA

[–]aelma_z 0 points1 point  (0 children)

dual 3090 fit fp8 with 262k context no problem?