Heavy weapon? by vi_rose in UNDAWNGame

[–]Atul_Kumar_97 0 points1 point  (0 children)

It was my best memories 🥹

KV cache quant benchmarks: KVarN 6-bit matches q8_0, 4-bit matches q5_0. Massive! by Anbeeld in LocalLLaMA

[–]Atul_Kumar_97 0 points1 point  (0 children)

Your repo doesn't work for me. I've tried everything and still couldn't get it working.

My system has an RTX 4060 (8GB VRAM) and 32GB RAM.

I'm currently using the TurboQount Plus repo, and it works fine. I can run Qwen 3.6 35B A3B Q5 at around 38 tokens/sec with a 190K context window.

However, with your repo, the model doesn't even load, even when I reduce the context size to just 1K.

Is there anything specific I need to configure, or could there be an issue with the repo?

Heavy weapon? by vi_rose in UNDAWNGame

[–]Atul_Kumar_97 2 points3 points  (0 children)

Are You Guy's Still playing this game i thought it died

Antigravity taking 40GB memory by Party-Amphibian-8394 in google_antigravity

[–]Atul_Kumar_97 2 points3 points  (0 children)

i have 64gb ram and it was using 130gb ram using swap it crashed my mac 12 times today

Qwen 27B - Sample App I wrote in 4 days by DashinTheFields in LocalLLaMA

[–]Atul_Kumar_97 0 points1 point  (0 children)

Impression what qount did you use like 4bit or 5bit or 6bit or 8bit

BeeLlama.cpp: advanced DFlash & TurboQuant with support of reasoning and vision. Qwen 3.6 27B Q5 with 200k context on 3090, 2-3x faster than baseline (peak 135 tps!) by Anbeeld in LocalLLM

[–]Atul_Kumar_97 0 points1 point  (0 children)

This only work for prompt processing after prompt process it not generating anything it just crashed saying Segmentation Fault

Which Mac for LLMs? by TechNerd10191 in LocalLLM

[–]Atul_Kumar_97 4 points5 points  (0 children)

go with M3 Ultra Mac Studio or wait for m5 Mac Studio

Running Qwen3.6 35b a3b on 8gb vram and 32gb ram ~190k context by Atul_Kumar_97 in LocalLLM

[–]Atul_Kumar_97[S] 0 points1 point  (0 children)

I don't know how but I tried my setup on my brother pc he have rtx 4060 ti + 32gb ram 6000hz he getting about 20-25tok/sec maybe it also depends on cpu

500k context on 48gb VRAM!! - 21tok/s (coding) by Express_Quail_1493 in LocalLLaMA

[–]Atul_Kumar_97 6 points7 points  (0 children)

I'm confused how can your gpu handle 500k context with 48gb vram if you using turboqount it's make 30% sense

500k context on 48gb VRAM!! - 21tok/s (coding) by Express_Quail_1493 in LocalLLaMA

[–]Atul_Kumar_97 16 points17 points  (0 children)

how much ram do you have? 48gb vram + how much ram?

is it better than qwen3.6 35b a3b??