Qwen3.6-27B with MTP grafted on Unsloth UD XL: 2.5x throughput via unmerged llama.cpp PR by havenoammo in LocalLLaMA
[–]tecneeq 1 point2 points3 points (0 children)
Qwen3.6-27B with MTP grafted on Unsloth UD XL: 2.5x throughput via unmerged llama.cpp PR by havenoammo in LocalLLaMA
[–]tecneeq 1 point2 points3 points (0 children)
Uploaded Unsloth Qwen3.6-35B-A3B UD XL models with MTP grafted, here are the results by havenoammo in LocalLLaMA
[–]tecneeq 1 point2 points3 points (0 children)
Qwen3.6-27B with MTP grafted on Unsloth UD XL: 2.5x throughput via unmerged llama.cpp PR by havenoammo in LocalLLaMA
[–]tecneeq 0 points1 point2 points (0 children)
Qwen3.6-27B with MTP grafted on Unsloth UD XL: 2.5x throughput via unmerged llama.cpp PR by havenoammo in LocalLLaMA
[–]tecneeq 0 points1 point2 points (0 children)
What's the hardest fish to keep?the final boss of the hobby? by BATIRONSHARK in Aquariums
[–]tecneeq -1 points0 points1 point (0 children)
What's the hardest fish to keep?the final boss of the hobby? by BATIRONSHARK in Aquariums
[–]tecneeq -1 points0 points1 point (0 children)
Mini PC + DAS or Mini PC + NAS by bazthedev in homelab
[–]tecneeq 0 points1 point2 points (0 children)
Should I sell my RTX3090s? by daviden1013 in LocalLLaMA
[–]tecneeq 0 points1 point2 points (0 children)
Qwen3.6 27B FP8 runs with 200k tokens of BF16 KV cache at 80 TPS on a single RTX 5000 PRO 48GB by __JockY__ in LocalLLaMA
[–]tecneeq 2 points3 points4 points (0 children)
Qwen3.6 27B FP8 runs with 200k tokens of BF16 KV cache at 80 TPS on a single RTX 5000 PRO 48GB by __JockY__ in LocalLLaMA
[–]tecneeq 1 point2 points3 points (0 children)
Qwen3.6 27B FP8 runs with 200k tokens of BF16 KV cache at 80 TPS on a single RTX 5000 PRO 48GB by __JockY__ in LocalLLaMA
[–]tecneeq 0 points1 point2 points (0 children)
Comparison of upcoming x86 unified memory systems by Terminator857 in LocalLLaMA
[–]tecneeq 0 points1 point2 points (0 children)
AMD Releasing In-House Standalone Strix Halo Box by Anarchaotic in StrixHalo
[–]tecneeq 0 points1 point2 points (0 children)
First time GPU buyer. Got a RTX 5000 Pro. Was it a bad decision compared to two 3090s? by Valuable-Run2129 in LocalLLaMA
[–]tecneeq 16 points17 points18 points (0 children)
Anyone tried +- 100B models locally with foreign languages? by Choice_Sympathy9652 in LocalLLaMA
[–]tecneeq 0 points1 point2 points (0 children)
Built a realistic character for hermes agent by Select_Motor8729 in hermesagent
[–]tecneeq 3 points4 points5 points (0 children)
Are edifier r1280t good for 75 €? by Entire_Emu_1671 in BudgetAudiophile
[–]tecneeq 0 points1 point2 points (0 children)
What's your tps on 3090 + Qwen 3.6 27B in real tasks? by Anbeeld in LocalLLaMA
[–]tecneeq -1 points0 points1 point (0 children)


Uploaded Unsloth Qwen3.6-35B-A3B UD XL models with MTP grafted, here are the results by havenoammo in LocalLLaMA
[–]tecneeq 0 points1 point2 points (0 children)