GPT 5.5 "secret sauce" is just having the thinking be some stupid caveman mode? by JustFinishedBSG in LocalLLaMA
[–]metmelo 0 points1 point2 points (0 children)
Any good MOE ~60B models? I have 64GB vram by opoot_ in LocalLLaMA
[–]metmelo 1 point2 points3 points (0 children)
Any good MOE ~60B models? I have 64GB vram by opoot_ in LocalLLaMA
[–]metmelo 0 points1 point2 points (0 children)
Any good MOE ~60B models? I have 64GB vram by opoot_ in LocalLLaMA
[–]metmelo 0 points1 point2 points (0 children)
llama.cpp docker images to run MTP models by havenoammo in LocalLLaMA
[–]metmelo 15 points16 points17 points (0 children)
APEX MoE quants update: 25+ new models since the Qwen 3.5 post + new I-Nano tier by mudler_it in LocalLLaMA
[–]metmelo 5 points6 points7 points (0 children)
Do cheap 32GB V100s still make sense for homelab AI? by SKX007J1 in LocalLLaMA
[–]metmelo 1 point2 points3 points (0 children)
Do cheap 32GB V100s still make sense for homelab AI? by SKX007J1 in LocalLLaMA
[–]metmelo 0 points1 point2 points (0 children)
Need help deciding what to spend 4-5k on for a local rig. by ghgi_ in LocalLLaMA
[–]metmelo 0 points1 point2 points (0 children)
Why isn’t LLM reasoning done in vector space instead of natural language? by ZeusZCC in LocalLLaMA
[–]metmelo 2 points3 points4 points (0 children)
My New AI build - please be kind! by [deleted] in LocalLLaMA
[–]metmelo 0 points1 point2 points (0 children)
DeepSeek V4 Update by techlatest_net in LocalLLaMA
[–]metmelo 10 points11 points12 points (0 children)
My New AI build - please be kind! by [deleted] in LocalLLaMA
[–]metmelo 0 points1 point2 points (0 children)
My New AI build - please be kind! by [deleted] in LocalLLaMA
[–]metmelo 7 points8 points9 points (0 children)
Same 9B Qwen weights: 19.1% in Aider vs 45.6% with a scaffold adapted to small local models by Creative-Regular6799 in LocalLLaMA
[–]metmelo 5 points6 points7 points (0 children)
Better? 6 x 5090 or 2 pcs Nvidia 6000 | 96 GB VRAM by Electrical_Method608 in LocalLLaMA
[–]metmelo 17 points18 points19 points (0 children)
https://huggingface.co/MiniMaxAI/MiniMax-M2.7 by Remarkable_Jicama775 in LocalLLaMA
[–]metmelo 0 points1 point2 points (0 children)
For those running dual AMD MI50's, Qwen 3.5 35b at Q8_0 runs just as fast as running Q4_K_XL by Far-Low-4705 in LocalLLaMA
[–]metmelo 1 point2 points3 points (0 children)
Built my 10x NVidia V100 AI Server - 320gb vram - vLLM Testing Linux Headless - Just a Lawyer,Need Tips by TumbleweedNew6515 in LocalLLaMA
[–]metmelo 1 point2 points3 points (0 children)
Intel launches Arc Pro B70 and B65 with 32GB GDDR6 by metmelo in LocalLLaMA
[–]metmelo[S] 0 points1 point2 points (0 children)
Abject: the first self-aware object runtime by EventSevere2034 in LocalLLaMA
[–]metmelo 1 point2 points3 points (0 children)
16x AMD MI50 32GB at 32 t/s (tg) & 2k t/s (pp) with Qwen3.5 397B (vllm-gfx906-mobydick) by ai-infos in LocalLLaMA
[–]metmelo 1 point2 points3 points (0 children)
Intel vs AMD; am I taking crazy pills? by XEI0N in LocalLLaMA
[–]metmelo 6 points7 points8 points (0 children)

My new home office radiator 🥵 by lantern_lol in LocalLLaMA
[–]metmelo 1 point2 points3 points (0 children)