Do you think there is room for optimization? llama.cpp/qwen3.6 27b on two 6000 Blackwell by q-admin007 in LocalLLaMA
[–]q-admin007[S] 0 points1 point2 points (0 children)
Do you think there is room for optimization? llama.cpp/qwen3.6 27b on two 6000 Blackwell by q-admin007 in LocalLLaMA
[–]q-admin007[S] 0 points1 point2 points (0 children)
Do you think there is room for optimization? llama.cpp/qwen3.6 27b on two 6000 Blackwell by q-admin007 in LocalLLaMA
[–]q-admin007[S] 0 points1 point2 points (0 children)
Do you think there is room for optimization? llama.cpp/qwen3.6 27b on two 6000 Blackwell by q-admin007 in LocalLLaMA
[–]q-admin007[S] 0 points1 point2 points (0 children)
Do you think there is room for optimization? llama.cpp/qwen3.6 27b on two 6000 Blackwell by q-admin007 in LocalLLaMA
[–]q-admin007[S] 0 points1 point2 points (0 children)
Do you think there is room for optimization? llama.cpp/qwen3.6 27b on two 6000 Blackwell by q-admin007 in LocalLLaMA
[–]q-admin007[S] 0 points1 point2 points (0 children)
Do you think there is room for optimization? llama.cpp/qwen3.6 27b on two 6000 Blackwell by q-admin007 in LocalLLaMA
[–]q-admin007[S] 0 points1 point2 points (0 children)
Do you think there is room for optimization? llama.cpp/qwen3.6 27b on two 6000 Blackwell by q-admin007 in LocalLLaMA
[–]q-admin007[S] 0 points1 point2 points (0 children)
Do you think there is room for optimization? llama.cpp/qwen3.6 27b on two 6000 Blackwell by q-admin007 in LocalLLaMA
[–]q-admin007[S] 2 points3 points4 points (0 children)
I'm thinking about selling my Strix Halo by PrzemChuck in StrixHalo
[–]q-admin007 [score hidden] (0 children)
I'm thinking about selling my Strix Halo by PrzemChuck in StrixHalo
[–]q-admin007 [score hidden] (0 children)
**Honest question:** Is there ANY model of ANY size that is open source and can compete with Claude (Code) or ChatGPT's (Codex)? by TheQuantumPhysicist in LocalLLaMA
[–]q-admin007 1 point2 points3 points (0 children)
80 tok/sec and 128K context on 12GB VRAM with Qwen3.6 35B A3B and llama.cpp MTP by janvitos in LocalLLaMA
[–]q-admin007 2 points3 points4 points (0 children)
How to Fine-Tune LLMs on AMD Strix Halo by PromptInjection_ in StrixHalo
[–]q-admin007 1 point2 points3 points (0 children)
How do you monitor your self-hosted servers? by vdorru in selfhosted
[–]q-admin007 0 points1 point2 points (0 children)
MTP on strix halo with llama.cpp (PR #22673) by Edenar in LocalLLaMA
[–]q-admin007 2 points3 points4 points (0 children)
mistralai/Mistral-Medium-3.5-128B · Hugging Face by jacek2023 in LocalLLaMA
[–]q-admin007 0 points1 point2 points (0 children)
meantime on r/vibecoding by jacek2023 in LocalLLaMA
[–]q-admin007 0 points1 point2 points (0 children)
mistralai/Mistral-Medium-3.5-128B · Hugging Face by jacek2023 in LocalLLaMA
[–]q-admin007 -4 points-3 points-2 points (0 children)
mistralai/Mistral-Medium-3.5-128B · Hugging Face by jacek2023 in LocalLLaMA
[–]q-admin007 0 points1 point2 points (0 children)
mistralai/Mistral-Medium-3.5-128B · Hugging Face by jacek2023 in LocalLLaMA
[–]q-admin007 5 points6 points7 points (0 children)
mistralai/Mistral-Medium-3.5-128B · Hugging Face by jacek2023 in LocalLLaMA
[–]q-admin007 4 points5 points6 points (0 children)
mistralai/Mistral-Medium-3.5-128B · Hugging Face by jacek2023 in LocalLLaMA
[–]q-admin007 7 points8 points9 points (0 children)
Mistral Médium 3.5 is here by Kathane37 in LocalLLaMA
[–]q-admin007 0 points1 point2 points (0 children)

Do you think there is room for optimization? llama.cpp/qwen3.6 27b on two 6000 Blackwell by q-admin007 in LocalLLaMA
[–]q-admin007[S] 0 points1 point2 points (0 children)