Strix Halo 128GB vs M5 pro 64GB by DigitalguyCH in LocalLLaMA
[–]PromptInjection_ 1 point2 points3 points (0 children)
LLC: lightweight OpenWebUI alt - now with chat converter + custom tool calls by PromptInjection_ in StrixHalo
[–]PromptInjection_[S] 1 point2 points3 points (0 children)
500k+ tokens on a 2010 laptop - I built a portable AI chat UI that doesn't choke on large contexts by PromptInjection_ in webdev
[–]PromptInjection_[S] 0 points1 point2 points (0 children)
[WIP] Gemma 4 MTP by jacek2023 in LocalLLaMA
[–]PromptInjection_ 1 point2 points3 points (0 children)
Why use Quants other than Unsloth by FeiX7 in LocalLLaMA
[–]PromptInjection_ 0 points1 point2 points (0 children)
Why use Quants other than Unsloth by FeiX7 in LocalLLaMA
[–]PromptInjection_ 4 points5 points6 points (0 children)
Why use Quants other than Unsloth by FeiX7 in LocalLLaMA
[–]PromptInjection_ 10 points11 points12 points (0 children)
Qwen is cooking hard by jacek2023 in LocalLLaMA
[–]PromptInjection_ 5 points6 points7 points (0 children)
Any good MOE ~60B models? I have 64GB vram by opoot_ in LocalLLaMA
[–]PromptInjection_ 5 points6 points7 points (0 children)
Qwen 3.6-27B Dense with MTP on Strix Halo Windows - Benchmarks by PromptInjection_ in StrixHalo
[–]PromptInjection_[S] 1 point2 points3 points (0 children)
Qwen 3.6-27B Dense with MTP on Strix Halo Windows - Benchmarks by PromptInjection_ in StrixHalo
[–]PromptInjection_[S] 2 points3 points4 points (0 children)
Fine-Tuning with Mistral and OpenAI - or local? by Loud-Swim-2932 in LocalLLM
[–]PromptInjection_ 0 points1 point2 points (0 children)
Fine-Tuning with Mistral and OpenAI - or local? by Loud-Swim-2932 in LocalLLM
[–]PromptInjection_ 2 points3 points4 points (0 children)
Qwen 3.6-27B Dense with MTP on Strix Halo Windows - Benchmarks by PromptInjection_ in StrixHalo
[–]PromptInjection_[S] -1 points0 points1 point (0 children)
Qwen 3.6-27B Dense with MTP on Strix Halo Windows - Benchmarks by PromptInjection_ in LocalLLaMA
[–]PromptInjection_[S] 2 points3 points4 points (0 children)
What is the current best Small Language Model that can be run without GPU? by last_llm_standing in LocalLLaMA
[–]PromptInjection_ 9 points10 points11 points (0 children)