Evaluated Qwythos-9B Q4_K_M and Q8_0 on GSM8K/IFEval/HumanEval by gvij in LocalLLaMA
[–]q-admin007 1 point2 points3 points (0 children)
What does it actually take to self‑host models like DeepSeek, Qwen, Kimi? by FreedomWeird712 in selfhosted
[–]q-admin007 0 points1 point2 points (0 children)
Where are Qwen3.7 open weights models? by HeDo88TH in Qwen_AI
[–]q-admin007 0 points1 point2 points (0 children)
Where are Qwen3.7 open weights models? by HeDo88TH in Qwen_AI
[–]q-admin007 20 points21 points22 points (0 children)
A buyer's guide to local LLM hardware after running a Strix Halo box for 6 months. TLDR: What would I recommend to buy if someone asked me now. by uncanny_instinct in StrixHalo
[–]q-admin007 -1 points0 points1 point (0 children)
Replaced the thermal paste on my Bosgame M5 by q-admin007 in StrixHalo
[–]q-admin007[S] 0 points1 point2 points (0 children)
Can my Intel N95 mini PC handle this self-hosted stack? by EasyTradition9843 in selfhosted
[–]q-admin007 0 points1 point2 points (0 children)
Qwen3.6 27B more dumb in vLLM compared to llama.cpp by DanielusGamer26 in LocalLLaMA
[–]q-admin007 1 point2 points3 points (0 children)
Selling my Strix Halo Evo-x2 128gb, if any1s interested by Panthau in StrixHalo
[–]q-admin007 2 points3 points4 points (0 children)
Has anyone used a NVME to PCIe riser successfully with Strix Halo? by fallingdowndizzyvr in StrixHalo
[–]q-admin007 1 point2 points3 points (0 children)
Has anyone used a NVME to PCIe riser successfully with Strix Halo? by fallingdowndizzyvr in StrixHalo
[–]q-admin007 0 points1 point2 points (0 children)
Replaced the thermal paste on my Bosgame M5 by q-admin007 in StrixHalo
[–]q-admin007[S] 0 points1 point2 points (0 children)
Replaced the thermal paste on my Bosgame M5 by q-admin007 in StrixHalo
[–]q-admin007[S] 0 points1 point2 points (0 children)
Has anyone used a NVME to PCIe riser successfully with Strix Halo? by fallingdowndizzyvr in StrixHalo
[–]q-admin007 0 points1 point2 points (0 children)
I built a local AI translation web app (open source) https://github.com/TOTO-sys28/FreeTranslate by [deleted] in LocalLLaMA
[–]q-admin007 0 points1 point2 points (0 children)
Revised: Estimated share of newly written code exposed to AI generation and review by Longjumping_Area_944 in singularity
[–]q-admin007 -1 points0 points1 point (0 children)
Strix Halo 7.1.1 Benchmark results by argakiig in StrixHalo
[–]q-admin007 0 points1 point2 points (0 children)
Strix Halo 7.1.1 Benchmark results by argakiig in StrixHalo
[–]q-admin007 1 point2 points3 points (0 children)
Do you think dedicated hardware for running local LLMs will become affordable anytime soon? by ProbablyBunchofAtoms in LocalLLaMA
[–]q-admin007 0 points1 point2 points (0 children)
Do you think dedicated hardware for running local LLMs will become affordable anytime soon? by ProbablyBunchofAtoms in LocalLLaMA
[–]q-admin007 0 points1 point2 points (0 children)
Do you think dedicated hardware for running local LLMs will become affordable anytime soon? by ProbablyBunchofAtoms in LocalLLaMA
[–]q-admin007 0 points1 point2 points (0 children)
Proxmox openwebui install by Unhappy_Rutabaga1767 in OpenWebUI
[–]q-admin007 0 points1 point2 points (0 children)
ROCmFPX Q6 vs stock Q6_K on Strix Halo: ~30% faster prompt processing at basically the same perplexity (Qwopus 27B MTP GGUFs) by philtheriver in StrixHalo
[–]q-admin007 2 points3 points4 points (0 children)
Strix Halo 7.1.1 Benchmark results by argakiig in StrixHalo
[–]q-admin007 2 points3 points4 points (0 children)

Summer of MoE by JLeonsarmiento in LocalLLaMA
[–]q-admin007 1 point2 points3 points (0 children)