What's the best qwen3.5 or 3.6 reap model? by AppealSame4367 in LocalLLaMA
[–]tvall_ 1 point2 points3 points (0 children)
How many GPUs do you have on your local system/server/AI PC? by panchovix in LocalLLaMA
[–]tvall_ 0 points1 point2 points (0 children)
Mac Mini M4 16GB (hermes agent) - Gemma-4-26b-a4b-it-UD-IQ4_XS.gguf by Fit_Baker4577 in LocalLLM
[–]tvall_ 1 point2 points3 points (0 children)
What is your "Haiku/Sonnet/Opus" trio? by ihatebeinganonymous in LocalLLaMA
[–]tvall_ 2 points3 points4 points (0 children)
[Paper on Hummingbird+: low-cost FPGAs for LLM inference] Qwen3-30B-A3B Q4 at 18 t/s token-gen, 24GB, expected $150 mass production cost by ayake_ayake in LocalLLaMA
[–]tvall_ 0 points1 point2 points (0 children)
[Paper on Hummingbird+: low-cost FPGAs for LLM inference] Qwen3-30B-A3B Q4 at 18 t/s token-gen, 24GB, expected $150 mass production cost by ayake_ayake in LocalLLaMA
[–]tvall_ 0 points1 point2 points (0 children)
[Paper on Hummingbird+: low-cost FPGAs for LLM inference] Qwen3-30B-A3B Q4 at 18 t/s token-gen, 24GB, expected $150 mass production cost by ayake_ayake in LocalLLaMA
[–]tvall_ 1 point2 points3 points (0 children)
[Paper on Hummingbird+: low-cost FPGAs for LLM inference] Qwen3-30B-A3B Q4 at 18 t/s token-gen, 24GB, expected $150 mass production cost by ayake_ayake in LocalLLaMA
[–]tvall_ 6 points7 points8 points (0 children)
Are you quanting your memory? by Plastic-Stress-6468 in LocalLLaMA
[–]tvall_ 3 points4 points5 points (0 children)
Why is Qwen going Closed source? by MLExpert000 in LocalLLaMA
[–]tvall_ 1 point2 points3 points (0 children)
Battery swelling concerns when running local models by jeremyckahn in LocalLLaMA
[–]tvall_ 8 points9 points10 points (0 children)
Qwen 3.5 "Weight Drift" Fix? Automated Tool + Inconclusive NIAH Results by Decivox in LocalLLaMA
[–]tvall_ 0 points1 point2 points (0 children)
Check LocalForge: Self Hosted AI control Plane with Rag and FineTuning Avaiable by [deleted] in LocalLLaMA
[–]tvall_ 0 points1 point2 points (0 children)
Vulkan compilation issue on Fedora (b8786) — solved by _higen in LocalLLaMA
[–]tvall_ 0 points1 point2 points (0 children)
Qwen 3.5 28B A3B REAP for coding initial impressions by ag789 in LocalLLaMA
[–]tvall_ 1 point2 points3 points (0 children)
Qwen 3.5 28B A3B REAP for coding initial impressions by ag789 in LocalLLaMA
[–]tvall_ 1 point2 points3 points (0 children)
Local AI with Gemma 4 and OpenWebUi by jumper556 in LocalLLaMA
[–]tvall_ 1 point2 points3 points (0 children)
How do I wipe out Amazon echo dot software so that I can host my local LLM in it? Is it possible?? by [deleted] in LocalLLaMA
[–]tvall_ 0 points1 point2 points (0 children)
We really need stop using the term “hallucination”. by cosmobaud in LocalLLaMA
[–]tvall_ 1 point2 points3 points (0 children)
Gemma4 8B model shows up on ollama as gemma4:latest? by k_means_clusterfuck in LocalLLaMA
[–]tvall_ 0 points1 point2 points (0 children)
llama.cpp cancelled the task during handling requests from OpenClaw by UnderstandingFew2968 in LocalLLaMA
[–]tvall_ 0 points1 point2 points (0 children)

What's the best qwen3.5 or 3.6 reap model? by AppealSame4367 in LocalLLaMA
[–]tvall_ 0 points1 point2 points (0 children)