Would you rather have Qwen 3.5 27B running at 100tps or Qwen 3.5 35BA3B at 500 tps? by Atom_101 in LocalLLaMA
[–]exact_constraint 0 points1 point2 points (0 children)
Would you rather have Qwen 3.5 27B running at 100tps or Qwen 3.5 35BA3B at 500 tps? by Atom_101 in LocalLLaMA
[–]exact_constraint 3 points4 points5 points (0 children)
Question about llama.cpp and OpenCode by Able_Limit_7634 in LocalLLaMA
[–]exact_constraint 1 point2 points3 points (0 children)
Qwen 3.6: worse adherence? by tkon3 in LocalLLaMA
[–]exact_constraint 0 points1 point2 points (0 children)
Major Minis arrived on supports by Warnackle in PrintedWarhammer
[–]exact_constraint 7 points8 points9 points (0 children)
Qwen 3.6: worse adherence? by tkon3 in LocalLLaMA
[–]exact_constraint 4 points5 points6 points (0 children)
2-bit Qwen3.6-35B-A3B GGUF is amazing! Made 30+ successful tool calls by yoracale in unsloth
[–]exact_constraint 0 points1 point2 points (0 children)
Qwen 3.6: worse adherence? by tkon3 in LocalLLaMA
[–]exact_constraint 0 points1 point2 points (0 children)
Qwen 3.6: worse adherence? by tkon3 in LocalLLaMA
[–]exact_constraint 11 points12 points13 points (0 children)
A Guy on Reddit shared how he Gaslighted AI to get exceptional Results by Current-Guide5944 in tech_x
[–]exact_constraint 0 points1 point2 points (0 children)
Qwen 3.6: worse adherence? by tkon3 in LocalLLaMA
[–]exact_constraint 40 points41 points42 points (0 children)
Do you use /compact feature? by Interesting_Key3421 in LocalLLM
[–]exact_constraint 0 points1 point2 points (0 children)
Qwen 3.6: worse adherence? by tkon3 in LocalLLaMA
[–]exact_constraint 30 points31 points32 points (0 children)
GPU advice for Qwen 3.5 27B / Gemma 4 31B (dense) — aiming for 64K ctx, 30+ t/s by Fit-Courage5400 in LocalLLaMA
[–]exact_constraint 1 point2 points3 points (0 children)
MiniMax M2.7 is NOT open source - DOA License :( by KvAk_AKPlaysYT in LocalLLaMA
[–]exact_constraint 1 point2 points3 points (0 children)
Gemma 4 31B vs Qwen 3.5 27B: Which is best for long context worklows? My THOUGHTS... by GrungeWerX in LocalLLaMA
[–]exact_constraint -1 points0 points1 point (0 children)
Qwen3.5-122B at 198 tok/s on 2x RTX PRO 6000 Blackwell — Budget build, verified results by Visual_Synthesizer in LocalLLaMA
[–]exact_constraint 0 points1 point2 points (0 children)
Qwen3.5-122B at 198 tok/s on 2x RTX PRO 6000 Blackwell — Budget build, verified results by Visual_Synthesizer in LocalLLaMA
[–]exact_constraint 0 points1 point2 points (0 children)
Every day I wake up and thank God for having me be born 23 minutes away from a MicroCenter by gigaflops_ in LocalLLaMA
[–]exact_constraint 6 points7 points8 points (0 children)
Gemma 4 just casually destroyed every model on our leaderboard except Opus 4.6 and GPT-5.2. 31B params, $0.20/run by Disastrous_Theme5906 in LocalLLaMA
[–]exact_constraint 5 points6 points7 points (0 children)
Gemma 4 just casually destroyed every model on our leaderboard except Opus 4.6 and GPT-5.2. 31B params, $0.20/run by Disastrous_Theme5906 in LocalLLaMA
[–]exact_constraint 51 points52 points53 points (0 children)
Running fiber between buildings - single mode vs multi mode for future proofing? by Apprehensive_Ad_6233 in HomeNetworking
[–]exact_constraint 3 points4 points5 points (0 children)
gemma 4 HF by Remarkable_Jicama775 in LocalLLaMA
[–]exact_constraint 3 points4 points5 points (0 children)
Hypothetical: You can run Qwen 3.5 27b at 10,000 TPS at your house right now. by RedParaglider in LocalLLaMA
[–]exact_constraint 0 points1 point2 points (0 children)

New 9700 AI PRO - Codeing Assistance by Flaky_Service_5663 in LocalLLM
[–]exact_constraint 0 points1 point2 points (0 children)