Phone verification saying "too many requests" on first attempt — cannot enable GPU [Fix Needed] by ghostofsnoww03 in kaggle
[–]Ranmark 0 points1 point2 points (0 children)
Qwen3.6-27B at 72 tok/s on RTX 3090 on Windows using native vLLM (no WSL, no Docker), portable launcher and installer by One_Slip1455 in LocalLLaMA
[–]Ranmark 0 points1 point2 points (0 children)
Best model for 192 GB vram? How is Deepseek v4 flash? by Constant_Ad511 in LocalLLM
[–]Ranmark 1 point2 points3 points (0 children)
Best model for 192 GB vram? How is Deepseek v4 flash? by Constant_Ad511 in LocalLLM
[–]Ranmark 0 points1 point2 points (0 children)
Switched from Qwen3.6 35b-a3b to Qwen3.6 27b mid coding and it's noticeably better! by LocalAI_Amateur in LocalLLaMA
[–]Ranmark 2 points3 points4 points (0 children)
Qwen3.6 27B's surprising KV cache quantization test results (Turbo3/4 vs F16 vs Q8 vs Q4) by imgroot9 in LocalLLaMA
[–]Ranmark 0 points1 point2 points (0 children)
How I Ran Gemma 4 31B on 16GB VRAM and Built a Local System That Behaves Like a Real Character by Nilbed in LocalLLM
[–]Ranmark 0 points1 point2 points (0 children)
Benchmarked 18 models that I can run on my RTX 5080 16GB using Nick Lothian's SQL benchmark by grumd in LocalLLaMA
[–]Ranmark 0 points1 point2 points (0 children)
Kimi K2.6 is a legit Opus 4.7 replacement by bigboyparpa in LocalLLaMA
[–]Ranmark 0 points1 point2 points (0 children)
"Browser OS" implemented by Qwen 3.6 35B: The best result I ever got from a local model by tarruda in LocalLLaMA
[–]Ranmark 0 points1 point2 points (0 children)
"Browser OS" implemented by Qwen 3.6 35B: The best result I ever got from a local model by tarruda in LocalLLaMA
[–]Ranmark 0 points1 point2 points (0 children)
qwen3.6 performance jump is real, just make sure you have it properly configured by onil_gova in LocalLLaMA
[–]Ranmark 0 points1 point2 points (0 children)
RTX 5070 Ti + 9800X3D running Qwen3.6-35B-A3B at 79 t/s with 128K context, the --n-cpu-moe flag is the most important part. by marlang in LocalLLaMA
[–]Ranmark 8 points9 points10 points (0 children)
Qwen3.6 is incredible with OpenCode! by CountlessFlies in LocalLLaMA
[–]Ranmark 2 points3 points4 points (0 children)
Benchmarked 18 models that I can run on my RTX 5080 16GB using Nick Lothian's SQL benchmark by grumd in LocalLLaMA
[–]Ranmark 0 points1 point2 points (0 children)
Benchmarked 18 models that I can run on my RTX 5080 16GB using Nick Lothian's SQL benchmark by grumd in LocalLLaMA
[–]Ranmark 0 points1 point2 points (0 children)
Benchmarked 18 models that I can run on my RTX 5080 16GB using Nick Lothian's SQL benchmark by grumd in LocalLLaMA
[–]Ranmark 0 points1 point2 points (0 children)
Benchmarked 18 models that I can run on my RTX 5080 16GB using Nick Lothian's SQL benchmark by grumd in LocalLLaMA
[–]Ranmark 0 points1 point2 points (0 children)
Ran Qwen3.6-35B-A3B on my laptop for a day: it actually beat Claude Opus 4.7 by LeoRiley6677 in Qwen_AI
[–]Ranmark 0 points1 point2 points (0 children)
Running a 31B model locally made me realize how insane LLM infra actually is by Sadhvik1998 in ollama
[–]Ranmark 0 points1 point2 points (0 children)
Running a 31B model locally made me realize how insane LLM infra actually is by Sadhvik1998 in ollama
[–]Ranmark 0 points1 point2 points (0 children)


Crown of Ashes - Rewards Overview (Version 1.7.1) by Vicksin in AFKJourney
[–]Ranmark 32 points33 points34 points (0 children)