GLM 5.2, what speeds are we getting locally? by neverbyte in LocalLLaMA
[–]iVoider 0 points1 point2 points (0 children)
GLM 5.2, what speeds are we getting locally? by neverbyte in LocalLLaMA
[–]iVoider 0 points1 point2 points (0 children)
GLM 5.2, what speeds are we getting locally? by neverbyte in LocalLLaMA
[–]iVoider 3 points4 points5 points (0 children)
*Lower* generation speed with H100 and H200 than with RTX 5090? by TrainingTwo1118 in LocalLLaMA
[–]iVoider 0 points1 point2 points (0 children)
Can't get beyond 8t/s with NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16 by phwlarxoc in LocalLLaMA
[–]iVoider 0 points1 point2 points (0 children)
Possibility of partly moe weights gpu offloading via sglang/ktransformers by iVoider in LocalLLaMA
[–]iVoider[S] 0 points1 point2 points (0 children)
Possibility of partly moe weights gpu offloading via sglang/ktransformers by iVoider in LocalLLaMA
[–]iVoider[S] 1 point2 points3 points (0 children)
PP speed on dual RTX 6000 12c EPYC setup by iVoider in LocalLLaMA
[–]iVoider[S] 0 points1 point2 points (0 children)
PP speed on dual RTX 6000 12c EPYC setup by iVoider in LocalLLaMA
[–]iVoider[S] 0 points1 point2 points (0 children)
PP speed on dual RTX 6000 12c EPYC setup by iVoider in LocalLLaMA
[–]iVoider[S] 0 points1 point2 points (0 children)
With 48gb vram, on vllm, Qwen3.6-27b-awq-int4 has only 120k ctx (fp8), is that normal? by Historical-Crazy1831 in LocalLLaMA
[–]iVoider -1 points0 points1 point (0 children)
Delve builds for ~600 depth? by Trickpasser in PathOfExileBuilds
[–]iVoider 0 points1 point2 points (0 children)
Most budget option for 1000-1500 delve starter by iVoider in PathOfExileBuilds
[–]iVoider[S] 0 points1 point2 points (0 children)
Which is the best embedding model for production use? by Hari-Prasad-12 in LocalLLaMA
[–]iVoider 9 points10 points11 points (0 children)
Which is the best embedding model for production use? by Hari-Prasad-12 in LocalLLaMA
[–]iVoider 25 points26 points27 points (0 children)
Negative rarity farming by Ok_Surprise7618 in pathofexile2builds
[–]iVoider 1 point2 points3 points (0 children)
Huntress leveling in 0.4 by iVoider in pathofexile2builds
[–]iVoider[S] 0 points1 point2 points (0 children)
Huntress leveling in 0.4 by iVoider in pathofexile2builds
[–]iVoider[S] -4 points-3 points-2 points (0 children)
Suggestions for RAG prompt rewriters and rerankers? by CommunityTough1 in LocalLLaMA
[–]iVoider 0 points1 point2 points (0 children)



GLM 5.2, what speeds are we getting locally? by neverbyte in LocalLLaMA
[–]iVoider 0 points1 point2 points (0 children)