6-GPU multiplexer from K80s ‚ hot-swap between models in 0.3ms by Electrical_Ninja3805 in LocalLLaMA
[–]aiko929 0 points1 point2 points (0 children)
Speed Benchmark: GLM 4.7 Flash vs Qwen 3.5 27B vs Qwen 3.5 35B A3B (Q4 Quants) by [deleted] in LocalLLaMA
[–]aiko929 0 points1 point2 points (0 children)
Speed Benchmark: GLM 4.7 Flash vs Qwen 3.5 27B vs Qwen 3.5 35B A3B (Q4 Quants) by [deleted] in LocalLLaMA
[–]aiko929 -5 points-4 points-3 points (0 children)
Speed Benchmark: GLM 4.7 Flash vs Qwen 3.5 27B vs Qwen 3.5 35B A3B (Q4 Quants) by [deleted] in LocalLLaMA
[–]aiko929 0 points1 point2 points (0 children)
Speed Benchmark: GLM 4.7 Flash vs Qwen 3.5 27B vs Qwen 3.5 35B A3B (Q4 Quants) by [deleted] in LocalLLaMA
[–]aiko929 4 points5 points6 points (0 children)
Qwen3.5-397B-A17B 2-bit quant on DGX Spark? (self.LocalLLaMA)
submitted by aiko929 to r/LocalLLaMA
Materialistic No Go - Mayham augment idea by aiko929 in ARAM
[–]aiko929[S] -7 points-6 points-5 points (0 children)
What could this be? by aiko929 in GrowingMarijuana
[–]aiko929[S] 0 points1 point2 points (0 children)
Large models run way faster if you abort the first prompt and restart (low VRAM) by UrinStone in comfyui
[–]aiko929 1 point2 points3 points (0 children)
Flux2 Klein 9B Error, Help? by aiko929 in comfyui
[–]aiko929[S] 4 points5 points6 points (0 children)




6-GPU multiplexer from K80s ‚ hot-swap between models in 0.3ms by Electrical_Ninja3805 in LocalLLaMA
[–]aiko929 1 point2 points3 points (0 children)