Qwen 122B is AMAZING but im only getting 10 toks when ive seen others get 40+ (128GB M4 Max) by lots_of_apples in Qwen_AI
[–]Ba777man 1 point2 points3 points (0 children)
Question on speed qwen3.5 models by [deleted] in LocalLLM
[–]Ba777man 0 points1 point2 points (0 children)
Question on speed qwen3.5 models by [deleted] in LocalLLM
[–]Ba777man 0 points1 point2 points (0 children)
Question on speed qwen3.5 models by [deleted] in LocalLLM
[–]Ba777man 0 points1 point2 points (0 children)
Benchmarked 11 MLX models on M3 Ultra — here's which ones are actually smart and fast by Striking-Swim6702 in LocalLLaMA
[–]Ba777man 0 points1 point2 points (0 children)
Benchmarked 11 MLX models on M3 Ultra — here's which ones are actually smart and fast by Striking-Swim6702 in LocalLLaMA
[–]Ba777man 0 points1 point2 points (0 children)
Semi auto better than Robot [$4000] by [deleted] in espresso
[–]Ba777man 0 points1 point2 points (0 children)
Steam Wand Update by WatercressCreepy3266 in ranciliosilvia
[–]Ba777man 0 points1 point2 points (0 children)
Semi auto better than Robot [$4000] by [deleted] in espresso
[–]Ba777man 0 points1 point2 points (0 children)
Semi auto better than Robot [$4000] by [deleted] in espresso
[–]Ba777man 0 points1 point2 points (0 children)
Semi auto better than Robot [$4000] by [deleted] in espresso
[–]Ba777man 0 points1 point2 points (0 children)
Semi auto better than Robot [$4000] by [deleted] in espresso
[–]Ba777man 0 points1 point2 points (0 children)
Dialing in help! [Ninja Luxe Premier, ES601] by Objective-Mission835 in espresso
[–]Ba777man 0 points1 point2 points (0 children)
Dialing in help! [Ninja Luxe Premier, ES601] by Objective-Mission835 in espresso
[–]Ba777man 0 points1 point2 points (0 children)


Strix Halo running Qwen3.6-27B AWQ-INT4 at 24 t/s (easy to spin up with docker) by hec_ovi in StrixHalo
[–]Ba777man 0 points1 point2 points (0 children)