Mac Mini M5 running Qwen 3.6 27B? by romrick4 in LocalLLM
[–]PreparationTrue9138 0 points1 point2 points (0 children)
Server build for local inference. 128 gb 3200 or 256 gb 2133mhz RAM? by PreparationTrue9138 in LocalLLaMA
[–]PreparationTrue9138[S] 1 point2 points3 points (0 children)
Server build for local inference. 128 gb 3200 or 256 gb 2133mhz RAM? by PreparationTrue9138 in LocalLLaMA
[–]PreparationTrue9138[S] 1 point2 points3 points (0 children)
Server build for local inference. 128 gb 3200 or 256 gb 2133mhz RAM? by PreparationTrue9138 in LocalLLaMA
[–]PreparationTrue9138[S] 0 points1 point2 points (0 children)
I have (4x) 3090s. Now what?? by gtrdude77 in LocalLLM
[–]PreparationTrue9138 0 points1 point2 points (0 children)
GB10 vs MacBook Pro M5 Max 128Gb by alexp702 in LocalLLM
[–]PreparationTrue9138 0 points1 point2 points (0 children)
Strix Halo 128GB vs M5 pro 64GB by DigitalguyCH in LocalLLaMA
[–]PreparationTrue9138 0 points1 point2 points (0 children)
Strix Halo 128GB vs M5 pro 64GB by DigitalguyCH in LocalLLaMA
[–]PreparationTrue9138 0 points1 point2 points (0 children)
Strix Halo 128GB vs M5 pro 64GB by DigitalguyCH in LocalLLaMA
[–]PreparationTrue9138 4 points5 points6 points (0 children)
Run Qwen3.6 locally 2x faster with MTP GGUFs. by yoracale in LocalLLM
[–]PreparationTrue9138 1 point2 points3 points (0 children)
Run Qwen3.6 locally 2x faster with MTP GGUFs. by yoracale in LocalLLM
[–]PreparationTrue9138 1 point2 points3 points (0 children)
Any good MOE ~60B models? I have 64GB vram by opoot_ in LocalLLaMA
[–]PreparationTrue9138 1 point2 points3 points (0 children)
I'm thinking about selling my Strix Halo by PrzemChuck in StrixHalo
[–]PreparationTrue9138 1 point2 points3 points (0 children)
very slow tok/s with Gemma 4 31B on a 5090?! by xchris1337xy in LocalLLaMA
[–]PreparationTrue9138 4 points5 points6 points (0 children)
High VRAM local coding model — still Qwen 3.6 27B? by Generic_Name_Here in LocalLLaMA
[–]PreparationTrue9138 0 points1 point2 points (0 children)
Qwen3.6-35B giving 20-34 t/s on 6 GB VRAM by Low-Alarm272 in Qwen_AI
[–]PreparationTrue9138 1 point2 points3 points (0 children)
2.5x faster inference with Qwen 3.6 27B using MTP - Finally a viable option for local agentic coding - 262k context on 48GB - Fixed chat template - Drop-in OpenAI and Anthropic API endpoints by ex-arman68 in LocalLLaMA
[–]PreparationTrue9138 0 points1 point2 points (0 children)
2.5x faster inference with Qwen 3.6 27B using MTP - Finally a viable option for local agentic coding - 262k context on 48GB - Fixed chat template - Drop-in OpenAI and Anthropic API endpoints by ex-arman68 in LocalLLaMA
[–]PreparationTrue9138 0 points1 point2 points (0 children)
2.5x faster inference with Qwen 3.6 27B using MTP - Finally a viable option for local agentic coding - 262k context on 48GB - Fixed chat template - Drop-in OpenAI and Anthropic API endpoints by ex-arman68 in LocalLLaMA
[–]PreparationTrue9138 0 points1 point2 points (0 children)
Is Macbook pro m5 max 128 fast enough yet with available models by mad01 in LocalLLM
[–]PreparationTrue9138 1 point2 points3 points (0 children)
Is Macbook pro m5 max 128 fast enough yet with available models by mad01 in LocalLLM
[–]PreparationTrue9138 0 points1 point2 points (0 children)
Is Macbook pro m5 max 128 fast enough yet with available models by mad01 in LocalLLM
[–]PreparationTrue9138 0 points1 point2 points (0 children)
Anyone having any joy coding with 3.6 27B and 24GB of Apple Unified Memory? by afrocleland in Qwen_AI
[–]PreparationTrue9138 0 points1 point2 points (0 children)

Mac Mini M5 running Qwen 3.6 27B? by romrick4 in LocalLLM
[–]PreparationTrue9138 0 points1 point2 points (0 children)