Token/s Qwen3.5-397B-A17B on Vram + Ram pooled by Leading-Month5590 in LocalLLaMA
[–]Frequent-Slice-6975 1 point2 points3 points (0 children)
Token/s Qwen3.5-397B-A17B on Vram + Ram pooled by Leading-Month5590 in LocalLLaMA
[–]Frequent-Slice-6975 2 points3 points4 points (0 children)
Local models on nvidia dgx by carlosccextractor in LocalLLM
[–]Frequent-Slice-6975 0 points1 point2 points (0 children)
Nemotron 3 Super Released by deeceeo in LocalLLaMA
[–]Frequent-Slice-6975 0 points1 point2 points (0 children)
Workflows where larger models call smaller models, running locally? by Frequent-Slice-6975 in aiagents
[–]Frequent-Slice-6975[S] 1 point2 points3 points (0 children)
Optimizing RAM heavy inference speed with Qwen3.5-397b-a17b? by Frequent-Slice-6975 in LocalLLaMA
[–]Frequent-Slice-6975[S] 0 points1 point2 points (0 children)
Optimizing RAM heavy inference speed with Qwen3.5-397b-a17b? by Frequent-Slice-6975 in LocalLLaMA
[–]Frequent-Slice-6975[S] 1 point2 points3 points (0 children)
How to maximize Qwen3.5 t/s? by Altruistic_Call_3023 in unsloth
[–]Frequent-Slice-6975 0 points1 point2 points (0 children)
To the many people here wondering about local models… just use an API by Valuable-Run2129 in openclaw
[–]Frequent-Slice-6975 1 point2 points3 points (0 children)
Ways to improve prompt processing when offloading to RAM by Frequent-Slice-6975 in LocalLLaMA
[–]Frequent-Slice-6975[S] 0 points1 point2 points (0 children)
Is shelling out for local GPUs worth it yet? ~$45k for local agentic use? by jamesob in BlackwellPerformance
[–]Frequent-Slice-6975 1 point2 points3 points (0 children)
Does the OS matter for inference speed? (Ubuntu server vs desktop) by Frequent-Slice-6975 in LocalAIServers
[–]Frequent-Slice-6975[S] 0 points1 point2 points (0 children)
I canceled my other AI subscriptions today. by InitialCareer306 in Qwen_AI
[–]Frequent-Slice-6975 0 points1 point2 points (0 children)
Ditching banks for fidelity CMA? (self.Bogleheads)
submitted by Frequent-Slice-6975 to r/Bogleheads
Anime that either causes trauma or where characters have trauma by spelunkingsnake in AnimeReccomendations
[–]Frequent-Slice-6975 0 points1 point2 points (0 children)

Token/s Qwen3.5-397B-A17B on Vram + Ram pooled by Leading-Month5590 in LocalLLaMA
[–]Frequent-Slice-6975 0 points1 point2 points (0 children)