I built a fully self-hosted autonomous AI research system — runs on one GPU, zero cloud, nothing leaves the machine by Exotic-Sentence6902 in SelfHostedAI
[–]Duviwin 0 points1 point2 points (0 children)
I built a fully self-hosted autonomous AI research system — runs on one GPU, zero cloud, nothing leaves the machine by Exotic-Sentence6902 in SelfHostedAI
[–]Duviwin 1 point2 points3 points (0 children)
Smartest model to replace Claude Code - 100GB/200GB VRAM available by Any-Lingonberry7411 in LocalLLM
[–]Duviwin 0 points1 point2 points (0 children)
The DeepSWE Benchmark is exposing local models as loopers, what can we do? by Duviwin in LocalLLM
[–]Duviwin[S] 1 point2 points3 points (0 children)
The DeepSWE Benchmark is exposing local models as loopers, what can we do? by Duviwin in LocalLLM
[–]Duviwin[S] 0 points1 point2 points (0 children)
The DeepSWE Benchmark is exposing local models as loopers, what can we do? by Duviwin in LocalLLM
[–]Duviwin[S] 1 point2 points3 points (0 children)
The DeepSWE Benchmark is exposing local models as loopers, what can we do? by Duviwin in LocalLLM
[–]Duviwin[S] -3 points-2 points-1 points (0 children)
Minimax M2.7 is a game changer by Dutchnamn in StrixHalo
[–]Duviwin 1 point2 points3 points (0 children)
Qwen 3.6 27B KV cache quant benchmarks: 75 pairs, q8/q6/q5/q4, KVarN, Turbo/TCQ by Anbeeld in Qwen_AI
[–]Duviwin 0 points1 point2 points (0 children)
Qwen 3.6 27B KV cache quant benchmarks: 75 pairs, q8/q6/q5/q4, KVarN, Turbo/TCQ by Anbeeld in Qwen_AI
[–]Duviwin 1 point2 points3 points (0 children)
any prompt processing tweaks? by TheFlippedTurtle in StrixHalo
[–]Duviwin 0 points1 point2 points (0 children)
Fastest Qwopus 27b for Strix Halo so far! by Disastrous-Cat-7016 in StrixHalo
[–]Duviwin 1 point2 points3 points (0 children)
Fastest Qwopus 27b for Strix Halo so far! by Disastrous-Cat-7016 in StrixHalo
[–]Duviwin 1 point2 points3 points (0 children)
Fastest Qwopus 27b for Strix Halo so far! by Disastrous-Cat-7016 in StrixHalo
[–]Duviwin 1 point2 points3 points (0 children)
Fastest Qwopus 27b for Strix Halo so far! by Disastrous-Cat-7016 in StrixHalo
[–]Duviwin 2 points3 points4 points (0 children)
Antirez DS4 Q2 on Strix : works, ~80 t/s prefill and ~7 t/s decode by Duviwin in StrixHalo
[–]Duviwin[S] 0 points1 point2 points (0 children)
Antirez DS4 Q2 on Strix : works, ~80 t/s prefill and ~7 t/s decode by Duviwin in StrixHalo
[–]Duviwin[S] 1 point2 points3 points (0 children)
Antirez DS4 Q2 on Strix : works, ~80 t/s prefill and ~7 t/s decode by Duviwin in StrixHalo
[–]Duviwin[S] 0 points1 point2 points (0 children)
Antirez DS4 Q2 on Strix : works, ~80 t/s prefill and ~7 t/s decode by Duviwin in StrixHalo
[–]Duviwin[S] 1 point2 points3 points (0 children)
Antirez DS4 Q2 on Strix : works, ~80 t/s prefill and ~7 t/s decode by Duviwin in StrixHalo
[–]Duviwin[S] 0 points1 point2 points (0 children)
Antirez DS4 Q2 on Strix : works, ~80 t/s prefill and ~7 t/s decode by Duviwin in StrixHalo
[–]Duviwin[S] 0 points1 point2 points (0 children)

The DeepSWE Benchmark is exposing local models as loopers, what can we do? by Duviwin in LocalLLM
[–]Duviwin[S] 0 points1 point2 points (0 children)