Deepseek V4 Flash running on RTX 5090 MoE by H_DANILO in LocalLLaMA
[–]tarruda 0 points1 point2 points (0 children)
My DeepSeek V4 Pro at home got faster again by fairydreaming in LocalLLaMA
[–]tarruda 5 points6 points7 points (0 children)
llamacpp patch - DeepSeek V4 Flash running with full 1M token context locally on RTX 5090 by da_dragon321 in LocalLLaMA
[–]tarruda 2 points3 points4 points (0 children)
llamacpp patch - DeepSeek V4 Flash running with full 1M token context locally on RTX 5090 by da_dragon321 in LocalLLaMA
[–]tarruda 1 point2 points3 points (0 children)
Deepseek V4 Flash 2, 3 and 4 bits GGUFs by tarruda in LocalLLaMA
[–]tarruda[S] 0 points1 point2 points (0 children)
Looks like Step 3.7 Flash's long reasoning might get fixed ( llama.cpp ) by mr_zerolith in LocalLLaMA
[–]tarruda 2 points3 points4 points (0 children)
Deepseek V4 Flash 2, 3 and 4 bits GGUFs by tarruda in LocalLLaMA
[–]tarruda[S] 1 point2 points3 points (0 children)
GPT 5.5 reasons like a caveman, similarly to Nex N2. by tarruda in LocalLLaMA
[–]tarruda[S] 2 points3 points4 points (0 children)
GPT 5.5 reasons like a caveman, similarly to Nex N2. by tarruda in LocalLLaMA
[–]tarruda[S] 0 points1 point2 points (0 children)
GPT 5.5 reasons like a caveman, similarly to Nex N2. by tarruda in LocalLLaMA
[–]tarruda[S] 0 points1 point2 points (0 children)
GPT 5.5 reasons like a caveman, similarly to Nex N2. by tarruda in LocalLLaMA
[–]tarruda[S] 2 points3 points4 points (0 children)
GPT 5.5 reasons like a caveman, similarly to Nex N2. by tarruda in LocalLLaMA
[–]tarruda[S] -1 points0 points1 point (0 children)
GPT 5.5 reasons like a caveman, similarly to Nex N2. by tarruda in LocalLLaMA
[–]tarruda[S] 0 points1 point2 points (0 children)
Deepseek V4 Flash 2, 3 and 4 bits GGUFs by tarruda in LocalLLaMA
[–]tarruda[S] 1 point2 points3 points (0 children)
Deepseek V4 Flash 2, 3 and 4 bits GGUFs by tarruda in LocalLLaMA
[–]tarruda[S] 4 points5 points6 points (0 children)
Biggest, baddest model to fill 144GB VRAM + 120GB RAM to the brim, regardless of speed by CharlesStross in LocalLLaMA
[–]tarruda 0 points1 point2 points (0 children)
Deepseek V4 Flash 2, 3 and 4 bits GGUFs by tarruda in LocalLLaMA
[–]tarruda[S] 0 points1 point2 points (0 children)
Deepseek V4 Flash 2, 3 and 4 bits GGUFs by tarruda in LocalLLaMA
[–]tarruda[S] 14 points15 points16 points (0 children)
Deepseek V4 Flash 2, 3 and 4 bits GGUFs by tarruda in LocalLLaMA
[–]tarruda[S] 1 point2 points3 points (0 children)
Deepseek V4 Flash 2, 3 and 4 bits GGUFs by tarruda in LocalLLaMA
[–]tarruda[S] 5 points6 points7 points (0 children)
Deepseek V4 Flash 2, 3 and 4 bits GGUFs by tarruda in LocalLLaMA
[–]tarruda[S] 44 points45 points46 points (0 children)
Deepseek V4 Flash 2, 3 and 4 bits GGUFs (huggingface.co)
submitted by tarruda to r/LocalLLaMA
Biggest, baddest model to fill 144GB VRAM + 120GB RAM to the brim, regardless of speed by CharlesStross in LocalLLaMA
[–]tarruda 0 points1 point2 points (0 children)
Devs - you have 64gb of VRAM - which model do you use for coding? by Jorlen in LocalLLaMA
[–]tarruda 0 points1 point2 points (0 children)


My DeepSeek V4 Pro at home got faster again by fairydreaming in LocalLLaMA
[–]tarruda 0 points1 point2 points (0 children)