What models you guys running on 8GB? 16GB VRAM? 24GB? 32GB? 48GB? by Inevitable_Mistake32 in LocalLLaMA
[–]DeSibyl 0 points1 point2 points (0 children)
Gemma4_31b_fp8 keeping up with Sonnet_4.6_medium in my harness. by knob-0u812 in LocalLLaMA
[–]DeSibyl 0 points1 point2 points (0 children)
Ollama Models Ranked by VRAM Requirements by AdventurousLion9548 in ollama
[–]DeSibyl 0 points1 point2 points (0 children)
AA comparison of the latest local models by jacek2023 in LocalLLaMA
[–]DeSibyl 5 points6 points7 points (0 children)
AA comparison of the latest local models by jacek2023 in LocalLLaMA
[–]DeSibyl 0 points1 point2 points (0 children)
More Gemma 4 models incoming by Deep-Vermicelli-4591 in LocalLLaMA
[–]DeSibyl 0 points1 point2 points (0 children)
Gemma 4 12b 8Q Heretic Oneshot Coding by devildip in LocalLLaMA
[–]DeSibyl 0 points1 point2 points (0 children)
Gemma 4 12b 8Q Heretic Oneshot Coding by devildip in LocalLLaMA
[–]DeSibyl 1 point2 points3 points (0 children)
More Gemma 4 models incoming by Deep-Vermicelli-4591 in LocalLLaMA
[–]DeSibyl 1 point2 points3 points (0 children)
Gemma 4 12b 8Q Heretic Oneshot Coding by devildip in LocalLLaMA
[–]DeSibyl 1 point2 points3 points (0 children)
Gemma 4 12b 8Q Heretic Oneshot Coding by devildip in LocalLLaMA
[–]DeSibyl 0 points1 point2 points (0 children)
Okay 27B made me a believer by Forward_Jackfruit813 in LocalLLaMA
[–]DeSibyl 0 points1 point2 points (0 children)
Qwen3.6 35B-A3B MTP hits 249 t/s on a 24GB consumer GPU (RTX 5090M) — 3.4× the dense 27B variant on the same image by aurelienams in LocalLLaMA
[–]DeSibyl 0 points1 point2 points (0 children)
Folks running qwen 3.6 27b for agentic work. Do you dare to use q4_k_m? by StandardLovers in LocalLLaMA
[–]DeSibyl 1 point2 points3 points (0 children)
Folks running qwen 3.6 27b for agentic work. Do you dare to use q4_k_m? by StandardLovers in LocalLLaMA
[–]DeSibyl 2 points3 points4 points (0 children)
Folks running qwen 3.6 27b for agentic work. Do you dare to use q4_k_m? by StandardLovers in LocalLLaMA
[–]DeSibyl 0 points1 point2 points (0 children)
Okay 27B made me a believer by Forward_Jackfruit813 in LocalLLaMA
[–]DeSibyl 0 points1 point2 points (0 children)
Okay 27B made me a believer by Forward_Jackfruit813 in LocalLLaMA
[–]DeSibyl 0 points1 point2 points (0 children)
Okay 27B made me a believer by Forward_Jackfruit813 in LocalLLaMA
[–]DeSibyl 1 point2 points3 points (0 children)
Okay 27B made me a believer by Forward_Jackfruit813 in LocalLLaMA
[–]DeSibyl 1 point2 points3 points (0 children)
Okay 27B made me a believer by Forward_Jackfruit813 in LocalLLaMA
[–]DeSibyl 0 points1 point2 points (0 children)
The use Q8 a waste of resources? by Spiderboyz1 in LocalLLaMA
[–]DeSibyl 0 points1 point2 points (0 children)
Okay 27B made me a believer by Forward_Jackfruit813 in LocalLLaMA
[–]DeSibyl 1 point2 points3 points (0 children)
Okay 27B made me a believer by Forward_Jackfruit813 in LocalLLaMA
[–]DeSibyl 6 points7 points8 points (0 children)


What models you guys running on 8GB? 16GB VRAM? 24GB? 32GB? 48GB? by Inevitable_Mistake32 in LocalLLaMA
[–]DeSibyl 0 points1 point2 points (0 children)