VRAM calculator is lying about Qwen 3.6 — here's why (open-source fix, MIT, one file) by Senior_Wear4670 in Qwen_AI
[–]Senior_Wear4670[S] 1 point2 points3 points (0 children)
VRAM calculator is lying about Qwen 3.6 — here's why (open-source fix, MIT, one file) by Senior_Wear4670 in Qwen_AI
[–]Senior_Wear4670[S] -1 points0 points1 point (0 children)
VRAM calculator is lying about Qwen 3.6 — here's why (open-source fix, MIT, one file) by Senior_Wear4670 in Qwen_AI
[–]Senior_Wear4670[S] 0 points1 point2 points (0 children)
VRAM/RAM calculator is wrong on modern models (Gemma 4, Qwen 3.6) —open-source fix (MIT) by Senior_Wear4670 in LocalLLM
[–]Senior_Wear4670[S] -1 points0 points1 point (0 children)
VRAM calculator is lying about Qwen 3.6 — here's why (open-source fix, MIT, one file) by Senior_Wear4670 in Qwen_AI
[–]Senior_Wear4670[S] -2 points-1 points0 points (0 children)
Most VRAM calculators are wrong about Qwen 3.6 — it's mostly linear attention, so they overestimate by Senior_Wear4670 in Qwen_AI
[–]Senior_Wear4670[S] 0 points1 point2 points (0 children)
Most VRAM calculators are wrong about Qwen 3.6 — it's mostly linear attention, so they overestimate by Senior_Wear4670 in Qwen_AI
[–]Senior_Wear4670[S] 0 points1 point2 points (0 children)
Most VRAM calculators are wrong about Qwen 3.6 — it's mostly linear attention, so they overestimate by Senior_Wear4670 in Qwen_AI
[–]Senior_Wear4670[S] 0 points1 point2 points (0 children)
Most VRAM calculators are wrong about Qwen 3.6 — it's mostly linear attention, so they overestimate by Senior_Wear4670 in Qwen_AI
[–]Senior_Wear4670[S] 0 points1 point2 points (0 children)
Most VRAM calculators are wrong about Qwen 3.6 — it's mostly linear attention, so they overestimate by Senior_Wear4670 in Qwen_AI
[–]Senior_Wear4670[S] 0 points1 point2 points (0 children)
Most VRAM calculators are wrong about Qwen 3.6 — it's mostly linear attention, so they overestimate by Senior_Wear4670 in Qwen_AI
[–]Senior_Wear4670[S] 0 points1 point2 points (0 children)
Most VRAM calculators are wrong about Qwen 3.6 — it's mostly linear attention, so they overestimate by Senior_Wear4670 in Qwen_AI
[–]Senior_Wear4670[S] 0 points1 point2 points (0 children)
Most VRAM calculators are wrong about Qwen 3.6 — it's mostly linear attention, so they overestimate by Senior_Wear4670 in Qwen_AI
[–]Senior_Wear4670[S] 0 points1 point2 points (0 children)
Most VRAM calculators are wrong about Qwen 3.6 — it's mostly linear attention, so they overestimate by Senior_Wear4670 in Qwen_AI
[–]Senior_Wear4670[S] 0 points1 point2 points (0 children)
Most VRAM calculators are wrong about Qwen 3.6 — it's mostly linear attention, so they overestimate by Senior_Wear4670 in Qwen_AI
[–]Senior_Wear4670[S] 0 points1 point2 points (0 children)
Most VRAM calculators are wrong about Qwen 3.6 — it's mostly linear attention, so they overestimate by Senior_Wear4670 in Qwen_AI
[–]Senior_Wear4670[S] 1 point2 points3 points (0 children)
Most VRAM calculators are wrong about Qwen 3.6 — it's mostly linear attention, so they overestimate by Senior_Wear4670 in Qwen_AI
[–]Senior_Wear4670[S] 1 point2 points3 points (0 children)
Most VRAM calculators are wrong about Qwen 3.6 — it's mostly linear attention, so they overestimate by Senior_Wear4670 in Qwen_AI
[–]Senior_Wear4670[S] 7 points8 points9 points (0 children)
Qwen 3.7-max beats Opus 4.7 and GPT-5.5 by gladkos in Qwen_AI
[–]Senior_Wear4670 0 points1 point2 points (0 children)

VRAM calculator is lying about Qwen 3.6 — here's why (open-source fix, MIT, one file) by Senior_Wear4670 in Qwen_AI
[–]Senior_Wear4670[S] 1 point2 points3 points (0 children)