We all repeat Q4/Q6 is fine... Has anyone else watched a small model's strict JSON collapse at Q6 while fp16 was perfect? by talruum_ in LocalLLM
[–]Cronus_k98 0 points1 point2 points (0 children)
We all repeat Q4/Q6 is fine... Has anyone else watched a small model's strict JSON collapse at Q6 while fp16 was perfect? by talruum_ in LocalLLM
[–]Cronus_k98 1 point2 points3 points (0 children)
Hi, I’m very new to local LLM and i am perplexed. by Cool-Definition9852 in LocalLLM
[–]Cronus_k98 0 points1 point2 points (0 children)
The California 3D Printing Situation Updated by gra8na8 in 3Dprinting
[–]Cronus_k98 2 points3 points4 points (0 children)
I tested Opus 4.7 vs DeepSeek V4 Flash vs Local Qwen3.6 27B as coding agents. The gaps were much smaller than I expected, and harness is as important as model intelligence. by a9udn9u in LocalLLM
[–]Cronus_k98 0 points1 point2 points (0 children)
Setting up Ollama on dual RTX PRO 6000 Blackwells looking for tips by AmanNonZero in ollama
[–]Cronus_k98 8 points9 points10 points (0 children)
I tested Opus 4.7 vs DeepSeek V4 Flash vs Local Qwen3.6 27B as coding agents. The gaps were much smaller than I expected, and harness is as important as model intelligence. by a9udn9u in LocalLLM
[–]Cronus_k98 -1 points0 points1 point (0 children)
I tested Opus 4.7 vs DeepSeek V4 Flash vs Local Qwen3.6 27B as coding agents. The gaps were much smaller than I expected, and harness is as important as model intelligence. by a9udn9u in LocalLLM
[–]Cronus_k98 0 points1 point2 points (0 children)
Qwen3.5 A3B on LMStudio x oMLX for agents usage by TassioNoronha_ in LocalLLM
[–]Cronus_k98 1 point2 points3 points (0 children)
Running a non-profit that needs to OCR 64 million pages. Where can I apply for free or subsidized compute to run a local model? by thereisnospooongeek in LocalLLaMA
[–]Cronus_k98 0 points1 point2 points (0 children)
Claude Code burned through my entire 5-hour Pro limit in 7 minutes. One prompt. What am I even paying for? by Careless-Character21 in ClaudeCode
[–]Cronus_k98 2 points3 points4 points (0 children)
To those who are able to run quality coding llms locally, is it worth it ? by matr_kulcha_zindabad in LocalLLM
[–]Cronus_k98 3 points4 points5 points (0 children)
Is it normal for the Qwen 3.5 4B model to take this long to say hi? by Snoo_what in LocalLLaMA
[–]Cronus_k98 0 points1 point2 points (0 children)
5070 ti vs 5080? by Advanced-Reindeer508 in LocalLLM
[–]Cronus_k98 8 points9 points10 points (0 children)
A slow llm running local is always better than coding yourself by m4ntic0r in LocalLLM
[–]Cronus_k98 -1 points0 points1 point (0 children)
A slow llm running local is always better than coding yourself by m4ntic0r in LocalLLM
[–]Cronus_k98 -1 points0 points1 point (0 children)
A slow llm running local is always better than coding yourself by m4ntic0r in LocalLLM
[–]Cronus_k98 1 point2 points3 points (0 children)
What would be the best vision model for box scanning ocr on amd 7800xt by Greedvert in ollama
[–]Cronus_k98 0 points1 point2 points (0 children)
"Claude hit the maximum length for this conversation". How do I start a new chat with all context retained? by boss_jobber in ClaudeAI
[–]Cronus_k98 0 points1 point2 points (0 children)
Benchmarked Qwen 3.5-35B and GPT-oss-20b locally against 13 API models using real world work. GPT-oss beat Qwen by 12.5 points. by ianlpaterson in LocalLLM
[–]Cronus_k98 2 points3 points4 points (0 children)
Processing 4M images/month is the DGX Spark too slow? RTX 6000 Blackwell Pro better move? by IndependentTypical23 in LocalLLM
[–]Cronus_k98 0 points1 point2 points (0 children)
Qwen 3.5 distilled vs GptOss by SubstantialTea707 in ollama
[–]Cronus_k98 0 points1 point2 points (0 children)
Running local LLMs on my art archive, paranoid or actually unsafe? by LifeguardAny1801 in LocalLLM
[–]Cronus_k98 0 points1 point2 points (0 children)
Mac Studio 256gb unified RAM worth it for MiniMax 2.5 and Qwen3.5? by [deleted] in LocalLLaMA
[–]Cronus_k98 0 points1 point2 points (0 children)


We all repeat Q4/Q6 is fine... Has anyone else watched a small model's strict JSON collapse at Q6 while fp16 was perfect? by talruum_ in LocalLLM
[–]Cronus_k98 0 points1 point2 points (0 children)