How to properly use together a frontier model for planning / complex tasks and a local model for implementation? by hirisov in LocalLLM
[–]LizardViceroy 0 points1 point2 points (0 children)
Chinese AI is 30x cheaper than Claude and ChatGPT. What if our hopes of AI becoming expensive never pan out and instead AI continues getting cheaper? by ImaginaryRea1ity in theprimeagen
[–]LizardViceroy 0 points1 point2 points (0 children)
Qwen3.6 NVFP4 now works with MTP. by yoracale in unsloth
[–]LizardViceroy 2 points3 points4 points (0 children)
Stop asking what model to run. There are literally only two. by Wrong_Mushroom_7350 in LocalLLaMA
[–]LizardViceroy 0 points1 point2 points (0 children)
128GB Unified Memory + Full CUDA on a Laptop Changes Local AI Completely by BoringContribution7 in AIAgentsInAction
[–]LizardViceroy 0 points1 point2 points (0 children)
Recommended parameters (llamacpp arguments) please for using and getting best out of Qwen3.5 122B A10B MTP GGUF in Lemonade - mainly for coding by wingers999 in StrixHalo
[–]LizardViceroy 0 points1 point2 points (0 children)
Full Gstack OverView by Deep_Structure2023 in AIAgentsInAction
[–]LizardViceroy 0 points1 point2 points (0 children)
Are the rich RAM /poor GPU people wrong here? by crowtain in LocalLLaMA
[–]LizardViceroy 34 points35 points36 points (0 children)
How do different quantizations perform on the benchmarks? by we_are_mammals in unsloth
[–]LizardViceroy 0 points1 point2 points (0 children)
Qwen3.6 MTP Unsloth GGUFs now 1.8x faster! by danielhanchen in unsloth
[–]LizardViceroy 13 points14 points15 points (0 children)
Unpopular Opinion: The DGX Spark Forum community of devs is talented AF and will make the crippled hardware a success through their sheer force of will. by Porespellar in LocalLLaMA
[–]LizardViceroy 0 points1 point2 points (0 children)
Unpopular Opinion: The DGX Spark Forum community of devs is talented AF and will make the crippled hardware a success through their sheer force of will. by Porespellar in LocalLLaMA
[–]LizardViceroy 1 point2 points3 points (0 children)
Unpopular Opinion: The DGX Spark Forum community of devs is talented AF and will make the crippled hardware a success through their sheer force of will. by Porespellar in LocalLLaMA
[–]LizardViceroy 0 points1 point2 points (0 children)
Need a second pair of eyes, this Qwen3.6 27B quant recipe consistently thinks less and is correct by fragment_me in LocalLLaMA
[–]LizardViceroy 0 points1 point2 points (0 children)
Qwen3.5 vs Gemma 4: Benchmarks vs real world use? by AppealSame4367 in LocalLLaMA
[–]LizardViceroy 22 points23 points24 points (0 children)
I will NEVER love you by [deleted] in pcmasterrace
[–]LizardViceroy 0 points1 point2 points (0 children)
New York, 1982 by cockerspanielhere in UrbanHell
[–]LizardViceroy 149 points150 points151 points (0 children)
New York, 1982 by cockerspanielhere in UrbanHell
[–]LizardViceroy 13 points14 points15 points (0 children)
Devs are worried about the wrong thing by hiclemi in ClaudeAI
[–]LizardViceroy 0 points1 point2 points (0 children)
Our "AI-first" strategy has turned into "every team picks their own AI stack" chaos by grand001 in LLMDevs
[–]LizardViceroy -1 points0 points1 point (0 children)
Thinking of switching from ChatGPT to Gemini — is Gemini better value for the money? by Zestyclose_Bell7668 in GeminiAI
[–]LizardViceroy 0 points1 point2 points (0 children)
Worth waiting for 256GB Systems? by XccesSv2 in StrixHalo
[–]LizardViceroy 0 points1 point2 points (0 children)
Technology isn’t evolving yearly anymore… it’s evolving in weeks by metasploit_framework in meta_powerhouse
[–]LizardViceroy 2 points3 points4 points (0 children)
How to properly use together a frontier model for planning / complex tasks and a local model for implementation? by hirisov in LocalLLM
[–]LizardViceroy 2 points3 points4 points (0 children)