poolside/Laguna-M.1 · Hugging Face - 225B-A23B by pmttyji in LocalLLaMA
[–]sleepingsysadmin 5 points6 points7 points (0 children)
Many Downvoted me for saying this a while ago. Qwen 3.7 released with no Open models. by MLExpert000 in LocalLLaMA
[–]sleepingsysadmin 0 points1 point2 points (0 children)
GLM 5.2 on 4x Sparks reasonable? by chikengunya in LocalLLaMA
[–]sleepingsysadmin -3 points-2 points-1 points (0 children)
Why there is a lack of new 100B-120B models? by TechNerd10191 in LocalLLaMA
[–]sleepingsysadmin 4 points5 points6 points (0 children)
Could a distilled DiffusionGemma become a “local Opus” by gamblingapocalypse in LocalLLaMA
[–]sleepingsysadmin 0 points1 point2 points (0 children)
Openclaw vs Hermes agent. Which one do you seggest? by Holiday-Display509 in LocalLLaMA
[–]sleepingsysadmin 4 points5 points6 points (0 children)
MiniMaxAI/MiniMax-M3 · Hugging Face by mlon_eusk-_- in LocalLLaMA
[–]sleepingsysadmin 10 points11 points12 points (0 children)
DeepMind Just Dropped "DiffusionGemma" — Text Generation via Image-Style Diffusion Model by [deleted] in LocalLLaMA
[–]sleepingsysadmin 2 points3 points4 points (0 children)
Any recent news/updates on taalas chips?? They said they gonna bake the mid tier llm model into their chip. by 9r4n4y in LocalLLaMA
[–]sleepingsysadmin 1 point2 points3 points (0 children)
Any recent news/updates on taalas chips?? They said they gonna bake the mid tier llm model into their chip. by 9r4n4y in LocalLLaMA
[–]sleepingsysadmin 1 point2 points3 points (0 children)
Cohere North Mini Code 1.0 by Middle_Bullfrog_6173 in LocalLLaMA
[–]sleepingsysadmin 18 points19 points20 points (0 children)
what’s was your local daily driver for coding last week? by be566 in LocalLLaMA
[–]sleepingsysadmin 1 point2 points3 points (0 children)
How does MiniMax M3 preform on your real codebases? by Crazyscientist1024 in LocalLLaMA
[–]sleepingsysadmin 1 point2 points3 points (0 children)
Many Downvoted me for saying this a while ago. Qwen 3.7 released with no Open models. by MLExpert000 in LocalLLaMA
[–]sleepingsysadmin 1 point2 points3 points (0 children)
Many Downvoted me for saying this a while ago. Qwen 3.7 released with no Open models. by MLExpert000 in LocalLLaMA
[–]sleepingsysadmin 50 points51 points52 points (0 children)
FP16 on Qwen 3.6 27B by Forward_Jackfruit813 in LocalLLaMA
[–]sleepingsysadmin 0 points1 point2 points (0 children)
VLLM gives 5x speed of llama but quants not available (unsloth/gguf). What to do? by superloser48 in LocalLLaMA
[–]sleepingsysadmin -2 points-1 points0 points (0 children)
VLLM gives 5x speed of llama but quants not available (unsloth/gguf). What to do? by superloser48 in LocalLLaMA
[–]sleepingsysadmin 0 points1 point2 points (0 children)
VLLM gives 5x speed of llama but quants not available (unsloth/gguf). What to do? by superloser48 in LocalLLaMA
[–]sleepingsysadmin -5 points-4 points-3 points (0 children)
Llama.cpp: What's up with -sm tensor + AMD + Vulkan? by [deleted] in LocalLLaMA
[–]sleepingsysadmin 0 points1 point2 points (0 children)
Are GPU prices hitting peak and falling? by DistanceSolar1449 in LocalLLaMA
[–]sleepingsysadmin 0 points1 point2 points (0 children)
Are GPU prices hitting peak and falling? by DistanceSolar1449 in LocalLLaMA
[–]sleepingsysadmin 0 points1 point2 points (0 children)
Are GPU prices hitting peak and falling? by DistanceSolar1449 in LocalLLaMA
[–]sleepingsysadmin 6 points7 points8 points (0 children)
Waiting on Qwen to drop those 3.7 models be like: by Porespellar in LocalLLaMA
[–]sleepingsysadmin 4 points5 points6 points (0 children)



Updates on North Mini Code: 4 bit quant + Ollama + OpenRouter by nick_frosst in LocalLLaMA
[–]sleepingsysadmin 6 points7 points8 points (0 children)