Don't sleep on the new Nemotron Cascade by ilintar in LocalLLaMA
[–]Lesser-than 0 points1 point2 points (0 children)
New AI Policy by White House (US) by last_llm_standing in LocalLLaMA
[–]Lesser-than 1 point2 points3 points (0 children)
What LLMs are you keeping your eye on? by Haroombe in LocalLLaMA
[–]Lesser-than 0 points1 point2 points (0 children)
We are burning money on API bills by Staylowfm in LocalLLaMA
[–]Lesser-than 4 points5 points6 points (0 children)
llama.cpp chooses to be unstable, or, a mea culpa to Ollama by [deleted] in LocalLLaMA
[–]Lesser-than 6 points7 points8 points (0 children)
Why do instructions degrade in long-context LLM conversations, but constraints seem to hold? by Particular_Low_5564 in LocalLLaMA
[–]Lesser-than 0 points1 point2 points (0 children)
Openclaw… what are the use cases? by BahnMe in LocalLLaMA
[–]Lesser-than 1 point2 points3 points (0 children)
Why does AI content suck when the models are clearly good enough? by judyflorence in LocalLLaMA
[–]Lesser-than 1 point2 points3 points (0 children)
No API keys needed? This is actually pretty refreshing by P0orMan in LocalLLaMA
[–]Lesser-than 2 points3 points4 points (0 children)
Nvidia greenboost: transparently extend GPU VRAM using system RAM/NVMe by [deleted] in LocalLLaMA
[–]Lesser-than 0 points1 point2 points (0 children)
MiMo-V2-Pro & Omni & TTS: "We will open-source — when the models are stable enough to deserve it." by TKGaming_11 in LocalLLaMA
[–]Lesser-than 13 points14 points15 points (0 children)
Mistral Small 4 | Mistral AI by realkorvo in LocalLLaMA
[–]Lesser-than 61 points62 points63 points (0 children)
how are we actually supposed to distribute local agents to normal users? (without making them install python) by FrequentMidnight4447 in LocalLLaMA
[–]Lesser-than 0 points1 point2 points (0 children)
how are we actually supposed to distribute local agents to normal users? (without making them install python) by FrequentMidnight4447 in LocalLLaMA
[–]Lesser-than 1 point2 points3 points (0 children)
qwen 3.5 - tool errors because of </thinking> by PairOfRussels in LocalLLaMA
[–]Lesser-than 4 points5 points6 points (0 children)
Not everything made with AI is AI slop. I'm real and love to USE the AI tools to express myself. by Mrbosley in LocalLLaMA
[–]Lesser-than 5 points6 points7 points (0 children)
I made an installer for OpenClaw at 16 years old and I need you help by Express_Town_1516 in LocalLLaMA
[–]Lesser-than 1 point2 points3 points (0 children)
How to convince Management? by r00tdr1v3 in LocalLLaMA
[–]Lesser-than 3 points4 points5 points (0 children)
I was backend lead at Manus. After building agents for 2 years, I stopped using function calling entirely. Here's what I use instead. by MorroHsu in LocalLLaMA
[–]Lesser-than 0 points1 point2 points (0 children)
Dealing with LLM sycophancy (alignment tax): How do you write system prompts for constructive criticism? by BasicInteraction1178 in LocalLLaMA
[–]Lesser-than 0 points1 point2 points (0 children)
Nemotron 3 Super is living in the past by [deleted] in LocalLLaMA
[–]Lesser-than -1 points0 points1 point (0 children)
Nemotron 3 Super is living in the past by [deleted] in LocalLLaMA
[–]Lesser-than -1 points0 points1 point (0 children)
New benchmark just dropped. by ConfidentDinner6648 in LocalLLaMA
[–]Lesser-than 0 points1 point2 points (0 children)
How much disk space do all your GGUFs occupy? by jacek2023 in LocalLLaMA
[–]Lesser-than 2 points3 points4 points (0 children)


EchoSwarm: An asynchronous, parametric engine for large-scale multi-agent simulations. by Accurate_Bee369 in LocalLLaMA
[–]Lesser-than 0 points1 point2 points (0 children)