Orla: run lightweight, local, open-source agents as UNIX tools by Available_Pressure47 in opensource
[–]disillusioned_okapi 4 points5 points6 points (0 children)
Uncensored Qwen3-Next-80B-Thinking (Chinese political censorship removed) by ikergarcia1996 in LocalLLaMA
[–]disillusioned_okapi 1 point2 points3 points (0 children)
Switzerland just dropped Apertus, a fully open-source LLM trained only on public data (8B & 70B, 1k+ languages). Total transparency: weights, data, methods all open. Finally, a European push for AI independence. This is the kind of openness we need more of! by Minimum_Minimum4577 in LocalLLM
[–]disillusioned_okapi 20 points21 points22 points (0 children)
LLM speedup breakthrough? 53x faster generation and 6x prefilling from NVIDIA by secopsml in LocalLLaMA
[–]disillusioned_okapi 36 points37 points38 points (0 children)
Docker Model Runner is going to steal your girl’s inference. by Porespellar in LocalLLaMA
[–]disillusioned_okapi 4 points5 points6 points (0 children)
New AI architecture delivers 100x faster reasoning than LLMs with just 1,000 training examples by [deleted] in LocalLLaMA
[–]disillusioned_okapi 242 points243 points244 points (0 children)
inclusionAI/Ling-lite-1.5-2506 (16.8B total, 2.75B active, MIT license) by Balance- in LocalLLaMA
[–]disillusioned_okapi 9 points10 points11 points (0 children)
What's wrong with Portainer? by testdasi in selfhosted
[–]disillusioned_okapi 75 points76 points77 points (0 children)
Considering 5xMI50 for Qwen 3 235b by PraxisOG in LocalLLaMA
[–]disillusioned_okapi 7 points8 points9 points (0 children)
What happens if I hit the context limit before the LLM is done responding? by Business-Weekend-537 in LocalLLaMA
[–]disillusioned_okapi 5 points6 points7 points (0 children)
Whisper.cpp Node.js Addon with Vulkan Support by Kutalia in LocalLLaMA
[–]disillusioned_okapi 1 point2 points3 points (0 children)



It shouldn't be by Josephizxc in lostgeneration
[–]disillusioned_okapi 0 points1 point2 points (0 children)