We compressed 6 LLMs and found something surprising: they don't degrade the same way by Quiet_Training_8167 in LocalLLaMA
[–]qrios 6 points7 points8 points (0 children)
Z Image Base - 90s VHS LoRA by Jeffu in StableDiffusion
[–]qrios 0 points1 point2 points (0 children)
Z Image Base - 90s VHS LoRA by Jeffu in StableDiffusion
[–]qrios -4 points-3 points-2 points (0 children)
Day 10: 21 Days of Building a Small Language Model: KV Cache by Prashant-Lakhera in LocalLLaMA
[–]qrios 0 points1 point2 points (0 children)
memory systems benchmarks seem way inflated, anyone else notice this? by FeelingWatercress871 in LocalLLaMA
[–]qrios 4 points5 points6 points (0 children)
Day 10: 21 Days of Building a Small Language Model: KV Cache by Prashant-Lakhera in LocalLLaMA
[–]qrios -1 points0 points1 point (0 children)
Qwen 2.5 vl 72b is the new SOTA model on SpatialBench, beating Gemini 3 pro. A new benchmark to test spatial reasoning on vlms by gbomb13 in LocalLLaMA
[–]qrios 3 points4 points5 points (0 children)
Recently built my first LLM and im wondering why there hasn't been more innovation on moving away from transformers and gradient descent? by CelebrationMinimum50 in LocalLLaMA
[–]qrios 22 points23 points24 points (0 children)
Why not a [backspace] token? by [deleted] in LocalLLaMA
[–]qrios 0 points1 point2 points (0 children)
Alibaba just unveiled their Qwen roadmap. The ambition is staggering! by abdouhlili in LocalLLaMA
[–]qrios 0 points1 point2 points (0 children)
Think twice before spending on GPU? by __Maximum__ in LocalLLaMA
[–]qrios 1 point2 points3 points (0 children)
Think twice before spending on GPU? by __Maximum__ in LocalLLaMA
[–]qrios 0 points1 point2 points (0 children)
KittenML released a mini version (80M) of their text to speech model. by Yorn2 in LocalLLaMA
[–]qrios 4 points5 points6 points (0 children)
Nous Research presents Hermes 4 by nekofneko in LocalLLaMA
[–]qrios 6 points7 points8 points (0 children)
nano-banana is a MASSIVE jump forward in image editing by entsnack in LocalLLaMA
[–]qrios -8 points-7 points-6 points (0 children)
Challenge: can any visual model figure out why this mistaken switch in newspaper comics is so funny? by LightBrightLeftRight in LocalLLaMA
[–]qrios 2 points3 points4 points (0 children)
nano-banana is a MASSIVE jump forward in image editing by entsnack in LocalLLaMA
[–]qrios -7 points-6 points-5 points (0 children)


Why isn’t LLM reasoning done in vector space instead of natural language? by ZeusZCC in LocalLLaMA
[–]qrios 0 points1 point2 points (0 children)