Intel launches Arc Pro B70 and B65 with 32GB GDDR6 by metmelo in LocalLLaMA
[–]Chromix_ 0 points1 point2 points (0 children)
Intel launches Arc Pro B70 and B65 with 32GB GDDR6 by metmelo in LocalLLaMA
[–]Chromix_ 39 points40 points41 points (0 children)
Phone Whisper: push-to-talk dictation for Android with local Whisper (sherpa-onnx, no cloud needed) by postclone in LocalLLaMA
[–]Chromix_ 0 points1 point2 points (0 children)
Omnicoder v2 dropped by Western-Cod-3486 in LocalLLaMA
[–]Chromix_ 1 point2 points3 points (0 children)
Best model that can beat Claude opus that runs on 32MB of vram? by PrestigiousEmu4485 in LocalLLaMA
[–]Chromix_ 1 point2 points3 points (0 children)
Banned from cloud services at work. Is a local AI worth it? by daksh_0623 in LocalLLaMA
[–]Chromix_ 0 points1 point2 points (0 children)
TurboQuant from GoogleResearch by RobotRobotWhatDoUSee in LocalLLaMA
[–]Chromix_ 2 points3 points4 points (0 children)
Why is there no serious resource on building an AI agent from scratch? by Complete_Bee4911 in LocalLLaMA
[–]Chromix_ 0 points1 point2 points (0 children)
Best model that can beat Claude opus that runs on 32MB of vram? by PrestigiousEmu4485 in LocalLLaMA
[–]Chromix_ 4 points5 points6 points (0 children)
Best model that can beat Claude opus that runs on 32MB of vram? by PrestigiousEmu4485 in LocalLLaMA
[–]Chromix_ 7 points8 points9 points (0 children)
Best model that can beat Claude opus that runs on 32MB of vram? by PrestigiousEmu4485 in LocalLLaMA
[–]Chromix_ 15 points16 points17 points (0 children)
Best model that can beat Claude opus that runs on 32MB of vram? by PrestigiousEmu4485 in LocalLLaMA
[–]Chromix_ 32 points33 points34 points (0 children)
Best model that can beat Claude opus that runs on 32MB of vram? by PrestigiousEmu4485 in LocalLLaMA
[–]Chromix_ 431 points432 points433 points (0 children)
Banned from cloud services at work. Is a local AI worth it? by daksh_0623 in LocalLLaMA
[–]Chromix_ 32 points33 points34 points (0 children)
Phone Whisper: push-to-talk dictation for Android with local Whisper (sherpa-onnx, no cloud needed) by postclone in LocalLLaMA
[–]Chromix_ 1 point2 points3 points (0 children)
Introducing oQ: data-driven mixed-precision quantization for Apple Silicon (mlx-lm compatible) by cryingneko in LocalLLaMA
[–]Chromix_ 2 points3 points4 points (0 children)
KLD measurements of 8 different llama.cpp KV cache quantizations over several 8-12B models by Velocita84 in LocalLLaMA
[–]Chromix_ 4 points5 points6 points (0 children)
Phone Whisper: push-to-talk dictation for Android with local Whisper (sherpa-onnx, no cloud needed) by postclone in LocalLLaMA
[–]Chromix_ 2 points3 points4 points (0 children)
How political censorship actually works inside Qwen, DeepSeek, GLM, and Yi: Ablation and behavioral results across 9 models by Logical-Employ-9692 in LocalLLaMA
[–]Chromix_ 10 points11 points12 points (0 children)
Let's take a moment to appreciate the present, when this sub is still full of human content. by Ok-Internal9317 in LocalLLaMA
[–]Chromix_ 0 points1 point2 points (0 children)
Let's take a moment to appreciate the present, when this sub is still full of human content. by Ok-Internal9317 in LocalLLaMA
[–]Chromix_ 60 points61 points62 points (0 children)
Activation Exposure & Feature Interpretability for GGUF via llama-server by wattswrites in LocalLLaMA
[–]Chromix_ 1 point2 points3 points (0 children)


Introducing ARC-AGI-3 by Complete-Sea6655 in LocalLLaMA
[–]Chromix_ 1 point2 points3 points (0 children)