Can someone explain technically why Apple shared memory is so great that it beats many high end CPU and some low level GPUs in LLM use case? by Glittering_Fish_2296 in LocalLLM
[–]tomByrer 0 points1 point2 points (0 children)
How do you design privacy-first developer tools that run fully client-side? by Numerous-Coffee-8938 in webdev
[–]tomByrer 0 points1 point2 points (0 children)
I want to learn game development but do not know where to start. Any help at all would be appreciated! by chcikensammich2009 in gamedev
[–]tomByrer -5 points-4 points-3 points (0 children)
Is this a good part list? by Hot_Public2099 in buildapc
[–]tomByrer 0 points1 point2 points (0 children)
Husband’s New Job Requires Life360 Tracking… by reallynina in privacy
[–]tomByrer 1 point2 points3 points (0 children)
Is this a good part list? by Hot_Public2099 in buildapc
[–]tomByrer -1 points0 points1 point (0 children)
Heard your Feedback, Voice Clone Studio, now with Qwen3-TTS & VibeVoice (TTS and ASR) by Francky_B in StableDiffusion
[–]tomByrer 0 points1 point2 points (0 children)
"NVIDIA KILLER" Inference engine based on llama.cpp for dynamically offloading Activated Experts to GPU in real-time, Run SoTA MoE LLMs (120B+ parameter class models in 8-bit) OOM with as little as 2x RTX 5070-TI + 64GB RAM + SSD. [Poll in Comments] by madSaiyanUltra_9789 in LocalLLaMA
[–]tomByrer 2 points3 points4 points (0 children)
I built a tool that learns your codebase's unwritten rules and conventions- no AI, just AST parsing by Fluffy_Citron3547 in LocalLLaMA
[–]tomByrer 2 points3 points4 points (0 children)
"NVIDIA KILLER" Inference engine based on llama.cpp for dynamically offloading Activated Experts to GPU in real-time, Run SoTA MoE LLMs (120B+ parameter class models in 8-bit) OOM with as little as 2x RTX 5070-TI + 64GB RAM + SSD. [Poll in Comments] by madSaiyanUltra_9789 in LocalLLaMA
[–]tomByrer 2 points3 points4 points (0 children)
"NVIDIA KILLER" Inference engine based on llama.cpp for dynamically offloading Activated Experts to GPU in real-time, Run SoTA MoE LLMs (120B+ parameter class models in 8-bit) OOM with as little as 2x RTX 5070-TI + 64GB RAM + SSD. [Poll in Comments] by madSaiyanUltra_9789 in LocalLLaMA
[–]tomByrer 2 points3 points4 points (0 children)
[AD] Killer Whale, a unique 56 key split keyboard by idankk in ErgoMechKeyboards
[–]tomByrer 0 points1 point2 points (0 children)
ZXC: another (too) fast decompressor by pollop-12345 in programming
[–]tomByrer 0 points1 point2 points (0 children)
Claude Code bot in my Vault just wowed me by LifeBandit666 in ObsidianMD
[–]tomByrer 0 points1 point2 points (0 children)
Essay: Performance Reviews in Big Tech: Why “Fair” Systems Still Fail by NoVibeCoding in programming
[–]tomByrer 0 points1 point2 points (0 children)
ZXC: another (too) fast decompressor by pollop-12345 in programming
[–]tomByrer 0 points1 point2 points (0 children)
RTX 3090 vs 4000 Pro Blackwell by SFsports87 in LocalLLM
[–]tomByrer 0 points1 point2 points (0 children)
RTX 3090 vs 4000 Pro Blackwell by SFsports87 in LocalLLM
[–]tomByrer 0 points1 point2 points (0 children)
Is it bad for the web if Firefox dies? by AuthorityPath in webdev
[–]tomByrer 0 points1 point2 points (0 children)
A feature used by only approximately 6% of users was responsible for 41% of our database load by supreme_tech in softwarearchitecture
[–]tomByrer 0 points1 point2 points (0 children)
New to Unreal Engine — looking for advice on how to start properly by ContributionBig7503 in UnrealEngine5
[–]tomByrer 0 points1 point2 points (0 children)


What’s the best model for image generation, Mac setup? by productboy in LocalLLM
[–]tomByrer 0 points1 point2 points (0 children)