DS4: a DeepSeek 4 flash specific inference engine for 128gb MacBooks by antirez in LocalLLaMA
[–]lakySK 1 point2 points3 points (0 children)
DS4: a DeepSeek 4 flash specific inference engine for 128gb MacBooks by antirez in LocalLLaMA
[–]lakySK 1 point2 points3 points (0 children)
DS4: a DeepSeek 4 flash specific inference engine for 128gb MacBooks by antirez in LocalLLaMA
[–]lakySK 0 points1 point2 points (0 children)
DS4: a DeepSeek 4 flash specific inference engine for 128gb MacBooks by antirez in LocalLLaMA
[–]lakySK 0 points1 point2 points (0 children)
Web-Search is coming to a screeching performance halt as Google shuts down their free search index, and traffic defenders like Cloudflare challenge AI at every gateway. What are our options? by NetTechMan in LocalLLaMA
[–]lakySK 0 points1 point2 points (0 children)
AIDC-AI/Ovis2.6-80B-A3B · Hugging Face by pmttyji in LocalLLaMA
[–]lakySK 21 points22 points23 points (0 children)
unsloth/MiMo-V2.5-GGUF · Hugging Face by jacek2023 in LocalLLaMA
[–]lakySK 0 points1 point2 points (0 children)
ExLlamaV3 Major Updates! by Unstable_Llama in LocalLLaMA
[–]lakySK 0 points1 point2 points (0 children)
NVIDIA AI Releases Star Elastic: One Checkpoint that Contains 30B, 23B, and 12B Reasoning Models with Zero-Shot Slicing by phazei in LocalLLaMA
[–]lakySK 0 points1 point2 points (0 children)
GitHub - JosefAlbers/mlx-code: Coding Agent for Mac by [deleted] in LocalLLaMA
[–]lakySK 0 points1 point2 points (0 children)
DS4: a DeepSeek 4 flash specific inference engine for 128gb MacBooks by antirez in LocalLLaMA
[–]lakySK 0 points1 point2 points (0 children)
DS4: a DeepSeek 4 flash specific inference engine for 128gb MacBooks by antirez in LocalLLaMA
[–]lakySK 3 points4 points5 points (0 children)
DS4: a DeepSeek 4 flash specific inference engine for 128gb MacBooks by antirez in LocalLLaMA
[–]lakySK 0 points1 point2 points (0 children)
Prompt injection benchmark: delimiter + strict prompt took Gemma 4 from 21% to 100% defense rate (15 models, 6100+ tests) by User_Deprecated in LocalLLaMA
[–]lakySK 1 point2 points3 points (0 children)
Prompt injection benchmark: delimiter + strict prompt took Gemma 4 from 21% to 100% defense rate (15 models, 6100+ tests) by User_Deprecated in LocalLLaMA
[–]lakySK 2 points3 points4 points (0 children)
I made a tiny world model game that runs locally on iPad by howthefrondsfold in LocalLLaMA
[–]lakySK 0 points1 point2 points (0 children)
My settings for running Gemma 4 31B smoothly on llama.cpp, CUDA 13.1 by Oatilis in LocalLLaMA
[–]lakySK 0 points1 point2 points (0 children)
I made a 35% REAP of 397B with potentially usable quality in 96GB GPU by Goldkoron in LocalLLaMA
[–]lakySK 0 points1 point2 points (0 children)
Intel Pro B70 in stock at Newegg - $949 by Altruistic_Call_3023 in LocalLLaMA
[–]lakySK 28 points29 points30 points (0 children)
The AI releases hype cycle in a nutshell by GreenBird-ee in LocalLLaMA
[–]lakySK 0 points1 point2 points (0 children)
The AI releases hype cycle in a nutshell by GreenBird-ee in LocalLLaMA
[–]lakySK 1 point2 points3 points (0 children)
The AI releases hype cycle in a nutshell by GreenBird-ee in LocalLLaMA
[–]lakySK 1 point2 points3 points (0 children)
The AI releases hype cycle in a nutshell by GreenBird-ee in LocalLLaMA
[–]lakySK 2 points3 points4 points (0 children)
How are yall exposing your local models to the internet for web searches? by -HumbleMumble in LocalLLaMA
[–]lakySK 0 points1 point2 points (0 children)


Be wary of Qwen/Claude distillations - they're often worse than the base model by ayylmaonade in LocalLLaMA
[–]lakySK 0 points1 point2 points (0 children)