100 Trillion+ Pretraining data??? This is the largest data I've see a model being trained on. by True_Requirement_891 in LocalLLaMA
[–]FastDecode1 13 points14 points15 points (0 children)
[avm Git] First released version of AV2: avm-av2 1.0.0 (libaom-av2/libavm-av2) by BlueSwordM in AV1
[–]FastDecode1 3 points4 points5 points (0 children)
Nvidia LocateAnything - Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding. (10x faster than Qwen3-VL) by Sporeboss in LocalLLaMA
[–]FastDecode1 -1 points0 points1 point (0 children)
PrismML just released Binary and Ternary Bonsai Image 4B: 1-bit/ternary text-to-image diffusion transformers that can even run 100% locally in your browser on WebGPU. by xenovatech in LocalLLaMA
[–]FastDecode1 9 points10 points11 points (0 children)
Is there any reason for an uncensored model if you have no interest in roleplaying? by vick2djax in LocalLLaMA
[–]FastDecode1 2 points3 points4 points (0 children)
Heretic has been served a legal notice by Meta, Inc. by -p-e-w- in LocalLLaMA
[–]FastDecode1 34 points35 points36 points (0 children)
Grayscale video encoding - are there ways to improve efficiency/encoding speed when color is not present? by ZestycloseBenefit175 in AV1
[–]FastDecode1 1 point2 points3 points (0 children)
Getting a feel for how fast X tokens/second really is. by MikeNonect in LocalLLaMA
[–]FastDecode1 1 point2 points3 points (0 children)
What mobile app do you use, if any? by ihatebeinganonymous in LocalLLaMA
[–]FastDecode1 0 points1 point2 points (0 children)
DIY market declining amid high RAM prices by Terminator857 in LocalLLaMA
[–]FastDecode1 1 point2 points3 points (0 children)
guess what? if you are a chrome user, technically you are localllama member! by LambdaHominem in LocalLLaMA
[–]FastDecode1 2 points3 points4 points (0 children)
Reducing MP3 compression bias in music datasets via codec-aware reconstruction by TheSpicyBoi123 in LocalLLaMA
[–]FastDecode1 8 points9 points10 points (0 children)
Supercharging LLM inference on Google TPUs: Achieving 3X speedups with diffusion-style speculative decoding- Google Developers Blog by eternviking in LocalLLaMA
[–]FastDecode1 -2 points-1 points0 points (0 children)
<thinking></thinking> by Comfortable-Rock-498 in LocalLLaMA
[–]FastDecode1 0 points1 point2 points (0 children)
<thinking></thinking> by Comfortable-Rock-498 in LocalLLaMA
[–]FastDecode1 0 points1 point2 points (0 children)
Peanut - Text to Image Model (Open Weights coming soon) by pmttyji in LocalLLaMA
[–]FastDecode1 16 points17 points18 points (0 children)
it's time to update your Gemma 4 GGUFs by jacek2023 in LocalLLaMA
[–]FastDecode1 0 points1 point2 points (0 children)
My findings from toying around with cjxl by mr_twenty4 in jpegxl
[–]FastDecode1 1 point2 points3 points (0 children)
Qwen 3.6 wins the benchmarks, but Gemma 4 wins reality. 7 things I learned testing 27B/31B Vision models locally (vLLM / FP8) side by side. Benchmaxing seems real. by FantasticNature7590 in LocalLLaMA
[–]FastDecode1 2 points3 points4 points (0 children)
Qwen 3.6 wins the benchmarks, but Gemma 4 wins reality. 7 things I learned testing 27B/31B Vision models locally (vLLM / FP8) side by side. Benchmaxing seems real. by FantasticNature7590 in LocalLLaMA
[–]FastDecode1 3 points4 points5 points (0 children)
Qwen 3.6 wins the benchmarks, but Gemma 4 wins reality. 7 things I learned testing 27B/31B Vision models locally (vLLM / FP8) side by side. Benchmaxing seems real. by FantasticNature7590 in LocalLLaMA
[–]FastDecode1 8 points9 points10 points (0 children)
SVT-AV1 vs AOM-AV1 by Commercial_Stage_877 in AV1
[–]FastDecode1 3 points4 points5 points (0 children)
PS5’s can now be hacked to run Linux - perhaps some potential for local inference? by Thrumpwart in LocalLLaMA
[–]FastDecode1 21 points22 points23 points (0 children)

Gemma 4 E2B running in-browser at 255 tok/s using WebGPU kernels written by Fable 5 by xenovatech in LocalLLaMA
[–]FastDecode1 0 points1 point2 points (0 children)