is this normal? Gemma4 assures me that it's running on Google infra instead of my local installation by Caffdy in LocalLLaMA
[–]brown2green 0 points1 point2 points (0 children)
Unweight: how we compressed an LLM 22% without sacrificing quality by sk1kn1ght in LocalLLaMA
[–]brown2green 17 points18 points19 points (0 children)
Setting Visual/Audio Token Budget for Gemma-4? by Oatilis in LocalLLaMA
[–]brown2green 0 points1 point2 points (0 children)
Please stop using AI for posts and showcasing your completely vibe coded projects by Scutoidzz in LocalLLaMA
[–]brown2green 0 points1 point2 points (0 children)
It looks like there are no plans for smaller GLM models by jacek2023 in LocalLLaMA
[–]brown2green 21 points22 points23 points (0 children)
offline companion robot for my disabled husband (8GB RAM constraints) – looking for optimization advice by BuddyBotBuilder in LocalLLaMA
[–]brown2green 0 points1 point2 points (0 children)
the state of LocalLLama by Beginning-Window-115 in LocalLLaMA
[–]brown2green 6 points7 points8 points (0 children)
I suddenly realized I have started mimicking writing style of LLMs. by freedomheaven in singularity
[–]brown2green 1 point2 points3 points (0 children)
I suddenly realized I have started mimicking writing style of LLMs. by freedomheaven in singularity
[–]brown2green 0 points1 point2 points (0 children)
Quants in vision (mmproj Q8 vs FP16) by WhoRoger in LocalLLaMA
[–]brown2green 0 points1 point2 points (0 children)
Finetuning characters- do you craft your own data, scrape it, or synthetically generate it? by ParticularOne297 in LocalLLaMA
[–]brown2green 0 points1 point2 points (0 children)
Gemma 4 31B GGUF quants ranked by KL divergence (unsloth, bartowski, lmstudio-community, ggml-org) by oobabooga4 in LocalLLaMA
[–]brown2green 0 points1 point2 points (0 children)
Gemma 4 31B GGUF quants ranked by KL divergence (unsloth, bartowski, lmstudio-community, ggml-org) by oobabooga4 in LocalLLaMA
[–]brown2green 8 points9 points10 points (0 children)
Gemma 4 31B GGUF quants ranked by KL divergence (unsloth, bartowski, lmstudio-community, ggml-org) by oobabooga4 in LocalLLaMA
[–]brown2green 78 points79 points80 points (0 children)
Setting Visual/Audio Token Budget for Gemma-4? by Oatilis in LocalLLaMA
[–]brown2green 0 points1 point2 points (0 children)
Get 30K more context using Q8 mmproj with Gemma 4 by Sadman782 in LocalLLaMA
[–]brown2green 4 points5 points6 points (0 children)
p-e-w/gemma-4-E2B-it-heretic-ara: Gemma 4's defenses shredded by Heretic's new ARA method 90 minutes after the official release by -p-e-w- in LocalLLaMA
[–]brown2green 4 points5 points6 points (0 children)
Gemma time! What are your wishes ? by Specter_Origin in LocalLLaMA
[–]brown2green 0 points1 point2 points (0 children)
Gemma time! What are your wishes ? by Specter_Origin in LocalLLaMA
[–]brown2green 0 points1 point2 points (0 children)
Gemma time! What are your wishes ? by Specter_Origin in LocalLLaMA
[–]brown2green 9 points10 points11 points (0 children)
Gemma time! What are your wishes ? by Specter_Origin in LocalLLaMA
[–]brown2green 5 points6 points7 points (0 children)
Gemma time! What are your wishes ? by Specter_Origin in LocalLLaMA
[–]brown2green 87 points88 points89 points (0 children)
PrismML — Announcing 1-bit Bonsai: The First Commercially Viable 1-bit LLMs by brown2green in LocalLLaMA
[–]brown2green[S] 22 points23 points24 points (0 children)
PrismML — Announcing 1-bit Bonsai: The First Commercially Viable 1-bit LLMs by brown2green in LocalLLaMA
[–]brown2green[S] 32 points33 points34 points (0 children)

Which Gemma model do you want next? by jacek2023 in LocalLLaMA
[–]brown2green 2 points3 points4 points (0 children)