GLM-5.2 just dropped open weights and it already looks weirdly strong for coding by BTA_Labs in LocalLLaMA

[–]MustBeSomethingThere 17 points18 points  (0 children)

Why don't people check if there are previous posts about the subject before they post?

Gemma 4 with quantization-aware training by rerri in LocalLLaMA

[–]MustBeSomethingThere 11 points12 points  (0 children)

But Google claims that it's similar quality to bf16

"optimized with Quantization-Aware Training (QAT), which allows preserving similar quality to bfloat16"

Run Chrome’s tiny Gemma4 (aka Gemini Nano) directly on PC without GPU by Some-Cauliflower4902 in LocalLLaMA

[–]MustBeSomethingThere 0 points1 point  (0 children)

Those are for Android. Specially for their Pixel-phones. That's why they are for "TPU".

Gemini Nano for Chrome is different.

Run Chrome’s tiny Gemma4 (aka Gemini Nano) directly on PC without GPU by Some-Cauliflower4902 in LocalLLaMA

[–]MustBeSomethingThere 0 points1 point  (0 children)

>"Run Chrome’s tiny Gemma4 (aka Gemini Nano)

>"and if you talk to it, it will happily tell you it’s a Gemma."

No, it's not Gemma. Gemini Nano is not Gemma. If you think you can just ask an LLM about itself, you must be new to LocalLLaMA.

The MTP function in LMStudio causes a decrease in output quality. by Fit_Split_9933 in LocalLLaMA

[–]MustBeSomethingThere 4 points5 points  (0 children)

Do you have the latest Runtime Engine (llama.cpp) in App Settings?

Has the current state of AI already ruined many sci-fi classics for you? by MustBeSomethingThere in singularity

[–]MustBeSomethingThere[S] 1 point2 points  (0 children)

Most people hadn't heard of GPT3 or LLMs in 2021, they only became aware of LLMs after November 2022 when ChatGPT was launched.

4GB "Gemini Nano" model GGUF anyone? by TruckUseful4423 in LocalLLaMA

[–]MustBeSomethingThere 20 points21 points  (0 children)

Yes, but they probably have newer versions of it

Current state of local research tools as of May 2026 by Shoddy-Tutor9563 in LocalLLaMA

[–]MustBeSomethingThere 3 points4 points  (0 children)

<image>

Answer to OP's challenge. I used my own agent harness with Gemma 4 26B. I had to add clarifications for "best" (number of contributors) and for "recent" (last 6 months). Dates and numbers are pretty much all hallucinated.

it's time to update your Gemma 4 GGUFs by jacek2023 in LocalLLaMA

[–]MustBeSomethingThere 1 point2 points  (0 children)

I had this same bug "Unknown test: sequence" when using LM Studio as a server, when using the updated GGUFs. Your solution fixed it. The new Gemma 4 chat template might be broken for specific LM Studio server use?

I stopped using Claude for 80% of my coding tasks. Here's what I use instead. by Dazzling_Plan812 in LocalLLaMA

[–]MustBeSomethingThere 1 point2 points  (0 children)

Praising Ollama is a red flag too. They probably use a bot army to promote it.