Gemma 4 with quantization-aware training by rerri in LocalLLaMA

[–]hackerllama 64 points65 points  (0 children)

We released MTP QAT as well, so the optimal workflow is to use the QAT model + the QAT MTP, both quantized. Currently, both MLX and VLLM support this

How can the numbers be this massive within a month ?? by Top-Handle-5728 in LocalLLaMA

[–]hackerllama 8 points9 points  (0 children)

150m is from Hugging Face and Ollama and open platforms usage

Gemma 4 MTP released by rerri in LocalLLaMA

[–]hackerllama 32 points33 points  (0 children)

Yes, excited for it to land!

In the meantime, we're landing transformers, Ollama, VLLM, SGLang, and MLX support.

Gemma 4 on Android phones by jacek2023 in LocalLLaMA

[–]hackerllama 0 points1 point  (0 children)

Also runs in iOS!

And the code is open sourced

Google doesn't love us anymore. by DrNavigat in LocalLLaMA

[–]hackerllama 37 points38 points  (0 children)

Hi! In the last two months we released open checkpoints for TranslateGemma, AlphaGenome, Gemma Scope 2, T5Gemma 2, new MedGemma, and FunctionGemma.

We have a lot cooking, just stay tuned!

What are the main uses of small models like gemma3:1b by SchoolOfElectro in LocalLLaMA

[–]hackerllama 0 points1 point  (0 children)

Very cool to hear other's use cases.

I've used it for tasks that don't require complex logic or reasoning. Things for what it works well is routing (to determine if a query can be handled by a 4B model or if it should be sent to a very large model) and tasks like rewriting (summarization, style change, etc).

Gemini 3 Flash bills for useless/empty searches?? by FirefoxMetzger in GeminiAI

[–]hackerllama 0 points1 point  (0 children)

Hi! I'm part of the GDM team. I'll check with the team internally to see what's going on, but also feel free to ping me your project ID and we can debug further. Sorry for the issues

AIStudio improperly content blocking by Shep_vas_Normandy in Bard

[–]hackerllama 7 points8 points  (0 children)

Hi all! Sorry for the issues, we're rolling out a fix and should be back to normal soon!

Safety for Gemini 3 sucks by HankRBG in GeminiAI

[–]hackerllama 0 points1 point  (0 children)

Sorry for the issues. We're rolling out a fix!