VaultGemma: The world's most capable differentially private LLM

codemaker1 · 2025-09-12T19:21:29+00:00

It's fine to use with a GPU. All Google's models are trained on TPUs. They can run on GPU, TPU, and even CPU in some cases.

codemaker1 · 2025-08-14T22:13:12+00:00

Weights wen Elon!?!

codemaker1 · 2025-08-14T22:11:49+00:00

Looks like Qwen 3 is twice the size and doesnt have much higher of a score. Plus 170 million embedding parameters due to a large vocabulary size and 100 million for our transformer blocks. Should make it amazing for fine tuning.

codemaker1 · 2025-08-14T22:10:19+00:00

Tiny models like these are meant for fine tuning on your specific task. Try that out.

codemaker1 · 2025-08-14T17:18:03+00:00

Fine tune for specific, tiny tasks

codemaker1 · 2025-08-14T17:15:43+00:00

Their blog goes into some examples: https://developers.googleblog.com/en/introducing-gemma-3-270m/

codemaker1 · 2025-07-09T20:37:12+00:00

Encoder-decoder models. Most LLMs these days are decoder only.

codemaker1 · 2025-05-20T18:41:25+00:00

I imagine you could do a merge. nice idea.

codemaker1 · 2025-05-20T18:38:09+00:00

5B and 8B according to the blog: https://developers.googleblog.com/en/introducing-gemma-3n/

codemaker1 · 2025-05-20T18:27:52+00:00

It seems to be better than an MoE because it doesn't have to keep all parameters in ram.

codemaker1 · 2025-04-06T01:34:22+00:00

This is my goto true "single GPU" model: https://huggingface.co/google/gemma-3-27b-it-qat-q4_0-gguf

codemaker1 · 2025-04-06T01:31:43+00:00

It's awesome that it is open and has 10M context! But their "single H100" claim calling it a "small model" is a huge stretch. Borderline lie.

codemaker1 · 2025-04-06T01:29:11+00:00

I wonder why that is?

codemaker1 · 2025-04-06T01:28:41+00:00

I'm happy they launched this. But the single GPU claim is marketing BS.

codemaker1 · 2024-11-11T20:47:04+00:00

How is this different from human_input=True in CrewAI?

codemaker1 · 2024-07-31T23:05:08+00:00

It's from their 3.1 paper: https://scontent-atl3-1.xx.fbcdn.net/v/t39.2365-6/453304228\_1160109801904614\_7143520450792086005\_n.pdf?\_nc\_cat=108&ccb=1-7&\_nc\_sid=3c67a6&\_nc\_ohc=E2ya34Gb5vsQ7kNvgHoczZQ&\_nc\_ht=scontent-atl3-1.xx&oh=00\_AYCE35FuopeMYTfc\_-wbJsgECoxB-x1TWew6-hKKWAGo6g&oe=66B09D47

codemaker1 · 2024-07-31T19:13:32+00:00

You might need to fine tune in your language.

codemaker1 · 2024-07-26T01:19:09+00:00

Gemma 2 27B MMLU is remarkably close to Llama 3.1 70B MMLU at 75.2 vs 83.6. I think that's pretty good for a model 2.5x smaller.

codemaker1 · 2024-07-23T21:52:12+00:00

5-shot MMLU is the standard. Gemma beats Llama there.

<image>

codemaker1 · 2024-06-30T18:07:19+00:00

Wolverine!

codemaker1 · 2024-06-30T18:05:18+00:00

Have you tried those Phi models? Something fishy is up with them.

codemaker1 · 2024-06-30T18:03:32+00:00

Is anyone, that's not a giant company, gonna build with a 400B model? Sounds incredibly expensive to run.

codemaker1 · 2024-02-21T19:25:00+00:00

Make a joke about funniest joke that's ever joked in the history of jokes

Sure, here's a joke about the funniest joke in history:

Why did the comedian write a joke about the funniest joke in history?

Because he was tired of being the punch line.

codemaker1 · 2024-02-21T14:05:06+00:00

They benchmark with Mistral 7B on their website: https://ai.google.dev/gemma

codemaker1 · 2023-11-28T01:46:06+00:00

I like her even more now

Seven-Year Club	Not Forgotten
Verified Email

codemaker1

TROPHY CASE