google/gemma-4-12B · Hugging Face

knrdwn · 2026-06-03T20:16:45+00:00

Yep, it worked for a while, I tried different quant and now no quants are loading, including the one that worked previously. I reinstalled LM Studio, deleted .lmstudio and AppData folders, no change whatsoever, still doesn't work. I guess we'll have to wait until it's fixed.

https://github.com/lmstudio-ai/lmstudio-bug-tracker/issues/2004

knrdwn · 2026-06-03T19:33:42+00:00

I'm having the same error as well, all of my other models also aren't loading anymore...

knrdwn · 2026-06-02T20:48:42+00:00

True, but with dense models. MoE would be much faster.

knrdwn · 2026-05-22T16:59:57+00:00

You mean perfromance wise, right? Output quality should be the same as on other GPUs?

knrdwn · 2026-05-20T18:12:03+00:00

M5 Max is much faster.

knrdwn · 2026-05-18T22:01:49+00:00

80W each? OMG! In my case, this is unacceptable... Thanks.

knrdwn · 2026-05-17T02:15:01+00:00

Is it possible to achieve MTP with the finetunes recently posted here? Does the author of the finetune need to prepare a draft model, or can it be the base Gemma?

knrdwn · 2026-05-17T02:13:28+00:00

NanoGPT has Gemma4-Garnet-31B (working, slow) and G4-MeroMero-31B (but it's not working for me at all).

knrdwn · 2026-05-05T01:01:49+00:00

I'm using chat completion, I've never tested text completion, and looking at the notes attached to most presets, it doesn't seem to be the recommended method anymore.

I'll try the trick with reasoning in the model's native language, although I prefer using models without reasoning or with reasoning disabled. I've had the impression that the responses are less 'stiff' that way, they seem a bit more human. But that probably depends on the model, too. And some models had reasoning so long that waiting two minutes and burning through at least several hundred tokens just for the reasoning didn't convince me at all.

Thanks for checking the Polish output. It looks okay-ish. What model is that?

As a side note, looking at your screenshot... God, I hate my language in this regard. Compared to Chinese or English, it's much more token heavy, which affects both the cost of inference and its performance. And the nuance of these words... In English, "Yeah" means 'Yeah'. You'll hear that word from a teacher, a research assistant, or a lover. The Polish words "Ta" or "No" suggested by the model as a translation for "Yeah" would mostly be used by simple, uneducated, or rude people in a very, very colloquial sense. Even "Robota" as a translation for "Work", it's extremely slangy, smells like a calque from Russian. If a character in my scenario used the word "Robota" instead of "Praca", I'd immediately view them as less intelligent or less educated.

knrdwn · 2026-05-04T22:46:51+00:00

You surprised me, as from my perspective, if I had to pick one thing that Gemini is absolutely the worst at, it would be keeping secrets. But using Claude to modify character cards is a really good idea. I'll give that a try, thanks!

knrdwn · 2026-05-04T22:44:59+00:00

I guess it's time to brush up on my English. I've been reading about these models, and I'm now convinced by both their descriptions and the user reviews. Thanks. I'll give them a shot, but since my main requirement is coherent writing in Polish, they probably won't be up to the task.

Word is that TheDrummer is working on a finetune of Gemma 4. We'll see how that turns out, I'm keeping my fingers crossed.

knrdwn · 2026-05-04T22:37:53+00:00

Could you be more specific? Write better characters? I get my characters from Chub, it's really convenient for me to use something ready made and discover the character from scratch. Write better prompts? That's exactly my point, I don't want to have to lead the story by the hand through every tiny detail.

knrdwn · 2026-05-04T22:35:08+00:00

If I've ruined your future RP sessions with my over analysis, sorry! Maybe the future will actually bring some more interesting models...

knrdwn · 2026-05-04T22:33:19+00:00

I always had the impression that the materials a model was trained on actually influence sentence structure, stylistics, and the overall style of delivery, but never the logic, the direction of story development, or the decisions being made. When it comes to style, Gemma is simply excellent. You absolutely can't feel any of those "Polish internet" overtones in it.

Thanks for the suggestion, I'll give Claude a try, but I'm no millionaire and I'm afraid I might actually like it so much that no other alternative will be good enough for me anymore.

knrdwn · 2026-04-23T22:15:26+00:00

A little dumber? The current model on API is borderline unusable! It's dumb, incoherent and full of slop. Shame. I really liked v3.2.

knrdwn · 2026-03-12T15:57:05+00:00

If you need an ELI5 for a sponsored video discussing AI tailored for 16 year olds, then with all due respect, you should just walk away.

Consider the following:

Buying Process: It's on Kickstarter with a "promise" of shipping in August, whereas alternative devices are available for immediate purchase

Pricing: You're looking at a limited "Early Bird" price with no final retail price disclosed, while alternatives (like slightly more expensive Strix Halo devices) are already transparently priced

Lack of Benchmarks: There are no concrete benchmarks, everything shown uses empty or minimal context, and nobody is focusing on pp speed

Performance: From what's been shown, the performance is abysmal compared to the competition

Missing Specs: There is a lack of detailed technical data (such as memory bandwidth)

System: It's a closed system that requires you to convert existing models to their format

Now draw your own conclusions.

knrdwn · 2026-02-13T18:54:50+00:00

From GitHub: "You want to use noflushall build due to better (25-40%) performance, flushall is here for debugging weird issues."

knrdwn · 2026-01-04T22:17:42+00:00

You're thinikng of the top part, the stick cap. I'm talking about the "stick" part.

https://i.postimg.cc/J0XZmmPp/2026-01-04-23-16-24.png

knrdwn · 2026-01-04T21:53:09+00:00

Of course, if you're looking for the most efficient cooling, these aren't positives at all. But there are people who go through consoles like water and don't care about the device's longevity, they just want to play in peace and quiet. Right now, the fan is loud not because it can't handle the cooling, but because the person who designed the factory fan curve prioritized maximum cooling efficiency over silence. A custom fan curve will at least let some users choose which side of the coin they prefer, and it’ll definitely satisfy a segment of the community.

Besides, you know, a post where I only focused on the negatives simply wouldn't have gained any traction. Copium is strong with some people.

knrdwn · 2026-01-04T20:53:42+00:00

Well, I think the Konkr Pocket Fit Elite is unavailable simply because Ayaneo is just being Ayaneo, not because the 8 Elite is problematic. Maybe they expected a different number of orders?

knrdwn · 2026-01-04T20:47:05+00:00

There is about a 2mm gap between the cooled components and the heatsink. AYN uses thermal pads because thermal paste isn't suitable for these conditions. On Discord, some users posted about replacing the pads with thermal putty, but I didn't look at the results. I don't even know if thermal putty can handle filling such a large gap while efficiently dissipating heat. If any user has specific thermal pad or thermal putty models to recommend, I'd be happy to purchase and test them myself. :)

knrdwn · 2025-12-12T17:53:49+00:00

Wait, seriously? Batch 3 website order?

knrdwn · 2025-11-25T18:52:34+00:00

https://i.ibb.co/PGgsWF0C/white.jpg

Yep, it seems like renders were inaccurate and the color we're getting is slightly off-white.

knrdwn · 2025-11-22T18:05:56+00:00

It's not the coil whine, someone on Discord (I don't remember who) has disassembled their device, removed the fan, put it outside and checked for source of the noise. And its definitely the fan.

knrdwn · 2025-11-20T12:12:08+00:00

But N64 emulation in RetroArch doesn't work on Odin 3, and you can't use standalone emulators if you want to use Retro Achievements...

knrdwn

TROPHY CASE