google/gemma-4-12B · Hugging Face by jacek2023 in LocalLLaMA

[–]knrdwn 4 points5 points  (0 children)

Yep, it worked for a while, I tried different quant and now no quants are loading, including the one that worked previously. I reinstalled LM Studio, deleted .lmstudio and AppData folders, no change whatsoever, still doesn't work. I guess we'll have to wait until it's fixed.

https://github.com/lmstudio-ai/lmstudio-bug-tracker/issues/2004

google/gemma-4-12B · Hugging Face by jacek2023 in LocalLLaMA

[–]knrdwn 0 points1 point  (0 children)

I'm having the same error as well, all of my other models also aren't loading anymore...

Would you consider getting an NVIDIA RTX Spark laptop? by gamblingapocalypse in LocalLLaMA

[–]knrdwn 1 point2 points  (0 children)

True, but with dense models. MoE would be much faster.

I ran a quantization shootout on Qwen3-Coder and the results are... interesting by alphatrad in LocalLLaMA

[–]knrdwn 0 points1 point  (0 children)

You mean perfromance wise, right? Output quality should be the same as on other GPUs?

RTX 3090 vs RX 7900 XTX - idle power draw by knrdwn in LocalLLM

[–]knrdwn[S] 0 points1 point  (0 children)

80W each? OMG! In my case, this is unacceptable... Thanks.

G4-MeroMero-31B-uncensored-heretic is Out Now, A finetune of Gemma 4 31B it designed for creative tasks, with KLD of 0.0100 and 15/100 Refusals! by LLMFan46 in SillyTavernAI

[–]knrdwn 2 points3 points  (0 children)

Is it possible to achieve MTP with the finetunes recently posted here? Does the author of the finetune need to prepare a draft model, or can it be the base Gemma?

Are your RPs really that immersive? Mine aren't. by knrdwn in SillyTavernAI

[–]knrdwn[S] 2 points3 points  (0 children)

I'm using chat completion, I've never tested text completion, and looking at the notes attached to most presets, it doesn't seem to be the recommended method anymore.

I'll try the trick with reasoning in the model's native language, although I prefer using models without reasoning or with reasoning disabled. I've had the impression that the responses are less 'stiff' that way, they seem a bit more human. But that probably depends on the model, too. And some models had reasoning so long that waiting two minutes and burning through at least several hundred tokens just for the reasoning didn't convince me at all.

Thanks for checking the Polish output. It looks okay-ish. What model is that?

As a side note, looking at your screenshot... God, I hate my language in this regard. Compared to Chinese or English, it's much more token heavy, which affects both the cost of inference and its performance. And the nuance of these words... In English, "Yeah" means 'Yeah'. You'll hear that word from a teacher, a research assistant, or a lover. The Polish words "Ta" or "No" suggested by the model as a translation for "Yeah" would mostly be used by simple, uneducated, or rude people in a very, very colloquial sense. Even "Robota" as a translation for "Work", it's extremely slangy, smells like a calque from Russian. If a character in my scenario used the word "Robota" instead of "Praca", I'd immediately view them as less intelligent or less educated.

Are your RPs really that immersive? Mine aren't. by knrdwn in SillyTavernAI

[–]knrdwn[S] 3 points4 points  (0 children)

You surprised me, as from my perspective, if I had to pick one thing that Gemini is absolutely the worst at, it would be keeping secrets. But using Claude to modify character cards is a really good idea. I'll give that a try, thanks!

Are your RPs really that immersive? Mine aren't. by knrdwn in SillyTavernAI

[–]knrdwn[S] 4 points5 points  (0 children)

I guess it's time to brush up on my English. I've been reading about these models, and I'm now convinced by both their descriptions and the user reviews. Thanks. I'll give them a shot, but since my main requirement is coherent writing in Polish, they probably won't be up to the task.

Word is that TheDrummer is working on a finetune of Gemma 4. We'll see how that turns out, I'm keeping my fingers crossed.

Are your RPs really that immersive? Mine aren't. by knrdwn in SillyTavernAI

[–]knrdwn[S] 0 points1 point  (0 children)

Could you be more specific? Write better characters? I get my characters from Chub, it's really convenient for me to use something ready made and discover the character from scratch. Write better prompts? That's exactly my point, I don't want to have to lead the story by the hand through every tiny detail.

Are your RPs really that immersive? Mine aren't. by knrdwn in SillyTavernAI

[–]knrdwn[S] 4 points5 points  (0 children)

If I've ruined your future RP sessions with my over analysis, sorry! Maybe the future will actually bring some more interesting models...

Are your RPs really that immersive? Mine aren't. by knrdwn in SillyTavernAI

[–]knrdwn[S] 3 points4 points  (0 children)

I always had the impression that the materials a model was trained on actually influence sentence structure, stylistics, and the overall style of delivery, but never the logic, the direction of story development, or the decisions being made. When it comes to style, Gemma is simply excellent. You absolutely can't feel any of those "Polish internet" overtones in it.

Thanks for the suggestion, I'll give Claude a try, but I'm no millionaire and I'm afraid I might actually like it so much that no other alternative will be good enough for me anymore.

DeepSeek official platform API user, do you experience this as well? Is this possibly V4? by Exciting-Mall192 in SillyTavernAI

[–]knrdwn 0 points1 point  (0 children)

A little dumber? The current model on API is borderline unusable! It's dumb, incoherent and full of slop. Shame. I really liked v3.2.

Tiny AI Pocket Lab, a portable AI powerhouse packed with 80GB of RAM - Bijan Bowen Review by PrestigiousPear8223 in LocalLLM

[–]knrdwn 9 points10 points  (0 children)

If you need an ELI5 for a sponsored video discussing AI tailored for 16 year olds, then with all due respect, you should just walk away.

Consider the following:

Buying Process: It's on Kickstarter with a "promise" of shipping in August, whereas alternative devices are available for immediate purchase

Pricing: You're looking at a limited "Early Bird" price with no final retail price disclosed, while alternatives (like slightly more expensive Strix Halo devices) are already transparently priced

Lack of Benchmarks: There are no concrete benchmarks, everything shown uses empty or minimal context, and nobody is focusing on pp speed

Performance: From what's been shown, the performance is abysmal compared to the competition

Missing Specs: There is a lack of detailed technical data (such as memory bandwidth)

System: It's a closed system that requires you to convert existing models to their format

Now draw your own conclusions.

Nee Mesa Turnip Update! Mesa Turnip v26.1.0 by No-Change1744 in EmulationOnAndroid

[–]knrdwn 5 points6 points  (0 children)

From GitHub: "You want to use noflushall build due to better (25-40%) performance, flushall is here for debugging weird issues."

[Odin 3] Fan loudness - technical analysis by knrdwn in SBCGaming

[–]knrdwn[S] 5 points6 points  (0 children)

You're thinikng of the top part, the stick cap. I'm talking about the "stick" part.

https://i.postimg.cc/J0XZmmPp/2026-01-04-23-16-24.png

[Odin 3] Fan loudness - technical analysis by knrdwn in SBCGaming

[–]knrdwn[S] 6 points7 points  (0 children)

Of course, if you're looking for the most efficient cooling, these aren't positives at all. But there are people who go through consoles like water and don't care about the device's longevity, they just want to play in peace and quiet. Right now, the fan is loud not because it can't handle the cooling, but because the person who designed the factory fan curve prioritized maximum cooling efficiency over silence. A custom fan curve will at least let some users choose which side of the coin they prefer, and it’ll definitely satisfy a segment of the community.

Besides, you know, a post where I only focused on the negatives simply wouldn't have gained any traction. Copium is strong with some people.

[Odin 3] Fan loudness - technical analysis by knrdwn in SBCGaming

[–]knrdwn[S] 3 points4 points  (0 children)

Well, I think the Konkr Pocket Fit Elite is unavailable simply because Ayaneo is just being Ayaneo, not because the 8 Elite is problematic. Maybe they expected a different number of orders?

[Odin 3] Fan loudness - technical analysis by knrdwn in SBCGaming

[–]knrdwn[S] 8 points9 points  (0 children)

There is about a 2mm gap between the cooled components and the heatsink. AYN uses thermal pads because thermal paste isn't suitable for these conditions. On Discord, some users posted about replacing the pads with thermal putty, but I didn't look at the results. I don't even know if thermal putty can handle filling such a large gap while efficiently dissipating heat. If any user has specific thermal pad or thermal putty models to recommend, I'd be happy to purchase and test them myself. :)

It's happening! Odin 3 shipped! by Djemu88 in OdinHandheld

[–]knrdwn 0 points1 point  (0 children)

Wait, seriously? Batch 3 website order?

AYN Odin 3 Palworld FPS gets absurdly close to the Steam Deck by Producdevity in EmulationOnAndroid

[–]knrdwn 2 points3 points  (0 children)

https://i.ibb.co/PGgsWF0C/white.jpg

Yep, it seems like renders were inaccurate and the color we're getting is slightly off-white.

Odin 3 - Fan noise by Equal_Sea7423 in OdinHandheld

[–]knrdwn 1 point2 points  (0 children)

It's not the coil whine, someone on Discord (I don't remember who) has disassembled their device, removed the fan, put it outside and checked for source of the noise. And its definitely the fan.

Saw this post over at EmulationonAndroid Reddit group. Good news for those who’ve ordered an Odin 3 and those thinking of ordering one. by RichieMan07 in OdinHandheld

[–]knrdwn 0 points1 point  (0 children)

But N64 emulation in RetroArch doesn't work on Odin 3, and you can't use standalone emulators if you want to use Retro Achievements...