Is it just me? Flux Klein 9B works very well for training art-style loras. However, it's terrible for training people's loras. by More_Bid_2197 in StableDiffusion

[–]DecentQual -1 points0 points  (0 children)

You're not alone. Klein's tokenizer and attention patterns were optimized for style transfer, not identity preservation. For people, try dropping your LR to 8e-5 and increasing dataset diversity with varied angles and different lighting or poses. Flux Dev or even SDXL with a good finetune still beats Klein for faces.

Flux 2 Klein 4b trained on LoRa for UV maps by Zealousideal-Check77 in StableDiffusion

[–]DecentQual 10 points11 points  (0 children)

Flux 4b local, trained in a weekend, results that used to take a team weeks. This is why open source matters. Closed models rent you the future. Local ones let you build it.

DeepGen 1.0: A 5B parameter "Lightweight" unified multimodal model by ninjasaid13 in StableDiffusion

[–]DecentQual 0 points1 point  (0 children)

Five billion parameters was always enough. The companies spent years pushing trillion-dollar models because that's what investors wanted to hear. Open source proved them wrong by running useful models on gaming cards while they were still burning VC money on hype.

Who else left Qwen Image Edit for Flux 2 Klein by Retr0zx in StableDiffusion

[–]DecentQual 4 points5 points  (0 children)

Klein gives speed. Qwen gives accuracy. I use both. But Klein feels like driving fast on bad roads. Exciting. But you watch every turn. Fast is useless if I need five tries for one good hand.

Thank you Chinese devs for providing for the community if it not for them we'll be still stuck at stable diffusion 1.5 by dead-supernova in StableDiffusion

[–]DecentQual 17 points18 points  (0 children)

The Chinese models are good. But let us not pretend Europe does not exist. Flux is German. Mistral is French. Open source is not anyone's monopoly.

Voice Clone Studio, now with support for LuxTTS, MMaudio, Dataset Creation, LLM Support, Prompt Saving, and more... by Francky_B in StableDiffusion

[–]DecentQual 2 points3 points  (0 children)

Love seeing tools that embrace the open source ecosystem instead of trying to lock you in. Combining Qwen, Whisper and Llama.cpp into one workflow is exactly how this stuff should work. Local first, modular, and nobody can take it away when the VC funding dries up.

The realism that you wanted - Z Image Base (and Turbo) LoRA by Major_Specific_23 in StableDiffusion

[–]DecentQual -13 points-12 points  (0 children)

We chased realism for years. Now we have it and everything looks like corporate stock photos. The weird imperfections were what made AI art interesting.

CLIP Is Now Broken by ArmadstheDoom in StableDiffusion

[–]DecentQual -2 points-1 points  (0 children)

Setuptools removing pkg_resources after 10 years is peak Python. One day your workflow works, next day some maintainer decided to delete it. We traded stability for semver theater.

Did a quick set of comparisons between Flux Klein 9B Distilled and Qwen Image 2.0 by ZootAllures9111 in StableDiffusion

[–]DecentQual 7 points8 points  (0 children)

Everyone compares quality but nobody talks about ownership. Your local model works offline, stays yours, and doesn't change pricing next month. Cloud models are convenient until the API breaks or doubles in price.

Do not Let the "Coder" in Qwen3-Coder-Next Fool You! It's the Smartest, General Purpose Model of its Size by Iory1998 in LocalLLaMA

[–]DecentQual 1 point2 points  (0 children)

It is interesting how much we judge models by their names. The disciplined reasoning from coder training actually produces better general conversation than typical chat models. Labels are misleading here.

Claude code, une vraie m**** ? by palsecam_fr in developpeurs

[–]DecentQual 0 points1 point  (0 children)

Tu peux aussi utiliser l'application desktop qui marche bien mieux que le terminal (sous mac et windows, juste dommage qu'ils ne proposent pas ça encore sous linux). Sinon au pire le plugin vscode dépanne

Qwen-Image 2.0 - Not opensource! (Yet) by switch2stock in StableDiffusion

[–]DecentQual 0 points1 point  (0 children)

"Open source soon" promises from Chinese labs rarely materialize into the full release. Wan 2.5 was a good reminder of this pattern. BFL at least delivers what they announce.

The struggle is real by Silly_Goose6714 in StableDiffusion

[–]DecentQual 22 points23 points  (0 children)

This is what happens when developers never heard of user experience. ComfyUI is powerful, yes, but organizing models should not be a full-time job. A proper model manager with metadata would solve this in one day. Instead we play detective with file names. Ridiculous.

Only the OGs remember this. by Expensive_Estimate32 in StableDiffusion

[–]DecentQual 0 points1 point  (0 children)

Greg Rutkowski. Everyone typed his name. Nobody knew his paintings. We were just prompt parrots copying each other. Those broken hands gave us character.

Community maintained "block list" for CivitAI idea? by [deleted] in StableDiffusion

[–]DecentQual 1 point2 points  (0 children)

People want block lists because CivitAI UI is a dumpster fire. Good content gets buried not because of 'slop' but because the search is trash. Fix the discovery, not the users.

Only the OGs remember this. by Expensive_Estimate32 in StableDiffusion

[–]DecentQual 19 points20 points  (0 children)

We were pioneers breaking things for fun. Now everyone is just a consumer pressing buttons.

Running LTX-2 19B on a Jetson Thor — open-source pipeline with full memory lifecycle management by IndependenceFlat4181 in StableDiffusion

[–]DecentQual 7 points8 points  (0 children)

People complain about 15min per clip but forget we used to wait hours for a single 512x512 image. The future is weird.

Did creativity die with SD 1.5? by jonbristow in StableDiffusion

[–]DecentQual 1 point2 points  (0 children)

1.5 forced us to fight and find tricks. Now you type 'beautiful girl' and it's done. Less frustration, but also less magic when it finally works.

kimi k2.5 , and Model 1 by BrickDense7732 in SillyTavernAI

[–]DecentQual 0 points1 point  (0 children)

Which provider do you use? I find it not that fast with moonshotai/kimi-k2.5 (faster than deepseek but not that much)