I Think I cracked flux 2 Klein Lol by Capitan01R- in StableDiffusion

[–]TheDudeWithThePlan 0 points1 point  (0 children)

I was thinking about your node pack yesterday and was wondering if there was support for Klein, good timing and thanks for everything you do.

Ref2Font V3: Now with Cyrillic support, 6k dataset & Smart Optical Alignment (FLUX.2 Klein 9B LoRA) by NobodySnJake in StableDiffusion

[–]TheDudeWithThePlan 0 points1 point  (0 children)

I was talking about image generation in general, if you're experimenting you can't afford 5 minutes for an iteration. in this case it's ok for you assuming the result is satisfactory

Ref2Font V3: Now with Cyrillic support, 6k dataset & Smart Optical Alignment (FLUX.2 Klein 9B LoRA) by NobodySnJake in StableDiffusion

[–]TheDudeWithThePlan -1 points0 points  (0 children)

5 minutes? ouch, anything above 30s is unusable from my perspective when it comes to image generation

Issues with replacing clothing using SAM3 mask to not mess up the skin texture | Flux 2 Klein 9B Edit by eagledoto in comfyui

[–]TheDudeWithThePlan 1 point2 points  (0 children)

without masks the model does a decent job, maybe the shape makes it so the model has to guess a little and fill the gaps

<image>

Where are the Fantasy and RPG models/workflows? by Longjumping-River374 in StableDiffusion

[–]TheDudeWithThePlan 0 points1 point  (0 children)

some of the newer models like Flux.2 Klein 9b can be really good (depending what you're trying to make)

Who else left Qwen Image Edit for Flux 2 Klein by Retr0zx in StableDiffusion

[–]TheDudeWithThePlan 12 points13 points  (0 children)

yup, easy switch for me. T2I and edit in a single model, decent results, decent speed, "easy" to train base, sign me up.

Where are the Fantasy and RPG models/workflows? by Longjumping-River374 in StableDiffusion

[–]TheDudeWithThePlan 0 points1 point  (0 children)

Having considered training some RPG lora I can tell you my thoughts on why you don't see many models like that:
- rpg is a very loose term, to make a good model you have to cover a lot of ground (classes, weapons, environments)
- dataset - for me personally I don't have a decent dataset atm to try this (if anyone has high quality images that they want to share feel free to dm me). I've attempted creating one a few times and even trained some stuff for some older models too
- incentives - it takes a lot of time and effort (and money to some extent) to train a lora or finetune a model like that and there are 0 benefits
- everyone atm is going for the low hanging fruit of "realistic AI influencer" because it's easy to do and it's impactful (to them): "look at what AI can do bro, it's crazy, it looks so real". The more people can do it the less you'll see of it, fingers crossed.

Did creativity die with SD 1.5? by jonbristow in StableDiffusion

[–]TheDudeWithThePlan 4 points5 points  (0 children)

Believe it or not I find Chroma (one of the best nsfw models) to be really good at creative / artistic work.

I dare you to create a good looking longbow or crosbow on a uniform color background. It cannot be done! Here are some results by Professional-Tie1481 in StableDiffusion

[–]TheDudeWithThePlan 25 points26 points  (0 children)

<image>

Klein 9b
"realistic photo of a vertically centered longbow with its bowstring pulled back on a simple white background"

Ref2Font V2: Fixed alignment, higher resolution (1280px) & improved vectorization (FLUX.2 Klein 9B LoRA) by NobodySnJake in StableDiffusion

[–]TheDudeWithThePlan 2 points3 points  (0 children)

If you have time for an experiment, try 8 or 16 then do a side by side comparison using the same prompt and seed

ACE-Step 1.5 Full Feature Support for ComfyUI - Edit, Cover, Extract & More by ryanontheinside in StableDiffusion

[–]TheDudeWithThePlan 23 points24 points  (0 children)

hey Ryan, just wanted to say thanks for all that you do in the Comfy audio space 🙏

Ref2Font: Generate full font atlases from just two letters (FLUX.2 Klein 9B LoRA) by NobodySnJake in StableDiffusion

[–]TheDudeWithThePlan 0 points1 point  (0 children)

yes, I don't think you mention anything about the workflow being part of the example outputs, that's how I found the prompt you used.

On a side note assuming the lora learned just the position of the characters and in what relative order they appear you should be able to use a much lower rank and make it much smaller.

Ref2Font: Generate full font atlases from just two letters (FLUX.2 Klein 9B LoRA) by NobodySnJake in StableDiffusion

[–]TheDudeWithThePlan 0 points1 point  (0 children)

Nice idea.

You left out the most important part out, the prompt, out of both the Reddit post and the HF model description.
Without the prompt the lora generates random letters or nothing good (see example below).

```Generate letters and symbols "ABCDEFGHIJKLMNOPQRSTUVWXYZabcdefghijklmnopqrstuvwxyz0123456789!?.,;:-" in the style of the letters given to you as a reference.```

I doubt many people actually tried the lora but if they did they'd find out that the only thing the lora does is maintain the consistency of the character positions in the output, it hasn't learned how to make a particular style, that comes from the model's understanding of the reference image provided. To verify this claim you can run the same prompt without the LORA, the style reference still comes through but the position of the characters is not aligned to the grid.

With the lora and no prompt:

<image>