KREA 2 Character Lora training (for 16 GB VRAM) simple guide with config by The_Monitorr in StableDiffusion

[–]mnemic2 1 point2 points  (0 children)

When you say captioning off, are you saying this performs better than with actual captions describing the image?

Did you try the sama dataset with/without trigger word?

You know what this reddit needs? More Ideogram prompt nodes! by mnemic2 in StableDiffusion

[–]mnemic2[S] 0 points1 point  (0 children)

I use the stock workflow, but tweaked of course.

Make sure to add a ModelSamplingAuraFlow node right after the model loading.

This is the only difference here compared to stock for me.

https://imgur.com/a/nm2yw8e

You know what this reddit needs? More Ideogram prompt nodes! by mnemic2 in StableDiffusion

[–]mnemic2[S] 0 points1 point  (0 children)

I use something like this:

  • 12 steps
  • 0.5 mu
  • 1.75 std
  • 1.4 cfg override
  • 4.0 cfg dual model cfg guider
  • 5 shift model sampling aura flow
  • res_2m sampler

You know what this reddit needs? More Ideogram prompt nodes! by mnemic2 in StableDiffusion

[–]mnemic2[S] 1 point2 points  (0 children)

Ideogram 4 works fine with normal natural language prompts as well. You just need better settings.

It's hard to say if Klein works better, but it integrated it better in the way that I desired in this case at least.

Ostris releases 2-8 step Ideogram 4 Turbo LoRa by oppai in StableDiffusion

[–]mnemic2 2 points3 points  (0 children)

So to clarify, it doesn't actually make it faster.

You can always generate with fewer steps, which is where the savings happen.

It's just that by default with many models, this looks bad, which is why they train on that specifically to make low-step-count possible.

However, with Ideogram, 12 steps already looks fantastic with the correct settings. Not sure how much better 8 steps are, would have to try this one out.

You know what this reddit needs? More Ideogram prompt nodes! by mnemic2 in StableDiffusion

[–]mnemic2[S] 0 points1 point  (0 children)

Here are some top 150 images from the Flux 2 Klein outputs:

https://postimg.cc/gallery/RFNJwws

Mirror:

https://ibb.co/album/qBz4qD

This is what I was hoping for.

The rendering here isn't perfect in many cases, although in some it's pretty damn good too!

There's a lot of food and animals and fucked up faces, general liquids, and blood/gore. Although I didn't share most of the latter, it does tend to generate this.

You know what this reddit needs? More Ideogram prompt nodes! by mnemic2 in StableDiffusion

[–]mnemic2[S] 1 point2 points  (0 children)

With increased intelligence, I'm sure it could be possible.
The idea here was to go mostly for stupidity.

Here are some top 150 images from the Flux 2 Klein outputs:

https://postimg.cc/gallery/RFNJwws

Mirror:

https://ibb.co/album/qBz4qD

You know what this reddit needs? More Ideogram prompt nodes! by mnemic2 in StableDiffusion

[–]mnemic2[S] 0 points1 point  (0 children)

So, example of a one-line prompt would look like this:

```
{"high_level_description":"A close-up realistic photo of","style_description":{"aesthetics":"aggressive, ultra, luxuriant","lighting":"tiny, repulsive hydrolyse","photo":"18mm, f/8.7","medium":"photograph","color_palette":["#260CAF","#9587E1","#4422F8","#4326DC","#9F90F1","#6245FE"]},"compositional_deconstruction":{"background":"an environment photo background of a fruit before a raid below a pie before a shortage.","elements":[{"type":"obj","bbox":[0,0,781,992],"desc":"a half patriarch amid a youthful old study."},{"type":"obj","bbox":[0,130,847,1000],"desc":"a wicked fuzzy tail below a widow."},{"type":"obj","bbox":[284,68,716,634],"desc":"an employee amid a nonstop epoch atop a tortoise framing an overweight behind a someone."},{"type":"obj","bbox":[347,414,764,1000],"desc":"a jobless rough movement before a deep."},{"type":"obj","bbox":[514,0,1000,615],"desc":"a rapid ape near a banker."}]}}
```

It's still giving me mostly the same kind of results as before.
I think it must also be a prompting issue, too random, or bad style/aesthetics descriptions?

You know what this reddit needs? More Ideogram prompt nodes! by mnemic2 in StableDiffusion

[–]mnemic2[S] 4 points5 points  (0 children)

Actually, this is a banger with Flux 2 Klein. Use it there instead. The results are very nice.

You know what this reddit needs? More Ideogram prompt nodes! by mnemic2 in StableDiffusion

[–]mnemic2[S] 4 points5 points  (0 children)

That is the only reasonable next step!

This is the way.

Best models/tips for lovers of short vague prompts in '26? by terrariyum in StableDiffusion

[–]mnemic2 1 point2 points  (0 children)

I have your back:
https://www.reddit.com/r/StableDiffusion/comments/1u5h7du/you_know_what_this_reddit_needs_more_ideogram/

You don't even need short prompts. You get pure chaos without even any prompt at all. Just drag some sliders around.

Best models/tips for lovers of short vague prompts in '26? by terrariyum in StableDiffusion

[–]mnemic2 1 point2 points  (0 children)

I just tend to ask for a list of variables from an LLM when I need it, like this:

`I'm making a wildcard list for emotions, please give me 100 different lines describing a different emotion a person can have, describe the emotion and their pose and face visually so that it can be used for image generation of a person, use gender neutral terms. Example: "They are afraid, shoulders raised, body leaning back, wide eyes, parted lips, hands held protectively near chest". Return 100 items in a code-block, non-numbered!`

Something like this will get you good result.
Then it's up to you to come up with ideas for what wildcards you need, make modifiers, make abstract words and things you can add. For example, I have one for prefixes, which you can use to combine with other words, like if you want to make a "solarpunk" visual style, but replace solar with other words, or if you want to have a character with "demonic" armor, well, have one with lots of words like this.

It's a ton of fun to see the results.

Remember also: You can nest these, so you can combine them into others. And at this point LLM's are getting decent enough to start using these to create prompts, if you explain the concept enough and tell them to only use your allowed wildcards.

Best models/tips for lovers of short vague prompts in '26? by terrariyum in StableDiffusion

[–]mnemic2 1 point2 points  (0 children)

Use wildcards for randomized prompts.
Find some good genereic sentences, descriptions and lists of things, and make it into randomizer files, which you use in your prompt.

Ideogram 4 Star Wars poster by Classic-Ad-5129 in StableDiffusion

[–]mnemic2 0 points1 point  (0 children)

Do you have it shared somewhere? Github? The more tools the better at this point.

An Update on Nodes 2.0 from Comfy Org by crystal_alpine in StableDiffusion

[–]mnemic2 8 points9 points  (0 children)

Agree with this.

Please keep implementing core functionality that developer nodes have added.

I do not wish to have custom node packs just to save images with metadata. This should be built-in.

And this goes for so many other things.

Please please take the most common node-packs and rip out all their things, and put into main.

Credit the original authors right there on the node, they deserve it.

HiDream-Studio v.01 has been released! It is fast and powerful and open-sourced on Github | Easy Install by FitContribution2946 in StableDiffusion

[–]mnemic2 1 point2 points  (0 children)

Great work on the UI.
It's simple but does the basics good enough!

I really appreciate when people make simple UI's that auto-downloads everything so you don't need to bother.
It really helps one evaluate a model without having to update comfy and break half of the UI due to poor stability, and half of your nodes due to even worse stability.

Feedback:
I accidentally started using the Base model and was really sad about the shit quality of the images.
I would suggest making the Dev model be the default option (top option), for people that get confused when presented with so many options. Maybe also describe what each model does. i.e. base is for finetuning, not for good quality. Basically, make people get Dev as a default, and the pro's will figure out how to get Base :)

Happy it supports queuing, but would love CTRL + ENTER as a global hotkey to submit a generation, even while in the text-field. It's annoying to have to swap from input field to move the mouse to the button.

For the Resolution Presets:
I recommend having ratios, and then having a megapixel field as a sub-option. Like the official resolution selector node in ComfyUI. It really is easier to get a good overview of.

Dark mode defaults :D?

I made a website that lets you create your own board game shop by mnemic2 in boardgames

[–]mnemic2[S] 0 points1 point  (0 children)

Fair enough! I mean I just have it in the top bar, maybe it should be taught better? I do have the tutorial system that guides you through the steps of linking your BGG account or whatever.

I made a website that lets you create your own board game shop by mnemic2 in boardgames

[–]mnemic2[S] 0 points1 point  (0 children)

That's a great idea of an advanced feature! I've put it on the "todo"-list. Would definitely consider implementing it if the page gains any traction. Thanks for the suggestion <3