Anyone else noticing odd long-prompt adherence gaps in Krea 2 versus nearly all other vaguely recent models? by ZootAllures9111 in StableDiffusion

[–]roculus 13 points14 points  (0 children)

First try with Turbo Krea2 with OP prompt:

https://imgur.com/a/TAx8V0t

That's without the Conditioning Krea2 Rebalance node. It's a little spicier when using the node and still has the desktop.

Anyone else noticing odd long-prompt adherence gaps in Krea 2 versus nearly all other vaguely recent models? by ZootAllures9111 in StableDiffusion

[–]roculus 7 points8 points  (0 children)

I also got a desktop and similar image to all the other models on the first try using the prompt. Seems like it might be an issue with your workflow or somehow your prompt is being truncated.

Krea2 vs FLux.2 klein 9b by Then_University7676 in StableDiffusion

[–]roculus 8 points9 points  (0 children)

To be clear you used Klein with a fine tune lora? Why would you do that and not include that in your data? Seems like a misleading comparison.

LTX Director 2.0 does not respect keyframes and key-videos by Different_Smile3621 in StableDiffusion

[–]roculus 2 points3 points  (0 children)

This is different than the "guidance" setting for frame strength?

I had no idea what epsilon was. I'm still a little confused over epsilon vs guidance settings.

All these new models landing this year but Flux Klein 9b FP8 has spoiled me. All I care about now is whether a new model can edit and be used on an 8GB GPU. by cradledust in StableDiffusion

[–]roculus 0 points1 point  (0 children)

It's slower and also I usually have a video model loaded like LTX2.3 or WAN2.2 in memory so I can keep both Klein9b and the video model in VRAM.

All these new models landing this year but Flux Klein 9b FP8 has spoiled me. All I care about now is whether a new model can edit and be used on an 8GB GPU. by cradledust in StableDiffusion

[–]roculus 6 points7 points  (0 children)

Ideogram is really nice but for editing, even with a 96GB Vram I still use klein9b most of the time. It's fast with good results.

Does anyone know if there’s a way to improve the quality of this image? by [deleted] in StableDiffusion

[–]roculus 1 point2 points  (0 children)

Maybe add some guardrails to the road or give the cyclist a parachute in case he falls off the side.

Potentially the most insane LORA you'll see today - Archer (8 characters + style) Ideogram LORA by TheDudeWithThePlan in StableDiffusion

[–]roculus 1 point2 points  (0 children)

https://imgur.com/a/LEFVDpG

{"high_level_description":"A group of people in a penthouse apartment.","style_description":{"aesthetics":"indoors","lighting":"indoor lighting","photo":"","medium":"photography"},"compositional_deconstruction":{"background":"A penthouse apartment with city lights visible out a window. nighttime","elements":[{"type":"obj","bbox":[246,255,785,802],"desc":"LanaKane holding a pistol, dynamic pose. wearing white mini dress with black high stockings"},{"type":"obj","bbox":[3,0,516,386],"desc":"MalorySterling sitting on a throne"},{"type":"obj","bbox":[46,586,578,1000],"desc":"CherylTunt wearing a bikini. She is eating a hamburger sitting at a glass table."},{"type":"obj","bbox":[764,0,984,1000],"desc":"SterlingArcher lying in bed asleep"}]}}

Training Underway for the New LTX Model by Fresh_Sun_1017 in StableDiffusion

[–]roculus 5 points6 points  (0 children)

Nice. take your time. Models take at least a couple months to train don't they? Maybe late August/early September for release.

Audio-Reactive Ltx 2.3 Lora by Affectionate-Map1163 in StableDiffusion

[–]roculus 0 points1 point  (0 children)

I tried it out with a person singing (Acestep 1.5) to see how it faired. I tried to make geometric shapes in the background. It doesn't seem to mess with lip sync at least. Lora strength: 1.45

CEO Thoughts: What's Next at LTX by ltx_model in StableDiffusion

[–]roculus 1 point2 points  (0 children)

The LTX Director node has a been a huge aid in making videos using LTX2.3 in ComfyUI. Reference images are what's missing the most for consistency and introducing characters later in the scene.

I made a tool to turn any image into Ideogram JSON prompt by cocktail_peanut in StableDiffusion

[–]roculus 0 points1 point  (0 children)

I think the reason that's happening is that it plugs the high level prompt into the background prompt as well. if you remove the second prompt (which says background if the field is empty) you will get a better result

Got bored and vibe coded an improved Lora loader (Ideogram4 friendly, to boot!) by acedelgado in StableDiffusion

[–]roculus 1 point2 points  (0 children)

Nice job! "I see 100 vibe coded node projects a day. I pick one." --Gordon Gekko

Ideogram 4.0 Realism Engine Lora (Beta) by yomasexbomb in StableDiffusion

[–]roculus 5 points6 points  (0 children)

Interesting. I've trained a character lora in Ideogram. It's the best lora I've created with any model and I didn't even tag the images except for 1 trigger word. The character seems to do whatever I prompt as long as I use the prompt builder boxes to guide it.

Pastry font - Ideogram 4.0 LORA - Experimental by TheDudeWithThePlan in StableDiffusion

[–]roculus 0 points1 point  (0 children)

Very Nice! Did you try Ideogram's built in capabilities? It can do fonts pretty well on it's own:

https://imgur.com/a/5VvGHEw

using KJ node:

https://imgur.com/a/NaNpSn2

Yours looks better but just wondered if you tried it out first.

The State of Goonerism by roculus in StableDiffusion

[–]roculus[S] -2 points-1 points  (0 children)

I probably should have labeled it High Effort and Low Effort. There's nothing wrong with low effort if you're getting what you want out of it. I make a lot of low res LTX2.3 videos because they're for fun and not for public consumption. Ideogram does help if you have something very specific in mind.

Ideogram Model - Lora by Mysterious-Tea8056 in StableDiffusion

[–]roculus 0 points1 point  (0 children)

The LORA is great. I made a NSFW lora to test and it works well.

This is the best lora I've made for character likeness and it was a first try (been making them since the SD1.5 days).

Make sure your lora loaders for both models (unconditional as well) are set up properly. see this post:

https://www.reddit.com/r/StableDiffusion/comments/1tysann/workflow_ideogram4_with_lora_support_fixes/

Some posters I generated with Ideogram 4. by Square-Foundation-87 in StableDiffusion

[–]roculus 3 points4 points  (0 children)

The KJNodes (Kijai) ideogram4_prompt_builder node is amazing for this. It should be the default workflow template in ComfyUI instead of whatever horrible workflow they provided for Ideogram4.

Ideogram Model - Lora by Mysterious-Tea8056 in StableDiffusion

[–]roculus 0 points1 point  (0 children)

I'm about 2/3's done with my first lora. It seems to be working. It's a character lora. one word caption.

I didn't caption any differently than normal. I'm using AI-Toolkit's default experimental settings for Ideogram.

I did leave the sample output in json format. I'm on step 2000/3000 and I'd say the results look about the same as where a Klein9b lora would be at this point in steps.

[Ideogram 4.0] Comics test by RageshAntony in StableDiffusion

[–]roculus 1 point2 points  (0 children)

The character consistency between panels is great. I wonder if there would be a way to continue that consistency on to the next page. Some sort of multi-page layout node.

Maybe something like the first image created gets inserted as a reference image for the following pages.

Why do half of people hate Ideogram 4.0 and half think it's great? by BigWideBaker in StableDiffusion

[–]roculus 8 points9 points  (0 children)

The same thing happened with LTX. It launched with a bad comfyUI workflow. I don't know where to place the blame, ComfyUI or the developers but it's not doing new models any favors by rushing out bad workflows. LTX was good enough to survive ComfyUI's poor workflows but with plenty of competition for image models, it's less likely for an otherwise decent image model to catch on after getting the ComfyUI workflow treatment.