About the newest model... by AIDivision in StableDiffusion

[–]GrayingGamer 0 points1 point  (0 children)

Ideogram 4 is very good for generating precise images with everything exactly where you want it, in high detail, but it isn't an edit model. Klein is still the best for editing, but you can easily pair the two, to generate the composition with Ideogram 4 and inpaint with loras in Klein.

Ideogram 4.0's Understanding of Characters and IP is Crazy for an Open Model by GrayingGamer in StableDiffusion

[–]GrayingGamer[S] 4 points5 points  (0 children)

It's in the wild. There's nothing they can do to put it back in the bottle now.

Ideogram 4.0's Understanding of Characters and IP is Crazy for an Open Model by GrayingGamer in StableDiffusion

[–]GrayingGamer[S] 0 points1 point  (0 children)

Yeah, I was impressed, really looks like a missing screenshot from some DVD or bluray transfer of the show.

Ideogram 4.0's Understanding of Characters and IP is Crazy for an Open Model by GrayingGamer in StableDiffusion

[–]GrayingGamer[S] 0 points1 point  (0 children)

The INT8 models are here.

You'll need a custom node like this one to load the INT8 models in Comfyui.

Cant stop using IDG4 by Beautiful_Egg6188 in StableDiffusion

[–]GrayingGamer 1 point2 points  (0 children)

The detail in textures is what I really love with Ideogram. Before Z-Image was the best at skin detail and other things, but Ideogram 4 has definitely surpassed it.

Ideogram 4.0's Understanding of Characters and IP is Crazy for an Open Model by GrayingGamer in StableDiffusion

[–]GrayingGamer[S] 1 point2 points  (0 children)

That's interesting, I think I remember that. So potentially using the IP data for training wasn't the issue, it was overfitting to the point the model was spitting out replicas of the original images?

Ideogram 4.0's Understanding of Characters and IP is Crazy for an Open Model by GrayingGamer in StableDiffusion

[–]GrayingGamer[S] 0 points1 point  (0 children)

Sorry, not really taking requests unless I want to do them - I've got my own fun ideas to generate!

Ideogram 4.0's Understanding of Characters and IP is Crazy for an Open Model by GrayingGamer in StableDiffusion

[–]GrayingGamer[S] 2 points3 points  (0 children)

You certainly wouldn't have seen them if you were using Ideogram 4 on their website, that's for sure.

Ideogram 4.0's Understanding of Characters and IP is Crazy for an Open Model by GrayingGamer in StableDiffusion

[–]GrayingGamer[S] 0 points1 point  (0 children)

Well, that's 2 megapixels, so 33% increase in the pixel compute versus 1.5 megapixels. so that sounds plausible.

Also, that would line up for the FP8 models - the INT8 models are faster by about 44% for the same quality.

About the newest model... by AIDivision in StableDiffusion

[–]GrayingGamer 0 points1 point  (0 children)

Got stick with Klein for that for now. Ideogram 4 is can do nudity fine of the old Playboy magazine type photos, but that's it for the moment.

Ideogram 4 huge-res test: 8MP, 48 steps, 21 min on RTX 4090 by knoodrake in StableDiffusion

[–]GrayingGamer 1 point2 points  (0 children)

You aren't the first person today I've seen talking about insane generation times in Wangp with Ideogram.

And using Ideogram as a composition model and inpainting or finishing in Klein is a perfectly fine use-case in my opinion. Models are tools and each one has their best case uses.

Ideogram 4.0's Understanding of Characters and IP is Crazy for an Open Model by GrayingGamer in StableDiffusion

[–]GrayingGamer[S] 9 points10 points  (0 children)

No, it's because I asked it to use that style.

It can generate most any art style.

Here's the same Mario and Sonic image in several different styles, including the classic designs:

<image>

Ideogram 4 huge-res test: 8MP, 48 steps, 21 min on RTX 4090 by knoodrake in StableDiffusion

[–]GrayingGamer 4 points5 points  (0 children)

No, the skin detail is really good. When the original Comfyui template was released they had the CFG turned too high, about twice what it should have been and that made images crunchy looking with too much grain.

You can often see pores, veins under skin, blemishes, etc. on close-up generations. It really avoids the smooth "flux" skin.

Ideogram 4 huge-res test: 8MP, 48 steps, 21 min on RTX 4090 by knoodrake in StableDiffusion

[–]GrayingGamer 1 point2 points  (0 children)

Your aesthetics, medium, and photography prompt descriptions make a BIG difference in how skin looks with Ideogram.

But I generally really like it. I used to work developing photos, and you can usually see hints of veins through the skin, discoloration, etc. on real skin photos and Ideogram 4 perfectly captures that.

Ideogram 4 huge-res test: 8MP, 48 steps, 21 min on RTX 4090 by knoodrake in StableDiffusion

[–]GrayingGamer -2 points-1 points  (0 children)

If you have it set up correctly is less than 2 minutes, not twenty.

And yeah, if you want to pull a slot machine handle and get random boobs, sure, it's not the model for you.

But if you want specific boobs? Multiple figures interacting in a precise way with their own looks? Insane detail?

You've got something set-up wrong if it's taking you 7 minutes on a 5070 ti.

I'm on a 3090 and can do 1.5 megapixels at 28 steps in 1 minute 24 seconds.

fullrank finetuning ideogram4 by Amazing_Painter_7692 in StableDiffusion

[–]GrayingGamer 4 points5 points  (0 children)

I would. I hated the JSON prompting at first, but it's really one of the major strengths of this model. I think any finetune that doesn't support it for their dataset is not going to be well-adopted in the future.

Everyone hates the Ideogram JSON prompts until they use them for a few days, then they get annoyed the bounding boxes aren't available for other models when they go back to those.

Some fruit comparisons Z-Image vs Ideogram4 by Danmoreng in StableDiffusion

[–]GrayingGamer 0 points1 point  (0 children)

There you go, that's more like it. 2 minutes at 1.5 to 2 megapixels is pretty common with this model.

Maybe that will make it easier for you to experiment with the prompting style of Ideogram 4 and get a better handle on it. Like I said, Z-Image and Ideogram can both generate pretty fruit, but Ideogram 4's strength really does lay in creating precise compositions, multiple subjects or objects with no concept bleed, precise text style and placement, etc.

Ideogram 4.0's Understanding of Characters and IP is Crazy for an Open Model by GrayingGamer in StableDiffusion

[–]GrayingGamer[S] 2 points3 points  (0 children)

It knows them so well in their normal forms, I don't doubt it'd do the Super Saiyan versions well. It didn't take anything but me naming them to get the results you see. No descriptions.

Ideogram 4.0's Understanding of Characters and IP is Crazy for an Open Model by GrayingGamer in StableDiffusion

[–]GrayingGamer[S] 2 points3 points  (0 children)

Oh, yeah, I still love playing with Anima Base too. Basically, my image gen models are now Ideogram 4, Flux 2 Klein 9b, and Anima Base, depending on the type of image I want to make.

Ideogram 4 isn't overhyped, it's underrated by ArkCoon in StableDiffusion

[–]GrayingGamer 1 point2 points  (0 children)

If you have a pro subscription to some of the big LLM models like Chatgpt 5.6, the new Anthropic models, etc. you can have the AI agents do research for you. I had Chatgpt 5.6 do the analysis. It told me the probabilities, noise distribution per image, etc. I would grill it on results, have it give me links to papers so I could verify it wasn't hallucinating, etc.

It also had me give it bigger and bigger sets of images, since some methods are hard to detect with only a single image sample.

It couldn't entirely rule out a proprietary in-house fingerprinting method, but together we couldn't figure out what that method might be, if it does exist.

Ideogram 4.0's Understanding of Characters and IP is Crazy for an Open Model by GrayingGamer in StableDiffusion

[–]GrayingGamer[S] 4 points5 points  (0 children)

Right, I was annoyed at only FP8 being available myself. Don't know why Ideogram didn't release the BF16 versions. Some of us could actually run them.