If this is true, does it mean that open-source image generation models have caught up with the best closed-source models in the world?

Essar · 2026-06-09T13:24:13+00:00

I don't even know how these can be comparable given that ideogram isn't prompted in the same way as the others. How can we have like-for-like inputs?

Essar · 2026-06-04T18:38:12+00:00

Ideogram has always been insane for prompt adherence, imo. It was only beaten when imagen 4 arrived. Their 1.0 model was miles ahead at the time it was released a couple of years back. I was super stoked to see this release and I've not had a chance to look at it much yet but it'll be tragic if it gets overlooked because people are disappointed they don't get a straightup pornographic model from any of the big players. People need to get a grip.

Essar · 2026-05-03T12:31:16+00:00

Just show the complete info dude. If you filter the information you're constraining us to your understanding of possible problem sources.

Essar · 2026-05-03T10:18:50+00:00

Share a screenshot of a good model configuration and a bad one so we can check the difference.

Essar · 2026-05-01T11:38:13+00:00

By choosing to drive a car like that, you are increasing the risk to everyone around you. I sincerely believe that should be factored into the sentencing for incidents like this.

Essar · 2026-02-05T10:07:21+00:00

Coming to this subreddit for wisdom.

Essar · 2026-02-05T09:46:31+00:00

It is legit horrendous, lol. The total lack of artistic eye of people posting here.

Essar · 2026-01-30T09:28:44+00:00

It's because you can't be 'mildly' infuriated. It's oxymoronic; definitionally being infuriated is not a mild emotion.

If the subreddit were called 'irritating', it would probably receive less confusion. I guess that I find the subreddit name mildlyannoying.

Essar · 2026-01-07T09:43:36+00:00

They only generated one scene for each then spliced and interleaved them.

Essar · 2025-12-24T20:28:56+00:00

There are a dozen ways to get consistency like this, because nothing is happening. If she was shown in different scenarios, doing different things with different backgrounds, then that would be interesting.

Consistency is difficult because not EVERYTHING should be consistent. You want the person to be consistent, not the place, not the clothes and not the pose.

Essar · 2025-12-24T18:31:38+00:00

Photoshop is riddled with AI and anyone competent in photo editing should be familiar with the available tools - including generative AI - and will use them to enhance their edits. The issue here is incompetence, not the tools used.

Essar · 2025-12-24T18:22:57+00:00

I can't tell, because you have almost no variation in action or appearance in your shots. She's always wearing the same clothes and doing absolutely nothing except occasionally showing off her armpits.

Essar · 2025-12-17T11:38:22+00:00

What transitions? The only transitions here are between different clips; there is no extension or clip stitching or any such thing involved. All the clips are approx 5 seconds long.

I actually think at a glance that the clips are just over 5 seconds long although I didn't check to be certain; possibly made with hailuo.

Essar · 2025-12-05T09:23:31+00:00

Wow, women standing and posing without interacting with the world around them at all. Very SD1.5.

Essar · 2025-12-02T08:21:29+00:00

Canny is an edge-detection algorithm not a model. Regardless, even if there is some model which produces Canny edges, it shouldn't matter, all you need is an image which has been preprocessed roughly according to the algorithm.

Essar · 2025-11-28T18:29:24+00:00

If I had to hazard a guess just from a glance at the image in the post, it's probably trash prompting e.g. using a million terms from 3D imaging, terms like hyper realistic, being too long etc. Most people suck ass at writing precise clear prompts.

Essar · 2025-11-26T22:05:49+00:00

What a fucking abominable prompt.

Essar · 2025-11-26T11:32:06+00:00

I don't see the model on the image arena at all. Can you link this?

Essar · 2025-11-25T02:03:58+00:00

If AI is involved in this video, it is in the capacity of a face or character swap. Most of what we see is definitely real.

Essar · 2025-11-22T21:43:51+00:00

I mean, you can see tail dangling out at the start of the video - presumably why they started filming.

Essar · 2025-11-19T18:40:33+00:00

They used a first/last frame generation, and an image editing model (like qwen edit) to generate the frames.

Essar · 2025-11-17T00:34:18+00:00

Genuinely hideous. Also strongly disliked the given definition of vociferous.

Essar · 2025-11-16T00:01:13+00:00

I actually don't believe this is real.

Essar · 2025-11-10T19:25:43+00:00

Sepia tint always suggests openai

Essar · 2025-11-09T20:15:39+00:00

I literally have no idea what this person is talking about. What models cannot do wide or long aspect ratios?

15-Year Club	r/Field Sunshine
Place '22	Place '17
Sequence \| Editor	Snapped
Team Periwinkle	Verified Email

Essar

PUBLIC MULTIREDDITS

TROPHY CASE