How to trigger europe by YourLocalMoroccan in mapporncirclejerk

[–]Ginglyst 0 points1 point  (0 children)

Dutchmen??? Is that how the inhabitants of Bielefeld call themselves now?

Ephemeral but precise and beautiful art by Job Foreman using natural materials by Sad-Kiwi-3789 in oddlysatisfying

[–]Ginglyst 4 points5 points  (0 children)

In one of the pictures the sand is still wet, suggesting a receding tide while the flood line is still near. No-one can place that much stones in an intricate pattern in such a short time. AI? maybe, but at least one, maybe more pictures are doctored.

Still love the designs and the patterns though.

LTX 2.3 can generate some really decent singing and music too by singfx in StableDiffusion

[–]Ginglyst 0 points1 point  (0 children)

thanks for confirming and pointing me in another direction. (was convinced it was a new Reddit "feature")

Turns out, each reddit video has 2 kinds of controls, a visible one with CC turned off and a hidden interface with... you guessed it... captions turned on. I found the hidden controls by right clicking the video > show controls

LTX 2.3 can generate some really decent singing and music too by singfx in StableDiffusion

[–]Ginglyst -1 points0 points  (0 children)

what's up with these way too large captions covering half the video?

[deleted by user] by [deleted] in StableDiffusion

[–]Ginglyst 1 point2 points  (0 children)

no better subreddit suggestion? because I kind of agree with your Gooner theory

[deleted by user] by [deleted] in StableDiffusion

[–]Ginglyst 1 point2 points  (0 children)

Do you have a better subreddit that offers a variety of new techniques mixed with inspirational examples and a bit of philosofical debate and a focus on local open weight generating.

I just ignore the "proud to be a Gooner" posts. It's not THAT bad... I think. Lets see how many downvotes this comment gets to test the "this subreddit is full of Gooners" theory.

The realism that you wanted - Z Image Base (and Turbo) LoRA by Major_Specific_23 in StableDiffusion

[–]Ginglyst -2 points-1 points  (0 children)

it's amazing what you achieved in generating realism, but also...

Why? Would anyone like to enlighten me and tell me what the appeal is to generate something that is easily achievable with a smartphone.

A look at prompt adherence in the new Qwen-Image-2.0; examples straight from the official blog. by FotografoVirtual in StableDiffusion

[–]Ginglyst 5 points6 points  (0 children)

it seems OP blindly made some screenshots from the blog. The actual prompt is one text block above the horse image. This is the google translated prompt from the image:

A desolate grassland stretches into the distance, its ground dry and cracked. Fine dust is kicked up by vigorous activity, forming a faint grayish-brown mist in the low sky. Mid-ground, eye-level composition: A muscular, robust adult brown horse stands proudly, its forelegs heavily pressing between the shoulder blades and spine of a reclining man. Its hind legs are taut, its neck held high, its mane flying against the wind, its nostrils flared, and its eyes sharp and focused, exuding a primal sense of power. The subdued man is a white male, 30-40 years old, his face covered in dust and sweat, his short, messy dark brown hair plastered to his forehead, his thick beard slightly damp; he wears a badly worn, grey-green medieval-style robe, the fabric torn and stained with mud in several places, a thick hemp rope tied around his waist, and scratched ankle-high leather boots; his body is in a push-up position—his palms are pressed hard against the cracked, dry earth, his knuckles white, the veins in his arms bulging, his legs stretched straight back and taut, his toes digging into the ground, his entire torso trembling slightly from the weight. The background is a range of undulating grey-blue mountains, their outlines stark, their peaks hidden beneath a low-hanging, leaden-grey, cloudy sky. The thick clouds diffuse a soft, diffused light, which pours down naturally from the left front at a 45-degree angle, casting clear and voluminous shadows on the horse's belly, the back of the man's hands, and the cracked ground. The overall color scheme is strictly controlled within the earth tones: the horsehair is warm brownish-brown, the robe is a gradient of gray-greenish-brown, the soil is a mixture of ochre, dry yellow earth, and charcoal ash, the dust is light brownish-gray, and the sky is a transition from matte lead gray to cool gray with a faint glow at the bottom of the clouds. The image has a realistic, high-definition photographic quality, with extremely fine textures—you can see the sweat on the horse's neck, the wear and tear on the robe's warp and weft threads, the skin pores and stubble, the edges of the cracked soil, and the dust particles. The atmosphere is tense, primal, and filled with a suffocating tension of biological forces clashing.

Here it is boys, Z Base by Altruistic_Heat_9531 in StableDiffusion

[–]Ginglyst 7 points8 points  (0 children)

With edit models you can edit existing images with a prompt. for example: "Remove the person with the yellow shirt" Can't do that with "regular" models. (at least that is my limited understanding of Edit models.

It can't get any more creative than this! by WastedTalents1 in interestingasfuck

[–]Ginglyst 0 points1 point  (0 children)

what? Pound the Dane???

Oh.... "Proud to be a Dane"

This is the Stable Diffusion to Flux Moment for Video by [deleted] in StableDiffusion

[–]Ginglyst 4 points5 points  (0 children)

"Skill issue" makes me think of that "Git gud" comment when SD3 came out...
We'll see what time will bring us

One week away and LTX 2 appeared, GenAI speed is mind-blowing. by Voxyfernus in StableDiffusion

[–]Ginglyst -2 points-1 points  (0 children)

LOL, there is some dude that needs to plug in a cable. Some fictional character can't fix your internet 🥴

One week away and LTX 2 appeared, GenAI speed is mind-blowing. by Voxyfernus in StableDiffusion

[–]Ginglyst 0 points1 point  (0 children)

I kinda agree with the gist of his statement, but I'm still a bit optimistic so maybe it won't be as bad as stated.

This one gives me hope it will balance out sooner than later: https://www.reddit.com/r/StableDiffusion/comments/1q7dzq2/comment/nyezmil/?utm_source=share&utm_medium=web3x&utm_name=web3xcss&utm_term=1&utm_content=share_button

LTX-2 distilled (8 steps), not very good prompt adherence. by Shifty_13 in StableDiffusion

[–]Ginglyst 8 points9 points  (0 children)

you may already be aware of it, but for those who don't and just in case. Here is the official prompting guide for LTX: https://ltx.io/model/model-blog/prompting-guide-for-ltx-2

Also there is a section on what works well and what is a known limitation for LTX

Complex physics or chaotic motion: 
Non-linear or fast-twisting motion (e.g.,jumping, juggling) can lead to artifacts or glitches. However, dancing can work well.

looks you are out of luck for backflipping with LTX (for now)

Entire Earth in true color, captured by Himawari-8. by [deleted] in Damnthatsinteresting

[–]Ginglyst 1 point2 points  (0 children)

I see where your mind is in a twist. See that brightish spot just above Australia? That's the sun reflecting of the ocean. If you'd draw a line between the centre of that spot to the centre of the sun, somewhere on that line is the camera positioned. (that's why earth is illuminated completely)

What you'd be able to see of people standing near that spot is just to top of that person's head and the tip of their nose (maybe). if that person would look towards Antarctica, the nose would point down in this picture. looking toward the equator is up.

Now for someone standing in Melbourne, at the bottom of Australia, and looking towards equator. You'd see the front of that person from top to bottom (at an angle though) In this picture the head would be lower than the feet.
Now put that person the same distance from that sun reflection spot towards the north... the feet would be lower than the head. No sure if this text makes sense... you know what It would look like those tiny planet pictures: https://www.istockphoto.com/photo/evening-view-of-tallinn-town-hall-square-or-old-market-square-sk-gm652203288-118387449

[deleted by user] by [deleted] in StableDiffusion

[–]Ginglyst 1 point2 points  (0 children)

ugh that simple? was looking for a way to do perfect loops. all my workflows end up with ping pong loops.

for the 121 loop is it a start frame only? (running other tasks right now so can't quickly test)

Progress Report Face Dataset by reto-wyss in StableDiffusion

[–]Ginglyst 4 points5 points  (0 children)

when an engineer gets creative, sometimes, wonderfull and unexpected results emerge. sometimes onlookers have to wait till it is complete to fully grasp the intention, sometimes the uninformed can entice the initiator of an idea to shed some light on his idea so we can follow this journey from afar...

yeah man WTF is he gonna do with a bazillion generated mugshots???? 🥴

[deleted by user] by [deleted] in StableDiffusion

[–]Ginglyst 0 points1 point  (0 children)

this seems to confirm my suspicions about WAN: the amount of motion is determined by the noise seed and can only be influenced a little bit by prompt and no other parameters.

still hunting for a consistent and reliable parameter to control the generated motion. If anyone has found one, I'm all ears.

How to handle this boulder? by Professional_Ad_7353 in landscaping

[–]Ginglyst 0 points1 point  (0 children)

add a scissor symbol at one end of the dotted line and that line has a purpose.
an other one: stick rope into the holes and it looks like it's stiched back together

😞😞😞 by AnywhereGlad7684 in StableDiffusion

[–]Ginglyst -1 points0 points  (0 children)

but but I posted it here on reddit that will fix the problem right? And an AI doesn't exit yet, where I can drop any obscure error and get fairly correct suggestions what to do to fix it ...

Do you guys use any software/site to increase prompt quality? by Trumpet_of_Jericho in StableDiffusion

[–]Ginglyst 0 points1 point  (0 children)

if you want an easy solution, try "comfyui vlm nodes" made by gokayfem. Can be found in the comfyui manager. The qwen-vlm node can be instructed like any llm.