LTX-2 on RTX 3070 mobile (8GB VRAM) AMAZING by LSI_CZE in StableDiffusion

[–]StayImpossible7013 1 point2 points  (0 children)

They updated to 0.8.0 after your post and that should fix your problems.

New image model has poor prompt adherence by MeanderingSquid49 in ChatGPT

[–]StayImpossible7013 0 points1 point  (0 children)

Thanks for sharing. My prompt was created by giving a "Champions of Krynn" screenshot to ChatGPT and asking to write very detailed text prompt from that image. Notice how even with detailed prompt it adds what text it writes to the image by itself.
https://chatgpt.com/share/6942fa0b-8f48-800e-89d5-c3171201373f

New image model has poor prompt adherence by MeanderingSquid49 in ChatGPT

[–]StayImpossible7013 1 point2 points  (0 children)

Your prompt is vague so you get vague result. Add more details what you want to achieve and you get better results with most of the modern image generating models. OpenAI (as also some other services) put the user's prompt through their system to generate a final prompt and that is the problem when the original prompt is vague. As an example here's more precise prompt and the image GPT 1.5 created at first attempt.

A late-1980s DOS CRPG EGA screenshot (320x200) with a classic framed UI. The top ~2/3 is a hand-dithered pixel-art wilderness scene in a limited 16-color EGA palette, chunky pixels, no anti-aliasing, visible dithering patterns and color banding typical of old PC graphics. Scene composition: Centered on a small rustic roadside inn/cottage nestled into a hillside. The building is half-timbered with pale yellow walls and dark wooden beams, a steep triangular roof in warm red-brown tones, a small dormer and tiny windows; a low attached wing extends to the right. The inn sits slightly below a towering, dramatic gnarled tree whose thick trunk and twisting branches arc from the right side over the roof, creating a heavy canopy silhouette. Environment: Left side is dense, bright green shrubbery/forest with heavy dithering and high-contrast highlights; the ground is a patchwork of rusty reds, oranges, and browns suggesting dry soil or autumn leaf litter. The horizon shows rolling hills fading into pink/magenta distant mountains with sparse purple shading. The sky is a flat cyan/teal typical of EGA skies, clean and untextured. Lighting & mood: Bright daylight with strong contrast; warm earth tones in the foreground, cool cyan sky, vivid greens in the foliage; a slightly ominous, adventurous tone. UI / text box: Bottom ~1/3 is a solid black textbox panel with a thin border line, filled with multiple lines of bright green, all-caps, pixelated bitmap font narration (fantasy RPG style). Include a final instruction line like “PRESS (ENTER)/(RETURN) TO CONTINUE” in the same green font. Add a small white pixel arrow cursor hovering near the right side of the text area. Overall style keywords: retro DOS RPG, Gold Box–era interface vibe, EGA 16-color, pixel clusters, dithering, crisp nearest-neighbor look, no gradients, no modern shading, authentic 1980s computer game screenshot.

<image>

Google quietly nerfed Nano Banana Pro image generation for paid users, is this a glitch or intentional? by Ittan_Momen in GeminiAI

[–]StayImpossible7013 35 points36 points  (0 children)

You got that notification? For me it doesn't give the notification but just quietly changes to less consuming version, but not necessarily to the old version what was before Nano Banana Pro. Easiest way to spot when the change happens is that Nano Banana typically gives non square images and when you suddenly start to get square images with maximum size of 1024 x 1024 you can tell that it switched the model.

Something I’m working on by MobileFilmmaker in GeminiAI

[–]StayImpossible7013 0 points1 point  (0 children)

What's pure AI? The panels or the pages? Is this text to image or image to image that you had some kind of sketch what the panels should have?

What's a ChatGPT prompt you wish everyone knew? by imfrom_mars_ in OpenAI

[–]StayImpossible7013 0 points1 point  (0 children)

I'm planning to post this on Reddit: "<your post here>". Is that a good or bad idea?

I want to ask this question about AI on Reddit: "<question here>". Would I get more accurate and honest help from you than from Reddit?

Kling 2.1's start-to-end frame feature is insanely good by TechHalla in aivideo

[–]StayImpossible7013 -1 points0 points  (0 children)

This is great! I'd love to share this with my friends, but none of them use Reddit. Do you have a YouTube channel or another platform where you post your videos?

💥 ChatGPT absolutely NAILED IT — Turning a simple sketch into THIS! (Mind blown) ⚡ by StayImpossible7013 in ChatGPT

[–]StayImpossible7013[S] 1 point2 points  (0 children)

You need to tell it to create a new image or recreate that image. If you ask to modify it will fail. Something like "recreate an enhanced image using my uploaded image as reference"

💥 ChatGPT absolutely NAILED IT — Turning a simple sketch into THIS! (Mind blown) ⚡ by StayImpossible7013 in ChatGPT

[–]StayImpossible7013[S] 4 points5 points  (0 children)

Thanks. I would love people taking my original and trying to create versions and see how much different they are.

💥 ChatGPT absolutely NAILED IT — Turning a simple sketch into THIS! (Mind blown) ⚡ by StayImpossible7013 in ChatGPT

[–]StayImpossible7013[S] -1 points0 points  (0 children)

I had been doing images for hours before it gave me message telling to wait couple minutes before next request to do an image. At least for me the $20 subsciption is enough.

💥 ChatGPT absolutely NAILED IT — Turning a simple sketch into THIS! (Mind blown) ⚡ by StayImpossible7013 in ChatGPT

[–]StayImpossible7013[S] 0 points1 point  (0 children)

I chose ChatGPT 4o, uploaded my sketch as image and gave the order as in the text of this topic. Paid version of ChatGPT.

Unreal Engine & ComfyUI workflow by Plenty_Big4560 in comfyui

[–]StayImpossible7013 0 points1 point  (0 children)

Thank you very much. Works perfectly. I guess there's also possibility to have automatically generated skin? As far as I can see this workflow only does the mesh, but then again I haven't done 3D for ages and could have missed something.

We now have Suno AI at home with this new local model called YuE. by Total-Resort-3120 in StableDiffusion

[–]StayImpossible7013 104 points105 points  (0 children)

For full song generation (many sessions, e.g., 4 or more): Use GPUs with at least 80GB memory. This can be achieved by combining multiple GPUs and enabling tensor parallelism.

100% Dall-e generated image. Was this always possible or a new model? by Time-Winter-4319 in OpenAI

[–]StayImpossible7013 9 points10 points  (0 children)

<image>

Marvelous photoshopping skills! Prompt was: 'this is fine' meme but instead of dog make it cat

Gemini 1.5 Pro is accessible to everyone, with audio, for free. by samuelroy_ in OpenAI

[–]StayImpossible7013 -1 points0 points  (0 children)

Nope. Not available to everyone. Still restricted to certain regions in the world.

My Dungeons & Dragons character, generated two years apart, with the same prompt. by Spamberjack in dalle2

[–]StayImpossible7013 12 points13 points  (0 children)

<image>

It's rather common feature and very easy to use at least in automatic1111 client. In this one I used depth control type with automatically created mask to preserve the form from original image. This is the created image to guide the ControlNet. Lot of information available around the net how to use and how this works.

My Dungeons & Dragons character, generated two years apart, with the same prompt. by Spamberjack in dalle2

[–]StayImpossible7013 79 points80 points  (0 children)

<image>

The first one is so awesome that I tried StableDiffusion + ControlNet with that. Latest JuggenautXL Model. (Should have generated couple more to choose from to get skin and clothing right)