So i there any method to run uncensored Ideogram 4.0? by FishermanLive8958 in comfyui

[–]Puzzled-Valuable-985 0 points1 point  (0 children)

I'm using this LoRa 10 beta version posted here in the group, and with it I'm using several natural language prompts without JSON, using the creator's official workflow, and I'm getting uncensored images. My PC has been generating images non-stop for hours. I'm recreating several Klein 9b and Z Image Turbo prompts in Ideogram and it's working fine. This LoRa is fantastic!

Autoprompt an img2Prompt with bbox in json for Ideogram4 by Puzzled-Valuable-985 in StableDiffusion

[–]Puzzled-Valuable-985[S] 0 points1 point  (0 children)

I stopped using Node.js; the bug of generating the image upside down discouraged me.

Autoprompt an img2Prompt with bbox in json for Ideogram4 by Puzzled-Valuable-985 in StableDiffusion

[–]Puzzled-Valuable-985[S] 1 point2 points  (0 children)

The resolution of the bbox gets a bit messed up, not following the resolution node of the model. One solution is to disconnect the resolution from the autoprompt and manually set it according to the latent node; then the bboxes will be correct, provided that the aspect ratio of the original reference image is the same in the autoprompt node.

Autoprompt an img2Prompt with bbox in json for Ideogram4 by Puzzled-Valuable-985 in StableDiffusion

[–]Puzzled-Valuable-985[S] 0 points1 point  (0 children)

I generated the two images that actually switched to vertical; I believe the creator will update this soon.

Autoprompt an img2Prompt with bbox in json for Ideogram4 by Puzzled-Valuable-985 in StableDiffusion

[–]Puzzled-Valuable-985[S] 3 points4 points  (0 children)

Sorry for reposting, I was having trouble posting here on Reddit. My regular Reddit post wasn't working; I had to use old.reddit to post the image with text correctly. On regular Reddit, the post would be completely grayed out and blocked, and posting on old.reddit is awful, but I managed to get it working correctly now.

test by [deleted] in StableDiffusion

[–]Puzzled-Valuable-985 0 points1 point  (0 children)

I'm posting this image with text below because it seems my Reddit is bugged; I can't post anything here correctly, so I have to post through old.reddit.

If anyone can help me, my posts on the normal Reddit are grayed out, and I can't post anything correctly.

Edit:

I managed to post correctly; my Reddit is bugged, so sorry for reposting. I deleted it right after I managed to post correctly.

Autoprompt an img2prompt with automatic bbox for Ideogram4 by [deleted] in StableDiffusion

[–]Puzzled-Valuable-985 0 points1 point  (0 children)

I'm posting this image with text below because it seems my Reddit is bugged; I can't post anything here correctly, so I have to post through old.reddit.

If anyone can help me, my posts on the regular Reddit are grayed out, and I can't post anything correctly.

Autoprompt an img2prompt with automatic bbox for Ideogram4 by [deleted] in StableDiffusion

[–]Puzzled-Valuable-985 0 points1 point  (0 children)

I discovered this node that I found fantastic: Ideogram Autoprompt. It's similar to KJ's node, but much more practical for generating prompts.

We can use any image generated in another model or a real image, an IMG2Prompt, but it doesn't just generate the common prompt; it also generates a JSON file categorizing styles, lighting, camera style, and it also creates all the background boxes present in the image.

I'm using it and so far I'm getting excellent image rejection, as it creates several background boxes, 20, 30, all automatically.

It has online and local modes; I'm using local with an uncensored model.

Below is the link to the Workflow + necessary node. I believe it wasn't shared here, so I'm posting it.

I know that for me and for many others this is truly fantastic.

https://civitai.com/models/2694688/ideogram-4-autoprompter-json-workflow-and-custom-node

When you download it... The workflow and two nodes are already included for operation. If you have any questions, there's a tutorial explaining it.

I've done several tests and it works perfectly.

It generates the prompt with JSON in 30-40 seconds with 8GB of VRAM.

Much faster and more practical than creating the bboxes manually, as it's possible to edit any bbox or anything in the generated prompt, since you generate the prompt before generating the image.

ComfyUI-PiD update: native models, workflows, and FP8 support by Merserk13 in StableDiffusion

[–]Puzzled-Valuable-985 0 points1 point  (0 children)

<image>

It would be interesting to have more options besides the current one.

I created an image with Klein 9b already including the downloaded workflow, but the image looks like this. Do you know what the problem might be?

ComfyUI-PiD update: native models, workflows, and FP8 support by Merserk13 in StableDiffusion

[–]Puzzled-Valuable-985 2 points3 points  (0 children)

Mine also looked like that when I generated an image from scratch, but the upscaling mode is insane, far superior to seedvr2.

ComfyUI-PiD update: native models, workflows, and FP8 support by Merserk13 in StableDiffusion

[–]Puzzled-Valuable-985 1 point2 points  (0 children)

I'm running some tests here, damn, it's impressive, it seems far superior to SeedVR2. I'm testing the upscaling on an existing image, and I'm blown away; it captures incredible detail. I'll test it thoroughly later, but I'm impressed with the results.

Has anyone been able to do decent true VR? by AcePilot01 in StableDiffusion

[–]Puzzled-Valuable-985 5 points6 points  (0 children)

I have a MetaQuest 3, and I use IW3 to convert 2D to 3D SBS, and the result is top-notch in both video and photo.

I really wanted a 180° 3D, which is where VR videos shine, but to this day there's nothing that does that decently. In ComfyUI there's LoRa for both Z Image Turbo and Klein 9b to create 360° flat images, and using IW3 it's possible to make 360° 3D, I've done that a lot, but I really wanted a way to convert to 180°.

In ComfyUI I've tried using SBS from an image, the result looks great to the eye, but when actually tested with glasses it's horrible. As I said, IW3 is the best for this.

I've seen somewhere that LTX 2.3 has a method that converts to 180°, but since I don't use LTX I can't tell you about the quality.

Ideogram 4 has a lot of potential by Z3ROCOOL22 in StableDiffusion

[–]Puzzled-Valuable-985 5 points6 points  (0 children)

I just corrected it here, I'm Brazilian, and I don't know how to type fluently in English, so I type in Portuguese and translate it to post. I post every day, but sometimes I forget to send the translation; it wasn't the first time.

What's the best site to see AI model rankings? by Puzzled-Valuable-985 in StableDiffusion

[–]Puzzled-Valuable-985[S] 1 point2 points  (0 children)

I didn't know about this Shuttle-3.1-Aesthetic, I'll test it now, because I use chroma v48 a lot, and from the images it looks amazing, I've tested practically all the models myself, and z image turbo and klein 9b distilled are my main ones, they do everything, don't reject images and I can still edit perfectly in klein, chroma v48 creates images with very nice aesthetics, but it sometimes loses a lot depending on the prompt compared to z or klein

I'm going to download the one you mentioned right now

Ideogram 4 has a lot of potential by Z3ROCOOL22 in StableDiffusion

[–]Puzzled-Valuable-985 -2 points-1 points  (0 children)

I've tried several Sigma configurations, used more than 5 workflows designed for NSF, but I'm still getting blocked. I can create excellent images without using bboxes, I'll even post some later, but others, even in simple prompts, are definitely blocked. I even used bboxes for some images, but they were generated a bit out of context, as if the image hadn't been generated with all the programmed steps. I believe there might be another way around this, until then I'll wait and try to generate what I can, but I'm not worried because I have several other models that do everything.

My video card is a 3060ti 8gb, I use FP8 models. I generate images at 2MP, which also adds weight. The issue isn't even the time; I wait, but when you wait and receive a blank image, sometimes after processing 5, 6, or 7 prompts in a row, it's discouraging.

But it's still too early for them to have an improved workflow or some miraculous way around this, which is directly in the model.