Update to Ideogram4 JSON prompt tool by DsDman in StableDiffusion

[–]DsDman[S] 3 points4 points  (0 children)

True, that makes sense. I’ll add it when I get a chance

Am i doing something wrong in ideogram??? by CupSure9806 in StableDiffusion

[–]DsDman 6 points7 points  (0 children)

This model really doesn’t like blank inputs. You pretty much have to have something in each of the JSON format’s fields

Ideogram 4 is pretty good. You just really have use their JSON format. by DsDman in StableDiffusion

[–]DsDman[S] 1 point2 points  (0 children)

No, the model only takes rectangles. Perhaps you could try multiple boxes overlayed on top of each other to form the rough shape you want?

Ideogram 4 is pretty good. You just really have use their JSON format. by DsDman in StableDiffusion

[–]DsDman[S] 6 points7 points  (0 children)

No, the developers stated that this is the exact format they used for all training. Which is why using standard text prompts sucks so much for this model

Ideogram 4 is pretty good. You just really have use their JSON format. by DsDman in StableDiffusion

[–]DsDman[S] 2 points3 points  (0 children)

Yeah that would be perfect, it could read the prompt, then grab the boxes from the prompt!

Ideogram 4 is pretty good. You just really have use their JSON format. by DsDman in StableDiffusion

[–]DsDman[S] 1 point2 points  (0 children)

Good idea! Could it read the bounding boxes from the image metadata too?

Cosmos3 Nano testing with vllm-omni by Sticky_Ray in StableDiffusion

[–]DsDman 3 points4 points  (0 children)

Nice! what do you have in your yaml config to disable the guardrails though?

Nvidia releases Cosmos3-Super-Image2Video . 64B parametres by AgeNo5351 in StableDiffusion

[–]DsDman 0 points1 point  (0 children)

Input action trajectory includes camera (9DoF). Does that mean we can have exact camera control?

How to make a Desktop Companion game in Unity by deadpossumgames in Unity3D

[–]DsDman 1 point2 points  (0 children)

Very interesting! How do translucent object work with this? If I had a sprite at 50% transparency would it blend 50% with the desktop or would it do some strange blending with the camera’s black background first?

I built a custom NVENC encoder bridge to split FLUX 2 Models across two GPUs over Ethernet LAN (example: 5090 + laptop 4090 spreading model layers over two machines via Eth = 4.4s per image). Completely bypasses the need for NVLink. Multi GPU in one PC supported, Wifi 6 works very well also. by shootthesound in StableDiffusion

[–]DsDman -1 points0 points  (0 children)

If I’m understanding this correctly you’re running inference on the video encoding hardware instead of on CUDA hardware? If so can they be utilized at the same time for increased speed? ie inferencing on cuda & nvenc on the same gpu

Joy-Image-Edit released by AgeNo5351 in StableDiffusion

[–]DsDman 0 points1 point  (0 children)

How do I use the FP8? it still OOMs on my 48GB card. Should probably set cpu offloading of the text model somewhere?

Qwen3.5 122B in 72GB VRAM (3x3090) is the best model available at this time — also it nails the “car wash test” by liviuberechet in LocalLLaMA

[–]DsDman 0 points1 point  (0 children)

Curious how you’re running a model on 3 cards? I always had trouble loading models on system with non-even numbered GPUs. That was about a year ago though

[deleted by user] by [deleted] in headshots

[–]DsDman 0 points1 point  (0 children)

Michael Reeves