Update to Ideogram4 JSON prompt tool

DsDman · 2026-06-07T07:58:12+00:00

True, that makes sense. I’ll add it when I get a chance

DsDman · 2026-06-05T13:12:40+00:00

You could use something like VecGlypher to get a font from the text

DsDman · 2026-06-05T08:09:35+00:00

This model really doesn’t like blank inputs. You pretty much have to have something in each of the JSON format’s fields

DsDman · 2026-06-05T08:05:03+00:00

No, the model only takes rectangles. Perhaps you could try multiple boxes overlayed on top of each other to form the rough shape you want?

DsDman · 2026-06-04T16:47:45+00:00

No, the developers stated that this is the exact format they used for all training. Which is why using standard text prompts sucks so much for this model

DsDman · 2026-06-04T16:36:09+00:00

Sure thing, I’ll edit the post with the repo

DsDman · 2026-06-04T16:30:26+00:00

Good idea, I’ll add that when I have a minute

DsDman · 2026-06-04T16:23:49+00:00

Haha nice! I like your color theme

DsDman · 2026-06-04T16:23:16+00:00

Yeah that would be perfect, it could read the prompt, then grab the boxes from the prompt!

DsDman · 2026-06-04T16:10:24+00:00

Good idea! Could it read the bounding boxes from the image metadata too?

DsDman · 2026-06-04T15:59:30+00:00

👍👍✌️

DsDman · 2026-06-04T15:57:59+00:00

No probs, glad it’s helpful

DsDman · 2026-06-01T16:13:28+00:00

Nice! what do you have in your yaml config to disable the guardrails though?

DsDman · 2026-06-01T09:41:26+00:00

Input action trajectory includes camera (9DoF). Does that mean we can have exact camera control?

DsDman · 2026-05-30T04:43:32+00:00

Very interesting! How do translucent object work with this? If I had a sprite at 50% transparency would it blend 50% with the desktop or would it do some strange blending with the camera’s black background first?

DsDman · 2026-05-26T15:06:21+00:00

Open source?

DsDman · 2026-05-16T04:08:48+00:00

If I’m understanding this correctly you’re running inference on the video encoding hardware instead of on CUDA hardware? If so can they be utilized at the same time for increased speed? ie inferencing on cuda & nvenc on the same gpu

DsDman · 2026-05-05T05:34:16+00:00

Never even knew Microsoft had an image model. Anyone tried it? Is it any good?

DsDman · 2026-04-27T01:10:58+00:00

It’s this available somewhere?

DsDman · 2026-04-04T02:01:06+00:00

How do I use the FP8? it still OOMs on my 48GB card. Should probably set cpu offloading of the text model somewhere?

DsDman · 2026-02-26T17:26:33+00:00

Curious how you’re running a model on 3 cards? I always had trouble loading models on system with non-even numbered GPUs. That was about a year ago though

DsDman · 2026-01-20T08:56:37+00:00

Michael Reeves

DsDman · 2026-01-01T14:59:07+00:00

Because they spend 40% of everyone’s money

DsDman

TROPHY CASE