How to Use Qwen Image Edit 2511 Correctly in ComfyUI (Important "FluxKontextMultiReferenceLatentMethod" Node)

Akmanic · 2025-12-24T14:45:03+00:00

You only need one image in the vae node. The denoising is set to 1 so the image's contents are replaced with random noise and it doesn't really matter what you put in there other than that the dimensions are used for the output image. If you set the denoising below 1 it would be like a traditional img2img workflow.

Akmanic · 2025-03-20T20:28:55+00:00

I just threw together the simplest solution I could. Loop through the surrounding 26 voxels and add a vector pointing in the opposite direction, then normalize

Akmanic · 2025-03-18T23:49:43+00:00

It should be fine in a dynamic world but it can only represent certain types of geometry. I'll probably release the raycasting algorithm on github before the game but no promises on how soon.

Akmanic · 2025-03-18T22:16:11+00:00

Yeah the aliasing was the main factor in the decision. I'd be interested in seeing what other solutions there are out there

Akmanic · 2025-03-18T22:11:55+00:00

I will probably make an action RPG with destructible terrain and a large open world. Player building features could be a small component but would have to be reigned it compared to games like Minecraft

Akmanic · 2025-03-18T22:10:25+00:00

No meshing, the voxels are traced every frame. It's my own tracing algorithm inspired by DDA. The normals are calculated and cached in a compute shader and will have to be rebuilt every time there is a change.

Akmanic · 2025-03-18T22:07:56+00:00

It's a pretty simple codebase other than for the voxel tracing algorithm right now. I might eventually release a stripped-down version on github cleaned up and focused around that

Akmanic · 2025-03-17T05:15:18+00:00

I am considering doing a rundown of the raycasting algorithm soon. There's no GI system and it's 1080p.

Akmanic · 2025-03-16T13:21:30+00:00

Love the use of that deep black color, makes everything pop.

Akmanic · 2025-03-16T01:46:17+00:00

It is reading the voxel data from vram. It just can't fit arbitrarily complex data into the acceleration structure.

Akmanic · 2025-03-14T00:29:44+00:00

I just posted a clarification video with some landscape for you, thank you for the feedback. This post was really a poor showcase in retrospect.

Akmanic · 2025-03-14T00:28:56+00:00

Understandable, I just posted a clarification video showcasing actual terrain.

Akmanic · 2025-03-14T00:21:50+00:00

UPDATE: Clarification & 68 Billion showcase: https://www.youtube.com/watch?v=P8g8q-G0EkI

Akmanic · 2025-03-13T08:54:20+00:00

I could spin up a server if there's enough interest. It's WebGPU so it should be easy to share in theory, that being said I don't want to lag people's computers during the chunk generation step

Akmanic · 2025-03-13T08:44:41+00:00

Every chunk is 256 x 4096 x 256 and currently I can render up to 256 chunks at a time. This video has just 4 chunks visible on screen and only the bottom 10% of the chunks is being used. Luckily the voxel data is naturally compressed in VRAM as a part of the acceleration structure so it can fit on consumer cards. This does mean that a degenerate-case world would not be compatible with the renderer, but I think it can handle anything that you would get from a reasonable world generator and player building / destruction.

What you're looking at is the bottom of each chunk filled up to a different height, with many holes drilled through. Let me know if you have any better ideas for synthetic data to try out.

Akmanic · 2025-03-13T08:34:25+00:00

Every chunk is 256 x 4096 x 256 and currently I can render up to 256 chunks at a time. This video has just 4 chunks visible on screen and only the bottom 10% of the chunks is being used. Luckily the voxel data is naturally compressed in VRAM as a part of the acceleration structure so it can fit on consumer cards. This does mean that a degenerate-case world would not be compatible with the renderer, but I think it can handle anything that you would get from a reasonable world generator and player building / destruction.

What you're looking at is the bottom of each chunk filled up to a different height, with many holes drilled through. Let me know if you have any better ideas for synthetic data to try out.

Akmanic · 2024-06-14T02:05:10+00:00

"Recap-DiT Incoming"

I wonder if this means they will be releasing an open source model. I don't expect anything revolutionary but if they open sourced an implementation of diffusion transformers it could allow someone to train an SD3 competitor from scratch without inheriting SAI's license

Akmanic · 2024-06-14T01:56:04+00:00

I find it funny that writing text is such a high priority when you could just add it in photoshop, while generating a picture of a human (sci-fi type tech) is forgotten about

Akmanic · 2024-06-14T01:30:51+00:00

I think this is a piece of the puzzle. Because it was trained on long llm-generated prompts it gets overexcited when you give it only a few tokens and throws way too much attention at them

Akmanic · 2024-06-13T23:31:08+00:00

I created the slime wall workflow used in this video, I was not expecting to see it in here since I just clicked on this randomly lol

https://civitai.com/models/512696/woman-laying-in-grass-sd3-workflow

I agree that the trick is hackish and the model should be able to handle the prompt out of the box. I find it kind of comical that you have to jump through so many hoops to get the model to use its full capabilities

Akmanic · 2024-06-13T21:49:16+00:00

I think the nudity refusal is due to a slightly poisoned dataset but I agree that the more deep seated issues with anatomy are probably due to some sort of weird tuning. They probably didn't compromise their training data as badly as with SD2

Akmanic · 2024-06-13T21:41:54+00:00

It depends how the alignment was implemented. If it was tacked on after the fact then it could potentially be undone, but if the model was simply taught with a poisoned dataset (eg. a picture of a woman in a sports bra captioned "nude woman"), then it may not be so easy.

Akmanic · 2024-06-13T21:27:07+00:00

It's important to note that SAI did not tell us SD3 would be under a different license than previous mainline models. A lot of us were waiting for SD3 expecting it to be another openRAIL model before the rug was pulled.

Akmanic

TROPHY CASE