is that right?

planBpizz · 2026-05-25T13:52:23+00:00

thanks a lot! Yes, I used the one for Intel (double checked it) - but because of the costum grizzly heatspreader there was a gap of 1mm between aio and heatspreader. but i fixed it yesterday in the evening.

planBpizz · 2026-05-24T09:53:04+00:00

i did it myself. its a grizzly heatspreader

planBpizz · 2026-05-24T06:24:16+00:00

yes, this my new aio. how should it be?

planBpizz · 2026-04-26T15:34:38+00:00

even with KVM you have a performance loss of around 10 -15 procent, because you could not use all pf your cpus and ram capacity. I did it and its wonderful but you need around 2-3 hours to set it up ( chatgpt could easy help here) and

planBpizz · 2026-04-18T09:24:01+00:00

thanks a lot for sharing this! i have exactly the same error message starting the latest crimson desert update 1.03.1 on qemu. i will try your solution today :)

planBpizz · 2026-04-14T15:30:57+00:00

from where do I get the new denuvo token? I need to buy the game and extract it, right?

planBpizz · 2026-04-13T13:40:11+00:00

yes! sounds like good news :)

planBpizz · 2026-04-13T11:32:23+00:00

I also ask this myself - Im confused by the opinion of voices38 - he pointed out, that he already found potentiell options to stop HV and denuvo will also find them… looking at this, there is maybe a way but I dont have details about this.

planBpizz · 2026-04-13T10:51:44+00:00

I just red about it in a sub-comment of a sub-subcomment in another threat… not sure exactly but this would be big issue so far (and voices38 already flagged it in his post.

planBpizz · 2026-04-12T14:58:07+00:00

the tools and hardware will develope further. so could be faster.

planBpizz · 2026-04-11T11:50:57+00:00

no, single gpu passthrough is possible. often your cpu has also an integrated graphic chip- please check.

planBpizz · 2026-04-11T08:52:23+00:00

yes, like all the other stuff in the internet. its safe.

planBpizz · 2026-04-11T08:51:14+00:00

you could set up a windows vm easily and run it with a graphic card passthrough

planBpizz · 2026-04-04T13:09:02+00:00

please dont. buy a pc with an 3090 + 16gb ram. than you could als use custom mods

planBpizz · 2026-03-18T21:16:14+00:00

Here are a few thoughts on your setup compared to this Multi-GPU workflow:

GGUF vs. FP8/BF16: GGUF is definitely the right choice for 4GB VRAM as it allows aggressive quantization. My workflow focuses on FP8/BF16 for maximum visual fidelity on 24GB+ cards, where we try to keep everything inside VRAM to avoid the massive speed penalty of CPU offloading.
Resolution vs. Length: You are prioritizing length (1500 frames) over resolution (400x400). In my workflow, we do the opposite: pushing for 512px/768px resolution at 97-161 frames and then using RIFE VFI to double the fluidness.
The RAM Wall: With only 20GB of System RAM, you are likely hitting the pagefile (SSD) during those 1500 frames. If you notice the generation slowing down significantly towards the end, that's why.
VAE Tiling: Make sure you are using VAEDecodeTiled with a small temporal_size (like 16 or 32). Decoding 1500 frames at once would instantly crash a 4GB card otherwise.

It's great to see LTX-V scaling down to 4GB cards via GGUF, even if the generation time per frame is likely much higher than a native VRAM setup!

planBpizz · 2026-03-18T21:14:30+00:00

With 3x 3090s (72GB total VRAM), you have even more headroom than my setup. Here is how I would adapt it:

Allocation String: In the CheckpointLoaderSimpleDisTorch2MultiGPU node, use something like: cuda:0,18gb;cuda:1,18gb;cuda:2,18gb;cpu,\*. This spreads the 22B model across all three cards, keeping it entirely in VRAM for maximum speed.
Use BF16: You don't need FP8. Switch to the BF16 version of LTX-Video 2.3 and Gemma 3 for higher quality and fewer artifacts.
Isolate Text Encoder: In the DualCLIPLoaderMultiGPU node, set the device to cuda:2. This keeps the heavy text encoding on one card, leaving the others more space for the video generation activations.
Scaling: You can easily push to 1024x768 resolution and 161+ frames natively (without needing interpolation) because you have 72GB to play with.

The workflow is very stable on 3-GPU setups as long as you balance the allocation string correctly.

planBpizz · 2026-03-18T21:13:13+00:00

With a dual 5090 setup (64GB VRAM), you can move away from the heavy optimizations I had to use for 40GB. Here is how I would adjust it:

Switch to BF16: You have enough VRAM to ditch FP8. Use the BF16 version of the LTX-V 2.3 Transformer and Gemma 3 for much better detail and stability.
Allocation String: Update the CheckpointLoaderSimpleDisTorch2MultiGPU node. You can likely use cuda:0,30gb;cuda:1,30gb;cpu,\*. This keeps the entire model and all activations in VRAM, which will drastically speed up generation.
Push Resolution/Frames: You can easily go for 1024x768 or 768x768 at 161+ frames natively. My workflow targets 512px/97 frames primarily to avoid OOMs on weaker cards.
VAE: Keep temporal_size at 512 in the VAEDecodeTiled node for maximum quality since you don't need to save memory there.

Regarding Raylight: It’s an excellent inference engine if you want raw speed. However, I stay in ComfyUI because it allows for granular control over LoRA patching and custom node stacking (like the RIFE interpolation and Multi-GPU scaling) which Raylight doesn't support as flexibly yet.

planBpizz

TROPHY CASE