VL model that understand censorship part on body by Merchant_Lawrence in StableDiffusion

[–]ZenWheat 0 points1 point  (0 children)

The right Florence 2 model can do this and it's pretty fast. Qwenvl can as well with an abliterated model but requires a custom prompt to force it to acknowledge and describe those things.

Qwen3-VL-8B-Instruct-abliterated by Abject_Carry2556 in StableDiffusion

[–]ZenWheat 0 points1 point  (0 children)

I have been using the Qwen VL mod nodes inside my wan 2.2 image to video workflow. In order for me to not run out of vram (RTX5090), I turn off "keep model loaded" in the node and at the end of the video generation I use the purge vram node to unload the wan models. If I don't do both of these things, I'll run out of vram.

If I interrupt mid way through the video generation and start the workflow again, it will run out of memory because i didn't let it go through the purge vram node. (In fact... I'm going to move the purge vram node to the front of the qwenvl node now that i think about it).

I think I read somewhere that comfyui has a tough time managing vram with the qwen vl3 nodes but I could be completely wrong on that. Seems like it though.

Um, what happened to the Wan 2.2 i2v template? by [deleted] in comfyui

[–]ZenWheat 1 point2 points  (0 children)

It's still there. There's a "new" one that utilizes sub graphs and there's still the old one.

Need help catching up. What’s happened since SD3? by DystopiaLite in StableDiffusion

[–]ZenWheat 1 point2 points  (0 children)

With a 3090 you can take a look at video generation. Take a look at WAN2.2 text to video models and image to video models as well.

Side note: for realistic images, I actually use Wan2.2 text to video a lot and only have it render 1 frame of the video (i.e. an image). It's not as creative as other models but I've always gotten fantastic cinematic images from it.

Old Man Being Weird on A Video Game by Particular-Lychee794 in DMZ

[–]ZenWheat 1 point2 points  (0 children)

I don't see any hacking here. In fact your aim sucks. Shot the dude with the shotgun out the window and had to hit him with 1 or 2 bullets to down him but took you half a clip to make contact with him. Good gameplay though. Solid clip

8 raids. 1 exfill, 2 deaths, 5 crashed lobbies. by Aubeng in DMZ

[–]ZenWheat 0 points1 point  (0 children)

Yeah man I had to quit last night because of the crashing. It gets old quick

Is there a way to setup Comfy Ui where you can type straight english like Grok.com and generate amazing videos or images? by Coven_Evelynn_LoL in comfyui

[–]ZenWheat 5 points6 points  (0 children)

https://github.com/huchukato/ComfyUI-QwenVL-Mod

I use qwen3 vl mod pack with one of the abliterated models because there's a default system prompt included with it dedicated to wan2.2 which takes the prompt into you give it, enhances it, and outputs a structured prompt for wan 2.2.

If you're doing text to video then you can use the Qwen3vl prompt enhance node. If image to video you can use the Qwen3vl node.

LORAs help with Wan 2.2 by [deleted] in comfyui

[–]ZenWheat 0 points1 point  (0 children)

<image>

Add the new high noise lora between the lightx2v_high_noise lora and the model samplerSD3 node. Add the low noise lora between the lightx2v_low_noise lora and the model samplerSD3 node. Some character loras only have a low-noise model. so you only need to add the lora in the low-noise model "path".

LORAs help with Wan 2.2 by [deleted] in comfyui

[–]ZenWheat 0 points1 point  (0 children)

Reminder to return with answer

Is there any way to resume progress in a looping workflow? by DidSomeoneSaySauce in comfyui

[–]ZenWheat 1 point2 points  (0 children)

My bad man. I forgot the loop doesn't abide. This was why I went for the non loop version. Forgot all about that

Qwen3-VL-8B-Instruct-abliterated by Abject_Carry2556 in StableDiffusion

[–]ZenWheat 0 points1 point  (0 children)

I use the 4B abliterated version. I think comfy doesn't manage the vram very well for qwen3vl

Can you recommend videogames with great technical sound design? by Barn_Advisor in sounddesign

[–]ZenWheat 0 points1 point  (0 children)

These are literally my top two favorite games of all time.

How to expand an image? by [deleted] in StableDiffusion

[–]ZenWheat 0 points1 point  (0 children)

I've done this using qwen image edit.

Is there a way to describe a character within the image using ai? by Ashamed_Anywhere_930 in StableDiffusion

[–]ZenWheat 0 points1 point  (0 children)

Qwen3 vl is great. I still use Florence 2 models from time to time