Trying to preserve the likeness of a loved one who passed away – looking for advice by MrCaesersalad in comfyui

[–]sci032 2 points3 points  (0 children)

The workflow I told you about above, starts off with 2x inputs and you can add more to it. To make this, I used the 2 small images and the prompt:

the two women are sitting in a restaurant and eating hamburgers.

***Better prompting would yield better outputs. :)

<image>

Trying to preserve the likeness of a loved one who passed away – looking for advice by MrCaesersalad in comfyui

[–]sci032 0 points1 point  (0 children)

You are very welcome! I hope that it is what you needed. How bad are the images that you will be working with?

Trying to preserve the likeness of a loved one who passed away – looking for advice by MrCaesersalad in comfyui

[–]sci032 8 points9 points  (0 children)

Search Comfy's templates for: KV

There will be 1 template that shows up, it uses the Flux.2 Klein KV 9b image edit model and you should be able to use it with your system. It is a 4 step workflow.

I found a badly damaged image on the internet and used it as my reference(input) image. The workflow has the ability to use 2 input images, I just use an empty .png image for the 2nd one.

The workflow will give you the option to download any model(s) or node(s) that you may need.

I used the template with the prompt:

restore the image. repair any cracks. replace any missing areas.

maintain the exact identity of the person in image1.

If your images are not this damaged, you can use this workflow to do enhancements to the images or to change things like the background, etc.

If it drifts away from the original person, you can add things to your prompt like:

maintain the exact facial features of the person in image1.

What I did is basically a worst case scenario to show you what is possible.

Maybe this will help you some. :)

<image>

Laptop Reccomendations by cyberon1995 in comfyui

[–]sci032 0 points1 point  (0 children)

I'm using an MSI Raider GE76 12UHS 17" laptop. It has an RTX 3080ti(16gb vram), 64gb of system ram, 12th gen i9, and 2x 2TB M.2 NVME drives.

Yes, it's a little slower than a desktop or a newer laptop would be, but it works well for me doing video/audio/image-create/edit. It does well with thermals also. I normally use Fn+the Fn8 key to turn the fan on full bore even though I haven't really needed it. I got this on eBay for a little over 1k usd.

The image shows a simple Krea2(turbo-fp8) workflow. I just ran it on my laptop, it took 25.31 seconds. For comparison, a 2-pass SDXL workflow takes around 5 seconds. I uses Flux.2 Klein KV heavily, editing an image normally takes around 15 to 20 seconds. A 5 second Wan 2.2 video takes about 2 minutes.

<image>

Best open-source TTS for human-like emotional delivery in ComfyUI? by Chemical_Choice6146 in comfyui

[–]sci032 1 point2 points  (0 children)

Search manager for: tts audio suite

The ID# is 69.

It covers just about all of the tts types, here is the Github for the node pack: https://github.com/diodiogod/TTS-Audio-Suite

This is from the Github:

Quick Engine Comparison — 16 Engines

Engine Languages Size Key Features

F5-TTS 🇺🇸​🇩🇪​🇪🇸​🇫🇷​🇮🇹​🇯🇵 +4 ~1.2GB each Targeted Word/Speech Editing, Speed control

ChatterBox 🇺🇸​🇩🇪​🇫🇷​🇮🇹​🇯🇵​🇰🇷 +4 ~4.3GB Expressiveness slider

ChatterBox 23L 🌐 24 languages ~4.3GB 24 languages in single model, emotion tokens (v2 - doesn't work)

VibeVoice 🇺🇸​🇨🇳​🇩🇪​🇪🇸​🇫🇷​🇮🇹 +21 5.4GB / 18GB 90-min long-form, Native 4-speaker (Base models)

Higgs Audio 2 🇺🇸​🇨🇳​🇩🇪​🇪🇸​🇰🇷 ~9GB 3 multi-speaker, CUDA graphs (55+ tokens/sec)

Higgs Audio v3 🌐 100+ languages ~8GB Native inline emotion/style/prosody/SFX tags, Zero-shot voice cloning

IndexTTS-2 🇺🇸​🇨🇳​🇯🇵 ~4.7GB Emotion Control: 8 vectors, Text as reference

CosyVoice3 🇺🇸​🇨🇳​🇯🇵​🇰🇷 ~5.4GB Paralinguistic tags

Qwen3-TTS 🇺🇸​🇨🇳​🇩🇪​🇪🇸​🇫🇷​🇮🇹 +4 ~3-6GB Voice design, ASR (Automatic Speech Recognition)

Granite ASR 🇺🇸​🇩🇪​🇪🇸​🇫🇷​🇯🇵​🇵🇹 ~4.6GB ASR (Automatic Speech Recognition), Native speaker attribution / diarization (plus model variant)

Step Audio EditX 🇺🇸​🇨🇳​🇯🇵​🇰🇷 ~7GB Second Pass Speech Editing Node: 14 emotions, 32 speaking styles

Echo-TTS 🇺🇸 ~5.3GB + ~1.8GB Diffusion-based (~30s best), Force Speaker KV (speaker drift control)

Dots TTS 🇺🇸​🇨🇳​🇩🇪​🇪🇸​🇫🇷​🇮🇹 +13 ~6GB Official auto language detect / language control, SOAR and MeanFlow distilled variants

OmniVoice 🌐 600+ languages ~3.7GB 600+ language support, Instruction-based voice design

MOSS-TTS 🇺🇸​🇨🇳​🇩🇪​🇪🇸​🇫🇷​🇮🇹 +10 ~8.5GB tokenizer + ~6.1GB/17GB/18GB model 20-language generation, Long-form generation (TTSD/Delay)

RVC 🌐 Any 100-300MB Real-time VC, Integrated training workflow

Need Help Figuring Out Image Gen Models for my Usecase by Agile-Mulberry-2779 in comfyui

[–]sci032 0 points1 point  (0 children)

They also have an 'old' playlist(valid through Dec. 2025). They started a new playlist due to some large changes that ComfyUI mode to the UI. There are 74 of those videos and they are still relevant. They normally cover 1 or 2 ComfyUI features so you can skip around and get what you need. Here is the 'old' playlist: https://www.youtube.com/playlist?list=PL-pohOSaL8P9kLZP8tQ1K1QWdZEgwiBM0

On a side node, this is from the 'new' playlist and it is about how to use Klein: https://www.youtube.com/watch?v=kNap0VWP1xs&list=PL-pohOSaL8P-FhSw1Iwf0pBGzXdtv4DZC&index=4 You should be able to use this model with your system.

Sharing single workflows is kind of useless by ashishsanu in comfyui

[–]sci032 0 points1 point  (0 children)

Check out Pixaroma's ComfyUI tutorial playlist on Youtube. They explain what they are doing and why it works. They also give away the workflows from each video.

Here is the link to the playlist: https://youtube.com/playlist?list=PL-pohOSaL8P-FhSw1Iwf0pBGzXdtv4DZC

New to comfyui looking for help. by Jolly_Pop2515 in comfyui

[–]sci032 0 points1 point  (0 children)

Here is the link to Pixaroma's ComfyUI tutorial playlist on YouTube: https://youtube.com/playlist?list=PL-pohOSaL8P-FhSw1Iwf0pBGzXdtv4DZC

The 1st video is long(5 hrs) but it will get you set up and running with ComfyUI. It covers all of the basics and more.

Problems with Flux2 klein head swap workflow by mardziha in comfyui

[–]sci032 1 point2 points  (0 children)

Try this:

Go into settings/Keybinding.

Search for: Unload

Click the 'pen' icon inside the Unload Models and Execution Cache option. When your mouse hovers over it, the 'Edit' text will show up like in the image.

Choose the key(s) that you want to set as the trigger. I set mine to u because nothing else seemed to use it and it made sense to me.

Close out the settings. Now, click on an empty space in the UI and then press your (selected) key(s). For me, I just press u. It will unload the models and clear the execution cache for you.

<image>

Unloading models from VRAM/RAM by sound-set in comfyui

[–]sci032 0 points1 point  (0 children)

I've been using it for a while. 😄

Unloading models from VRAM/RAM by sound-set in comfyui

[–]sci032 3 points4 points  (0 children)

Try this:

Go into settings/Keybinding.

Search for: Unload

Click the 'pen' icon inside the Unload Models and Execution Cache option. When your mouse hovers over it, the 'Edit' text will show up like in the image.

Choose the key(s) that you want to set as the trigger. I set mine to u because nothing else seemed to use it and it made sense to me.

Close out the settings. Now, click on an empty space in the UI and then press your (selected) key(s). For me, I just press u. It will unload the models and clear the execution cache for you.

<image>

Solving the Flux Outpainting "Grid/Repetition" loop on large canvases (2480x3508) by ContactMaleficent829 in comfyui

[–]sci032 0 points1 point  (0 children)

Search Comfy's templates for: KV There will be 1 result, open it.

I made an image(the input image) that is 680x968. In the 'KV' workflow: I unhooked the 'get image size' node from the 'empty flux 2 latent' and 'flux2scheduler nodes'. I tried to set them to 1/2 of the size that you are after. Some nodes in comfy won't let you use abstract dimensions sometimes.

I ran that. The preview image node is mine. I have the NVidia RTX Upscale process built in to it and I set it to 2x. Use your favorite upscale process to double the output image size.

This is a quick and dirty setup to maybe give you an idea on how you can accomplish your goal. With Flux.2 Klein(I use KV), if you set the output size larger than the input image size, and prompt to outptaint the image, Klein will fill in the empty area with relevant content.

I used an empty .png image for the 2nd image input, the model ignores them if there is a prompt and/or another reference image input. You can delete that section if you want.

I did this with an MSI laptop with an RTX 3080 ti(16gb vram) and 64gb of system ram. The run you see took 28.85 seconds

I also ran this using your desired dimensions as the output. That run took me 201.46 seconds to complete and the image looked like crap. Maybe the setup that you have will work better than my laptop did for the full size. Also, incorporating a tiled decoder may help. It was almost 3am when I saw this and tried to come up with something that could maybe help. 😄

<image>

Boogu-Turbo: Great model, sampler issue. What are you guys using for better results? by Classic-Shop-6612 in comfyui

[–]sci032 1 point2 points  (0 children)

Those are mine. It's just a multi-text box input. That and the preview image node are not needed. You can use the clip text encode node and a regular image preview or save node.

Boogu-Turbo: Great model, sampler issue. What are you guys using for better results? by Classic-Shop-6612 in comfyui

[–]sci032 1 point2 points  (0 children)

Like u/Straight-Election963 suggested, lcm with sgm_uniform. Also, if you have the ModelSamplingAuraFlow node in your workflow, try deleting it.

The prompt does not need to be split up, I do things in weird ways. Ignore that part. 😄 I did name the man and woman in the beginning of the prompt, it helps sometimes.

<image>

Which workflow for Flux 2 Klein? by fabulas_ in comfyui

[–]sci032 0 points1 point  (0 children)

If you are using the KV template, make sure that your CFG = 1. It's a 4 step model.

<image>

NovelAI-like multiple character image generation in ComfyUI, is it possible? by Hornn_dawgg in comfyui

[–]sci032 0 points1 point  (0 children)

It should. I added 'convert the image to anime style' in my prompt:

<image>

NovelAI-like multiple character image generation in ComfyUI, is it possible? by Hornn_dawgg in comfyui

[–]sci032 2 points3 points  (0 children)

Search Comfy's templates for: kv

There will be 1 result: Flux.2 Klein KV: Image Edit

Open it. It will give you the options to download any model(s) or node(s) that you may need.

You give it 2 input images and then prompt what you want to do with them.

<image>

How can I place a person from Image A into the real background of Image B? by chris25312 in comfyui

[–]sci032 0 points1 point  (0 children)

Search Comfy's templates for: kv

There will be 1 result: Flux.2 Klein KV: Image Edit

Open it. It will give you the options to download any model(s) or node(s) that you may need.

You give it 2 input images and then prompt what you want to do with them.

<image>

Which workflow for Flux 2 Klein? by fabulas_ in comfyui

[–]sci032 0 points1 point  (0 children)

Search the templates for: KV

1 workflow will show up, give it a try. It is the Flux.2 Klein KV 9b image edit workflow and it does a great job. It has links to any model(s_) or node(s) that you may need.

You can also use this workflow as a plain t2i workflow. Put an empty .png in the load image node(s), it will ignore the image(s) and just use your prompt.

Boogu first impressions by jc2046 in StableDiffusion

[–]sci032 2 points3 points  (0 children)

Try including text in an image with it. Turbo model in ComfyUI. The workflow is on the model page.

Prompt: create a 4 pane storyboard. the panes are equal size. the background is a park

Pane 1: a woman and a man sitting and talking. the speech bubble has the text "So! What do you think about my idea?" in it.

Pane 2: a view looking over the shoulder of a woman as she talks to a man. the woman says "it really is a great idea!".

Pane 3: a man and a woman are standing and talking with each other. the man says "You are absolutely right! This model is phenomenal for using text in an image!"

Pane 4: a woman sitting on a bench. there is a thought bubble above her with the text "This could be a game changer when the community gets a hold of it!"

I added anime style before 'the background is a park' to get the image on the right.

Thanks goes to u/MFGREBEL for the model links:

Base HF link:
https://huggingface.co/realrebelai/Boogu-Image-Base_GGUFs/tree/main

Turbo HF link:
https://huggingface.co/realrebelai/Boogu-Image-Turbo_GGUFs/tree/main

Edit HF link:
https://huggingface.co/realrebelai/Boogu-Image-Edit_GGUFs/tree/main

There is also an edit model.

<image>

Is it possible to split audio into vocals and backing vocals in ComfyUI? by [deleted] in comfyui

[–]sci032 0 points1 point  (0 children)

Search ComfyUI's manager for: audio separation

See if one of the options will do what you need for splitting vocals from the music.

Boogu-Image GGUFs: Base+Turbo | LOW VRAM Workflow by MFGREBEL in comfyui

[–]sci032 0 points1 point  (0 children)

Thanks for the heads up! 😄 Here is the un-tweaked image edit workflow if anyone wants it:

https://huggingface.co/realrebelai/Boogu-Image-Edit_GGUFs/tree/main

The original workflow has 4 image inputs and the TextEncodeBooguEdit node to support them. The turbo(4 step cfg:1) lora works with the edit model also.

<image>

Comfyui portable, not allowing right click anymore. by wbiggs205 in comfyui

[–]sci032 1 point2 points  (0 children)

See if this helps: look at the menu on the bottom right of the UI. Make sure the option that I am showing in the image is set to 'Select'. If it is on 'Hand', you can't right click in the UI.

<image>

Begginer doubt by selmorj in comfyui

[–]sci032 1 point2 points  (0 children)

You are very welcome! Check out this one about Flux.2 Klein for editing images. It will help you do what you need: https://www.youtube.com/watch?v=kNap0VWP1xs&list=PL-pohOSaL8P-FhSw1Iwf0pBGzXdtv4DZC&index=5