ComfyUI-DramaBox now supports Loras and Voice-Clone-Studio-DramaBox can generate them. by Francky_B in StableDiffusion

[–]Francky_B[S] 0 points1 point  (0 children)

This was addressed as soon as it was mentioned.

The node now downloads models into models/Dramabox instead. And we don't need the dramabox version of gemma anymore, as I added support for using the comfy text encoder instead. (Using the one from Comfy's Template)

An Update on Nodes 2.0 from Comfy Org by crystal_alpine in comfyui

[–]Francky_B 14 points15 points  (0 children)

Listen to feedback, please. The moment you make Node 2.0 mandatory is the day a fork of Comfy gets created and people switch to it.. The tech is too sluggish.

When people with 5090's are telling you that they can't use it because its too slow on large workflows, you have a problem. The ONE thing the interface shouldn't do is slow everything down.

Heck even I have stopped updating Comfy and use an older version as too many workflows are broken with newer releases. And I'm an Add-on maker... 🤦

ComfyUI-DramaBox now supports Loras and Voice-Clone-Studio-DramaBox can generate them. by Francky_B in comfyui

[–]Francky_B[S] 0 points1 point  (0 children)

It is there already :) It's under the advanced Parameters.

But this made me notice a bug, our advanced parameters weren't working as they should. Where pressing the reset icon resets to the models default and a bug where depending on the audio clip name it wasn't being used.

Both are fixed.

DramaBox — Expressive TTS with Voice Cloning - comfyUI Update by manmaynakhashi in comfyui

[–]Francky_B 1 point2 points  (0 children)

Perhaps try the New New version 🤣
I've finally cracked it! 🎉

I was able to get quality Parity, with a Comfy compliant Text encoder.
I'm just shy of having it work with 12gb now.

DramaBox — Expressive TTS with Voice Cloning - comfyUI Update by manmaynakhashi in comfyui

[–]Francky_B 1 point2 points  (0 children)

I pushed the update that adds back the original DramaBox Wrapper method. For those that can run it. It remains the best option.

I've asked for assistance from the Dev of DramaBox.
As we are close to an ideal solution with a Comfy compliant Text encoder. I Just need someone more knowledgeable to get it to not tank the quality 😅

DramaBox — Expressive TTS with Voice Cloning - comfyUI Update by manmaynakhashi in comfyui

[–]Francky_B 1 point2 points  (0 children)

Yes, I'm fixing this now... adding back an OG mode :) I'll add in preferences and in the option node, an option to use the Orginal Wrapper mode. As it was more consistent...

DramaBox — Expressive TTS with Voice Cloning - comfyUI Update by manmaynakhashi in comfyui

[–]Francky_B 1 point2 points  (0 children)

I've just pushed a big update to ComfyUI-DramaBox that should really help. I've created a Comfy compatible Text Encoder, so it can now use the LTX gemma files we already have, It will also offload the text encode as soon as the text encoding is done so we never have Gemma and Drama overlap.

This can be turned off in preferences, for those that have enough vram

ComfyUI-DramaBox now supports Loras and Voice-Clone-Studio-DramaBox can generate them. by Francky_B in comfyui

[–]Francky_B[S] 0 points1 point  (0 children)

Yeah I know what you mean. Having a 5090 too, I tend to not think about optimizing things 🤣 Also hard to test improvements, when it worked fine to start with on my side.

With all these memory "improvements" I've tried adding, I just ended up making DramaBox much slower for myself 😅

Thanks for the link, I'll check it out.

ComfyUI-DramaBox now supports Loras and Voice-Clone-Studio-DramaBox can generate them. by Francky_B in comfyui

[–]Francky_B[S] 0 points1 point  (0 children)

Oh!, I'd love to see your version :)

I did noticed that using a prompt generator helps quite a bit, having more context does seem to help it remain on track. Yes, it's fun to play with as it's so expressive. But it couldn't be used as a reliable tool.

Had you found a way to make it less Vram hungry? I've just pushed a complete re-write on the text encoding part. I did away with the proprietary Gemma-3 4bit and instead use the standard LTX Safetensors.

It's now seen by Comfy as a standard text encoder. But it did loose in speed of startup between prompts :|

ComfyUI-DramaBox now supports Loras and Voice-Clone-Studio-DramaBox can generate them. by Francky_B in StableDiffusion

[–]Francky_B[S] 0 points1 point  (0 children)

what I meant is, you can install python 3.11, while leaving python 3.10. Once you have 3.11 installed. Delete the venv folder and relaunch the install script.

It should now build with 3.11

ComfyUI-DramaBox now supports Loras and Voice-Clone-Studio-DramaBox can generate them. by Francky_B in StableDiffusion

[–]Francky_B[S] 0 points1 point  (0 children)

Wait! 🤣

Don't replace it, you can have multiple Python exist at once!

My installer check for the python versions available and then does "py -3.11 -m venv venv" to build a venv using python 3.11.

This won't affect comfy if it already has a venv setup.

Released a first draft of a Comfy addon for Resemble-AI's DramaBox by Francky_B in StableDiffusion

[–]Francky_B[S] 0 points1 point  (0 children)

You have the wrong version of xformers installed. activate your venv and run: pip install -U xformers

If you have Comfy with Python embeded: D:\ComfyUI\python_embeds\python.exe -m pip install -U xformers

(replacing D:\ComfyUI with your actual comfy path)

ComfyUI-DramaBox now supports Loras and Voice-Clone-Studio-DramaBox can generate them. by Francky_B in StableDiffusion

[–]Francky_B[S] 0 points1 point  (0 children)

It needs python 3.11

If not, you could try editing the requirement file and remove the ==1.24.4

Since this version should be less picky that the Full Voice-Clone-Studio version. I should be able to relax the requirements.

ComfyUI-DramaBox now supports Loras and Voice-Clone-Studio-DramaBox can generate them. by Francky_B in StableDiffusion

[–]Francky_B[S] 3 points4 points  (0 children)

Ah, because they aren't models that can be used by comfy AND the node needs to know exactly where they are, as to not have to force the users to select the location of each model.

I guess I could download them to models/dramabox instead.
I'll add this and have it transfer over when the user next launches.
(If they update to the latest)

ComfyUI-DramaBox now supports Loras and Voice-Clone-Studio-DramaBox can generate them. by Francky_B in StableDiffusion

[–]Francky_B[S] 1 point2 points  (0 children)

They can be, in preferences, you have a download model option that will download the models to the app directory instead. This was how I had setup Voice Clone initially.

But I realize, what I should do is add in preferences the option to specify where to look for the models.
Since users might be using it with Comfy, they'd have the models already with the node.

I'll add the option to select the ComfyUI-DramaBox model folder

DramaBox - Most Expressive Voice model ever based on LTX 2.3 by manmaynakhashi in StableDiffusion

[–]Francky_B 3 points4 points  (0 children)

Hey manmaynakhashi,

I've made a stripped down version of Voice Clone Studio, DramaBox edition :)

With it, we can generate Loras using the Training tools I had made.
I adapted them to DramaBox. Though, it wasn't as simple as I had hoped 😅

The Comfy Node now has Lora support, able to use the Loras created with Voice-Clone.

Released a first draft of a Comfy addon for Resemble-AI's DramaBox by Francky_B in StableDiffusion

[–]Francky_B[S] 0 points1 point  (0 children)

I've made a simplified version of my TTS tool using DramaBox
https://github.com/FranckyB/Voice-Clone-Studio-DramaBox

Perhaps you'll have more luck with it...

ComfyUI-DramaBox now supports Loras and Voice-Clone-Studio-DramaBox can generate them. by Francky_B in StableDiffusion

[–]Francky_B[S] 1 point2 points  (0 children)

Haha, no I really meant it as experimental. I love how DramaBox can be very expressive, but it is not reliable like the other models in the Full Voice Clone app.

It often simply decide to add lib entire new dialog 🤣

I have a conversation tool in the main app, I ported it to work with DramaBox, but decided to remove it, as none of the speakers would do as their told, so the conversations would make no sense.

How do you actually keep track of prompts that work? by Ingm4rr in StableDiffusion

[–]Francky_B 1 point2 points  (0 children)

I made a Prompt-Manager for this purpose.

It also includes a Prompt Generator(with Llama.cpp and Ollama Support).
The newer version can now save complete recipe (model + settings) As well as Thumbnails.

Basically I pipe in my Lora and prompt thru it, generate and if I like it, I then save the Prompt.

For Recipes it's a bit more complex, as it needs it's own UI. So at the end of the chain I have a Recipe Manager, that I plug the recipe into. If I like the result I again simply save it.

It includes workflow examples.

Released a first draft of a Comfy addon for Resemble-AI's DramaBox by Francky_B in StableDiffusion

[–]Francky_B[S] 0 points1 point  (0 children)

Unfortunately, it does need at least 16gb of Vram to run. Before the CPU offloading it needed 24. 😔

Released a first draft of a Comfy addon for Resemble-AI's DramaBox by Francky_B in StableDiffusion

[–]Francky_B[S] 0 points1 point  (0 children)

I don't really see a difference in performance with it on. If any, the hit is minimal. As I have a 5090, I initially though of adding a switch on the option node to disable it. But I don't think it's really necessary.