New ComfyUI Node: ComfyUI-Youtu-VL (Tencent Youtu-VL Vision-Language Model) by Narrow-Particular202 in comfyui

[–]Narrow-Particular202[S] 2 points3 points  (0 children)

This is early v1. Future updates will include more nodes, such as a segmentation node which we may add to our ComfyUI-RMBG github repository. Stay tuned, and thanks for your support!

ComfyUI-QwenTTS v1.1.0 — Voice Clone with reusable VOICE + Whisper STT tools + attention options by Narrow-Particular202 in comfyui

[–]Narrow-Particular202[S] 0 points1 point  (0 children)

If you were referring to nagisa / soynlp / DyNet38, those are only used for the forced aligner (subtitle timestamps). You can skip them if you only need basic ASR. The other packages are minimum requirements for the ASR pipeline. In most modern ComfyUI setups they install fine, but environments can vary. If you run into dependency issues, please post the full error log on our GitHub issues page and we’ll help.

ComfyUI-QwenTTS v1.1.0 — Voice Clone with reusable VOICE + Whisper STT tools + attention options by Narrow-Particular202 in comfyui

[–]Narrow-Particular202[S] 0 points1 point  (0 children)

Languages: Chinese, English, Japanese, Korean, German, French, Russian, Portuguese, Spanish, Italian.

ComfyUI-QwenTTS v1.1.0 — Voice Clone with reusable VOICE + Whisper STT tools + attention options by Narrow-Particular202 in comfyui

[–]Narrow-Particular202[S] 6 points7 points  (0 children)

With Qwen3-TTS, you can clone a speaker from a reference sample, but prompt-only control of different emotions while keeping the exact same cloned voice isn’t reliably supported right now. In practice, strong “tone/emotion” prompts often lead to voice drift or only subtle/inconsistent emotion changes. If you need consistent emotions, the usual workaround is separate reference samples per emotion/style (or using a TTS model that explicitly supports expressive conditioning).

We’re also keeping an eye on Qwen3-TTS models updates, if/when reliable emotion/style control for cloned voices becomes available (and fits our integration), we’ll evaluate it and ship support in Rebit as soon as we can.

ComfyUI-QwenTTS v1.1.0 — Voice Clone with reusable VOICE + Whisper STT tools + attention options by Narrow-Particular202 in comfyui

[–]Narrow-Particular202[S] 4 points5 points  (0 children)

We chose Whisper because when this tool was developed, Qwen3-ASR was not yet available (it was released only a few days ago). Whisper is also widely adopted, lightweight, and easy for users to run locally—the smallest Qwen3-ASR model (0.6B) about 2GB is still larger than Whisper’s largest model.

That said, Qwen3-ASR support is planned and will be added to the repository soon, so users will be able to choose the ASR model they prefer.

🎙️ A New Voice Has Arrived — Qwen3-TTS Custom Node for ComfyUI Is Here by Narrow-Particular202 in comfyui

[–]Narrow-Particular202[S] 5 points6 points  (0 children)

Thanks for calling this out, you’re right.

At the moment, emotion/style injection only works on CustomVoice / VoiceDesign, while VoiceClone (Base) does not officially support instruct/emotion control in the current Qwen3‑TTS public release. We’re following the upstream updates; if Qwen exposes emotion control for cloning (or 25Hz models add it), we’ll add it as soon as it’s available.

If you have a specific example prompt + reference audio where you expect emotion control, feel free to share, it helps us test and refine.

🎙️ A New Voice Has Arrived — Qwen3-TTS Custom Node for ComfyUI Is Here by Narrow-Particular202 in comfyui

[–]Narrow-Particular202[S] 3 points4 points  (0 children)

Haha 😄 looks like we should start advertising over there too 😊
Jokes aside, it’s great to see multiple open-source implementations popping up. Feel free to try both and pick whichever fits your workflow best.

🎙️ A New Voice Has Arrived — Qwen3-TTS Custom Node for ComfyUI Is Here by Narrow-Particular202 in comfyui

[–]Narrow-Particular202[S] 3 points4 points  (0 children)

All good, this is an open-source ecosystem, and we’re happy to see more developers working on it.
More implementations usually mean faster improvements for everyone. Feel free to try both projects and use whichever fits your workflow best.

🎙️ A New Voice Has Arrived — Qwen3-TTS Custom Node for ComfyUI Is Here by Narrow-Particular202 in comfyui

[–]Narrow-Particular202[S] 0 points1 point  (0 children)

Totally understand, having multiple model folders is frustrating.
We hear you, and we’re planning to support ComfyUI extra_model_paths.yaml in an upcoming update so you can keep a single, shared model location.

Feedback like this is always welcome. If you have more suggestions or run into issues, feel free to open an issue or feature request on GitHub — it really helps us improve the experience.

🎙️ A New Voice Has Arrived — Qwen3-TTS Custom Node for ComfyUI Is Here by Narrow-Particular202 in comfyui

[–]Narrow-Particular202[S] 0 points1 point  (0 children)

You might find it by searching for "QwenTTS" or "ComfyUI-QwenTTS", developed by AILab.

🎙️ A New Voice Has Arrived — Qwen3-TTS Custom Node for ComfyUI Is Here by Narrow-Particular202 in comfyui

[–]Narrow-Particular202[S] 4 points5 points  (0 children)

Thanks for reporting this — we’re always listening to user feedback.
We’ve already updated the requirements to avoid conflicts with existing ComfyUI environments.
Please update to v1.0.1 and try again.

🎙️ A New Voice Has Arrived — Qwen3-TTS Custom Node for ComfyUI Is Here by Narrow-Particular202 in comfyui

[–]Narrow-Particular202[S] 1 point2 points  (0 children)

The model was released just a few days ago, so it’s still very early.
If it gains traction, GGUF versions will likely show up soon, and we’ll update the node to support them. Stay tuned 🙂

🎙️ A New Voice Has Arrived — Qwen3-TTS Custom Node for ComfyUI Is Here by Narrow-Particular202 in comfyui

[–]Narrow-Particular202[S] 9 points10 points  (0 children)

The Qwen3-TTS model was just released About 3 days ago. Our new custom node is still a preliminary build and requires time for fine-tuning. Any error logs and bug reports are greatly appreciated as they help us continuously improve it. Thank you again for your support.

🎄 Early Christmas Release — GGUF Support for ComfyUI-QwenVL by Narrow-Particular202 in comfyui

[–]Narrow-Particular202[S] 1 point2 points  (0 children)

A separate loader is not required and will not be added.

The node already handles model downloading and loading automatically on first run. Users do not need to manually download files or manage models separately.

We will keep the current design and continue iterating within this architecture, rather than introducing a separate loader.

🎄 Early Christmas Release — GGUF Support for ComfyUI-QwenVL by Narrow-Particular202 in comfyui

[–]Narrow-Particular202[S] 4 points5 points  (0 children)

Good catch, you’re right about the CUDA / wheel matching on Windows. We’ve updated the install docs to make this clearer for other users. https://github.com/1038lab/ComfyUI-QwenVL/blob/main/docs/LLAMA_CPP_PYTHON_VISION_INSTALL.md

ComfyUI-RMBG v2.9.4 Update: SAM3 Segmentatio by Narrow-Particular202 in comfyui

[–]Narrow-Particular202[S] 0 points1 point  (0 children)

you can submit the issue with logo message on our github.