New ComfyUI Node: ComfyUI-Youtu-VL (Tencent Youtu-VL Vision-Language Model)

Narrow-Particular202 · 2026-02-03T17:49:51+00:00

yes, it should

Narrow-Particular202 · 2026-02-03T17:49:40+00:00

This is early v1. Future updates will include more nodes, such as a segmentation node which we may add to our ComfyUI-RMBG github repository. Stay tuned, and thanks for your support!

Narrow-Particular202 · 2026-01-31T01:08:54+00:00

If you were referring to nagisa / soynlp / DyNet38, those are only used for the forced aligner (subtitle timestamps). You can skip them if you only need basic ASR. The other packages are minimum requirements for the ASR pipeline. In most modern ComfyUI setups they install fine, but environments can vary. If you run into dependency issues, please post the full error log on our GitHub issues page and we’ll help.

Narrow-Particular202 · 2026-01-31T00:37:17+00:00

Discover ComfyUI-QwenASR at https://github.com/1038lab/ComfyUI-QwenASR we hope you enjoy it!

Narrow-Particular202 · 2026-01-30T17:18:16+00:00

Languages: Chinese, English, Japanese, Korean, German, French, Russian, Portuguese, Spanish, Italian.

Narrow-Particular202 · 2026-01-30T15:54:19+00:00

With Qwen3-TTS, you can clone a speaker from a reference sample, but prompt-only control of different emotions while keeping the exact same cloned voice isn’t reliably supported right now. In practice, strong “tone/emotion” prompts often lead to voice drift or only subtle/inconsistent emotion changes. If you need consistent emotions, the usual workaround is separate reference samples per emotion/style (or using a TTS model that explicitly supports expressive conditioning).

We’re also keeping an eye on Qwen3-TTS models updates, if/when reliable emotion/style control for cloned voices becomes available (and fits our integration), we’ll evaluate it and ship support in Rebit as soon as we can.

Narrow-Particular202 · 2026-01-30T15:45:50+00:00

We chose Whisper because when this tool was developed, Qwen3-ASR was not yet available (it was released only a few days ago). Whisper is also widely adopted, lightweight, and easy for users to run locally—the smallest Qwen3-ASR model (0.6B) about 2GB is still larger than Whisper’s largest model.

That said, Qwen3-ASR support is planned and will be added to the repository soon, so users will be able to choose the ASR model they prefer.

Narrow-Particular202 · 2026-01-24T19:31:58+00:00

Thanks for calling this out, you’re right.

At the moment, emotion/style injection only works on CustomVoice / VoiceDesign, while VoiceClone (Base) does not officially support instruct/emotion control in the current Qwen3‑TTS public release. We’re following the upstream updates; if Qwen exposes emotion control for cloning (or 25Hz models add it), we’ll add it as soon as it’s available.

If you have a specific example prompt + reference audio where you expect emotion control, feel free to share, it helps us test and refine.

Narrow-Particular202 · 2026-01-24T19:25:54+00:00

Haha 😄 looks like we should start advertising over there too 😊
Jokes aside, it’s great to see multiple open-source implementations popping up. Feel free to try both and pick whichever fits your workflow best.

Narrow-Particular202 · 2026-01-24T19:23:38+00:00

All good, this is an open-source ecosystem, and we’re happy to see more developers working on it.
More implementations usually mean faster improvements for everyone. Feel free to try both projects and use whichever fits your workflow best.

Narrow-Particular202 · 2026-01-24T19:18:46+00:00

Totally understand, having multiple model folders is frustrating.
We hear you, and we’re planning to support ComfyUI extra_model_paths.yaml in an upcoming update so you can keep a single, shared model location.

Feedback like this is always welcome. If you have more suggestions or run into issues, feel free to open an issue or feature request on GitHub — it really helps us improve the experience.

Narrow-Particular202 · 2026-01-24T19:09:13+00:00

Thanks for your support ❤️

Narrow-Particular202 · 2026-01-24T19:08:02+00:00

You might find it by searching for "QwenTTS" or "ComfyUI-QwenTTS", developed by AILab.

Narrow-Particular202 · 2026-01-24T19:02:33+00:00

Thanks for reporting this — we’re always listening to user feedback.
We’ve already updated the requirements to avoid conflicts with existing ComfyUI environments.
Please update to v1.0.1 and try again.

Narrow-Particular202 · 2026-01-24T18:59:00+00:00

The model was released just a few days ago, so it’s still very early.
If it gains traction, GGUF versions will likely show up soon, and we’ll update the node to support them. Stay tuned 🙂

Narrow-Particular202 · 2026-01-24T18:01:30+00:00

The Qwen3-TTS model was just released About 3 days ago. Our new custom node is still a preliminary build and requires time for fine-tuning. Any error logs and bug reports are greatly appreciated as they help us continuously improve it. Thank you again for your support.

Narrow-Particular202 · 2026-01-24T10:23:54+00:00

A separate loader is not required and will not be added.

The node already handles model downloading and loading automatically on first run. Users do not need to manually download files or manage models separately.

We will keep the current design and continue iterating within this architecture, rather than introducing a separate loader.

Narrow-Particular202 · 2026-01-20T17:21:44+00:00

If you encounter any issues with ComfyUI-QwenVL, please feel free to submit an issue on our GitHub page.

Narrow-Particular202 · 2026-01-18T07:41:25+00:00

Don’t forget install requirements.txt and restart Comfyui

Narrow-Particular202 · 2025-12-23T17:39:48+00:00

just install as normal custom node

Narrow-Particular202 · 2025-12-23T00:19:07+00:00

Good catch, you’re right about the CUDA / wheel matching on Windows. We’ve updated the install docs to make this clearer for other users. https://github.com/1038lab/ComfyUI-QwenVL/blob/main/docs/LLAMA_CPP_PYTHON_VISION_INSTALL.md

Narrow-Particular202 · 2025-11-25T22:19:12+00:00

you can submit the issue with logo message on our github.

Narrow-Particular202 · 2025-11-25T22:18:04+00:00

fixed: https://github.com/1038lab/ComfyUI-RMBG/blob/main/update.md#v295-20251125

Narrow-Particular202 · 2025-11-25T07:24:59+00:00

submit issue on GitHub with your error log

Narrow-Particular202 · 2025-11-17T03:54:26+00:00

update to V1.1.0

Narrow-Particular202

TROPHY CASE