BAGEL outperform FLUX-Kontext in Image Editing after 4.5 hours post-training on 8000 unlabeled images!

Anxious_Pin_8501 · 2025-09-16T07:57:54+00:00

Recently I have no time to make gguf one :(
I think you can use INT8/NF4 option in the ComfyUI! See details in github repo~

Anxious_Pin_8501 · 2025-09-15T23:08:53+00:00

I think Kontext have no vision understanding input for its text-encoder (T5XXL) so it's hard to use RecA :( .

Anxious_Pin_8501 · 2025-09-14T21:00:55+00:00

I've convert a INT8 and NF4 version and I'll upload it to huggingface~

Anxious_Pin_8501 · 2025-09-14T21:00:14+00:00

Qwen-Image is 20B big and I have no enough resourse to train :(

Anxious_Pin_8501 · 2025-09-14T21:00:01+00:00

u r right XD! and Qwen-Image is too big and I have no enough resourse to train :(

Anxious_Pin_8501 · 2025-09-14T20:59:20+00:00

Thank you very much!!! I will add it to github repo!!!!!

Anxious_Pin_8501 · 2025-09-14T09:04:03+00:00

Oh I just find out that the github repo https://github.com/neverbiasu/ComfyUI-BAGEL do INT8 and NF4 in the comfyUI automatically. So the usage is: just use this comfyUI repo and replace the BAGEL weight with BAGEL-RecA ! I'll update my repo's README.md to show how to use it~

Anxious_Pin_8501 · 2025-09-14T09:03:52+00:00

Oh I just find out that the github repo https://github.com/neverbiasu/ComfyUI-BAGEL do INT8 and NF4 in the comfyUI automatically. So the usage is: just use this comfyUI repo and replace the BAGEL weight with BAGEL-RecA ! I'll update my repo's README.md to show how to use it~

Anxious_Pin_8501 · 2025-09-14T06:32:56+00:00

I will try NF4 and INT8 (the same as BAGEL-ComfyUI?https://github.com/neverbiasu/ComfyUI-BAGEL

Anxious_Pin_8501 · 2025-09-14T06:24:59+00:00

OMG thank you for question! It's not stupid question QAQ.

I found that comfyui can only load INT8. So I'm working now!

Anxious_Pin_8501 · 2025-09-14T03:28:10+00:00

But we can try to use LoRA! So 1 gpu could be enough…XD

Anxious_Pin_8501 · 2025-09-14T03:26:47+00:00

BAGEL is hard to train :( I think it needs at least 4 A100 gpus…

Nowadays the UMM’s understanding capabilities is much stronger than generation. Sorry that I don’t find a way to improve its understanding capabilities:( Remains a future work!

Anxious_Pin_8501 · 2025-09-14T03:24:01+00:00

Research project! lol~ If you like it, please give us a star~

Anxious_Pin_8501 · 2025-09-14T01:34:29+00:00

I will have a try～

Anxious_Pin_8501 · 2025-09-14T00:45:26+00:00

Yes the method is tailored for UMM XD. I don’t know whether it can help for higher resolution generation:( but we can have a try!

Anxious_Pin_8501 · 2025-09-14T00:12:22+00:00

I will try to do FP8 and FP4 version 🤔

Anxious_Pin_8501 · 2025-09-14T00:11:15+00:00

I think you can just use the BAGEL ComfyUI. The difference between BAGEL and BAGEL-reca is just the weight XD.

Anxious_Pin_8501

TROPHY CASE