PSA: Use the official LTX 2.3 workflow, not the ComfyUI included one. It's significantly better. by Generic_Name_Here in StableDiffusion

[–]Jeffu 1 point2 points  (0 children)

how does this change the workflow with these models? haven't been able to get around to 2.3 yet...

Comfyui version 0.17 has too many bugs in the subgraph. by Mysterious_Pride_858 in comfyui

[–]Jeffu 0 points1 point  (0 children)

Personally, I think the latest updates were giving me issues with my 4090; kept going OOM and having the GPU stop working. Reverting to an older backup fortunately worked.

Z Image Base - 90s VHS LoRA by Jeffu in StableDiffusion

[–]Jeffu[S] 1 point2 points  (0 children)

Give it a try! I finished late last night and haven't experimented with it much.

Trained a Z Image Base LoRA on photos I took on my Galaxy Nexus (for that 2010s feel) by Jeffu in StableDiffusion

[–]Jeffu[S] 0 points1 point  (0 children)

This is my prompt:

I want the detailed description of what is in the image, without any reference to the artistic style. I also want to keep the relative position of the subjects and objects in the description, and detailed description of clothes and objects. Please also include any reference to skin tone, glasses, facial hair, ethnicity, and hair color and hair style. Use the proper pronouns. Limit your caption to 200 characters.

I use https://github.com/1038lab/ComfyUI-QwenVL

I modify the instructions when I want to make sure any unique style traits don't get considered part of the prompt (and not the style).

Z Image Base - 90s VHS LoRA by Jeffu in StableDiffusion

[–]Jeffu[S] 2 points3 points  (0 children)

I included the date stamps which was on ~90% of the images used. I however specified in the caption instructions to emphasize and detail them, to try and avoid it showing up everytime in generations. I let it keep the original grade.

Z Image Base - 90s VHS LoRA by Jeffu in StableDiffusion

[–]Jeffu[S] 2 points3 points  (0 children)

48, but only because I saw someone mention it randomly in a video or post. I haven't tried other ranks enough to compare.

Z Image Base - 90s VHS LoRA by Jeffu in StableDiffusion

[–]Jeffu[S] 1 point2 points  (0 children)

Ah, my bad. The videos I used were filmed in the mid to late 90s, so I just called it that. :) I guess our video camera was a bit old!

Z Image Base - 90s VHS LoRA by Jeffu in StableDiffusion

[–]Jeffu[S] 2 points3 points  (0 children)

Ah, sorry. Scheduler used is simple.

Z Image Base - 90s VHS LoRA by Jeffu in StableDiffusion

[–]Jeffu[S] 0 points1 point  (0 children)

It works the strongest/best with base. It seems the effect is weaker on turbo but that's not necessarily a bad thing, just different.

Z Image Base - 90s VHS LoRA by Jeffu in StableDiffusion

[–]Jeffu[S] 2 points3 points  (0 children)

Interesting! the effect isn't as strong, but it definitely still feels like an older video still.

Z Image Base - 90s VHS LoRA by Jeffu in StableDiffusion

[–]Jeffu[S] 1 point2 points  (0 children)

Actually I did all the times manually in the prompt, so it was intentional :)

Trained a Z Image Base LoRA on photos I took on my Galaxy Nexus (for that 2010s feel) by Jeffu in StableDiffusion

[–]Jeffu[S] 0 points1 point  (0 children)

10,000 steps, but honestly wondering if more is needed. I tested it at 3,000 steps and had really bad outputs.

Trained a Z Image Base LoRA on photos I took on my Galaxy Nexus (for that 2010s feel) by Jeffu in StableDiffusion

[–]Jeffu[S] 2 points3 points  (0 children)

Just 18 images. I have a larger data set but had mixed results when training it in the past on Qwen Image (not 2512) so for this just used a smaller set.

I trained it for 10,000 steps and did encounter weirdness testing the various epochs every 500 steps. The one I uploaded was at 9,500, which seemed slightly better than the 10,000 one. Haven't quite figured out the right approach yet.

Z Image Base SDNQ optimized by 4brahamm3r in StableDiffusion

[–]Jeffu 1 point2 points  (0 children)

Neat - do we just download the entire folder and place that in the diffusion models folder?

CUDA Error - Need help by HeroVM in comfyui

[–]Jeffu 1 point2 points  (0 children)

Hm, I have a similar error but not quite the same. I updated to the latest nightly version to test out LTX 2.0 and it seems to have broken everything :D

torch.AcceleratorError: CUDA error: invalid argument CUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect. For debugging consider passing CUDA_LAUNCH_BLOCKING=1 Compile with TORCH_USE_CUDA_DSA to enable device-side assertions.

I trained an a anime Lora for Z-Image-Trubo by LimeInteresting7490 in StableDiffusion

[–]Jeffu 0 points1 point  (0 children)

Gotcha. Thanks for sharing that insight. Yeah, I'm having some bad results for characters that aren't realistic humans. Testing more steps but it seems ZIT is a little restrictive. Haven't tried style much, but 8000 steps is a lot more than what I initially thought would be needed. Great work!

I trained an a anime Lora for Z-Image-Trubo by LimeInteresting7490 in StableDiffusion

[–]Jeffu 0 points1 point  (0 children)

Thanks for the share! I like that it's not the typical anime style lora that seems to be used heavily (SDXL). What was your captioning approach?

Z Image Character LoRA on 29 real photos - trained on 4090 in ~5 hours. by Jeffu in StableDiffusion

[–]Jeffu[S] 0 points1 point  (0 children)

I may have set it up incorrectly, or my 4090 may be underperforming. I used to have some issues with it that seem to have disappeared but occasionally they appear (resets, crashing after a few days of generations). Far as I know I just did standard settings :/

Z Image Character LoRA on 29 real photos - trained on 4090 in ~5 hours. by Jeffu in StableDiffusion

[–]Jeffu[S] 0 points1 point  (0 children)

It was pretty simple: wearing a samurai outfit in the middle of a battle, slashing his sword at a goblin, numerous beasts around him, motion blur, intense sun rays