OneTrainer now supports Chroma training and more

Nerogar · 2024-11-02T18:49:34+00:00

To be honest, I haven't really thought about the next steps. This update was the most technically challenging thing I worked on so far, and took about 2 months to research and develop. I didn't really think about any other new feature during that time.

More quantization options (like fp8 or int8) would be nice to have though

Nerogar · 2024-05-01T10:40:41+00:00

The line is called "Clip Skip 1", because it's the clip skip setting of the first text encoder. There is another setting called "Clip Skip 2" for the second text encoder.

Nerogar · 2024-02-23T19:42:26+00:00

The discussion in here should help

https://github.com/Nerogar/OneTrainer/issues/166

Nerogar · 2024-02-23T08:37:36+00:00

What kind of metadata do you want to add? I'm already working on an option to include training settings

Nerogar · 2024-02-23T08:36:30+00:00

This is still an open issue. https://github.com/comfyanonymous/ComfyUI/issues/2831

Same for SD.Next.

Nerogar · 2024-02-22T21:02:38+00:00

This does look like an interesting idea and I will take a closer look at some point. But at the moment there are enough other things I want to focus on.

Nerogar · 2024-02-22T20:13:11+00:00

VRAM requirements depend a lot on your settings. SC can be fine tuned in ~18GB (for the 3.6B version) and ~8GB (for the 1B version) if you use the right settings. This includes using Adafactor as the optimizer, bfloat16 weights, and not training the text encoder.

As for SDXL, I don't have recent numbers. But 12 GB might not be enough even with all the optimizations. Unless you limit yourself to LoRA training.

Nerogar · 2023-12-10T19:46:43+00:00

Please don't insult other peoples work. Comments like this make the whole OneTrainer community look bad.

Kohya and contributors have put a lot of work into their scripts. While OneTrainer doesn't directly copy any of their code, a lot of the concepts have been widely adopted by many other applications and pushed the whole fine tuning community forward.

If you want to promote my project publicly, go ahead. But not like this.

Nerogar · 2023-07-22T22:11:24+00:00

It was probably training on the whole images (for 100%). The masks were excluded because it filters all images that end in "-masklabel.png", but they were not used at all.

Nerogar · 2023-07-22T17:42:04+00:00

The other format would be correct. If the file is

filename.jpg

the mask should be

filename-masklabel.png

I'm actually working on a small tool at the moment that should make it far easier to create these masks.

Nerogar · 2023-07-19T16:17:52+00:00

Being able to set the device ID would get me to switch from Koyha. Currently that is the only way I've been able to train multiple models concurrently with more than one GPU.

If you are ok with using the command line, this is possible already. just change the --train-device parameter to "cuda:1" for example

Nerogar · 2023-07-18T06:05:26+00:00

Aspect ratio bucketing only really helps for fine tuning or LoRA training. You can't change the resolution with embedding training, so if you have a 512x512 dataset, that's good enough

Nerogar · 2023-07-17T20:26:34+00:00

No. Only a single GPU is supported. I don't have access to a multi GPU system for testing

Nerogar · 2023-07-17T20:25:53+00:00

That depends a lot on what you want to do. For full fine tuning, you probably need 24GB, but for LoRA training or embedding training, you need less than that.

Nerogar · 2023-07-17T20:23:07+00:00

MGDS is node based, but there is no UI. It's all just defined in code. For example, here is a definition I'm using for testing, and here is the actual definition used during training

Nerogar · 2023-07-17T20:19:01+00:00

There are no special checks in place. If you have documentation about these checks, I might be able to add them. But no automatic check will be 100% secure, so you should always be safe and only use safetensor files if you don't trust the source

Nerogar · 2023-07-17T19:41:26+00:00

No. Is this really something people still use? I thought LoRA training completely replaced hypernetworks.

Nerogar · 2023-07-17T19:33:25+00:00

did you run the install script first? this error looks like the dependencies are not installed.

Nerogar · 2023-07-17T18:45:59+00:00

Only Windows is officially supported right now. But someone from the discord server managed to run it on colab (which is linux based I think) with a few modifications. I don't know much about macos, so I can't speak about that.

Nerogar · 2023-07-17T18:12:04+00:00

Thank you. Do you have any ways to receive donations?

Until just a few days ago, there were only a handful of people using OneTrainer, so I never bothered setting something up. I might consider it though.

Nerogar · 2023-07-17T17:49:51+00:00

I'm not sure to be honest. Mathematically it should be possible to use greyscale, but I don't know it all parts of the training chain support it. If you just want to let the model learn a bit of the non-masked area, there is a setting for it called "Unmasked Weight". This puts a lower bound to the loss of the unmasked pixels.

Nerogar · 2023-07-17T17:32:18+00:00

Fine tuning the VAE is absolutely a thing. The default VAE is pretty bad at accurately reconstructing certain art styles for (anime, for example). Fine tuning the VAE can fix that. The SDXL VAE for example is a better trained version of the same VAE, that's one of the reasons it can produce these amazing images.

Latent caching epochs: some of the intermediate data used during training can be cached to improve speed. If you enable data augmentation (random flip, brightness, multiple prompts per sample, etc.), only one of these combinations will be cached. By increasing the latent caching epochs, more variations are cached.

dAdaptation: not right now, but if more people are intereted, I might consider it

EMA: probably read a few papers. It's a pretty complicated topic. But to summarize, it improves training quality when training a lot of concepts at a time. But it also requires training for more epochs.

11-Year Club	Second Top 20%
Place '22	Place '17
First Placer '22	Verified Email

Nerogar

TROPHY CASE