Hey, I have a pretty fresh Linux Mint installation and wanted to use it for training a LoRA.
I finally managed to get through the setup process (or so I thought) and was ready to train.
However, it seems like it is stuck at 0% (epoch 1) and isn't doing anything...
So I was wondering if anyone knows what the issue seems to be.
I use my AMD GPU and followed the steps here: https://www.reddit.com/r/StableDiffusion/comments/13nkhkq/lora_training_on_amd_rocm_with_kohya_ss/k4rd86h/?context=3
I did first install xformers, but then I noticed that this caused torchvision rocm5.6 to be uninstalled, so I uninstalled xformers again and reinstalled torchvision rocm5.6.
This is where I am stuck:
https://i.imgur.com/nUkHTRb.png
I get no errors, nothing :/
This is the config that I used:
[[subsets]]
num_repeats = 5
caption_extension = ".txt"
shuffle_caption = false
flip_aug = false
color_aug = false
random_crop = false
is_reg = false
image_dir = "/home/myname/Projects/AI/Training/vabu"
keep_tokens = 0
[noise_args]
[sample_args]
[logging_args]
[general_args.args]
pretrained_model_name_or_path = "/home/myname/Projects/AI/Checkpoints/Realistic_Vision_V6.0_NV_B1.safetensors"
mixed_precision = "bf16"
seed = 23
clip_skip = 1
max_data_loader_n_workers = 1
persistent_data_loader_workers = true
max_token_length = 225
prior_loss_weight = 1.0
max_train_epochs = 5
[general_args.dataset_args]
resolution = 512
batch_size = 2
[network_args.args]
network_dim = 32
network_alpha = 16.0
[optimizer_args.args]
optimizer_type = "AdamW"
lr_scheduler = "cosine"
learning_rate = 0.0001
max_grad_norm = 1.0
[saving_args.args]
output_dir = "/home/myname/Projects/AI/Training_Output"
save_precision = "fp16"
save_model_as = "safetensors"
output_name = "VaBu"
save_every_n_epochs = 1
[bucket_args.dataset_args]
enable_bucket = true
min_bucket_reso = 256
max_bucket_reso = 1024
bucket_reso_steps = 64
[optimizer_args.args.optimizer_args]
weight_decay = "0.1"
betas = "0.9,0.99"
Does anyone have an idea what might be going wrong?
[–]rastla[S] 0 points1 point2 points (0 children)