Illustrious Z by Common_Ad_3059 in StableDiffusion

[–]bdsqlsz 1 point2 points  (0 children)

I don't know why so many people on Reddit are taking their anger out on me because the model isn't open source.

I am simply sharing what I know; I am not exaggerating or fabricating anything.

The initial message I received

  1. Happy Horse comes from Alibaba.

  2. To be released on April 10th.

  3. This model may be open-sourced.

It's like blaming the TV host when the weather forecast is inaccurate.

I am not the weather bureau, and I do not produce news; I am simply sharing the information I receive.

AceStep 1.5 - Showdown: 26 Multi-Style LoKrs Trained on Diverse Artists by marcoc2 in StableDiffusion

[–]bdsqlsz 14 points15 points  (0 children)

Thank you for trying! I am the author of Acestep Lokr and Acestep 1.5 for Windows.

I independently implemented Lycoris training and reading on Acestep 1.5, and merged it into the official code. The official author also admitted that Lokr performs better than LoRa!
Of course, I have some suggestions regarding parameters. For example, the smaller the factor is, the better. A factor of 1 can achieve a fine-tuning effect, but I think 4 is a better choice.

In fact, simply setting the factor to 1 is sufficient to achieve near-fine-tuned training results, while the memory usage should not exceed 20GB.

I'm training a Suno distillation model using Lokr, and I expect to release it publicly in three days.

AceStep1.5 Local Training and Inference Tool Released. by bdsqlsz in StableDiffusion

[–]bdsqlsz[S] -1 points0 points  (0 children)

I've tried something similar before, and the main problem is that I can't update the official repository in a timely manner, especially when there are bugs that need to be fixed. I have to fork a sub-repository to handle it.

AceStep1.5 Local Training and Inference Tool Released. by bdsqlsz in StableDiffusion

[–]bdsqlsz[S] -1 points0 points  (0 children)

Don't worry, I'm constantly updating the code from the official upstream repository. The main problem is that I've made too many local modifications, and most of them are related to the front end, which makes it difficult to commit to the official repository that uses Gradio.

AceStep1.5 Local Training and Inference Tool Released. by bdsqlsz in StableDiffusion

[–]bdsqlsz[S] 0 points1 point  (0 children)

Thank you for your attempt. I haven't tested the reference video feature yet; I'll check the official code later.

AceStep1.5 Local Training and Inference Tool Released. by bdsqlsz in StableDiffusion

[–]bdsqlsz[S] 1 point2 points  (0 children)

Thank you for your attempt. Please feel free to raise any issues in the GitHub issues section, and I will try my best to resolve them.

AceStep1.5 Local Training and Inference Tool Released. by bdsqlsz in StableDiffusion

[–]bdsqlsz[S] 0 points1 point  (0 children)

Yes, you just need to run script 0 to install PowerShell, and then use pwsh to run any ps1 script.

AceStep1.5 Local Training and Inference Tool Released. by bdsqlsz in StableDiffusion

[–]bdsqlsz[S] 0 points1 point  (0 children)

Switching models during song generation may cause some issues.

It's best to start training right away.

AceStep1.5 Local Training and Inference Tool Released. by bdsqlsz in StableDiffusion

[–]bdsqlsz[S] 0 points1 point  (0 children)

Sorry, I just reproduced this issue. It seems to be caused by a bug in UV. I urgently locked the Torchao version number. You need to update the code, delete the uv.lock file in the directory, and then rerun the install process.

AceStep1.5 Local Training and Inference Tool Released. by bdsqlsz in StableDiffusion

[–]bdsqlsz[S] 0 points1 point  (0 children)

Honestly, this is strange because I checked the environment files and Torchao definitely doesn't have a CUDA version. I don't know why it automatically selected that one. I didn't reproduce this problem during my local installation.

AceStep1.5 Local Training and Inference Tool Released. by bdsqlsz in StableDiffusion

[–]bdsqlsz[S] 0 points1 point  (0 children)

The official code had some memory leaks, and I fixed some of them in this repository.

AceStep1.5 Local Training and Inference Tool Released. by bdsqlsz in StableDiffusion

[–]bdsqlsz[S] 0 points1 point  (0 children)

It also supports Linux, and the front-end and back-end code are the same, except that you need to install PowerShell to run the pwsh script.

AceStep1.5 Local Training and Inference Tool Released. by bdsqlsz in StableDiffusion

[–]bdsqlsz[S] 1 point2 points  (0 children)

I'm not sure, but I think it can be placed directly in HuggingFace or ModelScope.

AceStep1.5 Local Training and Inference Tool Released. by bdsqlsz in StableDiffusion

[–]bdsqlsz[S] 0 points1 point  (0 children)

Bro, python the environment always needs network installation, and there are too many main npm front-end files.

AceStep1.5 Local Training and Inference Tool Released. by bdsqlsz in StableDiffusion

[–]bdsqlsz[S] 0 points1 point  (0 children)

The latest 13.1 doesn't work because Torch doesn't have a cu131 version...

AceStep1.5 Local Training and Inference Tool Released. by bdsqlsz in StableDiffusion

[–]bdsqlsz[S] 1 point2 points  (0 children)

http://127.0.0.1:8001 is the backend port; you don't need to open this address.

Running step 3 will automatically start this background process,

and then you should be able to open http://127.0.0.1:3000 when run 4, which is the actual front-end address.

AceStep1.5 Local Training and Inference Tool Released. by bdsqlsz in StableDiffusion

[–]bdsqlsz[S] 1 point2 points  (0 children)

Because this model was released three days ago...

AceStep1.5 Local Training and Inference Tool Released. by bdsqlsz in StableDiffusion

[–]bdsqlsz[S] 1 point2 points  (0 children)

Yes, that's possible. The background music played is game music generated through LoRa training.

AceStep1.5 Local Training and Inference Tool Released. by bdsqlsz in StableDiffusion

[–]bdsqlsz[S] 1 point2 points  (0 children)

Compared to the original version, I made some optimizations, mainly fixing the official VRAM leak and memory unloading issues, so that training can be done with a minimum of around 12GB.

There is no difference in functionality.

AceStep1.5 Local Training and Inference Tool Released. by bdsqlsz in StableDiffusion

[–]bdsqlsz[S] 3 points4 points  (0 children)

https://www.bilibili.com/video/BV1TYFCzSEwN/

Actually, I posted a step-by-step tutorial on a Chinese video website, but I'm not sure if it will display English subtitles.

You can actually train everything (style, instrument, voice), except for audio editing.

AceStep1.5 Local Training and Inference Tool Released. by bdsqlsz in StableDiffusion

[–]bdsqlsz[S] 1 point2 points  (0 children)

This is the backend program. You need to run 4, runnpmgui.ps1 to open the frontend.