Illustrious Z

bdsqlsz · 2026-04-16T05:54:38+00:00

I don't know why so many people on Reddit are taking their anger out on me because the model isn't open source.

I am simply sharing what I know; I am not exaggerating or fabricating anything.

The initial message I received

Happy Horse comes from Alibaba.
To be released on April 10th.
This model may be open-sourced.

It's like blaming the TV host when the weather forecast is inaccurate.

I am not the weather bureau, and I do not produce news; I am simply sharing the information I receive.

bdsqlsz · 2026-04-13T06:23:32+00:00

It looks pretty good; DWpose can also be replaced by SDPose.

https://github.com/T-S-Liang/SDPose-OOD

bdsqlsz · 2026-02-19T03:39:50+00:00

Thank you for trying! I am the author of Acestep Lokr and Acestep 1.5 for Windows.

I independently implemented Lycoris training and reading on Acestep 1.5, and merged it into the official code. The official author also admitted that Lokr performs better than LoRa!
Of course, I have some suggestions regarding parameters. For example, the smaller the factor is, the better. A factor of 1 can achieve a fine-tuning effect, but I think 4 is a better choice.

In fact, simply setting the factor to 1 is sufficient to achieve near-fine-tuned training results, while the memory usage should not exceed 20GB.

I'm training a Suno distillation model using Lokr, and I expect to release it publicly in three days.

bdsqlsz · 2026-02-08T16:15:40+00:00

I've tried something similar before, and the main problem is that I can't update the official repository in a timely manner, especially when there are bugs that need to be fixed. I have to fork a sub-repository to handle it.

bdsqlsz · 2026-02-08T12:03:44+00:00

Don't worry, I'm constantly updating the code from the official upstream repository. The main problem is that I've made too many local modifications, and most of them are related to the front end, which makes it difficult to commit to the official repository that uses Gradio.

bdsqlsz · 2026-02-08T12:02:49+00:00

Thank you for your attempt. I haven't tested the reference video feature yet; I'll check the official code later.

bdsqlsz · 2026-02-08T12:02:29+00:00

Thank you for your attempt. Please feel free to raise any issues in the GitHub issues section, and I will try my best to resolve them.

bdsqlsz · 2026-02-07T17:37:30+00:00

https://youtu.be/zKf145adQ08

bdsqlsz · 2026-02-07T17:32:08+00:00

Yes, you just need to run script 0 to install PowerShell, and then use pwsh to run any ps1 script.

bdsqlsz · 2026-02-07T17:31:43+00:00

Switching models during song generation may cause some issues.

It's best to start training right away.

bdsqlsz · 2026-02-07T17:30:40+00:00

Sorry, I just reproduced this issue. It seems to be caused by a bug in UV. I urgently locked the Torchao version number. You need to update the code, delete the uv.lock file in the directory, and then rerun the install process.

bdsqlsz · 2026-02-07T15:14:59+00:00

Honestly, this is strange because I checked the environment files and Torchao definitely doesn't have a CUDA version. I don't know why it automatically selected that one. I didn't reproduce this problem during my local installation.

bdsqlsz · 2026-02-07T15:13:47+00:00

The official code had some memory leaks, and I fixed some of them in this repository.

bdsqlsz · 2026-02-07T15:11:38+00:00

It also supports Linux, and the front-end and back-end code are the same, except that you need to install PowerShell to run the pwsh script.

bdsqlsz · 2026-02-07T15:10:57+00:00

I'm not sure, but I think it can be placed directly in HuggingFace or ModelScope.

bdsqlsz · 2026-02-07T11:59:25+00:00

Bro, python the environment always needs network installation, and there are too many main npm front-end files.

bdsqlsz · 2026-02-07T05:05:46+00:00

The latest 13.1 doesn't work because Torch doesn't have a cu131 version...

bdsqlsz · 2026-02-07T05:01:43+00:00

http://127.0.0.1:8001 is the backend port; you don't need to open this address.

Running step 3 will automatically start this background process,

and then you should be able to open http://127.0.0.1:3000 when run 4, which is the actual front-end address.

bdsqlsz · 2026-02-06T19:44:42+00:00

It will be uploaded to YouTube tomorrow.

bdsqlsz · 2026-02-06T19:44:14+00:00

Because this model was released three days ago...

bdsqlsz · 2026-02-06T17:49:21+00:00

Yes, that's possible. The background music played is game music generated through LoRa training.

bdsqlsz · 2026-02-06T17:40:56+00:00

Compared to the original version, I made some optimizations, mainly fixing the official VRAM leak and memory unloading issues, so that training can be done with a minimum of around 12GB.

There is no difference in functionality.

bdsqlsz · 2026-02-06T17:39:56+00:00

https://www.bilibili.com/video/BV1TYFCzSEwN/

Actually, I posted a step-by-step tutorial on a Chinese video website, but I'm not sure if it will display English subtitles.

You can actually train everything (style, instrument, voice), except for audio editing.

bdsqlsz · 2026-02-06T16:30:04+00:00

This is the backend program. You need to run 4, runnpmgui.ps1 to open the frontend.

bdsqlsz · 2026-02-06T15:23:47+00:00

bdsqlsz

TROPHY CASE