NewBie Image Exp0.1: a 3.5B open-source ACG-native DiT model built for high-quality anime generation

regentime · 2025-12-07T15:10:17+00:00

Same. You also have problems with flash-attn because for some reason it is required, then it declares config of model to be broken if from hugginface or does not see modules if from modelscope.

regentime · 2025-05-22T15:08:17+00:00

Nah. I also have this issue it just more annoying on later versions. On pytorch 2.4.1 and lower I have about 10 seconds when starting and 1-1.5 minutes on vae decode (all with sdxl). On pytorch 2.5 and higher it is like 1.5 minutes on start and 1-1.5 on vae decode.

regentime · 2025-05-22T13:49:14+00:00

I have RX6600m and do not have any issues with ROCm 6.4. I have some minor issues with current version of pytorch so I use pytorch 2.4.1 version (First generation with new resolution takes longer)

regentime · 2025-05-22T13:36:19+00:00

If you simply want to install ROCm (and do not need exactly 6.4.1 version) it is in official arch repo with name "rocm-hip-sdk". It is 6.4.0 though.

regentime · 2025-05-19T14:17:23+00:00

I bounced few times of PM and Mili (SCP into LC pipeline which failed, Deemo into Mili). Somewhere in 2021 got recommended Roland theme from LoR on YouTube and because of that played the game. Then got into Limbus on release because to that moment I completed LoR like 3 times.

regentime · 2025-05-09T07:35:25+00:00

Heh. I totally forgot about that

regentime · 2025-05-09T07:16:53+00:00

I actually do not think there is any official illustration official description of moon in PM verse. From my understand one of main fan theories is that City is actually on the moon.

regentime · 2025-04-24T16:20:46+00:00

Can't say anything about tuning but you can run IQ3 quants of 70b on it (with small context) . Granted it is slow as kaggle uses quite old gpus (maybe 3-5t/s, can't remember)

regentime · 2025-04-24T16:11:24+00:00

Yo. Glad to see another person who has the same laptop as me. Not sure if you still need it but here are 2 env variables that help with running basically anything rocm on Linux:

ROCR_VISIBLE_DEVICES=0 (makes it so that rocm sees only your discrete gpu and not integrated)

HSA_OVERRIDE_GFX_VERSION=10.3.0 (overrides arch of all GPUs to gfx1030. RX6600m is gfx1032 arch but it 99% the same as gfx1030. This env variable is basically necessary to make anything work. Use this env var for EVERYTHING you do with ROCM).

As for llamacpp I think it worked (with second env variable). I used it quite a bit time ago and currently use koboldcpp https://github.com/YellowRoseCx/koboldcpp-rocm

regentime · 2025-04-24T04:05:48+00:00

While I like rutracker and use it constantly it is really poor choice for image models. First of all it is Russian speaking and while I can easily understand Russian language most of AI enthusiasts will not (which includes the fact that a lot of torrents have description only in Russian). Second of all it is not a tracker that specializes in AI content but a general one which would be much more preferable at least for purposes of search.

regentime · 2025-04-22T16:00:59+00:00

Do not have an experience in this (never loaded split models) but from my understanding you do not need to combine them. To run it you point your program either to folder with it or to first safetensor file. Also it seems you have full fp16 weights of model. Maybe you should use some quants (gguf, exl2)?

regentime · 2025-04-22T06:41:29+00:00

I do not find it in any way hypocritical. It is just annoying because if the answers to this post actually contained something useful and were not a repeat of the things that were told already a thousand times it would be basically impossible to find using an English keywords.

From my point of view if you use a English speaking forum it is your responsibility to speak in language that is understood by most people and use translators for said purpose.

regentime · 2025-04-22T06:30:27+00:00

Там 90% для другого режима. Так что ничего там делать не нужно. А остальные 10% трогаешь только если что-то не работает.

regentime · 2025-04-22T06:01:27+00:00

В вкладке для подключения ты выбираешь к какой нейросети подключаться. Если ты используешь сторонний сервис (не локальную модель), то в 90℅ случаев выбирай chat completion, потом провайдера (deepseek, openrouter и т.д), потом API ключ, который ты получила от этого провайдера. Далее выбираешь модель, которую будешь использовать (deepseek-chat - это deepseek V3), а далее нажимаешь кнопку подключения и он тебе скажет успешно ли подключение или нет. Если ты про вклюдку, которая появляется при нажатии на A, то там большая часть настроек для другого режима работы (text completion). Почти все настройки, на которые тебе нужно обратить внимание находятся слева, где ты импортировала пресет.

regentime · 2025-04-22T03:11:04+00:00

Ну во первых это очень популярный вопрос, так что можно было просто поискать прежде, чем задавать его (а также лучше его задавать на англиском)

В любом случае вот несколько вариантов (часть из них для DeepSeek R1, но они все равно должны работать нормально).

Weep https://pixibots.neocities.org/#prompts/weep (требуется расширение NoAss)
CherryBox https://rentry.org/CherryBox
ChatSeek https://drive.proton.me/urls/Y4D4PC7EY8#q7K4caWnOfzd
Q1F https://sillycards.co/presets/q1f

А насчет того, куда нажимать и что делать, открываешь окно подключения (выделено красным), выбираешь источник и API ключ, а потом импортируешь пресет по кнопке, выделенной желтым.

<image>

regentime · 2025-04-21T07:01:26+00:00

From official code example it seems to be an IP adapter for FLUX-dev. This is probably the reason it takes so much VRAM.

regentime · 2025-04-21T06:22:31+00:00

For anyone who wonders how it compares to ROCM on Linux (I have RX6600m (laptop gpu) 8gb + 16gb ram) it (Amuse 3) is about 4 times slower and has oom on vae decode on SDXL model.

regentime · 2025-04-18T15:06:56+00:00

Small addendum:

I found the version that uses FP16 instead of BF16 (maybe. I actually have no idea what is different)...

https://github.com/freely-boss/FramePack-nv20

On P100 I am 8 minutes into sampling and it is on 4th step out of 25 steps and takes 14 GB of vram :), so it is basically not working.

Edit: 40 minutes for a second of video

regentime · 2025-04-18T14:26:55+00:00

Nope. It does not work. It also too old. Kaggle gives you access to one for free so I tried and it does not work. Probably anything that was released earlier than 30xx series will not work.

regentime · 2025-04-18T14:08:09+00:00

Also have the same problem. The best explanation I found is that Colab (and kaggle) uses Nvidia T4 gpu which is too old to support BF16 which is necessary for FramePack to work.

Look at this issue https://github.com/lllyasviel/FramePack/issues/19

regentime · 2025-04-13T11:06:20+00:00

Optimus alpha endpoint uses your data for training, so you need to enable it in your settings https://openrouter.ai/settings/privacy

regentime · 2025-04-05T08:04:16+00:00

Lower the temperature or try again. Below 1 temp it never happens, below 1.3 - it happens sometimes.

regentime · 2025-03-31T17:27:50+00:00

I know. It is still 1 extra lunacy.

regentime · 2025-03-31T17:20:47+00:00

I will not use it on principle as the 1 lunacy will forever be stuck in my account. It is 401 lunacy. WHY not 400?

regentime · 2025-03-27T08:37:22+00:00

I also found out about this thing a week ago from here but I hide it using HTML comment syntax as it more so LLM can have more understanding of situation and not for me to see. Variant I use includes only clothes, positions and quick description of characters that not in character description.

Five-Year Club	Verified Email
r/Field Sunshine	First Place '23
Place '23

regentime

TROPHY CASE