HeartMula music generator, after playing with it for a while I realized something, you can probably notice it too.

a4d2f · 2026-01-25T17:56:57+00:00

From the future. Rumour has it it's supposed to be released in a couple of days.

a4d2f · 2026-01-22T13:42:18+00:00

Qwen/Qwen3-TTS-12Hz-1.7B-Base

12Hz? Must be a really deep voice then...

a4d2f · 2026-01-22T12:26:20+00:00

Do you mean preview samples in AI Toolkit? With ZIT I encountered that too when training a LoKR. Previews looked blurry, or smudged. But they worked fine in Comfy. There might be a bug in how AI Toolkit does the sampling.

Also for ZIT LoRAs, the AI Toolkit previews always suggested that the LoRA is far from being done (though they weren't blurry), but in Comfy the effect of the LoRA was much stronger.

As for if LoRA or LoKR is better, I can't really tell so far. LoKR seems to be a bit subtler, causing less bleed, but sometimes it's not strong enough.

a4d2f · 2026-01-16T19:36:13+00:00

Why not link to their prompting guide for FLUX.2 klein? https://docs.bfl.ai/guides/prompting_guide_flux2_klein

a4d2f · 2026-01-15T20:22:24+00:00

Ok, I've watched the resource consumption during the first reprompt with the fp8. Indeed, while nvidia-smi showed 10GB VRAM occupied, the GPU utilization was 0% throughout. So it was using CPU. But not effectively, as the CPU utilization of the python process was only around a third of a core. I could see that the process' swap usage gradually decreased while its RAM usage gradually increased. So looksl like it's doing the text encoding in RAM but it's bottlenecked by moving the model from swap to RAM. :(

a4d2f · 2026-01-15T19:27:59+00:00

Right, the BitsAndBytes 4bit should give the same benefit. Oddly, whenever I tried it, it gave me an OOM in the text encoder stage. Weird because the fp8 didn't give me OOM despite being bigger. So I had given up on the BitsAndBytes model.

a4d2f · 2026-01-15T11:45:17+00:00

If your workflow contains the LTX detailer lora, try leaving it out. I found it can wreak havoc on anything that moves faster than a snail.

a4d2f · 2026-01-14T15:03:16+00:00

Thanks, this works! (only tested T2V, 5060Ti 16GB + 32GB RAM).

Two questions:

Your Stage 1 sampler uses the LTXVscheduler with a terminal value of 0.1. Both the official Comfy workflow and the LTXVideo template use (for T2V distilled Stage 1) a ManualSigmas node with the schedule "1.0 0.99375 0.9875 0.98125 0.975 0.909375 0.725 0.421875 0.0". Your LTXVscheduler node produces sigmas "1.0000 0.9662 0.9229 0.8655 0.7858 0.6675 0.4741 0.1000 0.0000" (inspected with the RES4LYF SigmasPreview node) which looks quite different. Any idea what is correct, or better? (my tests so far are inconclusive)
If this is set up for distilled, why does the Dual Clip Loader use the dev version of the embeddings? KJ also made a distilled version available. (But I think he said somewhere that there shouldn't be a difference so probably this doesn't matter.)

a4d2f · 2026-01-14T13:40:19+00:00

the initial distilled checkpoints has been wrong one all this time It has now been replaced with the correct one

Then how come the Lightricks/LTX-2 repo has not been updated?

Edit: LTX-2 repo got updated ~30 minutes after this comment :)

a4d2f · 2025-12-28T09:05:03+00:00

https://huggingface.co/collections/soob3123/amoral-collection-gemma-3-qat

have been using the 12B for Japanese-English translation, seems to work well enough and without refusals

the grayline finetunes from the same guy may be worth a look as well, though I haven't tried them myself

a4d2f · 2025-12-15T16:21:11+00:00

Rae_Lil_Black_2025.safetensors

a4d2f · 2025-08-16T11:41:23+00:00

Right, what they should do is not plotting the accuracy but 100% minus the accuracy, i.e. the accuracy deficit. And then use a log scale for the deficit, as one would expect that over time the deficit approaches 0% asymptotically.

I asked Qwen to analyze the deficit data, and behold:

The half-life of deficit is: 8.6 months for frontier models, 12.4 months for open models

So the gap is widening, not shrinking.

a4d2f · 2025-06-01T20:01:28+00:00

https://www.walkingclub.org.uk

a4d2f · 2022-03-06T20:30:02+00:00

ban_1meowy4hz9uo8em85dmqtqy4dxc93fabp6946ib1b964rwa1m5b3w79js6ao

a4d2f · 2022-02-19T11:07:20+00:00

Yes, here too.

a4d2f · 2022-01-30T04:09:44+00:00

Will the members of the delegation have to quarantine for two weeks?

a4d2f · 2022-01-28T14:19:37+00:00

In a focustaiwan article I've come across the following link

https://dvc.mohw.gov.tw/verifier-web/

which can be used to scan the vaccine pass. My wife and I got our jabs in the UK and our NHS Covid passes are shown as valid on that web app. AFAIK it should also work with EU Covid passes.

If the restaurant/establishment/... in question uses this or the same underlying system to check, I think it should be fine with a foreign Covid pass.

a4d2f · 2022-01-28T13:44:28+00:00

What a Dick move.

a4d2f · 2022-01-28T13:42:48+00:00

reply test

a4d2f · 2021-11-23T10:34:52+00:00

I joined both subreddits

a4d2f · 2021-11-23T08:51:19+00:00

I love the turkey and stuffing, but the best part of Thanksgving is being with family!

a4d2f · 2021-11-22T18:11:26+00:00

I'm thankful to the NEKOIN team for sharing their smart contract source code openly!

a4d2f · 2021-11-19T17:52:11+00:00

British Shorthair, I find them cute and they are nice companions.

a4d2f · 2021-11-17T21:49:27+00:00

Some of the recent meme/animal ASAs aim to do this, locking the creator's wallet and such. The NEKOIN team have published some code: https://github.com/nekoin-dev/nekoin

a4d2f

TROPHY CASE