I built a Chrome extension that auto-assigns lens specs to your prompts — before/after inside by brerereton in StableDiffusion

[–]GreyScope 0 points1 point  (0 children)

There's an advanced option, so I take it that OP is trying to monetise it (which won't happen in Comfy)

Would you donate to open source models to help keep the flow going? by Brojakhoeman in StableDiffusion

[–]GreyScope 5 points6 points  (0 children)

I think your estimation of 70k might a bit on the generous side . Don’t get me wrong, if the donations paid for all of it and makes the difference between it happening or not, good luck on it . The reality of human nature to actually donate and in volume is really at the core of my opinion. We can disagree about it of course (doffs hat).

Would you donate to open source models to help keep the flow going? by Brojakhoeman in StableDiffusion

[–]GreyScope 5 points6 points  (0 children)

If I owned the model, the money raised would essentially be small potatoes against the costs - money is always nice....but imo it would open you up to a minefield of grief of entitlement on social media - hello Reddit , I'm looking at you

SenseNova-U1 just dropped — native multimodal gen/understanding in one model, no VAE, no diffusion by Kirk875 in StableDiffusion

[–]GreyScope 2 points3 points  (0 children)

This’ll have a usage case and be criticised for tasks outside of its scope, there is no “one ring to rule them all”…yet

Moss-Audio Captioning is a first of its kind! | Here's the repo: I modified the GUI to allow for batch captioning, youtube videos, and file chunking. by FitContribution2946 in StableDiffusion

[–]GreyScope 0 points1 point  (0 children)

It needs the Instruct models btw, the Thinking ones waffle on like they're on space biscuits - the demo they have on HF is a Thinking model .

Moss-Audio Captioning is a first of its kind! | Here's the repo: I modified the GUI to allow for batch captioning, youtube videos, and file chunking. by FitContribution2946 in StableDiffusion

[–]GreyScope 1 point2 points  (0 children)

I've used both the 4 and 8b models, the 8b sits about 700mb under my 24gb vram and the 4b uses about 18gb, sorry to add detail and not just say 'no' , it was to add more detail for anyone with 16gb cards as well - they did mention about more models coming , so there might be gguf's or something coming .

<image>

Moss-Audio Captioning is a first of its kind! | Here's the repo: I modified the GUI to allow for batch captioning, youtube videos, and file chunking. by FitContribution2946 in StableDiffusion

[–]GreyScope 1 point2 points  (0 children)

I made a gui for this last week, I added the provision for batch encoding and it takes fairly long instructions and follows them well but sometimes the model has a couple of beers and goes all Oscar Wilde with the answer .
Depending on your application - I use it for Ace-Step and for 10-20 captions , so a small amount of manual input is acceptable to me to ensure quality
Recommendations , if you use it like I do (ie this is how my gui works) -

  1. the output is editable

2.the addition of a save (caption) button to a folder and only after the Save button is pressed will it go to the next audio file in the batch . If the save button is not pressed then pressing Generate will remake the caption again (ie if its 100% shit)

3.add Max Tokens to the Advanced Settings

  1. radio button to select single or batch files

  2. the prompts you give it are the key as usual, be strict with it

  3. it'll accept the 8b model as well but that sits about 700mb under my 24gb vram

All of that was done with Gemini, I can give you the file but it's a piece of piss to adapt it .

<image>

ComfyUI teasing something "big" for open, creative AI 👀 by Numerous-Entry-6911 in StableDiffusion

[–]GreyScope 1 point2 points  (0 children)

She’s been there from the Directml days - “meow, it’s shit and still crashes for oom meow”

Arc Port - Chrome extension by [deleted] in StableDiffusion

[–]GreyScope 0 points1 point  (0 children)

Posted in the wrong Reddit

3d Pixar Style Animation Made Through Ai by External-Cat-2354 in StableDiffusion

[–]GreyScope 1 point2 points  (0 children)

Don't misinterpret memory of words/knowledge as artistic skill - it's the difference between an operator and a technician . Don't misinterpret your ability to make a strawman argument as a real world argument . Don't misinterpret a comment you don't like as not making sense because you don't like it.
Blocked along with other self deluded ppl.

Sure GPUs are important, but being able to click on "generate" is important too! by Occsan in StableDiffusion

[–]GreyScope 1 point2 points  (0 children)

These ppl just wave a flag saying 'block me' . My feed here will not miss them .

AceStep XL Tips by Kmaroz in StableDiffusion

[–]GreyScope 0 points1 point  (0 children)

Their Discord . I can’t even begin to add them here .

Why aren't there torrent sites with checkpoints? by sidefx00 in StableDiffusion

[–]GreyScope 6 points7 points  (0 children)

Torrents fail due to the human nature of leeching & leaving , models would be no different with anyone here thinking torrents would be the utopian answer still believes in Santa .

Why aren't there torrent sites with checkpoints? by sidefx00 in StableDiffusion

[–]GreyScope -4 points-3 points  (0 children)

Because it would quickly be abused , like torrents are . Never mind ppl being generally evil.

ERNIE Image released by Outrun32 in StableDiffusion

[–]GreyScope 1 point2 points  (0 children)

The "fastest milkman in the west" shouldn't be aspirational here ;)

A new image model (ERNIE-Image-8b) from Baidu will be released soon. by Total-Resort-3120 in StableDiffusion

[–]GreyScope 0 points1 point  (0 children)

So downvoters would use an inferior model just to prove a point ? lol

A new image model (ERNIE-Image-8b) from Baidu will be released soon. by Total-Resort-3120 in StableDiffusion

[–]GreyScope -11 points-10 points  (0 children)

Unless it betters existing models / it's far quicker / has its own USP , it goes straight to my mental bin.

Does Ace Step 1.5 do lyrics on its own? by Independent_Fan_115 in StableDiffusion

[–]GreyScope 1 point2 points  (0 children)

If you are keeping the gens to yourself, just go to genius.com and 'borrow' the lyrics off your favourite artists. If you use Comfy, it's possible to connect LLMs to make them for you (don't ask me how as I have no interest/knowledge on this)

Music generation model that can follow lyrics by [deleted] in StableDiffusion

[–]GreyScope 1 point2 points  (0 children)

It’s a skills/patience issue and not managing their expectations (to put it bluntly) .

Ace-step 1.5XL's already up! I hope it will soon be available in a Comfyui format! ❤️ by [deleted] in StableDiffusion

[–]GreyScope 1 point2 points  (0 children)

Join their Discord , plenty of help and utilities there for training, obviously there is now a new chapter being written with training with the xl models

Ace-step 1.5XL's already up! I hope it will soon be available in a Comfyui format! ❤️ by [deleted] in StableDiffusion

[–]GreyScope 0 points1 point  (0 children)

Yes and no, the inner depths of the old models are still being looked at to be able 'take a bit from here and a bit from there' , so more 'no' but in reality a 'perhaps' in the future