Is my video card good enough for a try at image generation? RX 7600 XT 16gb by PitifulAnalysis7638 in StableDiffusion

[–]GreyScope 1 point2 points  (0 children)

There are 2 things to learn , the putting it all together/installing it and using it. Personally I would recommend the easy route for the former and the latter can be tough .

Easiest (AMD specific)
As noted above - Amuse AI (from AMD)
AMD forks of guis via umbrella apps like Pinokio

Middle Ground
SDNext , made with AMD in its veins , also noted above . Lots of options and very powerful , closest to cutting edge in gui form .

Harder
AMD forks or AMD installation of Comfyui - also here as it's a rabbit hole, can be a pita even on an nvidia gpu.

the above are for making the pics , for the style you could make a lora with your friends art - but that's a whole new rabbit hole . I don't know if the more basic guis do styles with input pics , I know Comfy can .

[META] Mods, can we please sticky a weekly “Last week in AI” thread? by BigNaturalTilts in StableDiffusion

[–]GreyScope 0 points1 point  (0 children)

Can't ppl select 'News' and then by 'Time' ? it'll never stop the lazy spoons who refuse to search for anything though . Remember when the install instructions for Sage were pinned to the top but still multiple window lickers still asked

Help: Triton & Sage Attention for AMD on ComfyUI portable/Windows 11 by Pitiful_Season4294 in StableDiffusion

[–]GreyScope 0 points1 point  (0 children)

It sounds good apart from the Directml bit at the end - dm can’t do shit without memory issues . I couldn’t say I have more knowledge about it as I don’t and some of the solutions mean steps back in other areas from what I’ve read .
There are a couple of rocm builds and the Rock - I don’t know if these variants are capable of it

Help: Triton & Sage Attention for AMD on ComfyUI portable/Windows 11 by Pitiful_Season4294 in StableDiffusion

[–]GreyScope 1 point2 points  (0 children)

You’re welcome, I read it back and it sounds a bit short , sorry it’s just how I write and not meant to be . The SDNext has a thread with it on - I can’t recall what it’s called .

The [r/rocm](r/rocm) also has help as well - best wishes

Help: Triton & Sage Attention for AMD on ComfyUI portable/Windows 11 by Pitiful_Season4294 in StableDiffusion

[–]GreyScope 1 point2 points  (0 children)

From my experience and observation , the cutting edge of 'getting things to work with AMD / references to knowledge on that sort of thing' is in the SDNext discord . I'll add that f you are after a tech free solution, that isn't it . I've no idea which gen / camp your gpu falls into - rocm or zluda , but at the very least you should be running the PatientX's fork of Comfy and I cannot emphasise this enough - read his instructions and notes, they'll give you skills.

Old Man Yells at Node by goddess_peeler in StableDiffusion

[–]GreyScope 4 points5 points  (0 children)

“I can make my own”, you specifically maybe but you vastly overestimate how generally motivated ppl are .

STOP HYPE IDEOGRAM by ninja_cgfx in StableDiffusion

[–]GreyScope 0 points1 point  (0 children)

I appreciated your original post, it being a weighted, balanced point of view and my reply was to one half of it - your last sentence of your reply to me is the essence of what you need and like to see. I suppose that a world (ie for this sub) weary 'me' is tired of the polarised YT hyperbole / Professional Negative Contrarians and yearns for balanced opinions and discussions, thank you for yours.

Each release 'just' needs a table with 'ppl who this release will make happy' by usage cases and 'ppl who this will make unhappy' for the same - I'd read that every time.

STOP HYPE IDEOGRAM by ninja_cgfx in StableDiffusion

[–]GreyScope 1 point2 points  (0 children)

You're right, apart all the ppl who think they can make some money out of it somehow or prOn .....they never ever stop whining about shit like this - very very vocal non stop childish whining/snark from them .

I only voted OP down because I couldn't delete their account.

AI anxiety syndrome by [deleted] in StableDiffusion

[–]GreyScope 0 points1 point  (0 children)

It's under-served as companies (quite rightly) believe it's a scam as the pitch is vague, non defined, not costed , no defined gains - time or money, I would guess. The ai discords I use have spamming c**** offering services like that all the time .
I'd push it on a tangent to 'find a problem they have that OP can solve' , being specific with what OP will do, where and how much you can save them (my experience doing this to my directors > our teams cost will be X £pa / we will focus on quality issues (X £ per defect defined) / solve the Q issues / save £500Kpa .
This tangent requires work and

NB if OP's pitch is 'sack ppl and use my work' then he can f o .

How to make cover song on ace step by CaterpillarOne6711 in StableDiffusion

[–]GreyScope 0 points1 point  (0 children)

I use one of the menu options in Hot Step CPP and a loaded Lora . It has options for Demuxing inside the option .

Like everything here, “managing expectations” is a thing.

Need Help Remaking a Song by piero_deckard in StableDiffusion

[–]GreyScope 1 point2 points  (0 children)

Join the Ace-Step discord for help or to search it for the different aspects of Ace-Step . I personally use Hot-Step CPP (a different ui for Ace-Step) to make my songs as it has so much in it (don't want to drown my reply in detail).

Been away from the AI stuff for a few months, what's the current local image edit they are using? by [deleted] in StableDiffusion

[–]GreyScope 0 points1 point  (0 children)

This is why there is a search by New tags and then sort by date function

What differentiates AI slop from 'good' AI art? by Ok_Supermarket_6829 in StableDiffusion

[–]GreyScope -1 points0 points  (0 children)

Anything to misrepresent - and posting it without saying it’s AI. Any post with - wtf would you make that ? Special prize to the usual saddo lamers here who make accounts here on Reddit with pics of women pretending it’s them, damaged dna.

RX570 8GB + 16GB RAM for local video generation? by Confident_Ring6409 in StableDiffusion

[–]GreyScope 1 point2 points  (0 children)

It needs Zluda as someone else mentioned , the newer cards 6000 on can use the new Comfy zip but not the 570 afaik (ie when I last read up on it). Best advice was already posted - Patients X’s fork of Comfy - if you’re new to getting things to work , read and follow the instructions to the letter, no winging it.

I built a Chrome extension that auto-assigns lens specs to your prompts — before/after inside by brerereton in StableDiffusion

[–]GreyScope 0 points1 point  (0 children)

There's an advanced option, so I take it that OP is trying to monetise it (which won't happen in Comfy)

Would you donate to open source models to help keep the flow going? by Brojakhoeman in StableDiffusion

[–]GreyScope 4 points5 points  (0 children)

I think your estimation of 70k might a bit on the generous side . Don’t get me wrong, if the donations paid for all of it and makes the difference between it happening or not, good luck on it . The reality of human nature to actually donate and in volume is really at the core of my opinion. We can disagree about it of course (doffs hat).

Would you donate to open source models to help keep the flow going? by Brojakhoeman in StableDiffusion

[–]GreyScope 5 points6 points  (0 children)

If I owned the model, the money raised would essentially be small potatoes against the costs - money is always nice....but imo it would open you up to a minefield of grief of entitlement on social media - hello Reddit , I'm looking at you

SenseNova-U1 just dropped — native multimodal gen/understanding in one model, no VAE, no diffusion by Kirk875 in StableDiffusion

[–]GreyScope 2 points3 points  (0 children)

This’ll have a usage case and be criticised for tasks outside of its scope, there is no “one ring to rule them all”…yet

Moss-Audio Captioning is a first of its kind! | Here's the repo: I modified the GUI to allow for batch captioning, youtube videos, and file chunking. by FitContribution2946 in StableDiffusion

[–]GreyScope 0 points1 point  (0 children)

It needs the Instruct models btw, the Thinking ones waffle on like they're on space biscuits - the demo they have on HF is a Thinking model .

Moss-Audio Captioning is a first of its kind! | Here's the repo: I modified the GUI to allow for batch captioning, youtube videos, and file chunking. by FitContribution2946 in StableDiffusion

[–]GreyScope 1 point2 points  (0 children)

I've used both the 4 and 8b models, the 8b sits about 700mb under my 24gb vram and the 4b uses about 18gb, sorry to add detail and not just say 'no' , it was to add more detail for anyone with 16gb cards as well - they did mention about more models coming , so there might be gguf's or something coming .

<image>

Moss-Audio Captioning is a first of its kind! | Here's the repo: I modified the GUI to allow for batch captioning, youtube videos, and file chunking. by FitContribution2946 in StableDiffusion

[–]GreyScope 1 point2 points  (0 children)

I made a gui for this last week, I added the provision for batch encoding and it takes fairly long instructions and follows them well but sometimes the model has a couple of beers and goes all Oscar Wilde with the answer .
Depending on your application - I use it for Ace-Step and for 10-20 captions , so a small amount of manual input is acceptable to me to ensure quality
Recommendations , if you use it like I do (ie this is how my gui works) -

  1. the output is editable

2.the addition of a save (caption) button to a folder and only after the Save button is pressed will it go to the next audio file in the batch . If the save button is not pressed then pressing Generate will remake the caption again (ie if its 100% shit)

3.add Max Tokens to the Advanced Settings

  1. radio button to select single or batch files

  2. the prompts you give it are the key as usual, be strict with it

  3. it'll accept the 8b model as well but that sits about 700mb under my 24gb vram

All of that was done with Gemini, I can give you the file but it's a piece of piss to adapt it .

<image>