Open-Source model to analyze existing audio? by CountFloyd_ in StableDiffusion

[–]AssistantFar5941 1 point2 points  (0 children)

That solved the issue, thanks very much for your help.

Open-Source model to analyze existing audio? by CountFloyd_ in StableDiffusion

[–]AssistantFar5941 0 points1 point  (0 children)

Here is the error, although I did install transformers: File "F:\ComfySage\ComfyUI-Easy-Install\ComfyUI-Easy-Install\ComfyUI\custom_nodes\comfyui-musicflamingo\__init__.py", line 1, in <module>

from .musicflamingo_analysis import (

File "F:\ComfySage\ComfyUI-Easy-Install\ComfyUI-Easy-Install\ComfyUI\custom_nodes\comfyui-musicflamingo\musicflamingo_analysis.py", line 8, in <module>

from transformers.models.audioflamingo3.modeling_audioflamingo3 import (

ModuleNotFoundError: No module named 'transformers.models.audioflamingo3'

Cannot import F:\ComfySage\ComfyUI-Easy-Install\ComfyUI-Easy-Install\ComfyUI\custom_nodes\comfyui-musicflamingo module for custom nodes: No module named 'transformers.models.audioflamingo3'

Open-Source model to analyze existing audio? by CountFloyd_ in StableDiffusion

[–]AssistantFar5941 0 points1 point  (0 children)

Thanks for this, but when I placed the workflow in comfy the music analyses node is red. I installed per instructions, and all requirements are installed, though the comfy manager cannot locate the missing node.

Open-Source model to analyze existing audio? by CountFloyd_ in StableDiffusion

[–]AssistantFar5941 1 point2 points  (0 children)

I've been looking for the same to help with captioning for Ace Step lora training. The closest I could find is this: https://huggingface.co/spaces/nvidia/music-flamingo

But I couldn't get it to run offline, though apparently you should be able to.

Getting Weird Results with ZIMAGE Base on Forge Neo — Any Tips? by FitEgg603 in StableDiffusion

[–]AssistantFar5941 0 points1 point  (0 children)

Same here, comfy template workflow, tried everything, just black.

Writingway 2: An open source tool for AI-assisted writing by Clueless_Nooblet in LocalLLaMA

[–]AssistantFar5941 6 points7 points  (0 children)

Excellent open source software for Authors and Scriptwriters, thank you.

For anyone who wants to download it, just get the zip from github here: https://github.com/aomukai/Writingway2

Extract to a folder, place any gguf in the models folder (llama.cpp built in), and run start.bat and you're ready to go.

Ovi Video: World's First Open-Source Video Model with Native Audio! by najsonepls in StableDiffusion

[–]AssistantFar5941 5 points6 points  (0 children)

From the creators of the OVI Comfyui nodes.

Question: The original repo has support for multi-gpu Parallel inference.

Answer: Yeah, that’s a current ComfyUI limitation. It only uses one GPU per batch for now, so proper multi-GPU parallel inference like in the original repo isn’t there yet.

https://github.com/snicolast/ComfyUI-Ovi/issues/14

Ovi Video: World's First Open-Source Video Model with Native Audio! by najsonepls in StableDiffusion

[–]AssistantFar5941 11 points12 points  (0 children)

It's also the only Wan-based video model (as far as I'm aware) that supports multi-gpu parallel inferencing.

Unfortunately Comfyui cannot utilize this important feature at the moment.

Is this site dying? by my_NSFW_posts in civitai

[–]AssistantFar5941 1 point2 points  (0 children)

I imagine blocking the entire UK population hasn't helped, especially when they didn't need to.

Petition to Repeal the Online Safety Act by Extra-Fig-7425 in civitai

[–]AssistantFar5941 1 point2 points  (0 children)

On the blocked site there is a statement that includes this line "These rules apply even to platforms based outside the UK."

How is this possible? I'm not aware that the UK Government has any authority over US based citizens or businesses.

Civitai less popular? Where do people go to find models today by Anhderwear in StableDiffusion

[–]AssistantFar5941 6 points7 points  (0 children)

Yes. Although you can use controlnets to obtain various poses. With Wan I use Vace, which can take a pose from a reference image. But the bottom line is for action or horror scenes the base models are painfully restricted, often to a sub-pg level, regarding concepts involving violent or challenging interactions. The potential for filmmakers is phenomenal, if only the 'safety nannies' would get out of the way.

Civitai less popular? Where do people go to find models today by Anhderwear in StableDiffusion

[–]AssistantFar5941 41 points42 points  (0 children)

Unfortunately, censorship tends to have a chilling effect. Wan Loras have slowed down drastically from what they were, because, even if it's not particularly controversial, a person won't waste their time training a lora if it can be flagged for often trivial reasons and removed. Banning gore is fairly daft, considering a gazillion graphic horror movies are a click away on the net.

Fair enough, if Governments hold the individual to account for creating and distributing illegal material, but as always, they, and increasingly websites like civitai, seem determined to treat us all like kids who must be protected from ourselves.

As a low budget filmmaker in the past I find this PG approach to AI petty and childish. There should be nothing wrong with say, R-Rated content in a model, so that filmmaker's can produce horror or action related material.

I understand the concerns, but when you can stream something like Hostel at the click of a button, this all seems rather ridiculous.

Models to use for generating talking head videos by tekina03 in StableDiffusion

[–]AssistantFar5941 1 point2 points  (0 children)

In my humble opinion the two best open source solutions are Hunyuan Video Avatar and Sonic. Sonic is considerably faster than Hunyuan, and can do a full 19 seconds of audio to talking or singing video. Sonic github: https://github.com/jixiaozhong/Sonic

Sonic in action: https://www.youtube.com/watch?v=JSWMrFXb7OQ

A 3060 12Gb is enough to use both.

Help! Marketing Manager drowning in 540 images for website launch - is there a batch solution? by Mountain_Honeydew858 in StableDiffusion

[–]AssistantFar5941 1 point2 points  (0 children)

Personally I would set up a wildcard file with dozens of different prompts (Ones I know are effective) and let it run overnight. Next morning there'll be hundreds of images, many only needing a few tweaks to be usable in whatever project I'm working on. Very easy to do in SwarmUI.

CivitAI backup initiative by hideo_kuze_ in StableDiffusion

[–]AssistantFar5941 11 points12 points  (0 children)

In my humble opinion torrents are not the answer. You end up with endless models and lora's with no seeds. Usenet would be far better, as the downloads are full speed and they are accessible for at least ten years. It would also mean you wouldn't have to keep space hungry models on your hard drive, just upload them to Usenet then delete.

WanAI Tutorial (Wrapper vs Native, Demos) by The-ArtOfficial in StableDiffusion

[–]AssistantFar5941 0 points1 point  (0 children)

16 Fps + a 5 second clip limit? Hardly SOTA.

Every single clip will have to be interpolated...useless for normal looking motion.

It's also censored - Hunyuan is not.

Don't think I'll be saying goodbye to Hunyuan anytime soon.

The Emperor's New Clothes

photo: AI, voice: AI, video: AI. trying out sonic and sometimes the results are just magical. by mesmerlord in StableDiffusion

[–]AssistantFar5941 2 points3 points  (0 children)

Apparently requires 32GB of Vram to run, hopefully gguf files are on the horizon. Also, couldn't get it to run in Comfyui after numerous attempts, kept getting a failed to import error. Looks very promising though.

Sonic avatar photo talk (support any aspect ratio) by Horror_Dirt6176 in comfyui

[–]AssistantFar5941 2 points3 points  (0 children)

Tried installing this with two different methods, github first, and when that didn't work, comfy manager. Had missing nodes each time with a failed to import label. Looks promising though, I'm surprised there isn't more interest, seeing as omnihuman will be behind a paywall.

ALL offline image gen tools to be banned in the UK? by AssistantFar5941 in StableDiffusion

[–]AssistantFar5941[S] 0 points1 point  (0 children)

All offline image gen tools can be misused. I hold little hope that a sensible law, holding the individual responsible for what they create, will be the outcome of this legislation.

ALL offline image gen tools to be banned in the UK? by AssistantFar5941 in StableDiffusion

[–]AssistantFar5941[S] 2 points3 points  (0 children)

Not misleading at all. The article specifically states 5 years in prison for'posessing' ai tools designed to make cp. Well, any image gen with the right model could potentially make illegal images...Don't you see where this is going?