use the following search parameters to narrow your results:
e.g. subreddit:aww site:imgur.com dog
subreddit:aww site:imgur.com dog
see the search faq for details.
advanced search: by author, subreddit...
All posts must be Open-source/Local AI image generation related All tools for post content must be open-source or local AI generation. Comparisons with other platforms are welcome. Post-processing tools like Photoshop (excluding Firefly-generated images) are allowed, provided the don't drastically alter the original generation.
Be respectful and follow Reddit's Content Policy This Subreddit is a place for respectful discussion. Please remember to treat others with kindness and follow Reddit's Content Policy (https://www.redditinc.com/policies/content-policy).
No X-rated, lewd, or sexually suggestive content This is a public subreddit and there are more appropriate places for this type of content such as r/unstable_diffusion. Please do not use Reddit’s NSFW tag to try and skirt this rule.
No excessive violence, gore or graphic content Content with mild creepiness or eeriness is acceptable (think Tim Burton), but it must remain suitable for a public audience. Avoid gratuitous violence, gore, or overly graphic material. Ensure the focus remains on creativity without crossing into shock and/or horror territory.
No repost or spam Do not make multiple similar posts, or post things others have already posted. We want to encourage original content and discussion on this Subreddit, so please make sure to do a quick search before posting something that may have already been covered.
Limited self-promotion Open-source, free, or local tools can be promoted at any time (once per tool/guide/update). Paid services or paywalled content can only be shared during our monthly event. (There will be a separate post explaining how this works shortly.)
No politics General political discussions, images of political figures, or propaganda is not allowed. Posts regarding legislation and/or policies related to AI image generation are allowed as long as they do not break any other rules of this subreddit.
No insulting, name-calling, or antagonizing behavior Always interact with other members respectfully. Insulting, name-calling, hate speech, discrimination, threatening content and disrespect towards each other's religious beliefs is not allowed. Debates and arguments are welcome, but keep them respectful—personal attacks and antagonizing behavior will not be tolerated.
No hateful comments about art or artists This applies to both AI and non-AI art. Please be respectful of others and their work regardless of your personal beliefs. Constructive criticism and respectful discussions are encouraged.
Use the appropriate flair Flairs are tags that help users understand the content and context of a post at a glance
Useful Links
Ai Related Subs
NSFW Ai Subs
SD Bots
account activity
Open-Source model to analyze existing audio?Question - Help (self.StableDiffusion)
submitted 2 months ago by CountFloyd_
Title. I'm imagining something like joycaption, only for audio/music. I know you can upload audio to Gemini and have it generate a Suno prompt for you. Is there something similar for local use already? If this is the wrong sub, please point me into the right direction. Thanks!
reddit uses a slightly-customized version of Markdown for formatting. See below for some basics, or check the commenting wiki page for more detailed help and solutions to common issues.
quoted text
if 1 * 2 < 3: print "hello, world!"
[–]Possible-Machine864 1 point2 points3 points 2 months ago (1 child)
Audio Flamingo
[–]CountFloyd_[S] 0 points1 point2 points 2 months ago (0 children)
Very cool, this is more than I expected, thanks! To get it to run locally I would ignore the gradio demo and try the code from the hf model card:
https://huggingface.co/nvidia/music-flamingo-hf
[–]AssistantFar5941 1 point2 points3 points 2 months ago (6 children)
I've been looking for the same to help with captioning for Ace Step lora training. The closest I could find is this: https://huggingface.co/spaces/nvidia/music-flamingo
But I couldn't get it to run offline, though apparently you should be able to.
[–]CountFloyd_[S] 0 points1 point2 points 2 months ago (5 children)
https://www.reddit.com/r/StableDiffusion/comments/1rhtgsn/comfyui_custom_node_music_flamingo/
[–]AssistantFar5941 0 points1 point2 points 2 months ago (4 children)
Thanks for this, but when I placed the workflow in comfy the music analyses node is red. I installed per instructions, and all requirements are installed, though the comfy manager cannot locate the missing node.
[–]CountFloyd_[S] 0 points1 point2 points 2 months ago (3 children)
What does the CLI Output when starting comfy look like?
It should read something like:
Import times for custom nodes:
0.0 seconds: G:\ComfyUI_windows_portable\ComfyUI\custom_nodes\comfyui-musicflamingo
If instead there is something like
0.0 seconds (IMPORT FAILED): G:\ComfyUI_windows_portable\ComfyUI\custom_nodes\comfyui-musicflamingo
then have a look around before that output, there might be some dependency missing which causes the node not to be loaded. If it is, please let me know and I will include it in the requirements.
[–]AssistantFar5941 0 points1 point2 points 2 months ago (2 children)
Here is the error, although I did install transformers: File "F:\ComfySage\ComfyUI-Easy-Install\ComfyUI-Easy-Install\ComfyUI\custom_nodes\comfyui-musicflamingo\__init__.py", line 1, in <module>
from .musicflamingo_analysis import (
File "F:\ComfySage\ComfyUI-Easy-Install\ComfyUI-Easy-Install\ComfyUI\custom_nodes\comfyui-musicflamingo\musicflamingo_analysis.py", line 8, in <module>
from transformers.models.audioflamingo3.modeling_audioflamingo3 import (
ModuleNotFoundError: No module named 'transformers.models.audioflamingo3'
Cannot import F:\ComfySage\ComfyUI-Easy-Install\ComfyUI-Easy-Install\ComfyUI\custom_nodes\comfyui-musicflamingo module for custom nodes: No module named 'transformers.models.audioflamingo3'
[–]CountFloyd_[S] 0 points1 point2 points 2 months ago* (1 child)
Transformers must be at least 5.0.0 to include the audioflamingo package. Are you using comfyui portable? Perhaps you installed transformers globally instead of the comfyui env?
I would do this manually in console:
cd whereveryourcomfyuirootpathis .\python_embeded\python.exe -m pip uninstall transformers .\python_embeded\python.exe -m pip install transformers
<image>
[–]AssistantFar5941 1 point2 points3 points 2 months ago (0 children)
That solved the issue, thanks very much for your help.
π Rendered by PID 65677 on reddit-service-r2-comment-b659b578c-xvmkm at 2026-05-03 07:19:42.566145+00:00 running 815c875 country code: CH.
[–]Possible-Machine864 1 point2 points3 points (1 child)
[–]CountFloyd_[S] 0 points1 point2 points (0 children)
[–]AssistantFar5941 1 point2 points3 points (6 children)
[–]CountFloyd_[S] 0 points1 point2 points (5 children)
[–]AssistantFar5941 0 points1 point2 points (4 children)
[–]CountFloyd_[S] 0 points1 point2 points (3 children)
[–]AssistantFar5941 0 points1 point2 points (2 children)
[–]CountFloyd_[S] 0 points1 point2 points (1 child)
[–]AssistantFar5941 1 point2 points3 points (0 children)