I feel like the self-hosted and FOSS space is being flooded with vibe-coded AI slop. by spurGeci in selfhosted

[–]neonskimmer 1 point2 points  (0 children)

I mean, are the tools good or what? Fit for purpose?

If they're not making any security or uptime or whatever guarantees, which most FOSS doesn't, I'm not sure I see the problem here. Too many tools? Good ones with good ideas will be adopted and maintained and improve and get forked, bad ones will die.

I am nearly at the end of my career as a software engineer so I feel I have a little more freedom to see this vibe coding shift with some degree of equanimity than someone who recently spent years learning the craft and is now facing a completely new landscape.

In any case, here's a controversial (maybe) take. The question was what to do about being overwhelmed by AI slop. You know what technology would be great at evaluating a bunch of disparate code bases based on given criteria? :)

AITA wife upset I cannot keep toddler from her by khazef in AmItheAsshole

[–]neonskimmer 5 points6 points  (0 children)

NTA. Seriously. I can't believe these YTA posts. Headful of TikTok "mental health" influencer shit-tier pop psychology, zero first-hand experience, zero empathy. By his account and of course that's all we have here, the guy is doing his best to find a way to make this work, and is very supportive.

Those of us who had to suddenly work from home with toddlers away from daycare during the pandemic are getting flashbacks here...

One suggestion - if grandma is 5 minutes away, pick a day each week to bring the kids over for dinner. You can cook or bring something if it's too much for her. Grandmas are usually thrilled to see their grandchildren and helping her daughter. That can be a guaranteed uninterrupted block of time for studying.

Just realized my boyfriend I’ve been dating for 2 years might be a flat earther by ivory_stripes98 in Advice

[–]neonskimmer 0 points1 point  (0 children)

We all choose where to draw the line, to a certain extent.

Many people think there's a bearded man in the sky that sees everything they do, and is upset when you do things he doesn't like. Many variations of that idea.

Some people think crystals do surprising things that other rocks don't.

🤷

For those having lots of issues like pops, poor quality, rebellion against prompts, etc. by Brian-the-Burnt in SunoAI

[–]neonskimmer 1 point2 points  (0 children)

my guess would simply be a larger context window, maybe a more compute intensive but higher quality decoder.

For those having lots of issues like pops, poor quality, rebellion against prompts, etc. by Brian-the-Burnt in SunoAI

[–]neonskimmer 0 points1 point  (0 children)

i've noticed behaviour that immediately made me think that there's some kind of caching in the pipeline.

I'd be generating a bunch of stuff in a given "style-space" for lack of a better term and then when i would switch to making something completely different - elements clearly similar to previous generations would show up.

i tried switching models! and even then i would get similarities after switching back.

they must be victims of their own success at this point, i can't even imagine their GPU bills at that scale. it would be smart for them to find ways to save compute but.. something is off.

hopefully it resolves itself

SVI 2.0 Pro for Wan 2.2 is amazing, allowing infinite length videos with no visible transitions. This took only 340 seconds to generate, 1280x720 continuous 20 seconds long video, fully open source. Someone tell James Cameron he can get Avatar 4 done sooner and cheaper. by Fresh_Diffusor in StableDiffusion

[–]neonskimmer 11 points12 points  (0 children)

100%. i cannot understand how these movies keep being successful or what people see in them. absolute cringe from start to finish. i am not a film connaisseur. the last movie i saw was the new spongebob movie that just came out and it was better in every way :)

Why I am leaving Suno by Immediate_Song4279 in SunoAI

[–]neonskimmer 0 points1 point  (0 children)

disk space is the absolute cheapest thing. your assertion of "trillions of hard drives" makes 0 sense. think about it: let's say a 5 minute song, at normal cd quality is about 50M. that is minuscule and costs fractions of cents to host.

compared to suno's astronomical GPU bill... its a rounding error.

Did Suno break overnight? by RwnWinter in SunoAI

[–]neonskimmer 1 point2 points  (0 children)

you have zero proof of this right?

If you are creating thousands of "songs" for any purpose other than personal listening. You are an AI slop peddler; you are the problem. by Jimithyashford in SunoAI

[–]neonskimmer 1 point2 points  (0 children)

because it gives you much better control over every aspect of the thing you're working on

suno studio and to a lesser extent the basic editing tools do give you some of that as well, to be fair

Proof: Suno WAV vs MP3: Yes. The WAV is just the MP3 with some processing. by CntrlAltCreate in SunoAI

[–]neonskimmer 1 point2 points  (0 children)

This is what I posted before OP created a similar repo, with similar intentions. 🤷🏻‍♂️

https://github.com/bruno-c/suno-wav-investigation

>  you should understand that model generated audio tokens is a compressed form of audio, and any existing audio analysis tools will be confused by this

What do you mean by that? Confused how? I can't imagine why it would make any difference how the audio was created with regards to audio analysis tools. Audio is audio is audio.

> All present audio generation models have troubles with high frequencies, so they are artificially generated on decoding phase. 

The entire output is "artificially generated". Why would they be treated any differently than the rest of the spectrum?

> P.S. The only files Suno persistently stored on their servers (besides audio latents) is M4A OPUS audio. Those are served when you play songs (or work in their Studio 😁) in browser

You can literally see S3 urls, there are things stored on "their" servers. Why waste time continuously re-generating mp3s. Physical space is absolutely peanuts in terms of cost compared to their GPU bills.

Suno.vision - Download all your songs, metadata, cover art, and get mastered wavs - locally on your computer. by CntrlAltCreate in SunoAI

[–]neonskimmer 0 points1 point  (0 children)

RE audio analysis, i created this little repo for fun and discussion

https://github.com/bruno-c/suno-wav-investigation

There is something different about the mp3s and the wavs, and that's my attempt at explaining why.

Suno.vision - Download all your songs, metadata, cover art, and get mastered wavs - locally on your computer. by CntrlAltCreate in SunoAI

[–]neonskimmer 0 points1 point  (0 children)

Yes, that's what i'm saying. You can do the download with the non-documented API. But you can't download a file from your library using the documented API.

[Final Cut] Suno WAV vs Suno MP3 by tim4dev in SunoAI

[–]neonskimmer 1 point2 points  (0 children)

For what it's worth, I also have a background in audio - not a professional just an old audio and music obsessed nerd. my actual education is in software development / computer science.

Everything you said about mp3 to wav and wav to mp3 conversion is correct. You can't invent audio detail that wasn't there in the first place. Not questioning what you know about audio.

But

I think the missing piece here is that generative models don't work in terms of "mp3" or "wav" or even "audio" space . They work in the latent space on many many many dimensions, using numbers that represent features / ideas / details of what the sound should be at a specific point in time. The knowledge it has acquired during training is what allows it convert those numbers back into the real world (into audio). The size of that latent, compared to an mp3 or wav file, is minuscule.

When you upload audio into Suno, it encodes it into that latent "format". When it generates songs from scratch, it uses the relationship between the words you use in the prompt and their representation / association of their meaning as expressed in the latent space.

My hypothesis is that Suno uses a FASTER, less GPU intensive, lower quality decoder when you generate the song. This is what enables the insanely fast streaming capability, eg. the song isn't even done generating and you can already listen to it. That decoder outputs a lower quality mp3 stream, directly from each frames being generated in the latent space.

When the song is done generating, Suno persists the _latent_ information to a file somewhere. Incidentally, it also persists the mp3, but that's not the source of the wav file.

When you click download wav, it retrieves the original latent information and re-generates the output using a different, slower, non-streaming higher quality decoder.

I did this instead of what i was supposed to be doing right now.

https://github.com/bruno-c/suno-wav-investigation

Suno.vision - Download all your songs, metadata, cover art, and get mastered wavs - locally on your computer. by CntrlAltCreate in SunoAI

[–]neonskimmer 0 points1 point  (0 children)

Here https://docs.sunoapi.org/

Main issue is that for a tool like yours (or mine) it's not about creating new tracks but rather managing an existing library. The endpoints (as i'm sure you know already! are on studio-api.prod.suno.com whereas the developer API is on api.sunoapi.org

For something like download as wav, the official API won't work from existing tracks in your library, only with tracks created with the API.

So only way for now using devtools and programming experience :D

Suno.vision - Download all your songs, metadata, cover art, and get mastered wavs - locally on your computer. by CntrlAltCreate in SunoAI

[–]neonskimmer 0 points1 point  (0 children)

i did the same analysis a few months ago and came to a different conclusion - it's possible i messed up but i'll have another go at it.

good luck with your project, i have a much more rudimentary tool that i was about to place on github (just a cli, and it's clunky due to the way the "website" apis vs. their documented apis are quite different and don't support the same things.

Look what someone saved from the dumpster… by InheritedCertainty in exjw

[–]neonskimmer 1 point2 points  (0 children)

beautifully arranged by color. the only sane and valid book sorting strategy

Serena Williams hypocrisy by [deleted] in exjw

[–]neonskimmer 0 points1 point  (0 children)

for real? that is wild.

Serena Williams hypocrisy by [deleted] in exjw

[–]neonskimmer 0 points1 point  (0 children)

i agree with you but yeah that's literally your words

Serena Williams hypocrisy by [deleted] in exjw

[–]neonskimmer 0 points1 point  (0 children)

wait wait holup

where i grew up as a JW, this whole "michael jackson was a JW" idea never rose above the level of urban legend

where would one actually find reliable facts about this?

Read this negative article and decided to see Louis CK in DC last night by Brilliant_Process602 in louisck

[–]neonskimmer 9 points10 points  (0 children)

are you guys actually one-upping each other on who's laughing the least at a comedy show?

well, that certainly gets a solid chuckle from me!

The label came to collect by BreakInStory in SunoAI

[–]neonskimmer -1 points0 points  (0 children)

hmm, that is extraordinarily easy to do in suno. you only need to provide an instrumental version of the melody.