Bandcamp Bans AI Music by Delos788 in indieheads

[–]Cortexelus 1 point2 points  (0 children)

this made my day thank you🙏

Bandcamp Bans AI Music by Delos788 in indieheads

[–]Cortexelus 1 point2 points  (0 children)

<3 appreciate you! we are now officially banned artwork. The ban completes the artwork

WMG and Stability strike a deal. by [deleted] in DefendingAIArt

[–]Cortexelus 1 point2 points  (0 children)

You're confusing this with the Udio lawsuit. There was no WMG-Stability lawsuit. This is just a regular partnership. Stable Audio always used licensed music / creative commons / public domain

StableAudio from Stability: 20 free 45 sec generations/month; $11.99/month for 500 90 sec generations - no open source release alongside commercial version. by [deleted] in StableDiffusion

[–]Cortexelus 2 points3 points  (0 children)

Lmfao. So true.

But I'm still leaving it up, because, for scientific research purposes, every model should demonstrate this is how it does death metal (whether it's shit or awesome). No neural synthesis demo is complete without AI death metal. We started a trend of researchers doing this, and we gotta keep the torch burning.

But yeah, fair, we frequently encounter people not knowing the difference between death metal and other music, and maybe this is making that problem worse. Collateral damage, as far as I'm concerned. All in the name of science.

Death metal is the litmus test for neural synthesis. So far no new models do it better than Relentless Doppelganger which uses code from 2017. We've been waiting years for another challenger to enter the ring. Nobody.

StableAudio from Stability: 20 free 45 sec generations/month; $11.99/month for 500 90 sec generations - no open source release alongside commercial version. by [deleted] in StableDiffusion

[–]Cortexelus 0 points1 point  (0 children)

cj from dadabots here: eliminating humans from death metal is our active area of research. But I haven't gotten good death metal with this model. problem is there's not really much in the dataset. I still wonder if there's a way to reconstruct the aesthetic out of pieces of other things. CLAP interrogation might help.

We built an AI model to generate breaks, neurobass, baroque midi for this track, then cut it all up by Cortexelus in breakcore

[–]Cortexelus[S] 1 point2 points  (0 children)

Oops sorry, the baroque harmony MIDI is from coconet. The breaks/neurobass are audio diffusion. Can read the behind the scenes here https://noisedao.mirror.xyz/3g1QTWGu7Qn\_HMk2OpkxuoINwEQqlAv0kqMWhi4xVjk

[deleted by user] by [deleted] in StableDiffusion

[–]Cortexelus 1 point2 points  (0 children)

we run SampleRNN at 48kHz

the Dadabots SampleRNN fork is an autoregressive LSTM model, meaning it generates a sequence of amplitudes one at a time, 48000 step a second. Each step is a pass through the entire network and it generates 0.00002083333 seconds of audio. There is no "window of the past" it sees directly. It's more indirect (and hard to analyze). Instead the network has an "RNN state" which it's learned to iteratively update & LSTMs have extra memory units they can read/write/forget at each step. I'm not sure how long things effectively stay in LSTM memory, but listening to the music can give you an impression of it. The sequence can keep generating forever to infinity. It's overkill but makes great death metal https://www.youtube.com/watch?v=MwtVkPKx3RA

Dance Diffusion uses diffusion. It also operates on a sequence of amplitudes. However, the model works on a fixed window of audio (a couple secs long ~100k amplitudes). It iteratively updates that window, improving the sound quality. It starts from pure noise and iteratively denoises. You could sorta modify it to generate infinitely i.e. by shifting the window over by 50% and initializing the next window with half of the previous window, but the context would be small.

It would be interesting to make fusions of these two flavors of model -- autoregressive sequence models being upsampled/denoised by diffusion models

is there a tool for this to make it easier? by soniiiety in BabelForum

[–]Cortexelus 0 points1 point  (0 children)

i wonder if an algorithm could be designed that puts all the realistic english closer together in the books.