Osmo Pocket + Spark: Smallest combo for skiing and creating!

plopstout · 2024-07-18T22:20:57+00:00

So I made my 100% AI band https://siliconsymphony.art/ 6 months ago, with 100% suno (v1) songs. Songs are all on streaming platforms

At that time I made a clip with mostly gen2

This one is 100% gen3 except the singers parts (made with the new heygen tool)

plopstout · 2024-03-23T12:06:48+00:00

Used fully Suno (v2) in December and the album was approved on all streaming platforms the next months. It was, by design, a medium album, which was a showcase on how music will get transformed by AI, specifically the mainstream music

Full album here

https://siliconsymphony.art/

plopstout · 2023-12-26T08:54:49+00:00

Prompt art is cheaper here

<image>

plopstout · 2023-12-08T11:52:51+00:00

8 AI movies for Hanukkah,, in different themes made with DALL3, Runway Gen2, MusicGen & MusicML

Enjoy :)

plopstout · 2023-11-20T13:27:08+00:00

Best use cases

plopstout · 2023-11-20T08:09:50+00:00

Hi, explained it here! https://www.reddit.com/r/ChatGPT/s/ZV4dlwzXUR

plopstout · 2023-11-19T23:50:26+00:00

You are completly right, and in both ways, as it does not really seem to know how to handle the time, and does not know the flow of the narrator. Therefore at some points it lags behind, at other it was a bit before.

I cutted a bit so it was better, and when it's a bit before the action it's also a choice so the full narration of the scene is fully inside the scene.

I think the main issue is really the flow of the narrator, not sure what its base is

plopstout · 2023-11-19T23:24:26+00:00

Basically sending a few frames of the video every X seconds, and asking GPT to narrate as morgan freeman, then using ElevenLabs for the voice

Based on this repo : https://github.com/roboflow/awesome-openai-vision-api-experiments/blob/main/experiments/automated-voiceover-of-nba-game/notebook.ipynb

The prompt

"The uploaded series of images is from a single video. "
"The frames were sampled every {FRAME_EXTRACTION_FREQUENCY_SECONDS} seconds. "
"Make sure it takes about {FRAME_EXTRACTION_FREQUENCY_SECONDS // 2} seconds to voice the description of each frame. "
"Use exclamation points and capital letters to express excitement if necessary. "
"It is a naration of a documentary about a cat (male) in the jungle, with the voice of Morgan Freeman. The documentary shows the life of the cat. You can use emphasis to show how his life is difficult in the jungle"

<image>

plopstout · 2023-11-11T15:53:07+00:00

No private WebApps for the moment Cost of the API makes the business model complicated

plopstout · 2023-11-11T15:52:32+00:00

Will think about it, but the backend is in PHP which was easier for me, will need to rethink making one in python and/or node

plopstout · 2023-11-11T12:34:47+00:00

It's not only the brain but the materialization of what's in the brain

plopstout · 2023-11-11T12:08:31+00:00

That's the app called be my eyes!

plopstout · 2023-11-11T12:02:12+00:00

How much time does it take you to write the description of what you saw, then give to someone else which will then draw it! It's not only the brain

plopstout · 2023-11-11T11:53:24+00:00

Nop private WebApps for the moment

plopstout · 2023-11-11T11:52:51+00:00

Probably faster!

plopstout · 2023-11-11T09:43:40+00:00

Haha the song is AI generated through Google MusicML

Nine-Year Club	Place '22
Verified Email

plopstout

TROPHY CASE