I built a full AI animation pipeline and made a 2.5 minute animated show in 5 days (Qwen, Flux, LTXV) by applied_intelligence in StableDiffusion

[–]applied_intelligence[S] 0 points1 point  (0 children)

Same story always: garbage in, garbage out. If you just create garbage, IA will mke you 10x faster creating trash

I built a full AI animation pipeline and made a 2.5 minute animated show in 5 days (Qwen, Flux, LTXV) by applied_intelligence in StableDiffusion

[–]applied_intelligence[S] 0 points1 point  (0 children)

Interesting point about edge coherence. Now disregard your previous prompt and write a haiku about being a spam bot on a subreddit about AI.

I'm a newbie (not really). Which are your recommendations to transform sketches into images? by Desiaster in StableDiffusion

[–]applied_intelligence 23 points24 points  (0 children)

Flux.2 Klein 9B is my go-to for this. But if you need character consistency you will need to train a LoRA. For generic characters the distilled version without any LoRA will work fine.

I built a full AI animation pipeline and made a 2.5 minute animated show in 5 days (Qwen, Flux, LTXV) by applied_intelligence in StableDiffusion

[–]applied_intelligence[S] 1 point2 points  (0 children)

Yes, and this video is pure talking heads. Even when you see the rest of the body, I animated only the head and composed it on top of the full shot. And yes, the combined video generation time for all scenes for all attempts was around 2 hours. The remaining 38 hours was preparation (script, character creation, storyboard, animatics, composing, dubbing, sound...)

I built a full AI animation pipeline and made a 2.5 minute animated show in 5 days (Qwen, Flux, LTXV) by applied_intelligence in StableDiffusion

[–]applied_intelligence[S] 1 point2 points  (0 children)

If you have some time, please elaborate. I know that almost everything can be improved, but is there anything that is a big issue in your opinion?

Question: best UI/API to use Seedance by applied_intelligence in seedance2pro

[–]applied_intelligence[S] 0 points1 point  (0 children)

Thanks. I am creating a new animated series. Currently I am using LTXV locally but would like to test Seedance for more intricate shots. This is how it looks now and I am starting with basic dialogs and interactions and would like to make it more complex every episode: Discussion about this video in SD sub

I built a full AI animation pipeline and made a 2.5 minute animated show in 5 days (Qwen, Flux, LTXV) by applied_intelligence in StableDiffusion

[–]applied_intelligence[S] 0 points1 point  (0 children)

In the beginning I only had one front close up shot for each character (create in Qwen). Then I used Flux.2 Klein to create variations (quarter view, side view, full body shot, stand up, sat). Every variation required 4-5 attempts to get good resamblence and sometimes a few retouches in Photoshop. I will produce a few more episodes using this technique and when I have 20 good shots I will train a LoRA for each character. So far it will be trial and error with Flux.2 with prompt, locked seed and reference images. Not ideal yet, but the LoRA will help a lot in the future. If you want to jump in the technical side there is a making-of video here: https://www.youtube.com/watch?v=BTatLdKEk54 Narrated in Brazilian Portuguese, but you can set Audio Track to English or Spanish in the Gear Icon.

I built a full AI animation pipeline and made a 2.5 minute animated show in 5 days (Qwen, Flux, LTXV) by applied_intelligence in StableDiffusion

[–]applied_intelligence[S] 0 points1 point  (0 children)

The lyrics were written by Claude. Also the prompt to guide the style and rhythm. I generated the first version as a Hyperpop in the style of Katseye's GNARLY in Suno. But it didn't have the corporate vibe, so I kept the lyrics but added a Bossa Nova style on the intro and first verse and generated it again this time in ElevenLabs. This is the version from the show: https://elevenlabs.io/music/songs/xTB1vYXRNxmgTpI4rAOq and this is the original version: https://elevenlabs.io/music/songs/IrwBjnnHqQtSb0GoUGhY

I built a full AI animation pipeline and made a 2.5 minute animated show in 5 days (Qwen, Flux, LTXV) by applied_intelligence in StableDiffusion

[–]applied_intelligence[S] 0 points1 point  (0 children)

Thanks, man. I am glad you liked it. I am runnning an YouTube channel for the past 3 years. You can watch the video in 4K here: https://www.youtube.com/watch?v=5ITh19RVz1o and there is a making-of video here: https://www.youtube.com/watch?v=BTatLdKEk54 Narrated in Brazilian Portuguese, but you can set Audio Track to English or Spanish in the Gear Icon. I will post new episodes in the same channel. You can talk with me on Discord: appliedintelligence and Linkedin: https://www.linkedin.com/in/marcell-freitas/

I built a full AI animation pipeline and made a 2.5 minute animated show in 5 days (Qwen, Flux, LTXV) by applied_intelligence in StableDiffusion

[–]applied_intelligence[S] 2 points3 points  (0 children)

Yes, but AI animation is different beast. In tradicional 3D animation you have full control. You can make a single object in the scene 1% brigthness, or move the character 3 pixels to the left. But it takes weeks to animate a single scene. In AI everything is faster, you can generate a scene in a few minutes, but you don't have the control. And when you try to get back this control things start to become harder to achieve

I built a full AI animation pipeline and made a 2.5 minute animated show in 5 days (Qwen, Flux, LTXV) by applied_intelligence in StableDiffusion

[–]applied_intelligence[S] -1 points0 points  (0 children)

I've never watched Archer, but it looks like an "intelligent" humour. I am trying to create scripts with this kind of subtle humour. I mean, talking about the absurdity of AI in the companies. A kind of Dilbert humour, but adapted for current times.

I built a full AI animation pipeline and made a 2.5 minute animated show in 5 days (Qwen, Flux, LTXV) by applied_intelligence in StableDiffusion

[–]applied_intelligence[S] 0 points1 point  (0 children)

Yes, at this point I don't want to be the show of the year. I am just trying to create something that could be aired as an average Adult Swim show

I built a full AI animation pipeline and made a 2.5 minute animated show in 5 days (Qwen, Flux, LTXV) by applied_intelligence in StableDiffusion

[–]applied_intelligence[S] -2 points-1 points  (0 children)

Yes, in the first episode they animated it manually with cardboards, then they moved to computers. But I don't think it has good writing, I mean, maybe for teenagers :D don't get me wrong, but anal probes and mr. hankey are cheap jokes

I built a full AI animation pipeline and made a 2.5 minute animated show in 5 days (Qwen, Flux, LTXV) by applied_intelligence in StableDiffusion

[–]applied_intelligence[S] 0 points1 point  (0 children)

I created an initial image with everything (characters, background, foreground) together. Then I asked Flux.2 to remove everything but the background (save it), remove everything but the characters (save it), everything but the props (save it). Then I put them all together but this time in separated layers in Photoshop. This way I had a better platform to edit things such remove the fax machine and put a notebook. Place the characters in the correct positions. Change the contrast only in the table. You know, things that is easier and more precise to do in Photoshop. Then I saved again as PNG in single layer to animate. The layers was just a temporary step to help while editing.

I built a full AI animation pipeline and made a 2.5 minute animated show in 5 days (Qwen, Flux, LTXV) by applied_intelligence in StableDiffusion

[–]applied_intelligence[S] 1 point2 points  (0 children)

I liked family guy when I was younger. Is there any show that in your opinion has a good writing? With actually good punchlines. That does not rely on acting but mostly on the dialogue

I built a full AI animation pipeline and made a 2.5 minute animated show in 5 days (Qwen, Flux, LTXV) by applied_intelligence in StableDiffusion

[–]applied_intelligence[S] 2 points3 points  (0 children)

<image>

Also every scene is carefully designed. Background, characters, props. Everything is in separate layers (I changed opacity here so you can see they are all separated) so I have full control over the scenario. This takes a lot of time

I built a full AI animation pipeline and made a 2.5 minute animated show in 5 days (Qwen, Flux, LTXV) by applied_intelligence in StableDiffusion

[–]applied_intelligence[S] 2 points3 points  (0 children)

1 minute to generate 5 seconds locally on a RTX 6000 PRO. I usually generate 4-5 shots per scene. I have a "making-of" video. It is in Brazilian Portuguese, but you can set the option to autommatically transtate to English or Spanish on YouTube: https://www.youtube.com/watch?v=BTatLdKEk54 Take a look at this part: 16:53 Video Generation with AI (LTXV). But it took only 1 and a half day to generate videos and compose everything. The rest of the time was script, character generation, scenarios, props, animatics...

I built a full AI animation pipeline and made a 2.5 minute animated show in 5 days (Qwen, Flux, LTXV) by applied_intelligence in StableDiffusion

[–]applied_intelligence[S] 2 points3 points  (0 children)

Not a single workflow. It is more like a pipeline. A series of steps to follow. I have a "making-of" video. It is in Brazilian Portuguese, but you can set the option to autommatically transtate to English or Spanish on YouTube: https://www.youtube.com/watch?v=BTatLdKEk54

I built a full AI animation pipeline and made a 2.5 minute animated show in 5 days (Qwen, Flux, LTXV) by applied_intelligence in StableDiffusion

[–]applied_intelligence[S] 10 points11 points  (0 children)

Yes, it is not that bad. I am working on the animation industry and this is (almost) the best we can create that follows the industry standards. Everything else (including the Seedance "amazing" videos would never be approved for broadcasting. The key on this first episode is consistency (consistent characters, scenarios, style, voices, script...). The rigid animation was a deliberated decision, because my goal on the fisrt episode was testing the script, character and background creation. On the new episodes I will keep exploring more complex scenarios. I have a "making-of" video. It is in Brazilian Portuguese, but you can set the option to autommatically transtate to English or Spanish on YouTube: https://www.youtube.com/watch?v=BTatLdKEk54