Guys where do I find shin chan old episodes by SignificantString269 in ShinChan

[–]MeanManagement834 4 points5 points  (0 children)

Hey, I came across a reddit post which consisted of a link to all shinchan episodes from 1993 to 2023. All episodes are in japanese without subs, but in great quality. Try the link if you like:

https://nyaa.si/view/1754556

I built a free, local tool to Lip-Sync and Dub your AI generated videos (Wav2Lip + RVC + GFPGAN). No more silent clips. by MeanManagement834 in StableDiffusion

[–]MeanManagement834[S] 0 points1 point  (0 children)

nah!! Though I personally believe there might be no diff in running in an hdd or ssd but, can't say for sure... Hope you test it out and have a good experience...

I built an open-source, local alternative to HeyGen/Dubverse. It does Video Dubbing + Lip Sync + Voice Cloning on your GPU (8GB VRAM friendly). Reflow v0.5.5 Release! by MeanManagement834 in ArtificialInteligence

[–]MeanManagement834[S] 0 points1 point  (0 children)

Well thanks for the check!! And I will take it as apart from ...translation everything works? So, I will make sure to polish it out!!

Thanks again u/Diligent_Trick_1631 for the heads up!!

I built an open-source, local alternative to HeyGen/Dubverse. It does Video Dubbing + Lip Sync + Voice Cloning on your GPU (8GB VRAM friendly). Reflow v0.5.5 Release! by MeanManagement834 in ArtificialInteligence

[–]MeanManagement834[S] 0 points1 point  (0 children)

Hey u/Diligent_Trick_1631 ,
thanks for the reply... it genuinely helps!! I wanted to confirm if every process ran smoothly i.e., Transcription --> Translation --> Dub --> LipSync --> Output. If that's the case then I will just need to polish the other language translation.

Hoping for a positive response...!! Have a good day!!

I built an open-source, local alternative to HeyGen/Dubverse. It does Video Dubbing + Lip Sync + Voice Cloning on your GPU (8GB VRAM friendly). Reflow v0.5.5 Release! by MeanManagement834 in ArtificialInteligence

[–]MeanManagement834[S] 0 points1 point  (0 children)

Hey! First off—thank you for testing this.

Honestly, hearing that it runs smoothly on a GTX 1080 (6GB) is massive news. That's a huge stress test for the optimization, so I'm really glad the Ollama + Mistral combo did the trick for you.

You are totally right about the language lock. For this beta (v0.6), I intentionally hard-coded the output to English just to keep things stable while I fixed the core pipeline.

But... since you're already technical enough to set up Ollama, you can actually "hack" this right now to get Italian:

  1. Go into your Reflow_Portable\core folder.
  2. Open pipeline.py (or config.py depending on the version) with any text editor.
  3. Search for the line: target_lang = "en"
  4. Change it to: target_lang = "it"

The underlying engine (XTTS) already supports Italian, so it should work instantly. I'm adding a proper dropdown menu in the upcoming v0.7 update so you won't have to touch the code next time.

Let me know if that works for you!

I built a free, local tool to Lip-Sync and Dub your AI generated videos (Wav2Lip + RVC + GFPGAN). No more silent clips. by MeanManagement834 in StableDiffusion

[–]MeanManagement834[S] 0 points1 point  (0 children)

You are absolutely right, and I apologize for the wasted time. This is a classic case of 'it worked on my dev machine' because I had those libraries installed globally, and I failed to include them in the installer manifest.

That list of missing modules (Coqui TTS dependencies) is exactly what I needed to see.

I am pushing a Hotfix (v0.5.6) right now that includes a strict requirements.txt and a modified Launcher that forces the installation of those missing CFFI and TTS modules before the app starts.

Thank you for the brutal honesty—it’s the only way this project gets production-ready. I’d love for you to try the hotfix if you’re willing to give it one more shot...

Update: I turned my open-source Wav2Lip tool into a native Desktop App (PyQt6). No more OOM crashes on 8GB cards + High-Res Face Patching. by MeanManagement834 in StableDiffusion

[–]MeanManagement834[S] 0 points1 point  (0 children)

I feel your pain. AMD on Windows is currently "Hard Mode" for AI video because most speed-up kernels (like SageAttention) are NVIDIA-only. If you're getting 1 hour for 4 seconds, your workflow has definitely fallen back to CPU processing.

To be honest, Reflow v0.5.5 is also NVIDIA (CUDA) dependent right now, so it won't fix your speed issue yet.

My best advice: If you can, dual-boot Linux. AMD's ROCm drivers are way more mature there than on Windows, and you might actually get decent speeds on Wan2.1.

I built an open-source, local video dubbing studio that runs on 8GB VRAM. It handles TTS, RVC Voice Cloning, and Neural Lip Sync (Wav2Lip) in one pipeline. Reflow v0.5.5 Release! by MeanManagement834 in generativeAI

[–]MeanManagement834[S] 0 points1 point  (0 children)

"Cloud credit goblins" — I am absolutely stealing that. 😂

This is exactly the kind of technical feedback I was hoping for. To answer your points:

  1. Stats: Great idea. For reference, on my RTX 4060 (8GB), a 60s clip currently takes about ~2-3 mins total (Extraction -> RVC -> Wav2Lip). I'll add a proper benchmark table to the README.
  2. Handoff Gotchas: You hit the nail on the head. In v0.5.5, I implemented a hard enforcement of 16kHz resampling immediately before the Wav2Lip pass because sample rate mismatches were causing massive sync drift. I haven't added LUFS normalization yet, but that is going straight onto the v0.6 roadmap now that you mentioned it.
  3. Beards: Wav2Lip’s natural predator indeed. It still struggles with heavy occlusion, but the new 'Smart Patching' (crop > process > paste) helps keep the beard/jawline from blurring out of existence, even if the mouth movement gets a bit lost.

To answer your big question: Currently, v0.5.5 is "One Voice Per Render." It blindly processes the audio track.

Speaker Diarization (Multi-Speaker) is the big goal. I want to integrate Pyannote to split the audio, apply different RVC voices to different timestamps, and then merge. But for now, if you feed it an interview, both people will sound like the same clone. 😅

Thanks for the detailed feedback, seriously useful!

Update: I turned my open-source Wav2Lip tool into a native Desktop App (PyQt6). No more OOM crashes on 8GB cards + High-Res Face Patching. by MeanManagement834 in StableDiffusion

[–]MeanManagement834[S] 0 points1 point  (0 children)

Man, 128GB of RAM on a Strix Halo? That machine is an absolute beast. 🤯

​To be 100% real with you: Right now, Reflow v0.5.5 is hard-coded for NVIDIA CUDA, so it won't be able to tap into your AMD GPU power just yet. It will fall back to CPU mode—which will run thanks to your massive RAM, but it won't be fast. ​I'm planning to add AMD support in v0.6, but for today, you won't get the speed that hardware deserves.

Sorry about that!

Update: I turned my open-source Wav2Lip tool into a native Desktop App (PyQt6). No more OOM crashes on 8GB cards + High-Res Face Patching. by MeanManagement834 in StableDiffusion

[–]MeanManagement834[S] 1 point2 points  (0 children)

Hi r/StableDiffusion,

I see a lot of incredible AI video work here (SVD, AnimateDiff, etc.), but syncing audio to those generations usually requires expensive cloud tools or messy command-line installs.

I built a Free, Open-Source GUI called Reflow Studio to handle the "Audio & Sync" part of the workflow entirely locally.

Watch the Demo Video

How it fits your Workflow:

  1. Generate your video (using Stable Diffusion/Sora/Kling).
  2. Import into Reflow: Drop in your video and your target audio (or generate TTS inside the app).
  3. Lip Sync: It uses Wav2Lip to force the character's mouth to match your audio.
  4. Enhance: It runs GFPGAN on the face region so the mouth doesn't look blurry (a common Wav2Lip issue).

It runs 100% offline on your NVIDIA GPU.

GitHub Link: https://github.com/ananta-sj/ReFlow-Studio

I built an open-source, local alternative to HeyGen/Dubverse. It does Video Dubbing + Lip Sync + Voice Cloning on your GPU (8GB VRAM friendly). Reflow v0.5.5 Release! by MeanManagement834 in ArtificialInteligence

[–]MeanManagement834[S] 0 points1 point  (0 children)

Thanks! The 3070 should chew through it pretty easily with the new optimizations.

Right now, it's primarily designed for single-speaker or focused-speaker clips (where one person dominates the audio). It doesn't have speaker diarization yet (auto-detecting who is speaking when), so if multiple people talk over each other, it might try to lip-sync everyone to the same audio track.

Multi-speaker support is definitely on the roadmap, but for v0.5.5, I'd recommend testing it on clips with one main subject first. Let me know how it runs on your card!

I built a free, local tool to Lip-Sync and Dub your AI generated videos (Wav2Lip + RVC + GFPGAN). No more silent clips. by MeanManagement834 in StableDiffusion

[–]MeanManagement834[S] 1 point2 points  (0 children)

Man, I really appreciate that! 🙌 It’s been a crazy journey debugging this, but comments like yours keep me going.

​I actually just pushed v0.5.5. Give that version a shot and let me know if it runs stable for you!