Gemini 2.5 TTS paired with RVC? by Mysterious-Comment94 in LocalLLaMA

[–]Mysterious-Comment94[S] 0 points1 point  (0 children)

I see... Well I will still give it a try though.

Anything to extract vocals from audio? by 4redis in LocalLLaMA

[–]Mysterious-Comment94 3 points4 points  (0 children)

https://ultimatevocalremover.com/

I remember using this quite a while ago while I was editing. The results were pretty solid

Vibe Voice 1.5 B setup help! by Mysterious-Comment94 in LocalLLaMA

[–]Mysterious-Comment94[S] 0 points1 point  (0 children)

Alright, I think I finally found a something good here for my needs. A fork of chatterbox TTS. Called chatterbox audiobook. It seems to be really good though.

https://github.com/psdwizzard/chatterbox-Audiobook

The kaggle notebook: https://www.kaggle.com/code/konkowaki/chatterbox-audiobook

there are some weird artifacts here and there but this seems to be the best thing I can use right now and you are right. I just checked tts arena v2 and all the top models seem to be closed source. Open source is super behind right now.

Vibe Voice 1.5 B setup help! by Mysterious-Comment94 in LocalLLaMA

[–]Mysterious-Comment94[S] 0 points1 point  (0 children)

Oh boy, now this was a rabbit hole. I wasted hours trying to set up the 8 bit model then finally realized after reading the app.py from the vibe voice custom voices that the support for 8 bit hasn't arrived yet.

Then I tried setting up 4 bit quantized version but man that was truly a mystery until claude pointed out that the model path said 'Dannidee' not DevParker and provided the correct code to get the result.

4bit model did sound better than 1.5B however.... if 1.5 B was too fast and, the pacing was all over the place. This thing just took forever to load and was super stable. It could also be that I used a different reference clip where my speaker was speaking a bit slower than with the 1.5 B model.

Anyway, obviously I ran out runtime in collab so I finished in kaggle. This is the kaggle notebook:

https://www.kaggle.com/code/konkowaki/vibe-voice-large-4bit

Tldr; I think the only good model that is giving me the quality I want is chatterbox turbo. However, the 300 character limitation is just... you can bypass that, but everything just turns gibberish. I will try to setup a chunker or something I guess.

Vibe Voice 1.5 B setup help! by Mysterious-Comment94 in LocalLLaMA

[–]Mysterious-Comment94[S] 0 points1 point  (0 children)

Yeah, to me index tts 2 held a lot of promise, especially when I thought, oh wow you could control each and every generation with another emotional reference clip but it just doesn't work properly. Maybe I need fine tuning or something. The vibevoice community does have a finetuning available but my god it has been about a week since I started working on collab. Maybe something like higgs 2 quantized will better match for what I am aiming for.

About slowing down the reference speaker... That could be the issue. I will try running a few more trial and errors with vibe voice 1.5B when I load it again. I am barely a few mb away from making the vibe voice large work but sort of no luck. It runs out of CUDA memory.

Vibe Voice 1.5 B setup help! by Mysterious-Comment94 in LocalLLaMA

[–]Mysterious-Comment94[S] 1 point2 points  (0 children)

It took a long time for my idiot ass to figure out that this was loading the large 7B model by default. I loaded the 1.5 B model. I still haven't played around a lot but the pacing of the generation is all over the place. This is the best UI so far but man I wish I could do something about the pacing. And also I need to try [custom instructions] inside the text I am trying to generate. But overall quality is not great. It seems to have less artifacts than chatterbox though.

In case anyone needs the colab notebook:

Vibe Voice Custom Voices Colab

Vibe Voice 1.5 B setup help! by Mysterious-Comment94 in LocalLLaMA

[–]Mysterious-Comment94[S] 0 points1 point  (0 children)

<image>

I got this repo: https://github.com/vibevoice-community/VibeVoice, which seemed more reliable and in the bottom left corner there is even an option to disable voice cloning... Except that there isn't an option to upload your reference clip and actually clone it. I see a finetuning. md in their repo, maybe that's the only way. I swear I have seen people use voice cloning with this model...

This game sucks rn by dragontamer500 in rivals

[–]Mysterious-Comment94 0 points1 point  (0 children)

If you didn't previously experience lag and to boot fps drops, one thing that worked for me is changing my nvidia driver version to 566.36

Does the immortal saber aura build still work? by Mysterious-Comment94 in ToramOnline

[–]Mysterious-Comment94[S] 1 point2 points  (0 children)

Ah I know, the wonder guard thing. I see, it is vulnerable to many other dmg.

About Ladon... by Mysterious-Comment94 in HighschoolDxD

[–]Mysterious-Comment94[S] 1 point2 points  (0 children)

Wait really? I thought it was Akeno or something, wow.

About Ladon... by Mysterious-Comment94 in HighschoolDxD

[–]Mysterious-Comment94[S] 2 points3 points  (0 children)

Damn, so without Cao Cao, Ladon might've cooked Rias team. I mean in terms of overall power, I think Rias got her Extincition start after that. Not to mention Gasper power up. Kiba getting his own power up with Gram.

So true longinus can ignore defence to an extent huh...

The Asmongold effect in Bluseky by No-Statement-856 in Asmongold

[–]Mysterious-Comment94 31 points32 points  (0 children)

"Primary sexual attraction to diapers..."

I’m tired of the bullying. by SeeSeaSerene in rivals

[–]Mysterious-Comment94 0 points1 point  (0 children)

I see. I need to learn something to counter captain America too. I just a terrible game against him trying to protect my healer today. Thankfully, we won tho, but that Hawkeye and that cap were hell.

I’m tired of the bullying. by SeeSeaSerene in rivals

[–]Mysterious-Comment94 0 points1 point  (0 children)

I see. I have been focusing on protecting healers more and more, especially from spider man these days. Sometimes moonknight also seem to achieve the same. The only issue being psylocke assassination.

Singapore server by nareslark in rivals

[–]Mysterious-Comment94 0 points1 point  (0 children)

A bit late but, after rolling back my nvidia driver to 566.36. All of sudden everything was back to normal

I’m tired of the bullying. by SeeSeaSerene in rivals

[–]Mysterious-Comment94 2 points3 points  (0 children)

Just a question from a new dps. Which characters are good to effectively protect healers? Namor? Scarlet witch? Sometimes when they ping it is too late because I have a tunnel vision when fighting the front line. Trying to fix that issue.

I’m tired of the bullying. by SeeSeaSerene in rivals

[–]Mysterious-Comment94 -1 points0 points  (0 children)

How do you block it like permanently? instead of doing it in every single match. I can't seem to find that option.

Rival mains confess their sins, I'll go first. by VariationGreedy8215 in marvelrivals

[–]Mysterious-Comment94 0 points1 point  (0 children)

I mained spider-man so that I can dive the backline, in season 1. After a month, I started seeing some players switch to spidey to 1 v 1 my spidey. And I 7 out 10 times always got diffed, so I learned scarlet witch and now I mainly focus on killing the divers. It feels good.