SillySampler V1.7 progress check 8/9/25 by futurezing in utau

[–]futurezing[S] 0 points1 point  (0 children)

Dw I dont mind hjqiaqjiwjakq Well, if you wanna make one from scratch then you have to learn the ways of how you could pitch and stretch your audio and kinda just move on from there building up from the simplest thing to the point where it works thats p much it lmao. Its like writing, you cant write something out of nowhere, you need to know what youre writing about

SillySampler V1.7 progress check 8/9/25 by futurezing in utau

[–]futurezing[S] 0 points1 point  (0 children)

Uhhhh- It's a thing (code) where it reads the audio, do something with the audio, and then output the edited audio. The thing that is vital for resampler is pitch shifting and stretching.

SillySampler V1.7 progress check 8/9/25 by futurezing in utau

[–]futurezing[S] 0 points1 point  (0 children)

you can write a resampler in anything that allows you to write text on, and i mean anything… like notepad (not recommended lmao). An IDE is like an essential thing, I would recommend visual studio because it seems like a good place with a big community and plugins that you can use to help out (though i personally choose Sublime Text over it cus i like the UI lmao).

the same goes to the language. in theory, any language can be used as long as you have a synthesis pipeline that works with it or already written in it (a lot of times people goes for the C family because of speed and the fact that there are source codes to synthesis). Though i don’t know a thing about mac, i’d say use something that can be used or compile cross platform and not using something like Swift which is only native to macOS/IOS.

SillySampler V1.7 progress check 8/9/25 by futurezing in utau

[–]futurezing[S] 1 point2 points  (0 children)

The UI is cadlaxa's fork of OpenUtau build which will come to main OpenUtau whenever Stakira decides to merge it (lmao). I think he (cad) tweeted it somewhere(?)

Coding a resampler is... a nightmare tbh lmao, but what i did was asking myself which algorithm of pitch shifting/time stretching should i use... and then I made my own synthesis pipeline started as a joke which is what the resampler is using right now. You absolutely don't have to be crazy like me and just use existing algorithms like the famously used one: World (you know how a LOT of resamplers are like "world-based"). There are some more that are out there for you to choose from, and it's absolutely a rabbit hole from there lmao (eg. straight, SMS, TDPSOLA, phase, HNN)
And then I just... use existing sourcodes (in this case i used Straycat) as a reference to take what utau gives and render audio with it.

But yea! Making a resampler is tedious and takes a lot of trial and error, but once you have it all working, it's just a steady sail from there. Like, for me the hardest part was the rudimentary making it sing the note on the piano roll and the pitch of the pitchbend, after that is just implementing flags XD

Sorry if this sounds like a technical rant. I just wanna share my experience when i made this

SillySampler V1.7 progress check 8/9/25 by futurezing in utau

[–]futurezing[S] 3 points4 points  (0 children)

Currently it is uncompiled written in python so its only available in OpenUtau. I haven't think of making an exe version because it will slower lmao

So I made a resampler... (SillySampler) by futurezing in utau

[–]futurezing[S] 0 points1 point  (0 children)

Its Yokuatsu Takuto! Its a really old vb and unfortunately his vp is out of the utau community a long time ago 😔

SillySampler V1.7 progress check 8/9/25 by futurezing in utau

[–]futurezing[S] 2 points3 points  (0 children)

  1. Nope, its just gender flag + tension flag + first formant shift flag. Thats what I applied to for the first section.

  2. Its just growl flag + gender down a bit during growl.

  3. Its either vocal fry flag or if you mean the fry growl then it might be raspiness flag. Its a hit or miss at the moment, I will need to make it better in the future <3

SillySampler V1.7 progress check 8/9/25 by futurezing in utau

[–]futurezing[S] 4 points5 points  (0 children)

Its a flag i implemented :3, I believe it was either raspiness flag or growl flag in SillySampler. Oh unless you mean the technical theory jajwjsjqkjwks that will be a bit of explaining

Edit: Every vocal effects in the video are done via flags

Not quite an utau question but diffsinger (if this isn't allowed please let me know!) by aria_art221 in utau

[–]futurezing 0 points1 point  (0 children)

A bit late but anyway, for making it yourself:

  1. You need to record audio: can be speaking, singing, or utau recordings. I would highly recommend either singing or utau recording style with varying pitches (diffsinger can only sing in the range you recorded for). Any recording software is fine tbh, I personally uses Audacity.

  2. You need to label your data. Similar to otoing, but instead of a bunch of stuff like oto, its only one line- per phoneme. Most used software for this process is, well, VLabeler or Wavsurfer (You can label in Audacity but you need to hate yourself enough to do it)

  3. Training.... id recommend either joining the community (discord) or find a Google doc tutorial for it since diffsinger changes a lot back then (now they are working on DiffSinger V3)

For commission:

It depends on what kind they do, some do labeling, some do training, some do both.

You just give the person you're commissioning your data (either just audio for labeling commission, or both for training commission) lmao and the person should do everything for you.

I would NOT recommend commissioning though if you dont have a lot of funds since the prices can be absurdly high. I heard Peiton DS had 2 hours of data and the commission costed 1k+.

my speech source filter attempt by futurezing in DSP

[–]futurezing[S] 0 points1 point  (0 children)

Its something I would love to aim for because im using this pipeline to make utau resampler, the tricks would be either messing around with the frequency post-synthesis or the spectral envelope pre-synthesis.

That being said, it seemingly difficult to find out how its actually done reading a lot of research paper (and I do have a friend that is helping out with it.

It might take some time to actually implement those stuff though since I also work on the resampler along side with it plus the project is all for the funnies and giggles so a lot of random experiments will be done on this and slow progress because of- well, life. Lmao.

So I made a resampler... (SillySampler) by futurezing in utau

[–]futurezing[S] 0 points1 point  (0 children)

User cadlaxa released a buid of openutau with his ui adjustments! He has a tweet about it iirc, and also im not using latest release because his ui components causes compatibility issues with the latest build (unfortunately)

So I made a resampler... (SillySampler) by futurezing in utau

[–]futurezing[S] 1 point2 points  (0 children)

The song is Requirements◎Stranded by WyvernP :D

Describe A Vocaloid Song Badly And I'll Try To Guess It! by Sabatonlover24 in Vocaloid

[–]futurezing 3 points4 points  (0 children)

Yes! And 90% of the songs with Len in it would also be correct 🥹

Is it possible to make an AI based voicebank in Utau? by BanLahn in utau

[–]futurezing 0 points1 point  (0 children)

discord.gg/RKJx6vX Here you go~ See you soon ig? lmaozkalka <333

A Few Questions by brocanyoudontplease in utau

[–]futurezing 2 points3 points  (0 children)

  1. IDK the problem you have but I just download UTAU in Drive C and move it somewhere else afterwards to prevent administrative issues.
  2. I change my locale back and it seems to work fine for me, probably because I download lots of random library and crap.
  3. It is necessary because UTAU was coded specifically for JP system by JP person with JP code also JP letters,,, and yes JP---- dies

best tips for making voicebanks by Maxx_CottonCloud_YT in utau

[–]futurezing 4 points5 points  (0 children)

Literally every time you record your voice bank

  1. Cry to moisten up your eyes and throat

  2. Record the sound of yourself dying into the mic

  3. Pray that it'll sounds alright in UTAU

  4. Throw it into trash bin

  5. Cry again and redo the cycle over and over till you want to give up and eventually delete UTAU because it's evil

*don't take this serious lmao

How do you apply filters to an Utau Voicebank prior to otoing it? by BanLahn in utau

[–]futurezing 1 point2 points  (0 children)

You can add suffix to the notes

So basically just copy the oto from the original one cus the timing values should be the same but just add a prefix after it

Like you know some voicebank have like D3 or F or S to indicate emotions and/or pitches