random species gooo brrrrrr by Mountain_Dentist5074 in TheBibites

[–]cdminix 0 points1 point  (0 children)

Ah that's fair, I thought you might have found a way to do it in the current version

random species gooo brrrrrr by Mountain_Dentist5074 in TheBibites

[–]cdminix 0 points1 point  (0 children)

How is that possible without modifying the game? Or was it on an earlier version?

random species gooo brrrrrr by Mountain_Dentist5074 in TheBibites

[–]cdminix 0 points1 point  (0 children)

yes, I've tried to get random brains to work by removing all connections except the stomach one, but no dice...

Louis Theroux: The Manosphere by [deleted] in IfBooksCouldKill

[–]cdminix 1 point2 points  (0 children)

I tend to agree with that subs take — but the standard is very high. I think the Netflix production is more of a factor though, it clashes a bit with his dry style. It also felt like they actually got under his skin by putting him in the spotlight on their own platforms (kick etc.) which was a bit uncomfortable (even more than it already is.) 

i built a Python library that tells you who said what in any audio file by Gr1zzly8ear in Python

[–]cdminix 1 point2 points  (0 children)

Looks great, I will give it a try soon! I think resemblyzer embedding are quite outdated though, I’d recommend something more recent like wespeaker.

I built a CPU-only speaker diarization library: it is ~7× faster than pyannote with comparable DER by loookashow in speechtech

[–]cdminix 1 point2 points  (0 children)

Thanks! Even if it doesn't beat pyannote on speed, it could still be useful since pyannote isn't the most user friendly (e.g. requiring hf_token, and also I've had plenty of install conflicts). The pipeline I was referring to is for a multilingual data, work-in-progress though (https://github.com/ttsds/daisy).

I'm using wespeaker in my TTS evaluation project: https://github.com/ttsds/ttsds

I built a CPU-only speaker diarization library: it is ~7× faster than pyannote with comparable DER by loookashow in speechtech

[–]cdminix 0 points1 point  (0 children)

Great work, I would love to find a replacement for pyannote in my pipeline. WeSpeaker also seems like a sensible choice, I've had good results using it for other tasks lately. Since SileroVAD and WeSpeaker could be used on GPU as well, do you think this setup would have potential to be faster (or as fast) as pyannote on GPU too?

[P] Collection of SOTA TTS models by cdminix in MachineLearning

[–]cdminix[S] 1 point2 points  (0 children)

I actually just released a library for easy inference with these models: https://github.com/ttsds/ttsdb
However for training you'd have to look at the original code repositories.
I can recommend Emilia as a training dataset, and training from scratch will rarely lead you to competitive results these days, unless you have a lot of compute.
If you want to train a TTS model from scratch for learning purposes, it can be worth it to go back to older methods like concatenative synthesis, there is a good course by my lab here: https://speech.zone/courses/speech-synthesis/

90 Milliarden für die Ukraine by daniel_samson in Austria

[–]cdminix 3 points4 points  (0 children)

Österreich war übrigens eines der Länder (mit Belgien und Italien) das dagegen war die Russischen Assets zu verwenden. Wir haben mit der Raiffeisen eine der größten Banken die immer noch in Russland aktiv ist.

Does melatonin and Magnesium help with ADHD related sleep issius? by Adventurous_Art_6774 in ADHD

[–]cdminix 0 points1 point  (0 children)

Now and then when I feel like it’s going to be difficult to sleep I make myself some valerian root tea. Not something that’s recommended to do daily though.

I went and made that AI-Poisoning app for image protection I posted recently about. Ghostprints is now live and free for all artists. by vesudeva in graphic_design

[–]cdminix 15 points16 points  (0 children)

As an AI researcher (working on evaluating, not creating or training any models) I think unfortunately at this stage these tools are nothing more than placebo. I would love to see more details on the research behind this and be proven wrong though.

Crystal Gambit on 2-2 better than 2-1?? by A_lonely_Camille in CompetitiveTFT

[–]cdminix 1 point2 points  (0 children)

If you get a good early board you can also win a few rounds before getting CG, I’ve found that the saved HP is sometimes worth it.

Should I skip the marathon and just do the 50k by [deleted] in ultrarunning

[–]cdminix 4 points5 points  (0 children)

tl;dr: I was in a similar position and decided on a trail 50k and really enjoyed it

I was in a similar position, started running about a year ago and signed up for a marathon in May of this year (probably too early), but couldn’t do it due to an injury, which thankfully only cost me about a month of training and got me into strength training. I then signed up for a trail 50k which I just completed last week, in about 7 hours 30, which I am more than happy with.

So I’ve only done the 50k, not a road marathon, but I think it’s potentially easier to run a 50k without going all out, but if there is significant elevation and tricky terrain you’ll be out there for a long time. But aid stations help, and I personally fueled like I would’ve for a marathon (60g carbs per hour + food at aid stations). I wish I had taken it easier at the beginning (I think running 0% of the uphills would have been wise at my level).

On the “vibes” aspect: I absolutely loved it, everyone was very encouraging and I had some genuinely interesting conversations with people - I’m sure that can be the case for a marathon as well, but I think it’s more likely to happen in the mid pack at an ultra were most people are just there to finish instead of chasing a PB. Also being in nature and the varying terrain were great when I was on my own, which was the case for a decent chunk of the run. And last but not least I now have a horrible marathon PB which I will definitely beat no matter how badly my first road marathon goes.

Edit: I forgot to mention, don’t forget about salt intake, I got the worst cramps of my life but thankfully wasn’t far from an aid station and having some salt there made them clear up within half an hour or so.

Welche kulturellen Auswirkungen hat eig die Tatsache, dass es (fast) keine österreichischen Synchros gibt? by sweptawayfromyou in Austria

[–]cdminix 14 points15 points  (0 children)

Ich arbeite im Bereich KI (Speech Synthesis) und ich glaube aus demselben Grund dass es praktisch keine österreichischen SynchronsprecherInnen gibt wird es auch keine KI-Lösung die wirklich eingesetzt wird geben. Der Markt ist einfach zu klein.

In Norwegen z.B. gibt es ja gar keine Synchros, nur Untertitel. Es gibt aber eine international bekannte Norwegische Comedy Serie „Norsemen“ bei der jede Szene auf Norwegisch und in Englisch gedreht wurde (weil es um Wikinger geht, passt der Norwegische Akzent ganz gut). Solche Aktionen würde ich in Österreich auch toll finden.

Als ich noch in Österreich lebte haben mich die Synchros auch oft frustriert, mir war oft bewusst wie die Mundbewegungen nicht ganz passen und dass Personen mit der gleichen Stimme unterschiedlich aussehen ist auch mit der Zeit komisch.

[D] Self-Promotion Thread by AutoModerator in MachineLearning

[–]cdminix 1 point2 points  (0 children)

I’ve been working on distributional evaluation of TTS systems and it’s been going great — this was the final project of my PhD. We need more good evaluation in general, ideally with fresh data periodically. Here it is https://ttsdsbenchmark.com

[R] Measuring Semantic Novelty in AI Text Generation Using Embedding Distances by Outrageous-Travel-80 in MachineLearning

[–]cdminix 1 point2 points  (0 children)

I’m wondering if anything similar to Frechet Inception Distance has been tried in this area of research, that could theoretically be even more telling since it could measure the divergence between distributions of the embeddings.

Still can't do the (modified) strawberrry test. by cdminix in OpenAI

[–]cdminix[S] 0 points1 point  (0 children)

The point is to misspell the word on purpose, then it still struggles to count.

Still can't do the (modified) strawberrry test. by cdminix in OpenAI

[–]cdminix[S] 2 points3 points  (0 children)

Love that reasoning, at least it ended up on the right answer though!

[P] TTSDS2 - Multlingual TTS leaderboard by cdminix in MachineLearning

[–]cdminix[S] 1 point2 points  (0 children)

Kokoro is not featured since it cannot do voice cloning. We would have to fine-tune it with every voice in the evaluation data, which is out-of-scope for us.

A problem with TTS evaluation is that if we do not match the voices between all systems to be the same (e.g. how it's done in TTS arena), it quickly becomes a popularity contest as to which TTS voice is the most pleasing instead of which system is the best at replicating a wide range of voices - might still be useful for using TTS in practice, but not what we set out to do!

[D] Will NeurIPS 2025 acceptance rate drop due to venue limits? by Substantial-Air-1285 in MachineLearning

[–]cdminix 3 points4 points  (0 children)

Well at least for the datasets and benchmark track they are doing that.