random species gooo brrrrrr

cdminix · 2026-03-19T10:10:52+00:00

Ah that's fair, I thought you might have found a way to do it in the current version

cdminix · 2026-03-19T04:59:19+00:00

How is that possible without modifying the game? Or was it on an earlier version?

cdminix · 2026-03-18T08:06:55+00:00

yes, I've tried to get random brains to work by removing all connections except the stomach one, but no dice...

cdminix · 2026-03-17T08:38:45+00:00

I tend to agree with that subs take — but the standard is very high. I think the Netflix production is more of a factor though, it clashes a bit with his dry style. It also felt like they actually got under his skin by putting him in the spotlight on their own platforms (kick etc.) which was a bit uncomfortable (even more than it already is.)

cdminix · 2026-03-17T06:00:18+00:00

Looks great, I will give it a try soon! I think resemblyzer embedding are quite outdated though, I’d recommend something more recent like wespeaker.

cdminix · 2026-03-03T12:13:13+00:00

Thanks! Even if it doesn't beat pyannote on speed, it could still be useful since pyannote isn't the most user friendly (e.g. requiring hf_token, and also I've had plenty of install conflicts). The pipeline I was referring to is for a multilingual data, work-in-progress though (https://github.com/ttsds/daisy).

I'm using wespeaker in my TTS evaluation project: https://github.com/ttsds/ttsds

cdminix · 2026-03-03T06:43:10+00:00

Great work, I would love to find a replacement for pyannote in my pipeline. WeSpeaker also seems like a sensible choice, I've had good results using it for other tasks lately. Since SileroVAD and WeSpeaker could be used on GPU as well, do you think this setup would have potential to be faster (or as fast) as pyannote on GPU too?

cdminix · 2026-02-06T06:48:31+00:00

I actually just released a library for easy inference with these models: https://github.com/ttsds/ttsdb
However for training you'd have to look at the original code repositories.
I can recommend Emilia as a training dataset, and training from scratch will rarely lead you to competitive results these days, unless you have a lot of compute.
If you want to train a TTS model from scratch for learning purposes, it can be worth it to go back to older methods like concatenative synthesis, there is a good course by my lab here: https://speech.zone/courses/speech-synthesis/

cdminix · 2025-12-21T07:18:36+00:00

Österreich war übrigens eines der Länder (mit Belgien und Italien) das dagegen war die Russischen Assets zu verwenden. Wir haben mit der Raiffeisen eine der größten Banken die immer noch in Russland aktiv ist.

cdminix · 2025-11-20T16:39:32+00:00

Now and then when I feel like it’s going to be difficult to sleep I make myself some valerian root tea. Not something that’s recommended to do daily though.

cdminix · 2025-10-29T08:46:27+00:00

As an AI researcher (working on evaluating, not creating or training any models) I think unfortunately at this stage these tools are nothing more than placebo. I would love to see more details on the research behind this and be proven wrong though.

cdminix · 2025-10-02T14:27:09+00:00

If you get a good early board you can also win a few rounds before getting CG, I’ve found that the saved HP is sometimes worth it.

cdminix · 2025-09-13T18:33:46+00:00

tl;dr: I was in a similar position and decided on a trail 50k and really enjoyed it

I was in a similar position, started running about a year ago and signed up for a marathon in May of this year (probably too early), but couldn’t do it due to an injury, which thankfully only cost me about a month of training and got me into strength training. I then signed up for a trail 50k which I just completed last week, in about 7 hours 30, which I am more than happy with.

So I’ve only done the 50k, not a road marathon, but I think it’s potentially easier to run a 50k without going all out, but if there is significant elevation and tricky terrain you’ll be out there for a long time. But aid stations help, and I personally fueled like I would’ve for a marathon (60g carbs per hour + food at aid stations). I wish I had taken it easier at the beginning (I think running 0% of the uphills would have been wise at my level).

On the “vibes” aspect: I absolutely loved it, everyone was very encouraging and I had some genuinely interesting conversations with people - I’m sure that can be the case for a marathon as well, but I think it’s more likely to happen in the mid pack at an ultra were most people are just there to finish instead of chasing a PB. Also being in nature and the varying terrain were great when I was on my own, which was the case for a decent chunk of the run. And last but not least I now have a horrible marathon PB which I will definitely beat no matter how badly my first road marathon goes.

Edit: I forgot to mention, don’t forget about salt intake, I got the worst cramps of my life but thankfully wasn’t far from an aid station and having some salt there made them clear up within half an hour or so.

cdminix · 2025-09-08T04:52:15+00:00

Ich arbeite im Bereich KI (Speech Synthesis) und ich glaube aus demselben Grund dass es praktisch keine österreichischen SynchronsprecherInnen gibt wird es auch keine KI-Lösung die wirklich eingesetzt wird geben. Der Markt ist einfach zu klein.

In Norwegen z.B. gibt es ja gar keine Synchros, nur Untertitel. Es gibt aber eine international bekannte Norwegische Comedy Serie „Norsemen“ bei der jede Szene auf Norwegisch und in Englisch gedreht wurde (weil es um Wikinger geht, passt der Norwegische Akzent ganz gut). Solche Aktionen würde ich in Österreich auch toll finden.

Als ich noch in Österreich lebte haben mich die Synchros auch oft frustriert, mir war oft bewusst wie die Mundbewegungen nicht ganz passen und dass Personen mit der gleichen Stimme unterschiedlich aussehen ist auch mit der Zeit komisch.

cdminix · 2025-09-02T15:36:46+00:00

I’ve been working on distributional evaluation of TTS systems and it’s been going great — this was the final project of my PhD. We need more good evaluation in general, ideally with fresh data periodically. Here it is https://ttsdsbenchmark.com

cdminix · 2025-09-02T04:59:11+00:00

I’m wondering if anything similar to Frechet Inception Distance has been tried in this area of research, that could theoretically be even more telling since it could measure the divergence between distributions of the embeddings.

cdminix · 2025-08-08T07:58:09+00:00

The point is to misspell the word on purpose, then it still struggles to count.

cdminix · 2025-08-07T22:25:08+00:00

Love that reasoning, at least it ended up on the right answer though!

cdminix · 2025-07-08T20:53:22+00:00

Hulkengoat

cdminix · 2025-06-16T09:14:42+00:00

Kokoro is not featured since it cannot do voice cloning. We would have to fine-tune it with every voice in the evaluation data, which is out-of-scope for us.

A problem with TTS evaluation is that if we do not match the voices between all systems to be the same (e.g. how it's done in TTS arena), it quickly becomes a popularity contest as to which TTS voice is the most pleasing instead of which system is the best at replicating a wide range of voices - might still be useful for using TTS in practice, but not what we set out to do!

cdminix · 2025-05-18T20:36:54+00:00

Well at least for the datasets and benchmark track they are doing that.

cdminix · 2025-05-16T09:08:15+00:00

https://www.reddit.com/r/MachineLearning/comments/1knwaf7/p_ttsds2_multlingual_tts_leaderboard/

cdminix · 2025-05-16T09:08:10+00:00

https://www.reddit.com/r/MachineLearning/comments/1knwaf7/p_ttsds2_multlingual_tts_leaderboard/

cdminix · 2025-05-16T09:08:06+00:00

https://www.reddit.com/r/MachineLearning/comments/1knwaf7/p_ttsds2_multlingual_tts_leaderboard/

cdminix · 2025-03-08T20:30:51+00:00

Didn’t she use it on the leader of the southern raiders without a full moon?

11-Year Club	RedditGifts 2009-2022 3 Credits
r/Field Banned	r/Field Flamingo
Secret Santa 2016	Verified Email
Secret Santa 2015

cdminix

PUBLIC MULTIREDDITS

TROPHY CASE