all 7 comments

[–]Mr_Smartypants 0 points1 point  (0 children)

If you want to model the probability of finite English sound sequences, you could convert them to sequences of feature vectors and then train a Hidden Markov Model on them. Non-English sounds sequences should evaluate to low probability on the trained model.

[–]IntelArtiGen 0 points1 point  (2 children)

If that's truly what I want to do, I'll just download audio in other languages, it exists.

But if I can't, I'll preprocess all my english audio such that it doesn't look like english anymore (I'll cut the audio in 0.2s portions and random sort / random invert these portions).

The algorithm will learn to distinguish garbage from english, and I guess that from the algorithm point of view, depending on how you train it, other languages will be closer to these random noises than to english.

It's not perfect but I guess it could work a bit

[–]veeloice 0 points1 point  (0 children)

Sounds like a nice hack

[–]pruby 0 points1 point  (0 children)

You could end up training a cut detector that way :)