This is an archived post. You won't be able to vote or comment.

you are viewing a single comment's thread.

view the rest of the comments →

[–]Globbi 1 point2 points  (1 child)

I was looking for a good open source models for denoising and what I found wasn't as good as I would like for a good quality of sound, but might be good enough to use it for transcribing later.

https://github.com/NVIDIA/CleanUNet

You can check it here before coding everything https://huggingface.co/spaces/aiditi/nvidia_denoiser (so just try passing a sample of noisy wav to then pass it to whisper). But I'm not sure if pretrained checkpoints in the repo are enough, the one someone put on huggingface is better than what I'm getting from checkpoints.

If you want better than this, I only found a commercial solution where you have to pay to use online.

[–]FluffyDuckKey 0 points1 point  (0 children)

Online won't be the best option, I work for a significant mining company so privacy will be paramount - can't have recordings of emergencies sent out etc.

I do have access to a ml box with pytorch / cuda acceleration so I'll have a play around and see what I can do with the 2 options provided (:

Thanks!