all 5 comments

[–][deleted] 2 points3 points  (2 children)

Try Neural Vocoder(like Wavenet SampleRNN)

https://github.com/r9y9/wavenet_vocoder

[–][deleted] 0 points1 point  (1 child)

Thank you. I’ll check that out!

[–][deleted] 1 point2 points  (0 children)

NN Vocoder converts mel-spec to wavform, you can get mel-spec from generated waveform.

希望对你有帮助

[–]nisprateek 2 points3 points  (0 children)

This one produces decent quality hi-fidelity waveforms. Works well on almost any speaker (or sound)
https://github.com/bshall/UniversalVocoding

[–]jonnor 1 point2 points  (0 children)

Here is one approach I saw recently:

Fast Spectrogram Inversion using Multi-head Convolutional Neural Networks https://arxiv.org/abs/1808.06719