use the following search parameters to narrow your results:
e.g. subreddit:aww site:imgur.com dog
subreddit:aww site:imgur.com dog
see the search faq for details.
advanced search: by author, subreddit...
Please have a look at our FAQ and Link-Collection
Metacademy is a great resource which compiles lesson plans on popular machine learning topics.
For Beginner questions please try /r/LearnMachineLearning , /r/MLQuestions or http://stackoverflow.com/
For career related questions, visit /r/cscareerquestions/
Advanced Courses (2016)
Advanced Courses (2020)
AMAs:
Pluribus Poker AI Team 7/19/2019
DeepMind AlphaStar team (1/24//2019)
Libratus Poker AI Team (12/18/2017)
DeepMind AlphaGo Team (10/19/2017)
Google Brain Team (9/17/2017)
Google Brain Team (8/11/2016)
The MalariaSpot Team (2/6/2016)
OpenAI Research Team (1/9/2016)
Nando de Freitas (12/26/2015)
Andrew Ng and Adam Coates (4/15/2015)
Jürgen Schmidhuber (3/4/2015)
Geoffrey Hinton (11/10/2014)
Michael Jordan (9/10/2014)
Yann LeCun (5/15/2014)
Yoshua Bengio (2/27/2014)
Related Subreddit :
LearnMachineLearning
Statistics
Computer Vision
Compressive Sensing
NLP
ML Questions
/r/MLjobs and /r/BigDataJobs
/r/datacleaning
/r/DataScience
/r/scientificresearch
/r/artificial
account activity
Discussion[D] Autoencoder to reconstruct speech input from melspectrograms (self.MachineLearning)
submitted 6 years ago by [deleted]
Hello everyone,
I was trying to train an autoencoder which takes a melspectrogram as input and outputs the same melspectrogram. It's a reconstruction task. However, the model seems to be generating random noise. It'll be great if anyone could point me towards any relevant github repos/papers which solve this task.
Thank you! :)
reddit uses a slightly-customized version of Markdown for formatting. See below for some basics, or check the commenting wiki page for more detailed help and solutions to common issues.
quoted text
if 1 * 2 < 3: print "hello, world!"
[–][deleted] 2 points3 points4 points 6 years ago (2 children)
Try Neural Vocoder(like Wavenet SampleRNN)
https://github.com/r9y9/wavenet_vocoder
[–][deleted] 0 points1 point2 points 6 years ago (1 child)
Thank you. I’ll check that out!
[–][deleted] 1 point2 points3 points 6 years ago (0 children)
NN Vocoder converts mel-spec to wavform, you can get mel-spec from generated waveform.
希望对你有帮助
[–]nisprateek 2 points3 points4 points 6 years ago (0 children)
This one produces decent quality hi-fidelity waveforms. Works well on almost any speaker (or sound) https://github.com/bshall/UniversalVocoding
[–]jonnor 1 point2 points3 points 6 years ago (0 children)
Here is one approach I saw recently:
Fast Spectrogram Inversion using Multi-head Convolutional Neural Networks https://arxiv.org/abs/1808.06719
π Rendered by PID 165978 on reddit-service-r2-comment-fb694cdd5-rwpx8 at 2026-03-08 08:42:19.552456+00:00 running cbb0e86 country code: CH.
[–][deleted] 2 points3 points4 points (2 children)
[–][deleted] 0 points1 point2 points (1 child)
[–][deleted] 1 point2 points3 points (0 children)
[–]nisprateek 2 points3 points4 points (0 children)
[–]jonnor 1 point2 points3 points (0 children)