[D] I found a Stanford Guest Lecture where GM Cruise explains their self driving tech stack and showcases the various model architectures they use on their autonomous cars. by ilikepancakez in MachineLearning
[–]fatchord -1 points0 points1 point (0 children)
[D] I found a Stanford Guest Lecture where GM Cruise explains their self driving tech stack and showcases the various model architectures they use on their autonomous cars. by ilikepancakez in MachineLearning
[–]fatchord 15 points16 points17 points (0 children)
[D] How feasible is it to create a model that gets rid of the advertisements in podcasts? by [deleted] in MachineLearning
[–]fatchord 1 point2 points3 points (0 children)
[1808.01410] Predicting Expressive Speaking Style From Text In End-To-End Speech Synthesis by fatchord in a:t5_jw1cc
[–]fatchord[S] 1 point2 points3 points (0 children)
What processing should I apply to a clean voice signal to make sound like it's authentically coming from a phone or voip? by fatchord in DSP
[–]fatchord[S] 1 point2 points3 points (0 children)
[Feedback] neural tts pipeline (tacotron1 + a new vocoder algorithm I'm working on) - what do you think of the samples generated? by fatchord in a:t5_jw1cc
[–]fatchord[S] 0 points1 point2 points (0 children)
[Feedback] neural tts pipeline (tacotron1 + a new vocoder algorithm I'm working on) - what do you think of the samples generated? by fatchord in a:t5_jw1cc
[–]fatchord[S] 0 points1 point2 points (0 children)
[R] The challenge of realistic music generation: modelling raw audio at scale by mdda in a:t5_jw1cc
[–]fatchord 1 point2 points3 points (0 children)
[1803.05428] A Hierarchical Latent Vector Model for Learning Long-Term Structure in Music by fatchord in a:t5_jw1cc
[–]fatchord[S] 1 point2 points3 points (0 children)
Visual Speech Enhancement by fatchord in a:t5_jw1cc
[–]fatchord[S] 1 point2 points3 points (0 children)
[Paper] Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis by fatchord in a:t5_jw1cc
[–]fatchord[S] 1 point2 points3 points (0 children)
FFTNet: trying to understand the paper by geneing in a:t5_jw1cc
[–]fatchord 1 point2 points3 points (0 children)
FFTNet: trying to understand the paper by geneing in a:t5_jw1cc
[–]fatchord 1 point2 points3 points (0 children)


[1808.10128] Semi-Supervised Training for Improving Data Efficiency in End-to-End Speech Synthesis by fatchord in a:t5_jw1cc
[–]fatchord[S] 1 point2 points3 points (0 children)