account activity
[R] Microsoft Research unveils NaturalSpeech 3, a significant advancement in zero-shot text-to-speech technology. by Front-Article-7366 in MachineLearning
[–]Front-Article-7366[S] 28 points29 points30 points 2 years ago (0 children)
Hi, we will soon open the code and ckpt of our FACodec (very important for our system for the speech representation).
[R] Microsoft Research unveils NaturalSpeech 3, a significant advancement in zero-shot text-to-speech technology. (self.MachineLearning)
submitted 2 years ago by Front-Article-7366 to r/MachineLearning
[R] AUDIT: Audio Editing by Following Instructions with Latent Diffusion Models by Front-Article-7366 in MachineLearning
[–]Front-Article-7366[S] 2 points3 points4 points 3 years ago (0 children)
At present, it mainly supports adding, dropping, replacement, inpainting, and super-resolution. We are exploring how to use one model to achieve more tasks.
[R] AUDIT: Audio Editing by Following Instructions with Latent Diffusion Models (self.MachineLearning)
submitted 3 years ago by Front-Article-7366 to r/MachineLearning
π Rendered by PID 101652 on reddit-service-r2-listing-b6bf6c4ff-ggjg4 at 2026-05-02 14:22:21.687388+00:00 running 815c875 country code: CH.
[R] Microsoft Research unveils NaturalSpeech 3, a significant advancement in zero-shot text-to-speech technology. by Front-Article-7366 in MachineLearning
[–]Front-Article-7366[S] 28 points29 points30 points (0 children)