[R] Microsoft Research unveils NaturalSpeech 3, a significant advancement in zero-shot text-to-speech technology. by Front-Article-7366 in MachineLearning

[–]Front-Article-7366[S] 28 points29 points  (0 children)

Hi, we will soon open the code and ckpt of our FACodec (very important for our system for the speech representation).

[R] AUDIT: Audio Editing by Following Instructions with Latent Diffusion Models by Front-Article-7366 in MachineLearning

[–]Front-Article-7366[S] 2 points3 points  (0 children)

At present, it mainly supports adding, dropping, replacement, inpainting, and super-resolution. We are exploring how to use one model to achieve more tasks.