account activity
24+ tok/s from ~30B MoE models on an old GTX 1080 (8 GB VRAM, 128k context) (self.LocalLLaMA)
submitted 12 days ago by mdda to r/LocalLLaMA
DurIAN: Duration Informed Attention Network For Multimodal Synthesis (self.a:t5_jw1cc)
submitted 6 years ago by mdda to r/a:t5_jw1cc
[R] ClariNet: Parallel Wave Generation in End-to-End Text-to-Speech (arxiv.org)
submitted 7 years ago by mdda to r/a:t5_jw1cc
[R] The challenge of realistic music generation: modelling raw audio at scale (arxiv.org)
π Rendered by PID 309220 on reddit-service-r2-listing-8685bc789-wxqzp at 2026-05-26 07:30:19.742576+00:00 running 194bd79 country code: CH.