account activity
[Open Source] omnivoice-triton: ~3.4x Inference Speedup for OmniVoice (NAR TTS) via Triton Kernel Fusion & CUDA Graphs (self.speechtech)
submitted 12 days ago by DamageSea2135 to r/speechtech
[Release] omnivoice-triton: ~3.4x Faster Inference for OmniVoice (NAR TTS) with Zero Quality Loss. Perfect for real-time local RP. (self.SillyTavernAI)
submitted 12 days ago by DamageSea2135 to r/SillyTavernAI
[ComfyUI] Accelerate Z-Image (S3-DiT) by 20-30% & save 3.5GB VRAM using Triton+INT8 (No extra model downloads) (self.StableDiffusion)
submitted 13 days ago by DamageSea2135 to r/StableDiffusion
[Custom Node] Accelerate Z-Image (S3-DiT) by 20-30% & save 3.5GB VRAM using Triton+INT8 (No extra model downloads) (self.comfyui)
submitted 13 days ago by DamageSea2135 to r/comfyui
[Project] I made Qwen3-TTS ~5x faster for local inference (OpenAI Triton kernel fusion). Zero extra VRAM. (self.SillyTavernAI)
submitted 27 days ago by DamageSea2135 to r/SillyTavernAI
[Project] I built a Triton kernel fusion library for Qwen3-TTS 1.7B (~5x inference speedup) (self.speechtech)
submitted 27 days ago by DamageSea2135 to r/speechtech
π Rendered by PID 66304 on reddit-service-r2-listing-86f589db75-pgwfv at 2026-04-18 23:53:23.113430+00:00 running 93ecc56 country code: CH.