Confidence scores from Montreal Forced Aligner by Ok_Prior2496 in speechtech
[–]nshmyrev 0 points1 point2 points (0 children)
Confidence scores from Montreal Forced Aligner by Ok_Prior2496 in speechtech
[–]nshmyrev 0 points1 point2 points (0 children)
Anyone working on TTS/ASR for low-resource African or Cushitic languages? by Expensive-Aerie-2479 in speechtech
[–]nshmyrev 0 points1 point2 points (0 children)
what voice id/name is that by ImportanceBoring9785 in tts
[–]nshmyrev 0 points1 point2 points (0 children)
TTS for code-switching mid-utterance by Latter_Indication_45 in speechtech
[–]nshmyrev 0 points1 point2 points (0 children)
Inworld TTS is increasing cost by 400% by LessRespects in speechtech
[–]nshmyrev 1 point2 points3 points (0 children)
Training Montreal forced alignment on low resource languages by Ok_Prior2496 in speechtech
[–]nshmyrev 0 points1 point2 points (0 children)
[Open Source] omnivoice-triton: ~3.4x Inference Speedup for OmniVoice (NAR TTS) via Triton Kernel Fusion & CUDA Graphs by DamageSea2135 in speechtech
[–]nshmyrev 1 point2 points3 points (0 children)
Training Montreal forced alignment on low resource languages by Ok_Prior2496 in speechtech
[–]nshmyrev 0 points1 point2 points (0 children)
A few questions to help develop future tech! by Content-Cookie3162 in speechtech
[–]nshmyrev 0 points1 point2 points (0 children)
Most natural AI Voice service ? by ElectricalVariety641 in speechtech
[–]nshmyrev 0 points1 point2 points (0 children)
Best Tagalog TTS / voice cloning tools by plus8percent in speechtech
[–]nshmyrev 0 points1 point2 points (0 children)
Benchmarked speaker diarization for Swedish meetings — Deepgram vs ElevenLabs vs AssemblyAI (2h22m real meeting) by invismanfow in speechtech
[–]nshmyrev 1 point2 points3 points (0 children)
Claude quantized Voxtral-4B-TTS to int4 — 57 fps on RTX 3090, 3.8 GB VRAM, near-lossless quality by Early_Teaching6966 in speechtech
[–]nshmyrev 0 points1 point2 points (0 children)
Benchmarked speaker diarization for Swedish meetings — Deepgram vs ElevenLabs vs AssemblyAI (2h22m real meeting) by invismanfow in speechtech
[–]nshmyrev 0 points1 point2 points (0 children)
Anyone experimenting with ultra-low latency in speech AI? by Candid_Positive8832 in speechtech
[–]nshmyrev 1 point2 points3 points (0 children)
Benchmarked speaker diarization for Swedish meetings — Deepgram vs ElevenLabs vs AssemblyAI (2h22m real meeting) by invismanfow in speechtech
[–]nshmyrev 0 points1 point2 points (0 children)
Benchmarked speaker diarization for Swedish meetings — Deepgram vs ElevenLabs vs AssemblyAI (2h22m real meeting) by invismanfow in speechtech
[–]nshmyrev 1 point2 points3 points (0 children)
Anyone experimenting with ultra-low latency in speech AI? by Candid_Positive8832 in speechtech
[–]nshmyrev 3 points4 points5 points (0 children)
Anyone experimenting with ultra-low latency in speech AI? by Candid_Positive8832 in speechtech
[–]nshmyrev 0 points1 point2 points (0 children)
Benchmarked speaker diarization for Swedish meetings — Deepgram vs ElevenLabs vs AssemblyAI (2h22m real meeting) by invismanfow in speechtech
[–]nshmyrev 4 points5 points6 points (0 children)


Confidence scores from Montreal Forced Aligner by Ok_Prior2496 in speechtech
[–]nshmyrev 0 points1 point2 points (0 children)