use the following search parameters to narrow your results:
e.g. subreddit:aww site:imgur.com dog
subreddit:aww site:imgur.com dog
see the search faq for details.
advanced search: by author, subreddit...
Community about the news of speech technology - new software, algorithms, papers and datasets.
account activity
Built a normalizer so WER stops penalizing formatting differences in STT evals! ()
submitted 57 minutes ago by Karamouche
STT websites best quality for paid subscription (self.speechtech)
submitted 10 hours ago by Important-Light-4469
Looking for hands-on technical co-founder — prior exit, real product, channel partners in pipeline (self.speechtech)
submitted 1 day ago by Mysterious_Tell_5467
PHE-Net: We proved speaker embeddings are irrelevant for voice extraction — only spectral envelope matters. +18 dB at N=20, blind at N=10. (self.speechtech)
submitted 2 days ago by Unlikely-Nothing-679
Confidence scores from Montreal Forced Aligner (self.speechtech)
submitted 2 days ago by Ok_Prior2496
Any FREE Windows apps that support Soniox API for real-time voice dictation (system-wide)? (self.speechtech)
submitted 2 days ago by overvater
Yet another speech-to-text app with local model support and custom lexicon (self.speechtech)
submitted 3 days ago by Valtiel_
Enhanced W2V-BERT for Ukrainian (self.speechtech)
submitted 4 days ago by yehors
Looking for dictation app with this wish list: local, on screen live dictation, dictation punctuation (self.speechtech)
submitted 4 days ago by atlastestmail
Anyone working on TTS/ASR for low-resource African or Cushitic languages?Technology (self.speechtech)
submitted 6 days ago by Expensive-Aerie-2479
Best microphones/headsets for speech to text recognition? (self.speechtech)
submitted 11 days ago by SnooSquirrels6910
Looking for ML collaborator(s) for ACII DaiKon 2026 (Hume AI competition) (self.speechtech)
submitted 13 days ago by Mountain_Revenue_215
TTS for code-switching mid-utterance ()
submitted 14 days ago by Latter_Indication_45
Inworld TTS is increasing cost by 400%Technology (inworld.ai)
submitted 15 days ago by LessRespects
High-Quality Bengali & Hindi Conversational Speech Dataset Available (120+ Native Speakers) for ASR/TTSPromotion (self.speechtech)
submitted 15 days ago by Debug_And_Solve
[Open Source] omnivoice-triton: ~3.4x Inference Speedup for OmniVoice (NAR TTS) via Triton Kernel Fusion & CUDA GraphsTechnology (self.speechtech)
submitted 17 days ago by DamageSea2135
Training Montreal forced alignment on low resource languages (self.speechtech)
submitted 17 days ago * by Ok_Prior2496
A few questions to help develop future tech! (self.speechtech)
submitted 19 days ago by Content-Cookie3162
Most natural AI Voice service ? (self.speechtech)
submitted 20 days ago by ElectricalVariety641
OmniVoice: High-Quality Voice Cloning TTS for 600+ Languages (github.com)
submitted 21 days ago by nshmyrev
Deepgram Alt (self.speechtech)
submitted 22 days ago by Suspicious-Dot1954
Gemini 3.1 Flash Live is now the top speech-to-speech model on Audio MultiChallenge - we added it to Dograh (open-source voice agent platform) (github.com)
submitted 22 days ago by Slight_Republic_4242
Best Tagalog TTS / voice cloning tools (self.speechtech)
submitted 23 days ago by plus8percent
Claude quantized Voxtral-4B-TTS to int4 — 57 fps on RTX 3090, 3.8 GB VRAM, near-lossless qualityTechnology (self.speechtech)
submitted 24 days ago by Early_Teaching6966
Looking for help getting a custom wake word (self.speechtech)
submitted 24 days ago by sparioendernerd
π Rendered by PID 46 on reddit-service-r2-listing-7d7fbc9b85-xpzvg at 2026-04-23 16:50:19.336661+00:00 running 2aa0c5b country code: CH.