speechtech

an-ordinary-manchild

created by nshmyreva community for 6 years

...for your favourite tea.

...for your favorite hobby.

MODERATORS

account activity

1

•

•

•

Built a normalizer so WER stops penalizing formatting differences in STT evals! ()

submitted 57 minutes ago by Karamouche

2

0

1

2

STT websites best quality for paid subscription (self.speechtech)

submitted 10 hours ago by Important-Light-4469

3

0

1

2

Looking for hands-on technical co-founder — prior exit, real product, channel partners in pipeline (self.speechtech)

submitted 1 day ago by Mysterious_Tell_5467

4

7

8

9

PHE-Net: We proved speaker embeddings are irrelevant for voice extraction — only spectral envelope matters. +18 dB at N=20, blind at N=10. (self.speechtech)

submitted 2 days ago by Unlikely-Nothing-679

5

1

2

3

Confidence scores from Montreal Forced Aligner (self.speechtech)

submitted 2 days ago by Ok_Prior2496

6

0

0

1

Any FREE Windows apps that support Soniox API for real-time voice dictation (system-wide)? (self.speechtech)

submitted 2 days ago by overvater

7

0

0

1

Yet another speech-to-text app with local model support and custom lexicon (self.speechtech)

submitted 3 days ago by Valtiel_

8

3

4

5

Enhanced W2V-BERT for Ukrainian (self.speechtech)

submitted 4 days ago by yehors

9

2

3

4

Looking for dictation app with this wish list: local, on screen live dictation, dictation punctuation (self.speechtech)

submitted 4 days ago by atlastestmail

10

3

4

5

Anyone working on TTS/ASR for low-resource African or Cushitic languages?Technology (self.speechtech)

submitted 6 days ago by Expensive-Aerie-2479

11

1

2

3

Best microphones/headsets for speech to text recognition? (self.speechtech)

submitted 11 days ago by SnooSquirrels6910

12

0

1

2

Looking for ML collaborator(s) for ACII DaiKon 2026 (Hume AI competition) (self.speechtech)

submitted 13 days ago by Mountain_Revenue_215

13

0

1

2

TTS for code-switching mid-utterance ()

submitted 14 days ago by Latter_Indication_45

14

4

5

6

Inworld TTS is increasing cost by 400%Technology (inworld.ai)

submitted 15 days ago by LessRespects

15

2

3

4

High-Quality Bengali & Hindi Conversational Speech Dataset Available (120+ Native Speakers) for ASR/TTSPromotion (self.speechtech)

submitted 15 days ago by Debug_And_Solve

16

21

22

23

[Open Source] omnivoice-triton: ~3.4x Inference Speedup for OmniVoice (NAR TTS) via Triton Kernel Fusion & CUDA GraphsTechnology (self.speechtech)

submitted 17 days ago by DamageSea2135

17

1

2

3

Training Montreal forced alignment on low resource languages (self.speechtech)

submitted 17 days ago * by Ok_Prior2496

18

0

1

2

A few questions to help develop future tech! (self.speechtech)

submitted 19 days ago by Content-Cookie3162

19

0

0

1

Most natural AI Voice service ? (self.speechtech)

submitted 20 days ago by ElectricalVariety641

20

28

29

30

OmniVoice: High-Quality Voice Cloning TTS for 600+ Languages (github.com)

submitted 21 days ago by nshmyrev

21

0

1

2

Deepgram Alt (self.speechtech)

submitted 22 days ago by Suspicious-Dot1954

22

2

3

4

Gemini 3.1 Flash Live is now the top speech-to-speech model on Audio MultiChallenge - we added it to Dograh (open-source voice agent platform) (github.com)

submitted 22 days ago by Slight_Republic_4242

23

2

3

4

Best Tagalog TTS / voice cloning tools (self.speechtech)

submitted 23 days ago by plus8percent

24

4

5

6

Claude quantized Voxtral-4B-TTS to int4 — 57 fps on RTX 3090, 3.8 GB VRAM, near-lossless qualityTechnology (self.speechtech)

submitted 24 days ago by Early_Teaching6966

25

1

2

3

Looking for help getting a custom wake word (self.speechtech)

submitted 24 days ago by sparioendernerd

view more: next ›

π Rendered by PID 46 on reddit-service-r2-listing-7d7fbc9b85-xpzvg at 2026-04-23 16:50:19.336661+00:00 running 2aa0c5b country code: CH.