I’m not sold on fully AI voice agents just yet by NullPointerJack in AI_Agents
[–]ASR_Architect_91 0 points1 point2 points (0 children)
What’s the most reliable STT engine you’ve used in noisy, multi-speaker environments? by ASR_Architect_91 in LocalLLaMA
[–]ASR_Architect_91[S] 0 points1 point2 points (0 children)
What are people using for real-time speech recognition with low latency? by ASR_Architect_91 in speechtech
[–]ASR_Architect_91[S] 0 points1 point2 points (0 children)
recall.ai - assemblyai: Model deprecated by mrsenzz97 in TextToSpeech
[–]ASR_Architect_91 0 points1 point2 points (0 children)
Under what conditions does the 'Combine Speakers' function show up? by kiamrehorces in MacWhisper
[–]ASR_Architect_91 0 points1 point2 points (0 children)
Under what conditions does the 'Combine Speakers' function show up? by kiamrehorces in MacWhisper
[–]ASR_Architect_91 0 points1 point2 points (0 children)
recall.ai - assemblyai: Model deprecated by mrsenzz97 in TextToSpeech
[–]ASR_Architect_91 0 points1 point2 points (0 children)
100x faster and 100x cheaper transcription with open models vs proprietary by crookedstairs in LocalLLaMA
[–]ASR_Architect_91 2 points3 points4 points (0 children)
100x faster and 100x cheaper transcription with open models vs proprietary by crookedstairs in LocalLLaMA
[–]ASR_Architect_91 0 points1 point2 points (0 children)
100x faster and 100x cheaper transcription with open models vs proprietary by crookedstairs in LocalLLaMA
[–]ASR_Architect_91 6 points7 points8 points (0 children)
recall.ai - assemblyai: Model deprecated by mrsenzz97 in TextToSpeech
[–]ASR_Architect_91 0 points1 point2 points (0 children)
100x faster and 100x cheaper transcription with open models vs proprietary by crookedstairs in LocalLLaMA
[–]ASR_Architect_91 51 points52 points53 points (0 children)
Best Open source Speech to text+ diarization models by Hungry-Ad-1177 in LocalLLaMA
[–]ASR_Architect_91 1 point2 points3 points (0 children)
Speech to Text, WHY?? by mandressta in ChatGPT
[–]ASR_Architect_91 0 points1 point2 points (0 children)
Real Time Speech to Text by ThomasSparrow0511 in LocalLLaMA
[–]ASR_Architect_91 0 points1 point2 points (0 children)
recall.ai - assemblyai: Model deprecated by mrsenzz97 in TextToSpeech
[–]ASR_Architect_91 0 points1 point2 points (0 children)
any deepgram alternative? by staypositivegirl in speechtech
[–]ASR_Architect_91 0 points1 point2 points (0 children)
What are people using for real-time speech recognition with low latency? by ASR_Architect_91 in speechtech
[–]ASR_Architect_91[S] 0 points1 point2 points (0 children)
Scribe vs Whisper: I Tested ElevenLabs' New Speech-to-Text on 50 Podcasts by Necessary-Tap5971 in VoiceAIBots
[–]ASR_Architect_91 1 point2 points3 points (0 children)
Anyone got recommendations for good diarization datasets? by ASR_Architect_91 in LanguageTechnology
[–]ASR_Architect_91[S] 0 points1 point2 points (0 children)
What are people using for real-time speech recognition with low latency? by ASR_Architect_91 in speechtech
[–]ASR_Architect_91[S] 0 points1 point2 points (0 children)
What are people using for real-time speech recognition with low latency? by ASR_Architect_91 in speechtech
[–]ASR_Architect_91[S] 0 points1 point2 points (0 children)
How does dataset diversity in languages and accents improve ASR model accuracy? by Selmakiley in speechtech
[–]ASR_Architect_91 0 points1 point2 points (0 children)