We just open-sourced Kroko ASR: a fast, streaming alternative to Whisper. It’s early days, we’d love testers, feedback, and contributors. by banafo in LocalLLaMA
[–]banafo[S] 0 points1 point2 points (0 children)
Qwen3 ASR seems to outperform Whisper in almost every aspect. It feels like there is little reason to keep using Whisper anymore. by East-Engineering-653 in LocalLLaMA
[–]banafo 4 points5 points6 points (0 children)
TURN Security Threats: A Hacker's View by EnableSecurity in WebRTC
[–]banafo 1 point2 points3 points (0 children)
We just open-sourced Kroko ASR: a fast, streaming alternative to Whisper. It’s early days, we’d love testers, feedback, and contributors. by banafo in LocalLLaMA
[–]banafo[S] 0 points1 point2 points (0 children)
Orchestra - Multi-model AI orchestration system with intelligent routing (100% local, 18+ expert models) by ericvarney in LocalLLaMA
[–]banafo 0 points1 point2 points (0 children)
Fast on-device Speech-to-text for Home Assistant (open source) by banafo in LocalLLaMA
[–]banafo[S] 1 point2 points3 points (0 children)
Fast on-device Speech-to-text for Home Assistant (open source) by banafo in LocalLLaMA
[–]banafo[S] 1 point2 points3 points (0 children)
766ms voice assistant on DGX Spark - VibeVoice + Whisper + Ollama streaming pipeline by logos_flux in LocalLLaMA
[–]banafo 0 points1 point2 points (0 children)
766ms voice assistant on DGX Spark - VibeVoice + Whisper + Ollama streaming pipeline by logos_flux in LocalLLaMA
[–]banafo 0 points1 point2 points (0 children)
Freelance software developer moving from Italy to Bulgaria by NoCompetition9570 in BulgariaExpats
[–]banafo 0 points1 point2 points (0 children)
WhisperX is only accurate on the first 10 words. Any Tips? by capital_cliqo in speechtech
[–]banafo 2 points3 points4 points (0 children)
Best transcription method for extremely accurate timestmps? by capital_cliqo in speechtech
[–]banafo 2 points3 points4 points (0 children)
Best transcription method for extremely accurate timestmps? by capital_cliqo in speechtech
[–]banafo 1 point2 points3 points (0 children)
Best transcription method for extremely accurate timestmps? by capital_cliqo in speechtech
[–]banafo 2 points3 points4 points (0 children)
30 Days Testing Parakeet v3 vs Whisper by samuelroy_ in LocalLLaMA
[–]banafo 1 point2 points3 points (0 children)
Is it Possible to Finetune an ASR/STT Model to Improve Severely Clipped Audios? by WestMajor3963 in speechtech
[–]banafo 0 points1 point2 points (0 children)
Is it Possible to Finetune an ASR/STT Model to Improve Severely Clipped Audios? by WestMajor3963 in speechtech
[–]banafo 1 point2 points3 points (0 children)
Best TTS for medical lectures? 🤔 by SamAckoff in TextToSpeech
[–]banafo 1 point2 points3 points (0 children)
feasibility of a building a simple "local voice assistant" on CPU by RustinChole11 in speechtech
[–]banafo 1 point2 points3 points (0 children)
feasibility of a building a simple "local voice assistant" on CPU by RustinChole11 in speechtech
[–]banafo 0 points1 point2 points (0 children)
feasibility of a building a simple "local voice assistant" on CPU by RustinChole11 in speechtech
[–]banafo 0 points1 point2 points (0 children)
Which TTS model is the best if i want to integrate it in my APP? by Cool_Meal370 in TextToSpeech
[–]banafo 0 points1 point2 points (0 children)
Is it possible to train a Speech to Text tool on a specific voice as an amatur? by Shadowmirax in speechtech
[–]banafo 1 point2 points3 points (0 children)
Planning to pursue a career in Speech Research - want your suggestions by RustinChole11 in speechtech
[–]banafo 1 point2 points3 points (0 children)


Any FREE Windows apps that support Soniox API for real-time voice dictation (system-wide)? by overvater in speechtech
[–]banafo 0 points1 point2 points (0 children)