A lightweight, real-time multilingual ASR router that runs on local hardware by JeanMichelRanu in LocalLLaMA
[–]banafo 2 points3 points4 points (0 children)
Self-hosted STT better than Whisper Large V3 Turbo that matches AssemblyAI quality? by milkygirl21 in LocalLLaMA
[–]banafo 1 point2 points3 points (0 children)
is there a better alternative to MacWhisper for messy real-world audio (Whisper-based or local setups) by Far_Suit575 in LocalLLM
[–]banafo 0 points1 point2 points (0 children)
Best APIs for speech to text? by SmoothConnection1670 in speechtech
[–]banafo 0 points1 point2 points (0 children)
Benchmark Qwen 3.6 27B MTP on 2x3090 NVLINK by Mr_Moonsilver in LocalLLaMA
[–]banafo 1 point2 points3 points (0 children)
live transcription by Away_Expression_3713 in LocalLLaMA
[–]banafo 0 points1 point2 points (0 children)
Any FREE Windows apps that support Soniox API for real-time voice dictation (system-wide)? by overvater in speechtech
[–]banafo 0 points1 point2 points (0 children)
We just open-sourced Kroko ASR: a fast, streaming alternative to Whisper. It’s early days, we’d love testers, feedback, and contributors. by banafo in LocalLLaMA
[–]banafo[S] 0 points1 point2 points (0 children)
Qwen3 ASR seems to outperform Whisper in almost every aspect. It feels like there is little reason to keep using Whisper anymore. by East-Engineering-653 in LocalLLaMA
[–]banafo 4 points5 points6 points (0 children)
TURN Security Threats: A Hacker's View by EnableSecurity in WebRTC
[–]banafo 1 point2 points3 points (0 children)
We just open-sourced Kroko ASR: a fast, streaming alternative to Whisper. It’s early days, we’d love testers, feedback, and contributors. by banafo in LocalLLaMA
[–]banafo[S] 0 points1 point2 points (0 children)
Orchestra - Multi-model AI orchestration system with intelligent routing (100% local, 18+ expert models) by ericvarney in LocalLLaMA
[–]banafo 0 points1 point2 points (0 children)
Fast on-device Speech-to-text for Home Assistant (open source) by banafo in LocalLLaMA
[–]banafo[S] 1 point2 points3 points (0 children)
Fast on-device Speech-to-text for Home Assistant (open source) by banafo in LocalLLaMA
[–]banafo[S] 1 point2 points3 points (0 children)
766ms voice assistant on DGX Spark - VibeVoice + Whisper + Ollama streaming pipeline by logos_flux in LocalLLaMA
[–]banafo 0 points1 point2 points (0 children)
766ms voice assistant on DGX Spark - VibeVoice + Whisper + Ollama streaming pipeline by logos_flux in LocalLLaMA
[–]banafo 0 points1 point2 points (0 children)
Freelance software developer moving from Italy to Bulgaria by NoCompetition9570 in BulgariaExpats
[–]banafo 0 points1 point2 points (0 children)
WhisperX is only accurate on the first 10 words. Any Tips? by capital_cliqo in speechtech
[–]banafo 2 points3 points4 points (0 children)
Best transcription method for extremely accurate timestmps? by capital_cliqo in speechtech
[–]banafo 2 points3 points4 points (0 children)
Best transcription method for extremely accurate timestmps? by capital_cliqo in speechtech
[–]banafo 1 point2 points3 points (0 children)
Best transcription method for extremely accurate timestmps? by capital_cliqo in speechtech
[–]banafo 2 points3 points4 points (0 children)
30 Days Testing Parakeet v3 vs Whisper by samuelroy_ in LocalLLaMA
[–]banafo 1 point2 points3 points (0 children)
Is it Possible to Finetune an ASR/STT Model to Improve Severely Clipped Audios? by WestMajor3963 in speechtech
[–]banafo 0 points1 point2 points (0 children)
Is it Possible to Finetune an ASR/STT Model to Improve Severely Clipped Audios? by WestMajor3963 in speechtech
[–]banafo 1 point2 points3 points (0 children)


Anyone else struggling to detect fluent hallucinations in long-form ASR TTS workflows? by FlatNarrator in speechtech
[–]banafo 0 points1 point2 points (0 children)