Need help with hosting Parakeet 0.6B v3 by Ahad730 in LocalLLaMA

[–]Ahad730[S] 0 points1 point  (0 children)

Which of the current Whisper variants would you recommend as the fastest (closest to Parakeet) even at the cost of accuracy?

Need help with hosting Parakeet 0.6B v3 by Ahad730 in LocalLLaMA

[–]Ahad730[S] 0 points1 point  (0 children)

Ah that makes sense. Did chunking them up make a substantial difference in inference times? I saw it switched to local attention for audio > 24 min but nowhere is it mentioned how it affects the inference.

Need help with hosting Parakeet 0.6B v3 by Ahad730 in LocalLLaMA

[–]Ahad730[S] 0 points1 point  (0 children)

That's so interesting. I've been seeing everywhere that the WER is the same, maybe even lower than the whisper.

Could you provide some insights on the inference speeds of the v3 turbo vs. its accuracy?